Fereshteh R. Dastjerdi , David A. Robinson , Liming Cai
{"title":"α-HMM和序列数据的最优解码高阶结构","authors":"Fereshteh R. Dastjerdi , David A. Robinson , Liming Cai","doi":"10.1016/j.jcmds.2022.100065","DOIUrl":null,"url":null,"abstract":"<div><p>Decoding higher-order structure on sequential data is an indispensable task in data science. It requires models to have the capability to characterize interdependencies among hidden events that have generated observable data. However, to be able to decode arbitrary structures, such models would need to cope with the intractability arising from computing context-sensitive relations, likely compromising the quality of answers. To address this important issue, the current paper introduces the <em>arbitrary order hidden Markov model</em> (<span><math><mi>α</mi></math></span>-HMM), an extension of the HMM that permits decoding of the optimal higher-order structure with an assurance of computational tractability. The advantage of the <span><math><mi>α</mi></math></span>-HMM<!--> <!-->is made possible by an identified principle on how random variables influence each other in a stochastic process. In particular, it is shown that decoding the optimal structure with an <span><math><mi>α</mi></math></span>-HMM<!--> <!-->can be computed in <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msup><mrow><mi>n</mi></mrow><mrow><mn>3</mn></mrow></msup><mo>)</mo></mrow></mrow></math></span>-time for any stochastic process of <span><math><mi>n</mi></math></span> random variables. As an application, it is demonstrated the decoding algorithm inspires a simple yet effective algorithm for RNA secondary structure prediction.</p></div>","PeriodicalId":100768,"journal":{"name":"Journal of Computational Mathematics and Data Science","volume":"5 ","pages":"Article 100065"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772415822000256/pdfft?md5=dbdeefc075425bee6733e90b348391d6&pid=1-s2.0-S2772415822000256-main.pdf","citationCount":"0","resultStr":"{\"title\":\"α-HMM and optimal decoding higher-order structures on sequential data\",\"authors\":\"Fereshteh R. Dastjerdi , David A. Robinson , Liming Cai\",\"doi\":\"10.1016/j.jcmds.2022.100065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Decoding higher-order structure on sequential data is an indispensable task in data science. It requires models to have the capability to characterize interdependencies among hidden events that have generated observable data. However, to be able to decode arbitrary structures, such models would need to cope with the intractability arising from computing context-sensitive relations, likely compromising the quality of answers. To address this important issue, the current paper introduces the <em>arbitrary order hidden Markov model</em> (<span><math><mi>α</mi></math></span>-HMM), an extension of the HMM that permits decoding of the optimal higher-order structure with an assurance of computational tractability. The advantage of the <span><math><mi>α</mi></math></span>-HMM<!--> <!-->is made possible by an identified principle on how random variables influence each other in a stochastic process. In particular, it is shown that decoding the optimal structure with an <span><math><mi>α</mi></math></span>-HMM<!--> <!-->can be computed in <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msup><mrow><mi>n</mi></mrow><mrow><mn>3</mn></mrow></msup><mo>)</mo></mrow></mrow></math></span>-time for any stochastic process of <span><math><mi>n</mi></math></span> random variables. As an application, it is demonstrated the decoding algorithm inspires a simple yet effective algorithm for RNA secondary structure prediction.</p></div>\",\"PeriodicalId\":100768,\"journal\":{\"name\":\"Journal of Computational Mathematics and Data Science\",\"volume\":\"5 \",\"pages\":\"Article 100065\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772415822000256/pdfft?md5=dbdeefc075425bee6733e90b348391d6&pid=1-s2.0-S2772415822000256-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Mathematics and Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772415822000256\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Mathematics and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772415822000256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
α-HMM and optimal decoding higher-order structures on sequential data
Decoding higher-order structure on sequential data is an indispensable task in data science. It requires models to have the capability to characterize interdependencies among hidden events that have generated observable data. However, to be able to decode arbitrary structures, such models would need to cope with the intractability arising from computing context-sensitive relations, likely compromising the quality of answers. To address this important issue, the current paper introduces the arbitrary order hidden Markov model (-HMM), an extension of the HMM that permits decoding of the optimal higher-order structure with an assurance of computational tractability. The advantage of the -HMM is made possible by an identified principle on how random variables influence each other in a stochastic process. In particular, it is shown that decoding the optimal structure with an -HMM can be computed in -time for any stochastic process of random variables. As an application, it is demonstrated the decoding algorithm inspires a simple yet effective algorithm for RNA secondary structure prediction.