{"title":"利用信息准则寻找隐马尔可夫模型中潜在状态的数量","authors":"Jodie Buckby, Ting Wang, David Fletcher, Jiancang Zhuang, Akiko Takeo, Kazushige Obara","doi":"10.1007/s10651-023-00584-5","DOIUrl":null,"url":null,"abstract":"<p>Hidden Markov models (HMMs) are often used to model time series data and are applied in many fields of research. However, estimating the unknown number of hidden states in the Markov chain is a non-trivial component of HMM model selection and an area of active research. Currently, AIC and BIC are commonly used for this purpose, despite theoretical issues and some evidence of poor performance in the literature. Here, motivated by the HMMs developed to model seismic tremor data, we use simulation studies to compare the performance of a number of model selection information criteria when used to select the number of hidden states in HMMs, including an adjusted BIC not previously used with HMMs. We find that AIC and BIC are not always reliable tools for selecting the number of hidden states in HMMs and that other information criteria such as adjusted BIC can actually perform better, depending on factors such as sample size and sojourn times in each state. We apply the information criteria to a set of HMMs fitted to seismic tremor data and compare the models selected by the different criteria.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"101 1 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Finding the number of latent states in hidden Markov models using information criteria\",\"authors\":\"Jodie Buckby, Ting Wang, David Fletcher, Jiancang Zhuang, Akiko Takeo, Kazushige Obara\",\"doi\":\"10.1007/s10651-023-00584-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Hidden Markov models (HMMs) are often used to model time series data and are applied in many fields of research. However, estimating the unknown number of hidden states in the Markov chain is a non-trivial component of HMM model selection and an area of active research. Currently, AIC and BIC are commonly used for this purpose, despite theoretical issues and some evidence of poor performance in the literature. Here, motivated by the HMMs developed to model seismic tremor data, we use simulation studies to compare the performance of a number of model selection information criteria when used to select the number of hidden states in HMMs, including an adjusted BIC not previously used with HMMs. We find that AIC and BIC are not always reliable tools for selecting the number of hidden states in HMMs and that other information criteria such as adjusted BIC can actually perform better, depending on factors such as sample size and sojourn times in each state. We apply the information criteria to a set of HMMs fitted to seismic tremor data and compare the models selected by the different criteria.</p>\",\"PeriodicalId\":50519,\"journal\":{\"name\":\"Environmental and Ecological Statistics\",\"volume\":\"101 1 1\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2023-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental and Ecological Statistics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s10651-023-00584-5\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental and Ecological Statistics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10651-023-00584-5","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Finding the number of latent states in hidden Markov models using information criteria
Hidden Markov models (HMMs) are often used to model time series data and are applied in many fields of research. However, estimating the unknown number of hidden states in the Markov chain is a non-trivial component of HMM model selection and an area of active research. Currently, AIC and BIC are commonly used for this purpose, despite theoretical issues and some evidence of poor performance in the literature. Here, motivated by the HMMs developed to model seismic tremor data, we use simulation studies to compare the performance of a number of model selection information criteria when used to select the number of hidden states in HMMs, including an adjusted BIC not previously used with HMMs. We find that AIC and BIC are not always reliable tools for selecting the number of hidden states in HMMs and that other information criteria such as adjusted BIC can actually perform better, depending on factors such as sample size and sojourn times in each state. We apply the information criteria to a set of HMMs fitted to seismic tremor data and compare the models selected by the different criteria.
期刊介绍:
Environmental and Ecological Statistics publishes papers on practical applications of statistics and related quantitative methods to environmental science addressing contemporary issues.
Emphasis is on applied mathematical statistics, statistical methodology, and data interpretation and improvement for future use, with a view to advance statistics for environment, ecology and environmental health, and to advance environmental theory and practice using valid statistics.
Besides clarity of exposition, a single most important criterion for publication is the appropriateness of the statistical method to the particular environmental problem. The Journal covers all aspects of the collection, analysis, presentation and interpretation of environmental data for research, policy and regulation. The Journal is cross-disciplinary within the context of contemporary environmental issues and the associated statistical tools, concepts and methods. The Journal broadly covers theory and methods, case studies and applications, environmental change and statistical ecology, environmental health statistics and stochastics, and related areas. Special features include invited discussion papers; research communications; technical notes and consultation corner; mini-reviews; letters to the Editor; news, views and announcements; hardware and software reviews; data management etc.