Item response theory models are commonly adopted in educational assessment and psychological measurement. Such models need to be modified to accommodate practical situations when statistical sampling assumptions are violated. Omission is a common phenomenon in educational testing. In modern computer-based testing, we have not only examinees' responses but also their response times. This paper utilizes response time and develops a joint model of responses and response times. The new approach is analogous to those developed in survival analysis for dealing with right-censored data. In particular, a key ingredient is the introduction of the omission time (OT), which corresponds to the censoring time in survival analysis. By competing risk formulation, the proposed method provides an alternative narrative to how an item becomes answered versus omitted, depending on the competing relationship of response time and OT, so that the likelihood function can be constructed properly. The maximum likelihood estimator can be computed via the expectation-maximization algorithm. Simulation studies were conducted to evaluate the performance of the proposed method and its robustness against various mis-specifications. The method was applied to a dataset from the PISA 2015 Science Test.
{"title":"Jointly modeling responses and omitted items by a competing risk model: A survival analysis approach","authors":"Jinxin Guo, Xin Xu, Guanhua Fang, Zhiliang Ying, Susu Zhang","doi":"10.1111/bmsp.12382","DOIUrl":"10.1111/bmsp.12382","url":null,"abstract":"<p>Item response theory models are commonly adopted in educational assessment and psychological measurement. Such models need to be modified to accommodate practical situations when statistical sampling assumptions are violated. Omission is a common phenomenon in educational testing. In modern computer-based testing, we have not only examinees' responses but also their response times. This paper utilizes response time and develops a joint model of responses and response times. The new approach is analogous to those developed in survival analysis for dealing with right-censored data. In particular, a key ingredient is the introduction of the omission time (OT), which corresponds to the censoring time in survival analysis. By competing risk formulation, the proposed method provides an alternative narrative to how an item becomes answered versus omitted, depending on the competing relationship of response time and OT, so that the likelihood function can be constructed properly. The maximum likelihood estimator can be computed via the expectation-maximization algorithm. Simulation studies were conducted to evaluate the performance of the proposed method and its robustness against various mis-specifications. The method was applied to a dataset from the PISA 2015 Science Test.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"804-829"},"PeriodicalIF":1.8,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12382","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143069619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Recent technological advancements have enabled the collection of intensive longitudinal data (ILD), consisting of repeated measurements from the same individual. The threshold autoregressive (TAR) model is often used to capture the dynamic outcome process in ILD, with autoregressive parameters varying based on outcome variable levels. For ILD from multiple individuals, multilevel TAR (ML-TAR) models have been proposed, with Bayesian approaches typically used for parameter estimation. However, fitting ML-TAR models can be computationally challenging. This study introduces a mean-field variational Bayes (MFVB) algorithm as an alternative to traditional Bayesian inference. By optimizing to approximate posterior densities, variational Bayes aims to find the best approximation within a defined set of distributions. Simulation results demonstrate that our MFVB algorithm is significantly faster than the standard Markov chain Monte Carlo (MCMC) approach. Moreover, increasing the number of individuals or time points enhances the accuracy of the parameter estimates using MFVB, suggesting that sufficient data are crucial for accurate estimation in complex models like ML-TAR models. When applied to real-world data, the MFVB algorithm was significantly more efficient than MCMC and maintained similar accuracy. Thus, the MFVB algorithm is a faster and more consistent alternative to MCMC for large-scale inference in ILD models.
{"title":"Efficient and accurate variational inference for multilevel threshold autoregressive models in intensive longitudinal data","authors":"Azizur Rahman, Depeng Jiang, Lisa M. Lix","doi":"10.1111/bmsp.12381","DOIUrl":"10.1111/bmsp.12381","url":null,"abstract":"<p>Recent technological advancements have enabled the collection of intensive longitudinal data (ILD), consisting of repeated measurements from the same individual. The threshold autoregressive (TAR) model is often used to capture the dynamic outcome process in ILD, with autoregressive parameters varying based on outcome variable levels. For ILD from multiple individuals, multilevel TAR (ML-TAR) models have been proposed, with Bayesian approaches typically used for parameter estimation. However, fitting ML-TAR models can be computationally challenging. This study introduces a mean-field variational Bayes (MFVB) algorithm as an alternative to traditional Bayesian inference. By optimizing to approximate posterior densities, variational Bayes aims to find the best approximation within a defined set of distributions. Simulation results demonstrate that our MFVB algorithm is significantly faster than the standard Markov chain Monte Carlo (MCMC) approach. Moreover, increasing the number of individuals or time points enhances the accuracy of the parameter estimates using MFVB, suggesting that sufficient data are crucial for accurate estimation in complex models like ML-TAR models. When applied to real-world data, the MFVB algorithm was significantly more efficient than MCMC and maintained similar accuracy. Thus, the MFVB algorithm is a faster and more consistent alternative to MCMC for large-scale inference in ILD models.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"785-803"},"PeriodicalIF":1.8,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143016865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Elisa Frutos-Bernal, Eva Ceulemans, Purificación Galindo-Villardón, Tom F. Wilderjans
In various areas of science, researchers try to gain insight into important processes by jointly analysing different datasets containing information regarding common aspects of these processes. For example, to explain individual differences in personality, researchers collect, for the same set of persons, data regarding behavioural signatures (i.e., the reaction profile of a person across different situations), on the one hand, and traits or dispositions, on the other hand. To uncover the processes underlying such coupled data, to all N-way