{"title":"多种输入模型的诊断检查","authors":"Yang Zhao","doi":"10.1007/s10182-021-00429-1","DOIUrl":null,"url":null,"abstract":"<div><p>Model checking in multiple imputation (MI, Rubin in Multiple imputation for nonresponse in surveys, Wiley, New York, 1987) becomes increasingly important with the recent developments in MI and its widespread use in statistical analysis with missing data (e.g. van Buuren et al. in J Stat Comput Simul 76(12):1049–1064, 2006; van Buuren and Groothuis-Oudshoorn in J Stat Soft 45(3):1–67, 2011; Chen et al. in Biometrics 67:799–809, 2011; Nguyen et al. in Emerg Themes Epidemiol 14(8):1–12, 2017). The currently recommended posterior predictive checking method (He and Zaslavsky in Stat Med 31:1–18, 2012; Nguyen et al. in Biom J 4:676–694, 2015) is less effective when the proportion of missing values increases and its produced posterior predictive <i>p</i> value is not supported by a null distribution as a standard <i>p</i> value (Meng in Annu Stat 22:1142–1160, 1994). This research develops a new diagnostic method for checking MI models and proposes a test statistic with a standard <i>p</i> value. The new diagnostic checking method is effective and flexible. It does not depend on the proportion of missing values and can deal with data sets with arbitrary nonmonotone missing data patterns. We examine the performance of the proposed method in a simulation study and illustrate the method in a study of coronary disease and associated factors.</p></div>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-021-00429-1.pdf","citationCount":"2","resultStr":"{\"title\":\"Diagnostic checking of multiple imputation models\",\"authors\":\"Yang Zhao\",\"doi\":\"10.1007/s10182-021-00429-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Model checking in multiple imputation (MI, Rubin in Multiple imputation for nonresponse in surveys, Wiley, New York, 1987) becomes increasingly important with the recent developments in MI and its widespread use in statistical analysis with missing data (e.g. van Buuren et al. in J Stat Comput Simul 76(12):1049–1064, 2006; van Buuren and Groothuis-Oudshoorn in J Stat Soft 45(3):1–67, 2011; Chen et al. in Biometrics 67:799–809, 2011; Nguyen et al. in Emerg Themes Epidemiol 14(8):1–12, 2017). The currently recommended posterior predictive checking method (He and Zaslavsky in Stat Med 31:1–18, 2012; Nguyen et al. in Biom J 4:676–694, 2015) is less effective when the proportion of missing values increases and its produced posterior predictive <i>p</i> value is not supported by a null distribution as a standard <i>p</i> value (Meng in Annu Stat 22:1142–1160, 1994). This research develops a new diagnostic method for checking MI models and proposes a test statistic with a standard <i>p</i> value. The new diagnostic checking method is effective and flexible. It does not depend on the proportion of missing values and can deal with data sets with arbitrary nonmonotone missing data patterns. We examine the performance of the proposed method in a simulation study and illustrate the method in a study of coronary disease and associated factors.</p></div>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2022-01-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10182-021-00429-1.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10182-021-00429-1\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10182-021-00429-1","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 2
摘要
多重插补中的模型检查(MI,Rubin在调查中无响应的多重插补中,Wiley,New York,1987)随着MI的最新发展及其在缺失数据的统计分析中的广泛使用而变得越来越重要(例如,van Buuren等人在J Stat Comput Simul 76(12):1049–10642006;van Buuren和Groothuis Oudshoorn在J Stat Soft 45(3):1–672011;Chen等人在《生物计量学》67:799–8092011;Nguyen等人在《新兴主题流行病学》第14(8)期:2017年1月12日)。当前推荐的后验预测检查方法(He和Zaslavsky在《Stat Med》31:1-182012;Nguyen等人在《Biom J》4:676–6942015中)在缺失值比例增加时效果较差,并且其产生的后验预报p值没有作为标准p值的零分布支持(Meng在《Annu Stat》22:1142–11601994中)。本研究开发了一种检查MI模型的新诊断方法,并提出了一种标准p值的检验统计量。新的诊断检查方法有效且灵活。它不依赖于缺失值的比例,并且可以处理具有任意非单调缺失数据模式的数据集。我们在模拟研究中检验了所提出的方法的性能,并在冠状动脉疾病和相关因素的研究中说明了该方法。
Model checking in multiple imputation (MI, Rubin in Multiple imputation for nonresponse in surveys, Wiley, New York, 1987) becomes increasingly important with the recent developments in MI and its widespread use in statistical analysis with missing data (e.g. van Buuren et al. in J Stat Comput Simul 76(12):1049–1064, 2006; van Buuren and Groothuis-Oudshoorn in J Stat Soft 45(3):1–67, 2011; Chen et al. in Biometrics 67:799–809, 2011; Nguyen et al. in Emerg Themes Epidemiol 14(8):1–12, 2017). The currently recommended posterior predictive checking method (He and Zaslavsky in Stat Med 31:1–18, 2012; Nguyen et al. in Biom J 4:676–694, 2015) is less effective when the proportion of missing values increases and its produced posterior predictive p value is not supported by a null distribution as a standard p value (Meng in Annu Stat 22:1142–1160, 1994). This research develops a new diagnostic method for checking MI models and proposes a test statistic with a standard p value. The new diagnostic checking method is effective and flexible. It does not depend on the proportion of missing values and can deal with data sets with arbitrary nonmonotone missing data patterns. We examine the performance of the proposed method in a simulation study and illustrate the method in a study of coronary disease and associated factors.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.