诊断潜类模型中检测偏离条件独立假设：一项模拟研究。

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES BMC Medical Research Methodology Pub Date : 2024-12-05 DOI:10.1186/s12874-024-02432-x

Yasin Okkaoglu, Nicky J Welton, Hayley E Jones

{"title":"诊断潜类模型中检测偏离条件独立假设：一项模拟研究。","authors":"Yasin Okkaoglu, Nicky J Welton, Hayley E Jones","doi":"10.1186/s12874-024-02432-x","DOIUrl":null,"url":null,"abstract":"Background: Latent class models can be used to estimate diagnostic accuracy without a gold standard test. Early studies often assumed independence between tests given the true disease state, however this can lead to biased estimates when there are inter-test dependencies. Residual correlation plots and chi-squared statistics have been commonly utilized to assess the validity of the conditional independence assumption and, when it does not hold, identify which test pairs are conditionally dependent. We aimed to assess the performance of these tools with a simulation study covering a wide range of scenarios.Methods: We generated data sets from a model with four tests and a dependence between tests 1 and 2 within the diseased group. We varied sample size, prevalence, covariance, sensitivity and specificity, with 504 combinations of these in total, and 1000 data sets for each combination. We fitted the conditional independence model in a Bayesian framework, and reported absolute bias, coverage, and how often the residual correlation plots, <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> statistics indicated lack-of-fit globally or for each test pair.Results: Across all settings, residual correlation plots, pairwise <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> detected the correct correlated pair of tests only 12.1%, 10.3%, and 10.3% of the time, respectively, but incorrectly suggested dependence between tests 3 and 4 64.9%, 49.7%, and 49.5% of the time. We observed some variation in this across parameter settings, with these tools appearing to perform more as intended when tests 3 and 4 were both much more accurate than tests 1 and 2. Residual correlation plots, <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> statistics identified a lack of overall fit in 74.3%, 64.5% and 67.5% of models, respectively. The conditional independence model tended to overestimate the sensitivities of the correlated tests (median bias across all scenarios 0.094, 2.5th and 97.5th percentiles -0.003, 0.397) and underestimate prevalence and the specificities of the uncorrelated tests.Conclusions: Residual correlation plots and chi-squared statistics cannot be relied upon to identify which tests are conditionally dependent, and also have relatively low power to detect lack of overall fit. This is important since failure to account for conditional dependence can lead to highly biased parameter estimates.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"299"},"PeriodicalIF":3.9000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11619692/pdf/","citationCount":"0","resultStr":"{\"title\":\"Detecting departures from the conditional independence assumption in diagnostic latent class models: a simulation study.\",\"authors\":\"Yasin Okkaoglu, Nicky J Welton, Hayley E Jones\",\"doi\":\"10.1186/s12874-024-02432-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Latent class models can be used to estimate diagnostic accuracy without a gold standard test. Early studies often assumed independence between tests given the true disease state, however this can lead to biased estimates when there are inter-test dependencies. Residual correlation plots and chi-squared statistics have been commonly utilized to assess the validity of the conditional independence assumption and, when it does not hold, identify which test pairs are conditionally dependent. We aimed to assess the performance of these tools with a simulation study covering a wide range of scenarios.Methods: We generated data sets from a model with four tests and a dependence between tests 1 and 2 within the diseased group. We varied sample size, prevalence, covariance, sensitivity and specificity, with 504 combinations of these in total, and 1000 data sets for each combination. We fitted the conditional independence model in a Bayesian framework, and reported absolute bias, coverage, and how often the residual correlation plots, <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> statistics indicated lack-of-fit globally or for each test pair.Results: Across all settings, residual correlation plots, pairwise <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> detected the correct correlated pair of tests only 12.1%, 10.3%, and 10.3% of the time, respectively, but incorrectly suggested dependence between tests 3 and 4 64.9%, 49.7%, and 49.5% of the time. We observed some variation in this across parameter settings, with these tools appearing to perform more as intended when tests 3 and 4 were both much more accurate than tests 1 and 2. Residual correlation plots, <math> <msup><mrow><mi>G</mi></mrow> <mn>2</mn></msup> </math> and <math> <msup><mrow><mi>χ</mi></mrow> <mn>2</mn></msup> </math> statistics identified a lack of overall fit in 74.3%, 64.5% and 67.5% of models, respectively. The conditional independence model tended to overestimate the sensitivities of the correlated tests (median bias across all scenarios 0.094, 2.5th and 97.5th percentiles -0.003, 0.397) and underestimate prevalence and the specificities of the uncorrelated tests.Conclusions: Residual correlation plots and chi-squared statistics cannot be relied upon to identify which tests are conditionally dependent, and also have relatively low power to detect lack of overall fit. This is important since failure to account for conditional dependence can lead to highly biased parameter estimates.\",\"PeriodicalId\":9114,\"journal\":{\"name\":\"BMC Medical Research Methodology\",\"volume\":\"24 1\",\"pages\":\"299\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11619692/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Research Methodology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12874-024-02432-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-024-02432-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

背景：潜在类别模型可用于估计诊断准确性，而无需金标准检验。早期的研究通常假设在给定真实疾病状态的测试之间是独立的，然而，当存在测试之间的依赖关系时，这可能导致有偏差的估计。残差相关图和卡方统计量通常用于评估条件独立假设的有效性，当它不成立时，确定哪些检验对是条件相关的。我们的目标是通过涵盖广泛场景的模拟研究来评估这些工具的性能。方法：我们从一个包含四项测试的模型中生成数据集，并且在患病组中测试1和测试2之间存在依赖性。我们改变了样本量、患病率、协方差、敏感性和特异性，总共有504种组合，每种组合有1000个数据集。我们在贝叶斯框架中拟合了条件独立模型，并报告了绝对偏差、覆盖率以及残差相关图、g2和χ 2统计量显示全局或每个检验对缺乏拟合的频率。结果：在所有设置中，残差相关图、成对g2和χ 2分别仅在12.1%、10.3%和10.3%的时间内检测到正确的相关检验对，但错误地提示检验3和4之间的相关性为64.9%、49.7%和49.5%。我们观察到不同参数设置之间存在一些差异，当测试3和4都比测试1和2更准确时，这些工具似乎更符合预期。残差相关图、g2和χ 2统计分别发现74.3%、64.5%和67.5%的模型缺乏整体拟合。条件独立模型倾向于高估相关检验的敏感性（所有情景的中位偏差为0.094、2.5和97.5百分位数-0.003、0.397），低估不相关检验的患病率和特异性。结论：残差相关图和卡方统计量不能用来确定哪些检验是有条件依赖的，而且检测总体拟合缺乏的能力也相对较低。这一点很重要，因为不考虑条件依赖性可能导致参数估计高度偏倚。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Detecting departures from the conditional independence assumption in diagnostic latent class models: a simulation study.

Background: Latent class models can be used to estimate diagnostic accuracy without a gold standard test. Early studies often assumed independence between tests given the true disease state, however this can lead to biased estimates when there are inter-test dependencies. Residual correlation plots and chi-squared statistics have been commonly utilized to assess the validity of the conditional independence assumption and, when it does not hold, identify which test pairs are conditionally dependent. We aimed to assess the performance of these tools with a simulation study covering a wide range of scenarios.

Methods: We generated data sets from a model with four tests and a dependence between tests 1 and 2 within the diseased group. We varied sample size, prevalence, covariance, sensitivity and specificity, with 504 combinations of these in total, and 1000 data sets for each combination. We fitted the conditional independence model in a Bayesian framework, and reported absolute bias, coverage, and how often the residual correlation plots, $G^{2}$ and $χ^{2}$ statistics indicated lack-of-fit globally or for each test pair.

Results: Across all settings, residual correlation plots, pairwise $G^{2}$ and $χ^{2}$ detected the correct correlated pair of tests only 12.1%, 10.3%, and 10.3% of the time, respectively, but incorrectly suggested dependence between tests 3 and 4 64.9%, 49.7%, and 49.5% of the time. We observed some variation in this across parameter settings, with these tools appearing to perform more as intended when tests 3 and 4 were both much more accurate than tests 1 and 2. Residual correlation plots, $G^{2}$ and $χ^{2}$ statistics identified a lack of overall fit in 74.3%, 64.5% and 67.5% of models, respectively. The conditional independence model tended to overestimate the sensitivities of the correlated tests (median bias across all scenarios 0.094, 2.5th and 97.5th percentiles -0.003, 0.397) and underestimate prevalence and the specificities of the uncorrelated tests.

Conclusions: Residual correlation plots and chi-squared statistics cannot be relied upon to identify which tests are conditionally dependent, and also have relatively low power to detect lack of overall fit. This is important since failure to account for conditional dependence can lead to highly biased parameter estimates.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Medical Research Methodology 医学-卫生保健

CiteScore

6.50

自引率

2.50%

发文量

298

审稿时长

3-8 weeks

期刊介绍： BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.