首页 > 最新文献

Biostatistics and Epidemiology最新文献

英文 中文
Classification accuracy goals for diagnostic tests based on risk stratification 基于风险分层的诊断测试分类准确性目标
Q3 Medicine Pub Date : 2021-03-17 DOI: 10.1080/24709360.2021.1878406
G. Pennello
In diagnostic test evaluation, performance goals are often set for classification accuracy measures such as specificity, sensitivity and diagnostic likelihood ratio. For tests that detect rare conditions, classification accuracy goals are attractive because they can be evaluated in case-control studies enriched for the condition. A neglected area of research is determining classification accuracy goals that confer clinical usefulness of a test. We determine classification accuracy goals based on desired risk stratification, i.e. the post-test risk of having the condition compared with the pre-test risk. We determine goals for rule-out tests, rule-in tests, and those that do both. Goals for negative and positive likelihood ratios (NLR, PLR) are emphasized because of their natural relationships with risk stratification via Bayes Theorem. Goals for specificity and sensitivity are implied by goals on NLR and PLR. Goals that confer superiority or non-inferiority of a test to a comparator are based on approximating risk differences and relative risks by functions of likelihood ratios. Inference is based on Wald confidence intervals for ratios of likelihood ratios. To illustrate, we consider hypothetical data on a fetal fibronectin assay for ruling out risk of pre-term birth and two human papillomavirus assays for detecting cervical cancer. Trial registration ClinicalTrials.gov identifier: NCT01931566.
在诊断测试评估中,通常为分类准确性指标设定性能目标,如特异性、敏感性和诊断似然比。对于检测罕见疾病的测试,分类准确性目标很有吸引力,因为它们可以在针对该疾病的病例对照研究中进行评估。一个被忽视的研究领域是确定分类准确性目标,从而赋予测试临床实用性。我们根据期望的风险分层来确定分类准确性目标,即与测试前风险相比,测试后出现这种情况的风险。我们确定排除测试、规则引入测试以及两者兼而有之的测试的目标。负似然比和正似然比(NLR,PLR)的目标被强调,因为它们通过贝叶斯定理与风险分层有着天然的关系。NLR和PLR的目标隐含了特异性和敏感性的目标。将测试的优缺点赋予比较者的目标是基于通过似然比函数近似风险差异和相对风险。推理是基于似然比的Wald置信区间。为了说明这一点,我们考虑了一项用于排除早产风险的胎儿纤连蛋白检测和两项用于检测宫颈癌症的人乳头瘤病毒检测的假设数据。试验注册ClinicalTrials.gov标识符:NCT01931566。
{"title":"Classification accuracy goals for diagnostic tests based on risk stratification","authors":"G. Pennello","doi":"10.1080/24709360.2021.1878406","DOIUrl":"https://doi.org/10.1080/24709360.2021.1878406","url":null,"abstract":"In diagnostic test evaluation, performance goals are often set for classification accuracy measures such as specificity, sensitivity and diagnostic likelihood ratio. For tests that detect rare conditions, classification accuracy goals are attractive because they can be evaluated in case-control studies enriched for the condition. A neglected area of research is determining classification accuracy goals that confer clinical usefulness of a test. We determine classification accuracy goals based on desired risk stratification, i.e. the post-test risk of having the condition compared with the pre-test risk. We determine goals for rule-out tests, rule-in tests, and those that do both. Goals for negative and positive likelihood ratios (NLR, PLR) are emphasized because of their natural relationships with risk stratification via Bayes Theorem. Goals for specificity and sensitivity are implied by goals on NLR and PLR. Goals that confer superiority or non-inferiority of a test to a comparator are based on approximating risk differences and relative risks by functions of likelihood ratios. Inference is based on Wald confidence intervals for ratios of likelihood ratios. To illustrate, we consider hypothetical data on a fetal fibronectin assay for ruling out risk of pre-term birth and two human papillomavirus assays for detecting cervical cancer. Trial registration ClinicalTrials.gov identifier: NCT01931566.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"149 - 168"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1878406","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44791322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Case fatality risk estimated from routinely collected disease surveillance data: application to COVID–19 根据常规收集的疾病监测数据估计的病死率风险:在COVID-19中的应用
Q3 Medicine Pub Date : 2021-01-02 DOI: 10.1080/24709360.2021.1913708
I. Marschner
Case fatality risk (CFR) is the probability of death among cases of a disease. A crude CFR estimate is the ratio of the number deaths to the number of cases of the disease. This estimate is biased, however, particularly during outbreaks of emerging infectious diseases such as COVID-19, because the death time of recent cases is subject to right censoring. Instead, we propose deconvolution methods applied to routinely collected surveillance data of unlinked case and death counts over time. We begin by considering the death series to be the convolution of the case series and the fatality distribution, which is the subdistribution of the time between diagnosis and death. We then use deconvolution methods to estimate this fatality distribution. This provides a CFR estimate together with information about the distribution of time to death. Importantly, this information is extracted without the need to make strong assumptions used in previous analyses. The methods are applied to COVID-19 surveillance data from a range of countries illustrating substantial CFR differences. Simulations show that crude approaches lead to underestimation, particularly early in an outbreak, and that the proposed approach can rectify this bias. An R package called covidSurv is available for implementing the analyses.
病死率风险(CFR)是指疾病病例中死亡的概率。粗略估计的病死率是指死亡人数与患病人数之比。然而,这一估计有偏差,特别是在COVID-19等新出现的传染病爆发期间,因为最近病例的死亡时间需要进行正确的审查。相反,我们建议将反卷积方法应用于常规收集的随时间变化的无关联病例和死亡计数的监测数据。我们首先考虑死亡序列是病例序列和病死率分布的卷积,病死率分布是诊断和死亡之间时间的子分布。然后,我们使用反卷积方法来估计这种死亡率分布。这提供了病死率估计值以及有关死亡时间分布的信息。重要的是,提取这些信息时不需要像以前的分析那样做出强有力的假设。这些方法应用于来自一系列国家的COVID-19监测数据,这些数据显示病死率存在巨大差异。模拟表明,粗糙的方法会导致低估,特别是在爆发初期,而提出的方法可以纠正这种偏差。一个名为covid - surv的R包可用于实施分析。
{"title":"Case fatality risk estimated from routinely collected disease surveillance data: application to COVID–19","authors":"I. Marschner","doi":"10.1080/24709360.2021.1913708","DOIUrl":"https://doi.org/10.1080/24709360.2021.1913708","url":null,"abstract":"Case fatality risk (CFR) is the probability of death among cases of a disease. A crude CFR estimate is the ratio of the number deaths to the number of cases of the disease. This estimate is biased, however, particularly during outbreaks of emerging infectious diseases such as COVID-19, because the death time of recent cases is subject to right censoring. Instead, we propose deconvolution methods applied to routinely collected surveillance data of unlinked case and death counts over time. We begin by considering the death series to be the convolution of the case series and the fatality distribution, which is the subdistribution of the time between diagnosis and death. We then use deconvolution methods to estimate this fatality distribution. This provides a CFR estimate together with information about the distribution of time to death. Importantly, this information is extracted without the need to make strong assumptions used in previous analyses. The methods are applied to COVID-19 surveillance data from a range of countries illustrating substantial CFR differences. Simulations show that crude approaches lead to underestimation, particularly early in an outbreak, and that the proposed approach can rectify this bias. An R package called covidSurv is available for implementing the analyses.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"49 - 68"},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1913708","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"60127878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Methods for detecting outlying regions and influence diagnosis in multi-regional clinical trials 多区域临床试验中边缘区域的检测方法及影响诊断
Q3 Medicine Pub Date : 2021-01-02 DOI: 10.1080/24709360.2021.1921944
M. Aoki, H. Noma, M. Gosho
Due to the globalization of drug development, multi-regional clinical trials (MRCTs) have been increasingly adopted in clinical evaluations. In MRCTs, the primary objective is to demonstrate the efficacy of new drugs in all participating regions, but heterogeneity of various relevant factors across these regions can cause inconsistency of treatment effects. In particular, outlying regions with extreme profiles can influence the overall conclusions of these studies. In this article, we propose quantitative methods to detect these outlying regions and to assess their influences in MRCTs. The approaches are as follows: (1) a method using a dfbeta-like measure, a studentized residual obtained by a leave-one-out cross-validation (LOOCV) scheme; (2) a model-based significance testing method using a mean-shifted model; (3) a method using a relative change measure for the variance estimate of the overall effect estimator; and (4) a method using a relative change measure for the heterogeneity variance estimate in a random-effects model. Parametric bootstrap schemes are proposed to accurately assess the statistical significance and variabilities of the aforementioned influence diagnostic tools. We illustrate the effectiveness of these proposed methods via applications to two MRCTs, the RECORD and RENAAL studies.
由于药物开发的全球化,多区域临床试验(MRCT)越来越多地被用于临床评估。在MRCT中,主要目标是证明新药在所有参与区域的疗效,但这些区域的各种相关因素的异质性可能导致治疗效果的不一致。特别是,具有极端剖面的边远地区可能会影响这些研究的总体结论。在这篇文章中,我们提出了定量方法来检测这些边缘区域,并评估它们在MRCT中的影响。方法如下:(1)使用类dfbeta测度的方法,即通过留一交叉验证(LOOCV)方案获得的学生化残差;(2) 使用均值偏移模型的基于模型的显著性测试方法;(3) 将相对变化度量用于所述总体效应估计器的方差估计的方法;以及(4)一种在随机效应模型中使用相对变化度量来估计异质性方差的方法。提出了参数自举方案,以准确评估上述影响诊断工具的统计显著性和可变性。我们通过应用于两项MRCT,即RECORD和RENAL研究来说明这些拟议方法的有效性。
{"title":"Methods for detecting outlying regions and influence diagnosis in multi-regional clinical trials","authors":"M. Aoki, H. Noma, M. Gosho","doi":"10.1080/24709360.2021.1921944","DOIUrl":"https://doi.org/10.1080/24709360.2021.1921944","url":null,"abstract":"Due to the globalization of drug development, multi-regional clinical trials (MRCTs) have been increasingly adopted in clinical evaluations. In MRCTs, the primary objective is to demonstrate the efficacy of new drugs in all participating regions, but heterogeneity of various relevant factors across these regions can cause inconsistency of treatment effects. In particular, outlying regions with extreme profiles can influence the overall conclusions of these studies. In this article, we propose quantitative methods to detect these outlying regions and to assess their influences in MRCTs. The approaches are as follows: (1) a method using a dfbeta-like measure, a studentized residual obtained by a leave-one-out cross-validation (LOOCV) scheme; (2) a model-based significance testing method using a mean-shifted model; (3) a method using a relative change measure for the variance estimate of the overall effect estimator; and (4) a method using a relative change measure for the heterogeneity variance estimate in a random-effects model. Parametric bootstrap schemes are proposed to accurately assess the statistical significance and variabilities of the aforementioned influence diagnostic tools. We illustrate the effectiveness of these proposed methods via applications to two MRCTs, the RECORD and RENAAL studies.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"30 - 48"},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1921944","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45719376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Region as a risk factor for asthma prevalence: statistical evidence from administrative data 地区作为哮喘患病率的危险因素:来自行政数据的统计证据
Q3 Medicine Pub Date : 2021-01-02 DOI: 10.1080/24709360.2021.1924495
R. Wesonga, Khidir M. Abdelbasit
Geographical regions may have an influence on asthma exacerbation. No conclusive study has been conducted to fully support or dissipate this assertion. We sought to use a data-driven approach to investigate the possible effect of geographical location on asthma. This study was based on data collected by the Ministry of Health over a 6-year period from 2010 to 2015 and presented in their annual reports. Prevalence rates for 11 regions were computed using the analysis of variance and regression models to determine the proximal nature of the region as a risk factor for asthma. Our results show a statistically significant difference in prevalence rates of asthma among the 11 regions. The asthma prevalence rate among the male population was 18% (OR = 1.18, p = .011) more than for the female population. There was a positive marginal increase in the asthma prevalence over the period. Further, five groups were derived based on asthma prevalence rates and trends. The region has proximal risk factor and significantly associated with asthma prevalence over the period. We recommend the creation of a control mechanism that targets regions with higher prevalence and increasing trends.
地理区域可能对哮喘恶化有影响。没有进行任何结论性研究来完全支持或消除这一说法。我们试图使用数据驱动的方法来研究地理位置对哮喘的可能影响。这项研究基于卫生部在2010年至2015年6年期间收集的数据,并在其年度报告中提出。使用方差分析和回归模型计算了11个地区的患病率,以确定该地区作为哮喘风险因素的近端性质。我们的研究结果显示,11个地区的哮喘患病率存在统计学上的显著差异。男性人群哮喘患病率为18%(OR = 1.18,p = .011)比女性人口多。在这段时间内,哮喘患病率有一个正的边际增长。此外,根据哮喘患病率和趋势得出了五组。该地区具有近端风险因素,并与该时期的哮喘患病率显著相关。我们建议建立一个控制机制,针对流行率较高和呈上升趋势的地区。
{"title":"Region as a risk factor for asthma prevalence: statistical evidence from administrative data","authors":"R. Wesonga, Khidir M. Abdelbasit","doi":"10.1080/24709360.2021.1924495","DOIUrl":"https://doi.org/10.1080/24709360.2021.1924495","url":null,"abstract":"Geographical regions may have an influence on asthma exacerbation. No conclusive study has been conducted to fully support or dissipate this assertion. We sought to use a data-driven approach to investigate the possible effect of geographical location on asthma. This study was based on data collected by the Ministry of Health over a 6-year period from 2010 to 2015 and presented in their annual reports. Prevalence rates for 11 regions were computed using the analysis of variance and regression models to determine the proximal nature of the region as a risk factor for asthma. Our results show a statistically significant difference in prevalence rates of asthma among the 11 regions. The asthma prevalence rate among the male population was 18% (OR = 1.18, p = .011) more than for the female population. There was a positive marginal increase in the asthma prevalence over the period. Further, five groups were derived based on asthma prevalence rates and trends. The region has proximal risk factor and significantly associated with asthma prevalence over the period. We recommend the creation of a control mechanism that targets regions with higher prevalence and increasing trends.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"19 - 29"},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1924495","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42433147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative analysis of epidemiological models for COVID-19 pandemic predictions COVID-19大流行预测流行病学模型的比较分析
Q3 Medicine Pub Date : 2021-01-02 DOI: 10.1080/24709360.2021.1913709
Rajan Gupta, G. Pandey, S. Pal
Epidemiological modeling is an important problem around the world. This research presents COVID-19 analysis to understand which model works better for different regions. A comparative analysis of three growth curve fitting models (Gompertz, Logistic, and Exponential), two mathematical models (SEIR and IDEA), two forecasting models (Holt's exponential and ARIMA), and four machine/deep learning models (Neural Network, LSTM Networks, GANs, and Random Forest) using three evaluation criteria on ten prominent regions around the world from North America, South America, Europe, and Asia has been presented. The minimum and median values for RMSE were 1.8 and 5372.9; the values for the mean absolute percentage error were 0.005 and 6.63; and the values for AIC were 87.07 and 613.3, respectively, from a total of 125 experiments across 10 regions. The growth curve fitting models worked well where flattening of the cases has started. Based on region's growth curve, a relevant model from the list can be used for predicting the number of infected cases for COVID-19. Some other models used in forecasting the number of cases have been added in the future work section, which can help researchers to forecast the number of cases in different regions of the world.
流行病学建模是世界范围内的一个重要问题。本研究提出了COVID-19分析,以了解哪种模式更适合不同地区。在北美、南美、欧洲和亚洲的10个主要地区,采用3种评价标准,对3种增长曲线拟合模型(Gompertz、Logistic和Exponential)、2种数学模型(SEIR和IDEA)、2种预测模型(Holt’s Exponential和ARIMA)和4种机器/深度学习模型(Neural Network、LSTM Networks、gan和Random Forest)进行了比较分析。RMSE的最小值和中位数分别为1.8和5372.9;平均绝对百分比误差分别为0.005和6.63;10个地区共125个试验的AIC值分别为87.07和613.3。增长曲线拟合模型在情况开始趋于平缓的地方效果很好。根据区域增长曲线,利用列表中的相关模型预测新冠肺炎感染病例数。在未来工作部分增加了一些用于预测病例数的其他模型,可以帮助研究人员预测世界不同地区的病例数。
{"title":"Comparative analysis of epidemiological models for COVID-19 pandemic predictions","authors":"Rajan Gupta, G. Pandey, S. Pal","doi":"10.1080/24709360.2021.1913709","DOIUrl":"https://doi.org/10.1080/24709360.2021.1913709","url":null,"abstract":"Epidemiological modeling is an important problem around the world. This research presents COVID-19 analysis to understand which model works better for different regions. A comparative analysis of three growth curve fitting models (Gompertz, Logistic, and Exponential), two mathematical models (SEIR and IDEA), two forecasting models (Holt's exponential and ARIMA), and four machine/deep learning models (Neural Network, LSTM Networks, GANs, and Random Forest) using three evaluation criteria on ten prominent regions around the world from North America, South America, Europe, and Asia has been presented. The minimum and median values for RMSE were 1.8 and 5372.9; the values for the mean absolute percentage error were 0.005 and 6.63; and the values for AIC were 87.07 and 613.3, respectively, from a total of 125 experiments across 10 regions. The growth curve fitting models worked well where flattening of the cases has started. Based on region's growth curve, a relevant model from the list can be used for predicting the number of infected cases for COVID-19. Some other models used in forecasting the number of cases have been added in the future work section, which can help researchers to forecast the number of cases in different regions of the world.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"69 - 91"},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1913709","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48622498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Concordance Measures and Time-Dependent ROC Methods. 一致性测量和时间相关 ROC 方法。
Q3 Medicine Pub Date : 2021-01-01 Epub Date: 2021-05-25 DOI: 10.1080/24709360.2021.1926189
Norberto Pantoja-Galicia, Olivia I Okereke, Deborah Blacker, Rebecca A Betensky

The receiver operating characteristic (ROC) curve displays sensitivity versus 1-specificity over a set of thresholds. The area under the ROC curve (AUC) is a global scalar summary of this curve. In the context of time-dependent ROC methods, we are interested in global scalar measures that summarize sequences of time-dependent AUCs over time. The concordance probability is a candidate for such purposes. The concordance probability can provide a global assessment of the discrimination ability of a test for an event that occurs at random times and may be right censored. If the test adequately differentiates between subjects who survive longer times and those who survive shorter times, this will assist clinical decisions. In this context the concordance probability may support assessment of precision medicine tools based on prognostic biomarkers models for overall survival. Definitions of time-dependent sensitivity and specificity are reviewed. Some connections between such definitions and concordance measures are also reviewed and we establish new connections via new measures of global concordance. We explore the relationship between such measures and their corresponding time-dependent AUC. To illustrate these concepts, an application in the context of Alzheimer's disease is presented.

接收者操作特征曲线(ROC)显示一组阈值的灵敏度与特异性的关系。ROC 曲线下面积(AUC)是该曲线的全局标量总结。对于随时间变化的 ROC 方法,我们感兴趣的是能总结随时间变化的 AUC 序列的全局标量指标。一致性概率就是这样一个候选指标。一致性概率可以全面评估测试对随机发生且可能是右删失的事件的区分能力。如果测试能充分区分存活时间较长的受试者和存活时间较短的受试者,这将有助于临床决策。在这种情况下,一致性概率可支持对基于预后生物标志物模型的精准医疗工具进行评估,以确定总生存期。本文回顾了与时间相关的敏感性和特异性的定义。我们还回顾了此类定义与一致性测量之间的一些联系,并通过全局一致性的新测量方法建立了新的联系。我们还探讨了这些指标与其相应的随时间变化的 AUC 之间的关系。为了说明这些概念,我们介绍了在阿尔茨海默病中的应用。
{"title":"Concordance Measures and Time-Dependent ROC Methods.","authors":"Norberto Pantoja-Galicia, Olivia I Okereke, Deborah Blacker, Rebecca A Betensky","doi":"10.1080/24709360.2021.1926189","DOIUrl":"10.1080/24709360.2021.1926189","url":null,"abstract":"<p><p>The receiver operating characteristic (ROC) curve displays sensitivity versus 1-specificity over a set of thresholds. The area under the ROC curve (AUC) is a global scalar summary of this curve. In the context of time-dependent ROC methods, we are interested in global scalar measures that summarize sequences of time-dependent AUCs over time. The concordance probability is a candidate for such purposes. The concordance probability can provide a global assessment of the discrimination ability of a test for an event that occurs at random times and may be right censored. If the test adequately differentiates between subjects who survive longer times and those who survive shorter times, this will assist clinical decisions. In this context the concordance probability may support assessment of precision medicine tools based on prognostic biomarkers models for overall survival. Definitions of time-dependent sensitivity and specificity are reviewed. Some connections between such definitions and concordance measures are also reviewed and we establish new connections via new measures of global concordance. We explore the relationship between such measures and their corresponding time-dependent AUC. To illustrate these concepts, an application in the context of Alzheimer's disease is presented.</p>","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":" ","pages":"232-249"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523576/pdf/nihms-1701256.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40389965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Number needed to test: quantifying risk stratification provided by diagnostic tests and risk predictions 需要测试的数量:通过诊断测试和风险预测提供的量化风险分层
Q3 Medicine Pub Date : 2020-08-07 DOI: 10.1080/24709360.2020.1796176
H. Katki, R. Dey, P. Saha-Chaudhuri
Risk stratification is the ability of a test or model to separate those at high vs. low risk of disease. There is no risk stratification metric that is in terms of the number of people requiring testing, which would help with considering the benefits, harms, and costs associated with the test and interventions. We introduce the expected number needed to test (NNtest) to identify one more disease case than by randomly selecting people for disease ascertainment. We show that NNtest measures risk stratification, allowing us to decompose NNtest into components that contrast the increase in risk upon testing positive (‘concern’) versus the decrease in risk upon testing negative (‘reassurance’). A graph of the reciprocals of concern vs. reassurance have linear contours of constant NNtest, visualizing the relative importance and tradeoff of each component to better understand the properties of risk thresholds with equal NNtest. We apply NNtest to the controversy over the risk threshold for who should get testing for BRCA1/2 mutations that cause high risks of breast and ovarian cancers. We show that risk thresholds between 0.78% and 5% optimize NNtest. At these thresholds, people will require risk-model evaluation to find one more mutation-carrier. However, these thresholds of equal NNtest provide very different concern and reassurance, with 0.78% providing much more reassurance (and thus much less concern) than 5%. Given that genetic testing costs are declining rapidly, the greater reassurance provided by the 0.78% threshold might be deemed more important than the greater concern provided by the 5% threshold.
风险分层是一种测试或模型将疾病高风险人群与低风险人群区分开来的能力。没有以需要检测的人数为单位的风险分层指标,这将有助于考虑与检测和干预措施相关的益处、危害和成本。我们引入了预期需要检测的人数(NNtest),以识别比随机选择人群进行疾病确定多的一个疾病病例。我们表明,NNtest测量风险分层,使我们能够将NNtest分解为对比检测呈阳性时风险增加(“癌症”)与检测呈阴性时风险降低(“保证”)的成分。关注与保证的倒数图具有恒定NNtest的线性轮廓,可视化了每个组成部分的相对重要性和权衡,以更好地理解具有相等NNtest的风险阈值的性质。我们将NNtest应用于关于谁应该接受BRCA1/2突变检测的风险阈值的争议,BRCA1/2变异会导致乳腺癌和卵巢癌的高风险。我们发现,0.78%和5%之间的风险阈值优化了NNtest。在这些阈值下,人们将需要风险模型评估来找到另一个突变携带者。然而,这些相同NNtest的阈值提供了非常不同的担忧和保证,0.78%提供了比5%更多的保证(因此更少的担忧)。鉴于基因检测成本正在迅速下降,0.78%的门槛所提供的更大保证可能被认为比5%的门槛所带来的更大担忧更重要。
{"title":"Number needed to test: quantifying risk stratification provided by diagnostic tests and risk predictions","authors":"H. Katki, R. Dey, P. Saha-Chaudhuri","doi":"10.1080/24709360.2020.1796176","DOIUrl":"https://doi.org/10.1080/24709360.2020.1796176","url":null,"abstract":"Risk stratification is the ability of a test or model to separate those at high vs. low risk of disease. There is no risk stratification metric that is in terms of the number of people requiring testing, which would help with considering the benefits, harms, and costs associated with the test and interventions. We introduce the expected number needed to test (NNtest) to identify one more disease case than by randomly selecting people for disease ascertainment. We show that NNtest measures risk stratification, allowing us to decompose NNtest into components that contrast the increase in risk upon testing positive (‘concern’) versus the decrease in risk upon testing negative (‘reassurance’). A graph of the reciprocals of concern vs. reassurance have linear contours of constant NNtest, visualizing the relative importance and tradeoff of each component to better understand the properties of risk thresholds with equal NNtest. We apply NNtest to the controversy over the risk threshold for who should get testing for BRCA1/2 mutations that cause high risks of breast and ovarian cancers. We show that risk thresholds between 0.78% and 5% optimize NNtest. At these thresholds, people will require risk-model evaluation to find one more mutation-carrier. However, these thresholds of equal NNtest provide very different concern and reassurance, with 0.78% providing much more reassurance (and thus much less concern) than 5%. Given that genetic testing costs are declining rapidly, the greater reassurance provided by the 0.78% threshold might be deemed more important than the greater concern provided by the 5% threshold.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"134 - 148"},"PeriodicalIF":0.0,"publicationDate":"2020-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2020.1796176","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42483253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrast-specific propensity scores 对比特定倾向得分
Q3 Medicine Pub Date : 2020-07-02 DOI: 10.1080/24709360.2021.1936421
Shasha Han, D. Rubin
Basic propensity score methodology is designed to balance the distributions of multivariate pre-treatment covariates when comparing one active treatment with one control treatment. However, practical settings often involve comparing more than two treatments, where more complicated contrasts than the basic treatment-control one, , are relevant. Here, we propose the use of contrast-specific propensity scores (CSPS), which allows the creation of treatment groups of units that are balanced with respect to bifurcations of the specified contrasts and the multivariate space spanned by these bifurcations.
基本倾向评分方法的目的是在比较一种积极治疗与一种对照治疗时平衡多变量预处理协变量的分布。然而,实际情况往往涉及比较两种以上的治疗方法,在这种情况下,比基本的治疗对照更复杂的对比是相关的。在这里,我们建议使用对比特定倾向评分(CSPS),它允许创建相对于指定对比的分支和这些分支所跨越的多变量空间平衡的单元治疗组。
{"title":"Contrast-specific propensity scores","authors":"Shasha Han, D. Rubin","doi":"10.1080/24709360.2021.1936421","DOIUrl":"https://doi.org/10.1080/24709360.2021.1936421","url":null,"abstract":"Basic propensity score methodology is designed to balance the distributions of multivariate pre-treatment covariates when comparing one active treatment with one control treatment. However, practical settings often involve comparing more than two treatments, where more complicated contrasts than the basic treatment-control one, , are relevant. Here, we propose the use of contrast-specific propensity scores (CSPS), which allows the creation of treatment groups of units that are balanced with respect to bifurcations of the specified contrasts and the multivariate space spanned by these bifurcations.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"1 - 8"},"PeriodicalIF":0.0,"publicationDate":"2020-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1936421","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46223106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A new regression model for the forecasting of COVID-19 outbreak evolution: an application to Italian data 预测新冠肺炎疫情演变的新回归模型:在意大利数据中的应用
Q3 Medicine Pub Date : 2020-06-12 DOI: 10.1080/24709360.2021.1978270
D. Sisti, E. Rocchi, S. Peluso, S. Amatori, M. Carletti
The novel coronavirus SARS-CoV-2 was first identified in China in December 2019. In just over five months, the virus affected over 4 million people and caused about 300,000 deaths. This study aimed to model new COVID-19 cases in Italian regions using a new curve. A new empirical curve is proposed to model the number of new cases of COVID-19. It resembles a known exponential growth curve, which has a straight line as an exponent, but in the growth curve proposed, the exponent is a logistic curve multiplied for a straight line. This curve shows an initial phase, the expected exponential growth, then rises to the maximum value and finally reaches zero. We characterized the epidemic growth patterns for the entire Italian nation and each of the 20 Italian regions. The estimated growth curve has been used to calculate the expected time of the beginning, the time related to peak, and the end of the epidemics. Our analysis explores the development of the outbreaks in Italy and the impact of the containment measures. Data obtained are useful to forecast future scenarios and the possible end of the epidemic.
新型冠状病毒SARS-CoV-2于2019年12月在中国首次被发现。在短短五个多月内,该病毒影响了400多万人,造成约30万人死亡。这项研究旨在使用一条新曲线对意大利地区新的新冠肺炎病例进行建模。提出了一种新的经验曲线来模拟新冠肺炎新增病例数。它类似于一条已知的指数增长曲线,它有一条直线作为指数,但在所提出的增长曲线中,指数是一条乘以直线的逻辑曲线。这条曲线显示了一个初始阶段,即预期的指数增长,然后上升到最大值,最终达到零。我们描述了整个意大利国家和意大利20个地区的疫情增长模式。估计的增长曲线已用于计算流行病开始的预期时间、与峰值相关的时间和结束时间。我们的分析探讨了意大利疫情的发展以及遏制措施的影响。所获得的数据有助于预测未来的情况和疫情可能结束的情况。
{"title":"A new regression model for the forecasting of COVID-19 outbreak evolution: an application to Italian data","authors":"D. Sisti, E. Rocchi, S. Peluso, S. Amatori, M. Carletti","doi":"10.1080/24709360.2021.1978270","DOIUrl":"https://doi.org/10.1080/24709360.2021.1978270","url":null,"abstract":"The novel coronavirus SARS-CoV-2 was first identified in China in December 2019. In just over five months, the virus affected over 4 million people and caused about 300,000 deaths. This study aimed to model new COVID-19 cases in Italian regions using a new curve. A new empirical curve is proposed to model the number of new cases of COVID-19. It resembles a known exponential growth curve, which has a straight line as an exponent, but in the growth curve proposed, the exponent is a logistic curve multiplied for a straight line. This curve shows an initial phase, the expected exponential growth, then rises to the maximum value and finally reaches zero. We characterized the epidemic growth patterns for the entire Italian nation and each of the 20 Italian regions. The estimated growth curve has been used to calculate the expected time of the beginning, the time related to peak, and the end of the epidemics. Our analysis explores the development of the outbreaks in Italy and the impact of the containment measures. Data obtained are useful to forecast future scenarios and the possible end of the epidemic.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"6 1","pages":"48 - 56"},"PeriodicalIF":0.0,"publicationDate":"2020-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44476898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression with incomplete multivariate surrogate responses for a latent covariate 潜在协变量的不完全多元替代响应回归
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2020.1794705
Hua Shen, R. Cook
ABSTRACT We consider the setting in which a categorical exposure variable of interest can only be measured subject to misclassification via surrogate variables. These surrogate variables may represent the classification of an individual via imperfect diagnostic tests. In such settings, a random number of diagnostic tests may be ordered at the discretion of a treating physician with the decision to order further tests made in a sequential fashion based on the results of preliminary test results. Because the underlying latent status is not ascertainable these cheaper but imperfect surrogate test results are used in lieu of the definitive classification in a model for a long-term outcome. Naive use of a single surrogate or functions of the available surrogates can lead to biased estimators of the association and invalid inference. We propose a likelihood-based approach for modeling the effect of the latent variable in the absence of validation data with estimation based on an expectation–maximization (EM) algorithm. The method yields consistent and efficient estimates and is shown to out-perform several common alternative approaches. The performance of the proposed method is demonstrated in simulation studies and its utility is illustrated by applying the proposed method to the stimulating study on breast cancer.
摘要:我们考虑的环境是,感兴趣的分类暴露变量只能通过替代变量进行错误分类来衡量。这些替代变量可能代表通过不完善的诊断测试对个体的分类。在这样的设置中,可以由治疗医生决定随机数目的诊断测试,并决定基于初步测试结果以顺序方式进行进一步的测试。因为潜在的潜在状态是不可确定的,所以使用这些更便宜但不完美的替代测试结果来代替长期结果模型中的最终分类。天真地使用单个代理或可用代理的函数可能导致关联的有偏估计和无效推理。我们提出了一种基于似然的方法,用于在没有验证数据的情况下对潜在变量的影响进行建模,并基于期望最大化(EM)算法进行估计。该方法产生了一致且有效的估计,并被证明优于几种常见的替代方法。在模拟研究中验证了该方法的性能,并通过将该方法应用于癌症的刺激研究来说明其实用性。
{"title":"Regression with incomplete multivariate surrogate responses for a latent covariate","authors":"Hua Shen, R. Cook","doi":"10.1080/24709360.2020.1794705","DOIUrl":"https://doi.org/10.1080/24709360.2020.1794705","url":null,"abstract":"ABSTRACT We consider the setting in which a categorical exposure variable of interest can only be measured subject to misclassification via surrogate variables. These surrogate variables may represent the classification of an individual via imperfect diagnostic tests. In such settings, a random number of diagnostic tests may be ordered at the discretion of a treating physician with the decision to order further tests made in a sequential fashion based on the results of preliminary test results. Because the underlying latent status is not ascertainable these cheaper but imperfect surrogate test results are used in lieu of the definitive classification in a model for a long-term outcome. Naive use of a single surrogate or functions of the available surrogates can lead to biased estimators of the association and invalid inference. We propose a likelihood-based approach for modeling the effect of the latent variable in the absence of validation data with estimation based on an expectation–maximization (EM) algorithm. The method yields consistent and efficient estimates and is shown to out-perform several common alternative approaches. The performance of the proposed method is demonstrated in simulation studies and its utility is illustrated by applying the proposed method to the stimulating study on breast cancer.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"247 - 264"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2020.1794705","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45075884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics and Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1