首页 > 最新文献

Biostatistics and Epidemiology最新文献

英文 中文
The analysis and forecasting COVID-19 cases in the United States using Bayesian structural time series models 基于贝叶斯结构时间序列模型的美国新冠肺炎病例分析与预测
Q3 Medicine Pub Date : 2021-07-31 DOI: 10.1080/24709360.2021.1948380
Liming Xie
In this paper, the Bayesian structural time series model (BSTS) is used to analyze and predict total confirmed cases who infected COVID-19 in the United States from February 28, 2020 through April 6, 2020 using the collect data from CDC (Center of Disease Control) in the United States. It includes variables of days, total confirmed cases, confirmed cases daily, death cases daily, and fatality rates. The author exploits the flexibility of Local Linear Trend, Seasonality, Contemporaneous covariates of dynamic coefficients in the Bayesian structural time series models. In addition, Causal Impact function in R programming is applied to analyze the model and read report of model. The results of the model show that the total confirmed cases who infected COVID-19 will be still most likely to increase straightly, the total numbers infected COVID-19 would be broken through 600,000 in the United States in near future (in the subsequent months). And then arrive at the peak around mid-May 2020. Also, the model suggests that the probability of variable Recovered cases daily is 0.07.
本文利用美国疾病控制中心(CDC)收集的数据,采用贝叶斯结构时间序列模型(BSTS)对2020年2月28日至2020年4月6日美国新冠肺炎确诊病例总数进行了分析和预测。它包括天数、总确诊病例、每日确诊病例、每日死亡病例和死亡率等变量。作者利用贝叶斯结构时间序列模型中动态系数的局部线性趋势、季节性、同期协变量的灵活性。并运用R编程中的因果影响函数对模型进行分析,读取模型报告。模型结果显示,感染COVID-19的确诊病例总数仍有可能直线上升,在不久的将来(随后的几个月),美国感染COVID-19的总人数将突破60万。然后在2020年5月中旬左右达到峰值。同时,该模型表明,每天恢复的可变病例的概率为0.07。
{"title":"The analysis and forecasting COVID-19 cases in the United States using Bayesian structural time series models","authors":"Liming Xie","doi":"10.1080/24709360.2021.1948380","DOIUrl":"https://doi.org/10.1080/24709360.2021.1948380","url":null,"abstract":"In this paper, the Bayesian structural time series model (BSTS) is used to analyze and predict total confirmed cases who infected COVID-19 in the United States from February 28, 2020 through April 6, 2020 using the collect data from CDC (Center of Disease Control) in the United States. It includes variables of days, total confirmed cases, confirmed cases daily, death cases daily, and fatality rates. The author exploits the flexibility of Local Linear Trend, Seasonality, Contemporaneous covariates of dynamic coefficients in the Bayesian structural time series models. In addition, Causal Impact function in R programming is applied to analyze the model and read report of model. The results of the model show that the total confirmed cases who infected COVID-19 will be still most likely to increase straightly, the total numbers infected COVID-19 would be broken through 600,000 in the United States in near future (in the subsequent months). And then arrive at the peak around mid-May 2020. Also, the model suggests that the probability of variable Recovered cases daily is 0.07.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"6 1","pages":"1 - 15"},"PeriodicalIF":0.0,"publicationDate":"2021-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1948380","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47094435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Joint modeling of two count variables using a shared random effect model in the presence of clusters for complex data 在复杂数据存在聚类的情况下,使用共享随机效应模型对两个计数变量进行联合建模
Q3 Medicine Pub Date : 2021-07-22 DOI: 10.1080/24709360.2021.1948381
M. Sooriyarachchi
In epidemiology, it is often the case that two or more correlated count response variables are encountered. Under this scenario, it is more efficient to model the data using a joint model. Besides, if one of these count variables has an excess of zeros (spike at zero) the log link cannot be used in general. The situation is more complicated when the data is grouped into clusters. A Generalized Linear Mixed Model (GLMM) is used to accommodate this cluster covariance. The objective of this research is to develop a new modeling approach that can handle this situation. The method is illustrated on a global data set of Covid 19 patients. The important conclusions are that the new model was successfully implemented both in theory and practice. A plot of the residuals indicated a well-fitting model to the data.
在流行病学中,通常会遇到两个或多个相关的计数反应变量。在这种情况下,使用联合模型对数据进行建模更有效。此外,如果这些计数变量中的一个超过了零(峰值为零),则通常不能使用日志链接。当数据被分组到集群中时,情况会更加复杂。使用广义线性混合模型(GLMM)来适应这种聚类协方差。本研究的目的是开发一种能够处理这种情况的新建模方法。该方法在新冠肺炎19名患者的全球数据集中进行了说明。重要的结论是,新模式在理论和实践上都得到了成功的实施。残差图表明模型与数据拟合良好。
{"title":"Joint modeling of two count variables using a shared random effect model in the presence of clusters for complex data","authors":"M. Sooriyarachchi","doi":"10.1080/24709360.2021.1948381","DOIUrl":"https://doi.org/10.1080/24709360.2021.1948381","url":null,"abstract":"In epidemiology, it is often the case that two or more correlated count response variables are encountered. Under this scenario, it is more efficient to model the data using a joint model. Besides, if one of these count variables has an excess of zeros (spike at zero) the log link cannot be used in general. The situation is more complicated when the data is grouped into clusters. A Generalized Linear Mixed Model (GLMM) is used to accommodate this cluster covariance. The objective of this research is to develop a new modeling approach that can handle this situation. The method is illustrated on a global data set of Covid 19 patients. The important conclusions are that the new model was successfully implemented both in theory and practice. A plot of the residuals indicated a well-fitting model to the data.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"6 1","pages":"16 - 30"},"PeriodicalIF":0.0,"publicationDate":"2021-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1948381","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48934622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Special issue introduction: Statistical Methods in Precision Medicine: Diagnostic, Prognostic, Predictive and Therapeutic 特刊简介:精准医学中的统计方法:诊断、预后、预测和治疗
Q3 Medicine Pub Date : 2021-07-03 DOI: 10.1080/24709360.2021.1953942
G. Pennello, Xiting Yang
We are delighted to offer this special issue of Biostatistics & Epidemiology on ‘Statistical Methods in Precision Medicine: Diagnostic, Prognostic, Predictive and Therapeutic.’ Precision medicine, often referred to as personalized medicine, has a relatively short history but presents great opportunities and challenges. As former US Health and Human Services Secretary Michael Leavitt said in a 2007 meeting of the Personalized Medicine Coalition, advances in science and technology present an unprecedented ‘opportunity to bring health care to a new level of effectiveness and safety’ [1]. In particular, recent advances have been made in omicsbased in vitro measurements [2–4], quantitative imaging biomarkers [5], artificial intelligence/ machine learning [6], and electronic health record keeping [7]. These advances and others have led to a surge in medical research activity into personalized medicine, which has been described as ‘providing the right drug for the right patient at the right time’ [8]. As a result, the potential has never been greater to obtain powerful information for individualizing medical decision making, including but not limited to information on diagnosis, prognosis, and treatment selection, and for predicting dose, monitoring disease, modifying behavior, and aiding the development of a tailored therapy, that is, a drug or a medical device [9, 10]. The recognition that advances in science, technology, mathematics, and data collection could revolutionize healthcare has led to many important government initiatives. In 2015, the US launched the Precision Medicine Initiative (PMI), with the mission ‘to enable a new era of medicine through research, technology, and policies that empower patients, researchers, and providers to work together toward development of individualized care.’ This announcement was followed by the 21st Century Cures Act [11], which provided funding for PMI to drive research into the genetic, lifestyle and environmental variations of disease. Prior to PMI, the US Food and Drug Administration (FDA) had already made personalized medicine a top priority, issuing the discussion paper Paving the Way for Personalized Medicine: FDA’s Role in a New Era of Medical Product Development [12]. The FDA and the National Institutes of Health (NIH) published a working glossary of terminology for Biomarkers, EndpointS, and other Tools (BEST) [13]. The European Union Council [14] provided discussions on personalizedmedicine, including a formal definition. The EuropeanMedicinesAgency (EMA) provided a perspective on pharmacogenomic information in drug labeling [15]. The first goal of EMA’s vision of Regulatory Science Strategy to 2025 [16] is ‘Catalysing the integration of science and technology in medicines development,’ under which the first core recommendation is to ‘support developments in precision medicine, biomarkers and omics’. These are just a few selected examples of regulatory efforts being made across the globe to facilita
{"title":"Special issue introduction: Statistical Methods in Precision Medicine: Diagnostic, Prognostic, Predictive and Therapeutic","authors":"G. Pennello, Xiting Yang","doi":"10.1080/24709360.2021.1953942","DOIUrl":"https://doi.org/10.1080/24709360.2021.1953942","url":null,"abstract":"We are delighted to offer this special issue of Biostatistics & Epidemiology on ‘Statistical Methods in Precision Medicine: Diagnostic, Prognostic, Predictive and Therapeutic.’ Precision medicine, often referred to as personalized medicine, has a relatively short history but presents great opportunities and challenges. As former US Health and Human Services Secretary Michael Leavitt said in a 2007 meeting of the Personalized Medicine Coalition, advances in science and technology present an unprecedented ‘opportunity to bring health care to a new level of effectiveness and safety’ [1]. In particular, recent advances have been made in omicsbased in vitro measurements [2–4], quantitative imaging biomarkers [5], artificial intelligence/ machine learning [6], and electronic health record keeping [7]. These advances and others have led to a surge in medical research activity into personalized medicine, which has been described as ‘providing the right drug for the right patient at the right time’ [8]. As a result, the potential has never been greater to obtain powerful information for individualizing medical decision making, including but not limited to information on diagnosis, prognosis, and treatment selection, and for predicting dose, monitoring disease, modifying behavior, and aiding the development of a tailored therapy, that is, a drug or a medical device [9, 10]. The recognition that advances in science, technology, mathematics, and data collection could revolutionize healthcare has led to many important government initiatives. In 2015, the US launched the Precision Medicine Initiative (PMI), with the mission ‘to enable a new era of medicine through research, technology, and policies that empower patients, researchers, and providers to work together toward development of individualized care.’ This announcement was followed by the 21st Century Cures Act [11], which provided funding for PMI to drive research into the genetic, lifestyle and environmental variations of disease. Prior to PMI, the US Food and Drug Administration (FDA) had already made personalized medicine a top priority, issuing the discussion paper Paving the Way for Personalized Medicine: FDA’s Role in a New Era of Medical Product Development [12]. The FDA and the National Institutes of Health (NIH) published a working glossary of terminology for Biomarkers, EndpointS, and other Tools (BEST) [13]. The European Union Council [14] provided discussions on personalizedmedicine, including a formal definition. The EuropeanMedicinesAgency (EMA) provided a perspective on pharmacogenomic information in drug labeling [15]. The first goal of EMA’s vision of Regulatory Science Strategy to 2025 [16] is ‘Catalysing the integration of science and technology in medicines development,’ under which the first core recommendation is to ‘support developments in precision medicine, biomarkers and omics’. These are just a few selected examples of regulatory efforts being made across the globe to facilita","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"93 - 99"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49355065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A statistical review: why average weighted accuracy, not accuracy or AUC? 统计综述:为什么平均加权准确度,而不是准确度或AUC?
Q3 Medicine Pub Date : 2021-07-03 DOI: 10.1080/24709360.2021.1975255
Yunyun Jiang, Q. Pan, Ying Liu, S. Evans
Sensitivity and specificity are key aspects in evaluating the performance of diagnostic tests. Accuracy and AUC are commonly used composite measures that incorporate sensitivity and specificity. Average Weighted Accuracy (AWA) is motivated by the need for a statistical measure that compares diagnostic tests from the medical costs and clinical impact point of view, while incorporating the relevant prevalence range of the disease as well as the relative importance of false-positive versus false-negative cases. We illustrate the testing procedures in four different scenarios: (i) one diagnostic test vs. the best random test, (ii) two diagnostic tests from two independent samples, (iii) two diagnostic tests from the same sample, and (iv) more than two diagnostic tests from different or the same samples. The impacts of sample size, prevalence, and relative importance on power and average medical costs/clinical loss are examined through simulation studies. Accuracy has the highest power while AWA provides a consistent criterion in selecting the optimal threshold and better diagnostic tests with direct clinical interpretations. The use of AWA is illustrated on a three-arm clinical trial evaluating three different assays in detecting Neisseria gonorrhoeae and Chlamydia trachomatis in the rectum and pharynx.
敏感性和特异性是评估诊断测试性能的关键方面。准确度和AUC是通常使用的综合指标,包括敏感性和特异性。平均加权准确度(AWA)的动机是需要一种统计测量方法,从医疗成本和临床影响的角度对诊断测试进行比较,同时考虑疾病的相关流行范围以及假阳性与假阴性病例的相对重要性。我们说明了四种不同情况下的测试程序:(i)一次诊断测试与最佳随机测试,(ii)来自两个独立样本的两次诊断测试,(iii)来自同一样本的两项诊断测试,以及(iv)来自不同或相同样本的两个以上诊断测试。通过模拟研究检验了样本量、患病率和相对重要性对功率和平均医疗成本/临床损失的影响。准确度最高,而AWA在选择最佳阈值和更好的诊断测试时提供了一致的标准,并具有直接的临床解释。AWA的使用在一项三组临床试验中得到了说明,该试验评估了在直肠和咽部检测淋球菌和沙眼衣原体的三种不同检测方法。
{"title":"A statistical review: why average weighted accuracy, not accuracy or AUC?","authors":"Yunyun Jiang, Q. Pan, Ying Liu, S. Evans","doi":"10.1080/24709360.2021.1975255","DOIUrl":"https://doi.org/10.1080/24709360.2021.1975255","url":null,"abstract":"Sensitivity and specificity are key aspects in evaluating the performance of diagnostic tests. Accuracy and AUC are commonly used composite measures that incorporate sensitivity and specificity. Average Weighted Accuracy (AWA) is motivated by the need for a statistical measure that compares diagnostic tests from the medical costs and clinical impact point of view, while incorporating the relevant prevalence range of the disease as well as the relative importance of false-positive versus false-negative cases. We illustrate the testing procedures in four different scenarios: (i) one diagnostic test vs. the best random test, (ii) two diagnostic tests from two independent samples, (iii) two diagnostic tests from the same sample, and (iv) more than two diagnostic tests from different or the same samples. The impacts of sample size, prevalence, and relative importance on power and average medical costs/clinical loss are examined through simulation studies. Accuracy has the highest power while AWA provides a consistent criterion in selecting the optimal threshold and better diagnostic tests with direct clinical interpretations. The use of AWA is illustrated on a three-arm clinical trial evaluating three different assays in detecting Neisseria gonorrhoeae and Chlamydia trachomatis in the rectum and pharynx.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"267 - 286"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41893870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The role of statistics in the design and analysis of companion diagnostic (CDx) studies 统计学在伴随诊断(CDx)研究设计和分析中的作用
Q3 Medicine Pub Date : 2021-05-10 DOI: 10.1080/24709360.2021.1913706
G. Campbell
Companion diagnostic tests are crucial in the development of precision medicine. These tests provide information that is essential for the safe and effective use of specific therapeutic products. Statistics plays a key role in the design and analysis of studies to demonstrate the safety and effectiveness of the companion diagnostics. This article can serve as an introduction to companion diagnostics for therapeutic statisticians and for diagnostic ones as well as a discussion of some of the statistical challenges. The topics include biomarker development, diagnostic performance, misclassification, prospective-retrospective validation, bridging studies, missing data, follow-on diagnostics and complex signatures.
辅助诊断测试对精准医学的发展至关重要。这些测试提供了对安全有效使用特定治疗产品至关重要的信息。统计学在研究的设计和分析中发挥着关键作用,以证明伴随诊断的安全性和有效性。这篇文章可以为治疗统计学家和诊断统计学家介绍伴随诊断,并讨论一些统计挑战。主题包括生物标志物开发、诊断性能、错误分类、前瞻性回顾性验证、桥接研究、缺失数据、后续诊断和复杂特征。
{"title":"The role of statistics in the design and analysis of companion diagnostic (CDx) studies","authors":"G. Campbell","doi":"10.1080/24709360.2021.1913706","DOIUrl":"https://doi.org/10.1080/24709360.2021.1913706","url":null,"abstract":"Companion diagnostic tests are crucial in the development of precision medicine. These tests provide information that is essential for the safe and effective use of specific therapeutic products. Statistics plays a key role in the design and analysis of studies to demonstrate the safety and effectiveness of the companion diagnostics. This article can serve as an introduction to companion diagnostics for therapeutic statisticians and for diagnostic ones as well as a discussion of some of the statistical challenges. The topics include biomarker development, diagnostic performance, misclassification, prospective-retrospective validation, bridging studies, missing data, follow-on diagnostics and complex signatures.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"218 - 231"},"PeriodicalIF":0.0,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1913706","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41503728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A bivariate Bayesian framework for simultaneous evaluation of two candidate companion diagnostic assays in a new drug clinical trial 在新药临床试验中同时评估两种候选伴随诊断分析的双变量贝叶斯框架
Q3 Medicine Pub Date : 2021-04-25 DOI: 10.1080/24709360.2021.1913705
R. Simon, Songbai Wang
Companion diagnostic tests play an important role in precision medicine. With the advancement of new technologies, multiple companion diagnostic tests can be rapidly developed in multiple platforms and use different samples to select patients for new treatments. Analytically validated assays must be clinically evaluated before they can be implemented in patient management. The status quo design for validating candidate assays is to employ one candidate assay to select patients for new drug clinical trial and then further evaluate the 2nd candidate assay in a bridging study. We propose a new enrollment strategy that employs two assays to select patients. We then develop a bivariate Bayesian approach that enables the totality of data to be used in evaluating whether these assays can be used independently or in a composite procedure in selecting right patients for new treatment. We demonstrate through simulations that when proper priors are available, the Bayesian approach is superior to classical methods in terms of statistical power.
辅助诊断测试在精准医学中发挥着重要作用。随着新技术的进步,可以在多个平台上快速开发多种伴随诊断测试,并使用不同的样本选择患者进行新的治疗。分析验证的化验必须进行临床评估,然后才能在患者管理中实施。验证候选检测的现状设计是使用一种候选检测来选择患者进行新药临床试验,然后在桥接研究中进一步评估第二种候选检测。我们提出了一种新的入组策略,采用两种测定法来选择患者。然后,我们开发了一种双变量贝叶斯方法,使整个数据能够用于评估这些测定是否可以独立使用或在选择合适患者进行新治疗的复合程序中使用。我们通过仿真证明,当有合适的先验时,贝叶斯方法在统计能力方面优于经典方法。
{"title":"A bivariate Bayesian framework for simultaneous evaluation of two candidate companion diagnostic assays in a new drug clinical trial","authors":"R. Simon, Songbai Wang","doi":"10.1080/24709360.2021.1913705","DOIUrl":"https://doi.org/10.1080/24709360.2021.1913705","url":null,"abstract":"Companion diagnostic tests play an important role in precision medicine. With the advancement of new technologies, multiple companion diagnostic tests can be rapidly developed in multiple platforms and use different samples to select patients for new treatments. Analytically validated assays must be clinically evaluated before they can be implemented in patient management. The status quo design for validating candidate assays is to employ one candidate assay to select patients for new drug clinical trial and then further evaluate the 2nd candidate assay in a bridging study. We propose a new enrollment strategy that employs two assays to select patients. We then develop a bivariate Bayesian approach that enables the totality of data to be used in evaluating whether these assays can be used independently or in a composite procedure in selecting right patients for new treatment. We demonstrate through simulations that when proper priors are available, the Bayesian approach is superior to classical methods in terms of statistical power.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"207 - 217"},"PeriodicalIF":0.0,"publicationDate":"2021-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1913705","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44584920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Estimating the AUC with a graphical lasso method for high-dimensional biomarkers with LOD 用带LOD的高维生物标志物的图形套索法估计AUC
Q3 Medicine Pub Date : 2021-03-17 DOI: 10.1080/24709360.2021.1898731
Jirui Wang, Yunpeng Zhao, L. Tang
This manuscript estimates the area under the receiver operating characteristic curve (AUC) of combined biomarkers in a high-dimensional setting. We propose a penalization approach to the inference of precision matrices in the presence of the limit of detection. A new version of expectation-maximization algorithm is then proposed for the penalized likelihood, with the use of numerical integration and the graphical lasso method. The estimated precision matrix is then applied to the inference of AUCs. The proposed method outperforms the existing methods in numerical studies. We apply the proposed method to a data set of brain tumor study. The results show a higher accuracy on the estimation of AUC compared with the existing methods.
本文估计了在高维环境中组合生物标志物的接收器工作特征曲线(AUC)下的面积。在存在检测极限的情况下,我们提出了一种精度矩阵推理的惩罚方法。利用数值积分法和图形套索法,提出了一种新的惩罚似然期望最大化算法。然后将估计的精度矩阵应用于auc的推理。该方法在数值研究中优于现有方法。我们将所提出的方法应用于脑肿瘤研究数据集。结果表明,与现有方法相比,该方法对AUC的估计具有更高的精度。
{"title":"Estimating the AUC with a graphical lasso method for high-dimensional biomarkers with LOD","authors":"Jirui Wang, Yunpeng Zhao, L. Tang","doi":"10.1080/24709360.2021.1898731","DOIUrl":"https://doi.org/10.1080/24709360.2021.1898731","url":null,"abstract":"This manuscript estimates the area under the receiver operating characteristic curve (AUC) of combined biomarkers in a high-dimensional setting. We propose a penalization approach to the inference of precision matrices in the presence of the limit of detection. A new version of expectation-maximization algorithm is then proposed for the penalized likelihood, with the use of numerical integration and the graphical lasso method. The estimated precision matrix is then applied to the inference of AUCs. The proposed method outperforms the existing methods in numerical studies. We apply the proposed method to a data set of brain tumor study. The results show a higher accuracy on the estimation of AUC compared with the existing methods.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"189 - 206"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1898731","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44107211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bootstrapping inference of average treatment effect in completely randomized experiments with high-dimensional covariates 具有高维协变量的完全随机实验中平均治疗效果的自举推理
Q3 Medicine Pub Date : 2021-03-17 DOI: 10.1080/24709360.2021.1898269
Hanzhong Liu
Investigators often use regression adjustment methods to analyze the results of randomized experiments when baseline covariates are available. Their aim is to improve the estimation efficiency of treatment effects by adjusting for imbalance of covariates. Under mild conditions, the regression-adjusted average treatment effect estimator is asymptotically normal with asymptotic variance no greater than that of the unadjusted estimator. The asymptotic variance can be estimated conservatively based on residual sum of squares. This article studies alternative inference methods based on the bootstrap and investigates their asymptotic properties under the Neyman–Rubin causal model and randomization-based inference framework. We show that the weighted, residual and paired bootstrap methods provide asymptotically conservative variance estimators that perform at least as good as the estimator based on residual sum of squares. We further provide counterexamples, where the original estimator is asymptotically normal, but the bootstrap counterpart is inconsistent for estimating its limiting distribution. Simulation studies indicate that the paired bootstrap method is preferable, in terms of preserving type I errors, for a small sample size. Finally, our methods analyze HER2+ breast cancer data from the NeOAdjuvant Herceptin trial to examine the effectiveness of trastuzumab in combination with neoadjuvant chemotherapy.
当基线协变量可用时,研究人员经常使用回归调整方法来分析随机实验的结果。他们的目的是通过调整协变量的不平衡来提高治疗效果的估计效率。在温和条件下,回归调整平均治疗效果估计量是渐近正态的,渐近方差不大于未调整估计量的渐近方差。渐近方差可以根据残差平方和保守估计。本文研究了基于bootstrap的替代推理方法,并在Neyman-Rubin因果模型和基于随机化的推理框架下研究了它们的渐近性质。我们证明了加权、残差和配对自举方法提供了渐近保守的方差估计量,其性能至少与基于残差平方和的估计量一样好。我们进一步提供了反例,其中原始估计器是渐近正态的,但bootstrap对应物在估计其极限分布时是不一致的。仿真研究表明,对于小样本量,在保留I型误差方面,配对自举方法是优选的。最后,我们的方法分析了NeOAdjuvant Herceptin试验的HER2+乳腺癌症数据,以检查曲妥珠单抗联合新辅助化疗的有效性。
{"title":"Bootstrapping inference of average treatment effect in completely randomized experiments with high-dimensional covariates","authors":"Hanzhong Liu","doi":"10.1080/24709360.2021.1898269","DOIUrl":"https://doi.org/10.1080/24709360.2021.1898269","url":null,"abstract":"Investigators often use regression adjustment methods to analyze the results of randomized experiments when baseline covariates are available. Their aim is to improve the estimation efficiency of treatment effects by adjusting for imbalance of covariates. Under mild conditions, the regression-adjusted average treatment effect estimator is asymptotically normal with asymptotic variance no greater than that of the unadjusted estimator. The asymptotic variance can be estimated conservatively based on residual sum of squares. This article studies alternative inference methods based on the bootstrap and investigates their asymptotic properties under the Neyman–Rubin causal model and randomization-based inference framework. We show that the weighted, residual and paired bootstrap methods provide asymptotically conservative variance estimators that perform at least as good as the estimator based on residual sum of squares. We further provide counterexamples, where the original estimator is asymptotically normal, but the bootstrap counterpart is inconsistent for estimating its limiting distribution. Simulation studies indicate that the paired bootstrap method is preferable, in terms of preserving type I errors, for a small sample size. Finally, our methods analyze HER2+ breast cancer data from the NeOAdjuvant Herceptin trial to examine the effectiveness of trastuzumab in combination with neoadjuvant chemotherapy.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"6 1","pages":"203 - 220"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1898269","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45298163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative study of statistical methods for clustered ROC data: nonparametric methods and multiple outputation methods ROC聚类数据统计方法的比较研究:非参数方法和多重输出方法
Q3 Medicine Pub Date : 2021-03-17 DOI: 10.1080/24709360.2021.1880224
Zhuang Miao, L. Tang, Ao Yuan
In clustered receiver operating characteristic (ROC) data each patient has several normal and abnormal observations. Within the same cluster, observations are naturally correlated. Several nonparametric methods have been proposed in the literature to handle clustered data structure, but their performances on simulated and real datasets have not been compared. Recently, a multiple outputation method has been considered for clustered data in areas other than diagnostic accuracy to account for within-cluster correlation. The multiple outputation method offers a resampling-based alternative for one sample clustered data with or without covariates, or for hypothesis testing in two sample clustered data. The method does not require a specific within-cluster correlation structure and yields a valid estimator while accounting for the within-cluster correlations. This paper contributes to the literature by introducing the multiple outputation method to the ROC setting, and empirically comparing the performance of these clustered ROC curve methods. The performance of these methods is also evaluated through two real examples.
在集群接收器操作特征(ROC)数据中,每个患者都有几个正常和异常观察结果。在同一个集群中,观测结果是自然相关的。文献中已经提出了几种非参数方法来处理聚类数据结构,但它们在模拟数据集和真实数据集上的性能尚未进行比较。最近,针对诊断准确性以外的领域的聚类数据,考虑了一种多重输出方法,以说明聚类内的相关性。多重输出方法为具有或不具有协变量的单样本聚类数据或两样本聚类数据中的假设检验提供了一种基于重采样的替代方法。该方法不需要特定的簇内相关性结构,并且在考虑簇内相关性的同时产生有效的估计器。本文通过将多重输出方法引入ROC设置,并实证比较这些聚类ROC曲线方法的性能,为文献做出了贡献。通过两个实例对这些方法的性能进行了评价。
{"title":"Comparative study of statistical methods for clustered ROC data: nonparametric methods and multiple outputation methods","authors":"Zhuang Miao, L. Tang, Ao Yuan","doi":"10.1080/24709360.2021.1880224","DOIUrl":"https://doi.org/10.1080/24709360.2021.1880224","url":null,"abstract":"In clustered receiver operating characteristic (ROC) data each patient has several normal and abnormal observations. Within the same cluster, observations are naturally correlated. Several nonparametric methods have been proposed in the literature to handle clustered data structure, but their performances on simulated and real datasets have not been compared. Recently, a multiple outputation method has been considered for clustered data in areas other than diagnostic accuracy to account for within-cluster correlation. The multiple outputation method offers a resampling-based alternative for one sample clustered data with or without covariates, or for hypothesis testing in two sample clustered data. The method does not require a specific within-cluster correlation structure and yields a valid estimator while accounting for the within-cluster correlations. This paper contributes to the literature by introducing the multiple outputation method to the ROC setting, and empirically comparing the performance of these clustered ROC curve methods. The performance of these methods is also evaluated through two real examples.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"169 - 188"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1880224","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48308687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High-dimensional inference for the average treatment effect under model misspecification using penalized bias-reduced double-robust estimation 利用惩罚偏倚减少的双鲁棒估计对模型错误规范下的平均处理效果进行高维推断
Q3 Medicine Pub Date : 2021-03-17 DOI: 10.1080/24709360.2021.1898730
Vahe Avagyan, S. Vansteelandt
The presence of confounding by high-dimensional variables complicates the estimation of the average effect of a point treatment. On the one hand, it necessitates the use of variable selection strategies or more general high-dimensional statistical methods. On the other hand, the use of such techniques tends to result in biased estimators with a non-standard asymptotic behavior. Double-robust estimators are useful for offering a resolution because they possess a so-called small bias property. This property has been exploited to achieve valid (uniform) inference of the average causal effect when data-adaptive estimators of the propensity score and conditional outcome mean both converge to their respective truths at sufficiently fast rate. In this article, we extend this work in order to retain valid (uniform) inference when one of these estimators does not converge to the truth, regardless of which. This is done by generalizing prior work for low-dimensional settings by [Vermeulen K, Vansteelandt S. Bias-reduced doubly robust estimation. Am Stat Assoc. 2015;110(511):1024–1036.] to incorporate regularization. The proposed penalized bias-reduced double-robust estimation strategy exhibits promising performance in simulation studies and a data analysis, relative to competing proposals.
高维变量混杂的存在使积分治疗的平均效果的估计变得复杂。一方面,它需要使用变量选择策略或更通用的高维统计方法。另一方面,使用这种技术往往会导致具有非标准渐近行为的有偏估计量。双稳健估计量对于提供分辨率是有用的,因为它们具有所谓的小偏差性质。当倾向得分和条件结果均值的数据自适应估计量都以足够快的速度收敛到它们各自的真理时,这一特性已被用来实现平均因果效应的有效(一致)推断。在本文中,我们扩展了这项工作,以便在其中一个估计量不收敛于真值时保持有效(一致)推断,无论是哪一个。这是通过将[Vermeulen K,Vansteelandt S.Bias reduced double robust estimation.Am Stat Assoc.2015;110(511):1024–1036.]对低维设置的先前工作进行推广来实现的。相对于竞争方案,所提出的惩罚偏差减少双稳健估计策略在模拟研究和数据分析中表现出了良好的性能。
{"title":"High-dimensional inference for the average treatment effect under model misspecification using penalized bias-reduced double-robust estimation","authors":"Vahe Avagyan, S. Vansteelandt","doi":"10.1080/24709360.2021.1898730","DOIUrl":"https://doi.org/10.1080/24709360.2021.1898730","url":null,"abstract":"The presence of confounding by high-dimensional variables complicates the estimation of the average effect of a point treatment. On the one hand, it necessitates the use of variable selection strategies or more general high-dimensional statistical methods. On the other hand, the use of such techniques tends to result in biased estimators with a non-standard asymptotic behavior. Double-robust estimators are useful for offering a resolution because they possess a so-called small bias property. This property has been exploited to achieve valid (uniform) inference of the average causal effect when data-adaptive estimators of the propensity score and conditional outcome mean both converge to their respective truths at sufficiently fast rate. In this article, we extend this work in order to retain valid (uniform) inference when one of these estimators does not converge to the truth, regardless of which. This is done by generalizing prior work for low-dimensional settings by [Vermeulen K, Vansteelandt S. Bias-reduced doubly robust estimation. Am Stat Assoc. 2015;110(511):1024–1036.] to incorporate regularization. The proposed penalized bias-reduced double-robust estimation strategy exhibits promising performance in simulation studies and a data analysis, relative to competing proposals.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"6 1","pages":"221 - 238"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2021.1898730","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48653333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
Biostatistics and Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1