首页 > 最新文献

Biostatistics and Epidemiology最新文献

英文 中文
Evaluating heterogeneity of treatment effects 评价治疗效果的异质性
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2020.1724003
S. Vijan
Evaluation of treatment effects in randomized clinical trials typically focuses on the average difference in outcomes between arms of a trial. While this approach is the gold standard for establishing a causal relationship between treatment and outcome, reporting of average effects can mask important differences in benefits across various subpopulations, a phenomenon known as heterogeneity of treatment effects (HTE). The presence of HTE has been demonstrated in many settings and lack of consideration of HTE can lead to inappropriate treatment (or lack of treatment) for many patients. This paper describes approaches to analyzing and reporting trials with explicit consideration of heterogeneity, in order to improve our ability to treat individual patients more effectively.
随机临床试验中治疗效果的评估通常侧重于试验组之间结果的平均差异。虽然这种方法是建立治疗和结果之间因果关系的黄金标准,但平均效果的报告可以掩盖不同亚群之间益处的重要差异,这种现象被称为治疗效果的异质性(HTE)。HTE的存在已在许多情况下得到证实,对许多患者来说,缺乏对HTE的考虑可能导致不适当的治疗(或缺乏治疗)。本文描述了在明确考虑异质性的情况下分析和报告试验的方法,以提高我们更有效地治疗个别患者的能力。
{"title":"Evaluating heterogeneity of treatment effects","authors":"S. Vijan","doi":"10.1080/24709360.2020.1724003","DOIUrl":"https://doi.org/10.1080/24709360.2020.1724003","url":null,"abstract":"Evaluation of treatment effects in randomized clinical trials typically focuses on the average difference in outcomes between arms of a trial. While this approach is the gold standard for establishing a causal relationship between treatment and outcome, reporting of average effects can mask important differences in benefits across various subpopulations, a phenomenon known as heterogeneity of treatment effects (HTE). The presence of HTE has been demonstrated in many settings and lack of consideration of HTE can lead to inappropriate treatment (or lack of treatment) for many patients. This paper describes approaches to analyzing and reporting trials with explicit consideration of heterogeneity, in order to improve our ability to treat individual patients more effectively.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"98 - 104"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2020.1724003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42115724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Applying statistical and analytical methods to U.S. Department of Veterans Affairs databases 将统计和分析方法应用于美国退伍军人事务部数据库
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2019.1708660
T. Kashner
{"title":"Applying statistical and analytical methods to U.S. Department of Veterans Affairs databases","authors":"T. Kashner","doi":"10.1080/24709360.2019.1708660","DOIUrl":"https://doi.org/10.1080/24709360.2019.1708660","url":null,"abstract":"","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"3 - 5"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1708660","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48180550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Making causal inferences about treatment effect sizes from observational datasets 根据观察数据集对治疗效果大小进行因果推断
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2019.1681211
T. Kashner, Steven S. Henley, R. Golden, Xiao‐Hua Zhou
In the era of big data and cloud computing, analysts need statistical models to go beyond predicting outcomes to forecasting how outcomes change when decision-makers intervene to change one or more causal factors. This paper reviews methods to estimate the causal effects of treatment choices on patient health outcomes using observational datasets. Methods are limited to those that model choice of treatment (propensity scoring) and treatment outcomes (instrumental variable, difference in differences, control function). A regression framework was developed to show how unobserved confounding covariates and heterogeneous outcomes can introduce biases to effect size estimates. In response to criticisms that outcome approaches are not systematic and subject to model misspecification error, we extend the control function approach of Lu and White by applying Best Approximating Model technology (BAM-CF). Results from simulation experiments are presented to compare biases between BAM-CF and propensity scoring in the presence of an unobserved confounder. We conclude no one strategy is ‘optimal’ for all datasets, and analyst should consider multiple approaches to assess robustness. For both observational and randomized datasets, researchers should assess how moderating covariates impact estimates of treatment effect sizes so that clinicians can understand what is best for each individual patient.
在大数据和云计算时代,分析师需要统计模型超越预测结果,预测决策者干预改变一个或多个因果因素时结果如何变化。本文回顾了使用观察数据集估计治疗选择对患者健康结果的因果效应的方法。方法仅限于对治疗选择(倾向评分)和治疗结果(工具变量、差异中的差异、控制函数)进行建模的方法。开发了一个回归框架,以显示未观察到的混杂协变量和异质结果如何在效应大小估计中引入偏差。针对结果方法不系统且容易产生模型错定性误差的批评,我们通过应用最佳逼近模型技术(BAM-CF)扩展了Lu和White的控制函数方法。模拟实验的结果提出了比较偏差之间的BAM-CF和倾向评分在一个未观察到的混杂因素的存在。我们得出结论,没有一种策略对所有数据集都是“最优”的,分析师应该考虑多种方法来评估稳健性。对于观察性数据集和随机数据集,研究人员应该评估调节协变量如何影响治疗效果大小的估计,以便临床医生能够了解对每个患者最好的治疗方法。
{"title":"Making causal inferences about treatment effect sizes from observational datasets","authors":"T. Kashner, Steven S. Henley, R. Golden, Xiao‐Hua Zhou","doi":"10.1080/24709360.2019.1681211","DOIUrl":"https://doi.org/10.1080/24709360.2019.1681211","url":null,"abstract":"In the era of big data and cloud computing, analysts need statistical models to go beyond predicting outcomes to forecasting how outcomes change when decision-makers intervene to change one or more causal factors. This paper reviews methods to estimate the causal effects of treatment choices on patient health outcomes using observational datasets. Methods are limited to those that model choice of treatment (propensity scoring) and treatment outcomes (instrumental variable, difference in differences, control function). A regression framework was developed to show how unobserved confounding covariates and heterogeneous outcomes can introduce biases to effect size estimates. In response to criticisms that outcome approaches are not systematic and subject to model misspecification error, we extend the control function approach of Lu and White by applying Best Approximating Model technology (BAM-CF). Results from simulation experiments are presented to compare biases between BAM-CF and propensity scoring in the presence of an unobserved confounder. We conclude no one strategy is ‘optimal’ for all datasets, and analyst should consider multiple approaches to assess robustness. For both observational and randomized datasets, researchers should assess how moderating covariates impact estimates of treatment effect sizes so that clinicians can understand what is best for each individual patient.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"48 - 83"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1681211","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44059431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Transforming data into actionable insights 将数据转化为可操作的见解
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2019.1704127
C. Clancy
{"title":"Transforming data into actionable insights","authors":"C. Clancy","doi":"10.1080/24709360.2019.1704127","DOIUrl":"https://doi.org/10.1080/24709360.2019.1704127","url":null,"abstract":"","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"1 - 2"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1704127","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45784969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Common errors of interpretation in biostatistics 生物统计学中常见的解释错误
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2020.1790085
Elsa Vazquez Arreola, Kyle M. Irimata, Jeffrey R. Wilson
What do we wish to investigate? While this may be a common question in research, it does not always come with straightforward answers. This article reviews data-driven methods of collection, questions asked and questions answered, and the myriad of different conclusions that may result. We examine differences in answers to questions based on independent versus correlated observations, bivariate versus conditional associations, relations versus extrapolation, and single membership versus multiple membership modeling. Regardless of the issue, these differences are usually not due to so-called bad data or due to bad models; they are usually due to the investigators misinterpreting the answers that were given. Most importantly, one cannot ask a question and obtain an answer without understanding the data structure, its size and its representativeness. Simply stated, the fact that I went to the store and bought an outfit does not mean the outfit is appropriate for the event. The answers obtained may not be answering the question of interest.
我们希望调查什么?虽然这可能是研究中的一个常见问题,但它并不总是有直接的答案。本文回顾了数据驱动的收集方法、提出的问题和回答的问题,以及可能得出的无数不同结论。我们根据独立与相关观察、双变量与条件关联、关系与外推、单一隶属关系与多重隶属关系建模来研究问题答案的差异。不管问题是什么,这些差异通常不是由于所谓的坏数据或坏模型;他们通常是由于调查人员误解了给出的答案。最重要的是,如果不了解数据结构、大小和代表性,就无法提出问题并获得答案。简单地说,我去商店买了一套衣服并不意味着这套衣服适合这个活动。获得的答案可能不是回答感兴趣的问题。
{"title":"Common errors of interpretation in biostatistics","authors":"Elsa Vazquez Arreola, Kyle M. Irimata, Jeffrey R. Wilson","doi":"10.1080/24709360.2020.1790085","DOIUrl":"https://doi.org/10.1080/24709360.2020.1790085","url":null,"abstract":"What do we wish to investigate? While this may be a common question in research, it does not always come with straightforward answers. This article reviews data-driven methods of collection, questions asked and questions answered, and the myriad of different conclusions that may result. We examine differences in answers to questions based on independent versus correlated observations, bivariate versus conditional associations, relations versus extrapolation, and single membership versus multiple membership modeling. Regardless of the issue, these differences are usually not due to so-called bad data or due to bad models; they are usually due to the investigators misinterpreting the answers that were given. Most importantly, one cannot ask a question and obtain an answer without understanding the data structure, its size and its representativeness. Simply stated, the fact that I went to the store and bought an outfit does not mean the outfit is appropriate for the event. The answers obtained may not be answering the question of interest.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"238 - 246"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2020.1790085","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43256200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Statistical modeling methods: challenges and strategies 统计建模方法:挑战和策略
Q3 Medicine Pub Date : 2020-01-01 DOI: 10.1080/24709360.2019.1618653
Steven S. Henley, R. Golden, T. Kashner
ABSTRACT Statistical modeling methods are widely used in clinical science, epidemiology, and health services research to analyze data that has been collected in clinical trials as well as observational studies of existing data sources, such as claims files and electronic health records. Diagnostic and prognostic inferences from statistical models are critical to researchers advancing science, clinical practitioners making patient care decisions, and administrators and policy makers impacting the health care system to improve quality and reduce costs. The veracity of such inferences relies not only on the quality and completeness of the collected data, but also statistical model validity. A key component of establishing model validity is determining when a model is not correctly specified and therefore incapable of adequately representing the Data Generating Process (DGP). In this article, model validity is first described and methods designed for assessing model fit, specification, and selection are reviewed. Second, data transformations that improve the model’s ability to represent the DGP are addressed. Third, model search and validation methods are discussed. Finally, methods for evaluating predictive and classification performance are presented. Together, these methods provide a practical framework with recommendations to guide the development and evaluation of statistical models that provide valid statistical inferences.
摘要统计建模方法广泛用于临床科学、流行病学和卫生服务研究,用于分析临床试验以及现有数据源(如索赔文件和电子健康记录)的观察性研究中收集的数据。统计模型的诊断和预后推断对于研究人员推进科学、临床从业者做出患者护理决策以及管理人员和政策制定者影响医疗保健系统以提高质量和降低成本至关重要。这种推断的准确性不仅取决于所收集数据的质量和完整性,还取决于统计模型的有效性。建立模型有效性的一个关键组成部分是确定模型何时没有正确指定,因此无法充分表示数据生成过程(DGP)。本文首先描述了模型的有效性,并回顾了评估模型拟合、规范和选择的方法。其次,讨论了提高模型表示DGP能力的数据转换。第三,讨论了模型搜索和验证方法。最后,提出了评估预测和分类性能的方法。这些方法共同提供了一个实用的框架和建议,以指导提供有效统计推断的统计模型的开发和评估。
{"title":"Statistical modeling methods: challenges and strategies","authors":"Steven S. Henley, R. Golden, T. Kashner","doi":"10.1080/24709360.2019.1618653","DOIUrl":"https://doi.org/10.1080/24709360.2019.1618653","url":null,"abstract":"ABSTRACT Statistical modeling methods are widely used in clinical science, epidemiology, and health services research to analyze data that has been collected in clinical trials as well as observational studies of existing data sources, such as claims files and electronic health records. Diagnostic and prognostic inferences from statistical models are critical to researchers advancing science, clinical practitioners making patient care decisions, and administrators and policy makers impacting the health care system to improve quality and reduce costs. The veracity of such inferences relies not only on the quality and completeness of the collected data, but also statistical model validity. A key component of establishing model validity is determining when a model is not correctly specified and therefore incapable of adequately representing the Data Generating Process (DGP). In this article, model validity is first described and methods designed for assessing model fit, specification, and selection are reviewed. Second, data transformations that improve the model’s ability to represent the DGP are addressed. Third, model search and validation methods are discussed. Finally, methods for evaluating predictive and classification performance are presented. Together, these methods provide a practical framework with recommendations to guide the development and evaluation of statistical models that provide valid statistical inferences.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"4 1","pages":"105 - 139"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1618653","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47377251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Developments and debates on latent variable modeling in diagnostic studies when there is no gold standard 在没有金标准的情况下,诊断研究中潜在变量模型的发展和争论
Q3 Medicine Pub Date : 2019-10-15 DOI: 10.1080/24709360.2019.1673623
Zheyu Wang
Latent variable modeling is often used in diagnostic studies where a gold standard reference test is not available. Its applications have become increasing popular with the fast discovery of novel biomarkers and the effort to improve healthcare for each individual. This paper attempt to provide a review on current developments and debates of these models with a focus in diagnostic studies and to discuss the value as well as cautionary considerations in the applications of these models.
潜变量模型常用于诊断研究,其中金标准参考测试是不可用的。随着新的生物标记物的快速发现和改善每个人的医疗保健的努力,它的应用越来越受欢迎。本文试图以诊断研究为重点,对这些模型的当前发展和争论进行回顾,并讨论这些模型在应用中的价值和注意事项。
{"title":"Developments and debates on latent variable modeling in diagnostic studies when there is no gold standard","authors":"Zheyu Wang","doi":"10.1080/24709360.2019.1673623","DOIUrl":"https://doi.org/10.1080/24709360.2019.1673623","url":null,"abstract":"Latent variable modeling is often used in diagnostic studies where a gold standard reference test is not available. Its applications have become increasing popular with the fast discovery of novel biomarkers and the effort to improve healthcare for each individual. This paper attempt to provide a review on current developments and debates of these models with a focus in diagnostic studies and to discuss the value as well as cautionary considerations in the applications of these models.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"5 1","pages":"100 - 117"},"PeriodicalIF":0.0,"publicationDate":"2019-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1673623","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45189867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How many clusters exist? Answer via maximum clustering similarity implemented in R 有多少集群存在?通过在R中实现的最大聚类相似性来回答
Q3 Medicine Pub Date : 2019-01-01 DOI: 10.1080/24709360.2019.1615770
A. Albatineh, M. Wilcox, B. Zogheib, M. Niewiadomska-Bugaj
Finding the number of clusters in a data set is considered as one of the fundamental problems in cluster analysis. This paper integrates maximum clustering similarity (MCS), for finding the optimal number of clusters, into R statistical software through the package MCSim. The similarity between the two clustering methods is calculated at the same number of clusters, using Rand [Objective criteria for the evaluation of clustering methods. J Am Stat Assoc. 1971;66:846–850.] and Jaccard [The distribution of the flora of the alpine zone. New Phytologist. 1912;11:37–50.] indices, corrected for chance agreement. The number of clusters at which the index attains its maximum with most frequency is a candidate for the optimal number of clusters. Unlike other criteria, MCS can be used with circular data. Seven clustering algorithms, existing in R, are implemented in MCSim. A graph of the number of clusters vs. clusters similarity using corrected similarity indices is produced. Values of the similarity indices and a clustering tree (dendrogram) are produced. Several examples including simulated, real, and circular data sets are presented to show how MCSim successfully works in practice.
找出数据集中的聚类数量被认为是聚类分析的基本问题之一。本文通过MCSim软件包将最大聚类相似性(MCS)集成到R统计软件中,以寻找最优聚类数。两种聚类方法之间的相似性是在相同数量的聚类下计算的,使用Rand[聚类方法评估的客观标准。J Am Stat Assoc.1971;66:846–850.]和Jaccard[高山区植物群的分布。新植物学家。1912;11:37–50.]指数,对偶然一致性进行校正。指数以最高频率达到最大值的聚类数量是最优聚类数量的候选者。与其他标准不同,MCS可用于循环数据。在MCSim中实现了R中存在的七种聚类算法。使用校正的相似性指数生成聚类数量与聚类相似性的关系图。生成相似性指数的值和聚类树(树状图)。给出了几个例子,包括模拟、真实和循环数据集,以展示MCSim是如何在实践中成功工作的。
{"title":"How many clusters exist? Answer via maximum clustering similarity implemented in R","authors":"A. Albatineh, M. Wilcox, B. Zogheib, M. Niewiadomska-Bugaj","doi":"10.1080/24709360.2019.1615770","DOIUrl":"https://doi.org/10.1080/24709360.2019.1615770","url":null,"abstract":"Finding the number of clusters in a data set is considered as one of the fundamental problems in cluster analysis. This paper integrates maximum clustering similarity (MCS), for finding the optimal number of clusters, into R statistical software through the package MCSim. The similarity between the two clustering methods is calculated at the same number of clusters, using Rand [Objective criteria for the evaluation of clustering methods. J Am Stat Assoc. 1971;66:846–850.] and Jaccard [The distribution of the flora of the alpine zone. New Phytologist. 1912;11:37–50.] indices, corrected for chance agreement. The number of clusters at which the index attains its maximum with most frequency is a candidate for the optimal number of clusters. Unlike other criteria, MCS can be used with circular data. Seven clustering algorithms, existing in R, are implemented in MCSim. A graph of the number of clusters vs. clusters similarity using corrected similarity indices is produced. Values of the similarity indices and a clustering tree (dendrogram) are produced. Several examples including simulated, real, and circular data sets are presented to show how MCSim successfully works in practice.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"3 1","pages":"62 - 79"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1615770","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42954294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cohort study design for illness-death processes with disease status under intermittent observation 间歇观察下疾病状态下疾病死亡过程的队列研究设计
Q3 Medicine Pub Date : 2019-01-01 DOI: 10.1080/24709360.2019.1699341
Nathalie C. Moon, Leilei Zeng, R. Cook
Cohort studies are routinely conducted to learn about the incidence or progression rates of chronic diseases. The illness-death model offers a natural framework for joint consideration of non-fatal events in the semi-competing risks setting. We consider the design of prospective cohort studies where the goal is to estimate the effect of a marker on the risk of a non-fatal event which is subject to interval-censoring due to an intermittent observation scheme. The sample size is shown to depend on the effect of interest, the number of assessments, and the duration of follow-up. Minimum-cost designs are also developed to account for the different costs of recruitment and follow-up examination. We also consider the setting where the event status of individuals is observed subject to misclassification; the consequent need to increase the sample size to account for this error is illustrated through asymptotic calculations.
定期进行队列研究以了解慢性病的发病率或进展率。疾病-死亡模型为在半竞争风险环境下联合考虑非致命事件提供了一个自然的框架。我们考虑了前瞻性队列研究的设计,其目标是估计标记物对非致命事件风险的影响,该事件由于间歇性观察方案而受到间隔审查。样本量取决于兴趣的影响、评估的次数和随访的持续时间。最低成本的设计也考虑到招聘和后续审查的不同费用。我们还考虑了个体的事件状态被观察到可能被错误分类的情况;因此需要增加样本量来解释这个误差是通过渐近计算来说明的。
{"title":"Cohort study design for illness-death processes with disease status under intermittent observation","authors":"Nathalie C. Moon, Leilei Zeng, R. Cook","doi":"10.1080/24709360.2019.1699341","DOIUrl":"https://doi.org/10.1080/24709360.2019.1699341","url":null,"abstract":"Cohort studies are routinely conducted to learn about the incidence or progression rates of chronic diseases. The illness-death model offers a natural framework for joint consideration of non-fatal events in the semi-competing risks setting. We consider the design of prospective cohort studies where the goal is to estimate the effect of a marker on the risk of a non-fatal event which is subject to interval-censoring due to an intermittent observation scheme. The sample size is shown to depend on the effect of interest, the number of assessments, and the duration of follow-up. Minimum-cost designs are also developed to account for the different costs of recruitment and follow-up examination. We also consider the setting where the event status of individuals is observed subject to misclassification; the consequent need to increase the sample size to account for this error is illustrated through asymptotic calculations.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"3 1","pages":"178 - 200"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1699341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47561742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modified sparse functional principal component analysis for fMRI data process 改进的稀疏功能主成分分析在fMRI数据处理中的应用
Q3 Medicine Pub Date : 2019-01-01 DOI: 10.1080/24709360.2019.1591072
Zhengyang Fang, J. Y. Han, N. Simon, Xiaoping Zhou
Sparse and functional principal component analysis is a technique to extract sparse and smooth principal components from a matrix. In this paper, we propose a modified sparse and functional principal component analysis model for feature extraction. We measure the tuning parameters by their robustness against random perturbation, and select the tuning parameters by derivative-free optimization. We test our algorithm on the ADNI dataset to distinguish between the patients with Alzheimer's disease and the control group. By applying proper classification methods for sparse features, we get better result than classic singular value decomposition, support vector machine and logistic regression.
稀疏泛函主成分分析是一种从矩阵中提取稀疏光滑主成分的技术。本文提出了一种改进的稀疏功能主成分分析模型用于特征提取。我们通过对随机扰动的鲁棒性来衡量调谐参数,并通过无导数优化来选择调谐参数。我们在ADNI数据集上测试了我们的算法,以区分阿尔茨海默病患者和对照组。通过对稀疏特征采用合适的分类方法,得到了比经典的奇异值分解、支持向量机和逻辑回归更好的分类结果。
{"title":"Modified sparse functional principal component analysis for fMRI data process","authors":"Zhengyang Fang, J. Y. Han, N. Simon, Xiaoping Zhou","doi":"10.1080/24709360.2019.1591072","DOIUrl":"https://doi.org/10.1080/24709360.2019.1591072","url":null,"abstract":"Sparse and functional principal component analysis is a technique to extract sparse and smooth principal components from a matrix. In this paper, we propose a modified sparse and functional principal component analysis model for feature extraction. We measure the tuning parameters by their robustness against random perturbation, and select the tuning parameters by derivative-free optimization. We test our algorithm on the ADNI dataset to distinguish between the patients with Alzheimer's disease and the control group. By applying proper classification methods for sparse features, we get better result than classic singular value decomposition, support vector machine and logistic regression.","PeriodicalId":37240,"journal":{"name":"Biostatistics and Epidemiology","volume":"3 1","pages":"80 - 89"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24709360.2019.1591072","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46473035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Biostatistics and Epidemiology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1