Efficient risk-based collection of biospecimens in cohort studies: designing a prospective study of diagnostic performance for multicancer detection tests.

IF 5 2区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH American journal of epidemiology Pub Date : 2025-01-08 DOI:10.1093/aje/kwae139

Mark Louie F Ramos, Anil K Chaturvedi, Barry I Graubard, Hormuzd A Katki

{"title":"Efficient risk-based collection of biospecimens in cohort studies: designing a prospective study of diagnostic performance for multicancer detection tests.","authors":"Mark Louie F Ramos, Anil K Chaturvedi, Barry I Graubard, Hormuzd A Katki","doi":"10.1093/aje/kwae139","DOIUrl":null,"url":null,"abstract":"<p><p>In cohort studies, it can be infeasible to collect specimens on an entire cohort. For example, to estimate sensitivity of multiple multi-cancer detection (MCD) assays, we desire an extra 80 mL of cell-free DNA (cfDNA) blood, but this much extra blood is too expensive for us to collect on everyone. We propose a novel epidemiologic study design that efficiently oversamples those at highest baseline disease risk from whom to collect specimens, to increase the number of future cases with cfDNA blood collection. The variance reduction ratio from our risk-based subsample versus a simple random (sub)sample (SRS) depends primarily on the ratio of risk model sensitivity to the fraction of the cohort selected for specimen collection subject to constraining the risk model specificity. In a simulation where we chose 34% of the Prostate, Lung, Colorectal, and Ovarian Screening Trial cohort at highest risk of lung cancer for cfDNA blood collection, we could enrich the number of lung cancers 2.42-fold. The standard deviation of lung-cancer MCD sensitivity was 31%-33% reduced versus SRS. Risk-based collection of specimens on a subsample of the cohort could be a feasible and efficient approach to collecting extra specimens for molecular epidemiology.</p>","PeriodicalId":7472,"journal":{"name":"American journal of epidemiology","volume":" ","pages":"243-253"},"PeriodicalIF":5.0000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/aje/kwae139","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

Abstract

In cohort studies, it can be infeasible to collect specimens on an entire cohort. For example, to estimate sensitivity of multiple multi-cancer detection (MCD) assays, we desire an extra 80 mL of cell-free DNA (cfDNA) blood, but this much extra blood is too expensive for us to collect on everyone. We propose a novel epidemiologic study design that efficiently oversamples those at highest baseline disease risk from whom to collect specimens, to increase the number of future cases with cfDNA blood collection. The variance reduction ratio from our risk-based subsample versus a simple random (sub)sample (SRS) depends primarily on the ratio of risk model sensitivity to the fraction of the cohort selected for specimen collection subject to constraining the risk model specificity. In a simulation where we chose 34% of the Prostate, Lung, Colorectal, and Ovarian Screening Trial cohort at highest risk of lung cancer for cfDNA blood collection, we could enrich the number of lung cancers 2.42-fold. The standard deviation of lung-cancer MCD sensitivity was 31%-33% reduced versus SRS. Risk-based collection of specimens on a subsample of the cohort could be a feasible and efficient approach to collecting extra specimens for molecular epidemiology.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在队列研究中有效收集基于风险的生物样本：设计多癌症检测试验诊断性能的前瞻性研究。

在队列研究中，收集整个队列的标本可能是不可行的。例如，为了估算多种癌症检测（MCD）方法的灵敏度，我们需要额外采集 80 毫升无细胞 DNA（cfDNA）血液，但这么多额外的血液对我们来说太昂贵了，不可能对每个人都采集。我们提出了一种新颖的流行病学研究设计，它能有效地对基线疾病风险最高的人群进行超量采样，以增加未来通过采集 cfDNA 血液获得病例的数量。我们基于风险的子样本与简单随机（子）样本（SRS）的方差缩小率主要取决于风险模型灵敏度与受限于风险模型特异性而被选中采集标本的人群比例。在一项模拟中，我们选择了前列腺、肺、结直肠和卵巢筛查试验队列中 34% 的肺癌高危人群进行 cfDNA 血液采集，与 SRS 相比，我们可以将肺癌的数量增加 2.42 倍，肺癌 MCD 灵敏度的标准偏差降低了 31%-33%。基于风险对队列中的子样本进行标本采集是一种可行且高效的方法，可为分子流行病学收集额外的标本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

American journal of epidemiology 医学-公共卫生、环境卫生与职业卫生

CiteScore

7.40

自引率

4.00%

发文量

221

审稿时长

3-6 weeks

期刊介绍： The American Journal of Epidemiology is the oldest and one of the premier epidemiologic journals devoted to the publication of empirical research findings, opinion pieces, and methodological developments in the field of epidemiologic research. It is a peer-reviewed journal aimed at both fellow epidemiologists and those who use epidemiologic data, including public health workers and clinicians.