{"title":"Sample size and performance estimation for biomarker combinations based on pilot studies with small sample sizes","authors":"A. Al-Mekhlafi, Tobias Becker, F. Klawonn","doi":"10.1080/03610926.2020.1843053","DOIUrl":null,"url":null,"abstract":"Abstract High throughput technologies like microarrays, next generation sequencing and mass spectrometry enable the measurement of tens of thousands of biomarker candidates in pilot studies. Biological systems are often too complex to be based on simple single cause-effect associations and from the medical practice point of view, a single biomarker may not possess the desired sensitivity and/or specificity for disease classification and outcome prediction. Therefore, the efforts of researchers currently aims at combining biomarkers. The intention of biomarker pilot studies with small sample sizes is often to explore the possibility of finding good biomarker combinations and not to find and evaluate a final combination of biomarkers with high predictive value. The aim of the pilot study is to answer the question whether it is worthwhile to extend the study to a larger study and to obtain information about the required sample size. In this paper, we propose a method to judge the potential in a small biomarker pilot study without the need to explicitly identifying and confirming a specific subset of biomarkers. In addition, we provide a method for sample size estimation for an extended study when the results of the pilot study look promising. Abbreviations: ROC: receiver operating characteristic curve; AUC: Area Under the ROC curve; HAUCA: high AUC abundance; ER: Estrogen receptor; BCs: Biomarker candidates; w: with; wt: without","PeriodicalId":10531,"journal":{"name":"Communications in Statistics - Theory and Methods","volume":"51 1","pages":"5534 - 5548"},"PeriodicalIF":0.8000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/03610926.2020.1843053","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics - Theory and Methods","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/03610926.2020.1843053","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 12
Abstract
Abstract High throughput technologies like microarrays, next generation sequencing and mass spectrometry enable the measurement of tens of thousands of biomarker candidates in pilot studies. Biological systems are often too complex to be based on simple single cause-effect associations and from the medical practice point of view, a single biomarker may not possess the desired sensitivity and/or specificity for disease classification and outcome prediction. Therefore, the efforts of researchers currently aims at combining biomarkers. The intention of biomarker pilot studies with small sample sizes is often to explore the possibility of finding good biomarker combinations and not to find and evaluate a final combination of biomarkers with high predictive value. The aim of the pilot study is to answer the question whether it is worthwhile to extend the study to a larger study and to obtain information about the required sample size. In this paper, we propose a method to judge the potential in a small biomarker pilot study without the need to explicitly identifying and confirming a specific subset of biomarkers. In addition, we provide a method for sample size estimation for an extended study when the results of the pilot study look promising. Abbreviations: ROC: receiver operating characteristic curve; AUC: Area Under the ROC curve; HAUCA: high AUC abundance; ER: Estrogen receptor; BCs: Biomarker candidates; w: with; wt: without
期刊介绍:
The Theory and Methods series intends to publish papers that make theoretical and methodological advances in Probability and Statistics. New applications of statistical and probabilistic methods will also be considered for publication. In addition, special issues dedicated to a specific topic of current interest will also be published in this series periodically, providing an exhaustive and up-to-date review of that topic to the readership.