Pub Date : 2023-10-04DOI: 10.1007/s12561-023-09389-3
Xiaoyu Song
{"title":"Unlocking Cellular Insights Through Cell-Type Decomposition","authors":"Xiaoyu Song","doi":"10.1007/s12561-023-09389-3","DOIUrl":"https://doi.org/10.1007/s12561-023-09389-3","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135597032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-01DOI: 10.1007/s12561-023-09385-7
Phuc H. Nguyen, Amy H. Herring, Stephanie M. Engel
{"title":"Power Analysis of Exposure Mixture Studies Via Monte Carlo Simulations","authors":"Phuc H. Nguyen, Amy H. Herring, Stephanie M. Engel","doi":"10.1007/s12561-023-09385-7","DOIUrl":"https://doi.org/10.1007/s12561-023-09385-7","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135408814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-27DOI: 10.1007/s12561-023-09386-6
John Kidd, Dan-Yu Lin
{"title":"Improving the Power to Detect Indirect Effects in Mediation Analysis","authors":"John Kidd, Dan-Yu Lin","doi":"10.1007/s12561-023-09386-6","DOIUrl":"https://doi.org/10.1007/s12561-023-09386-6","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135537903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-16DOI: 10.1007/s12561-023-09387-5
Sesha Dassanayake, Joshua P. French
{"title":"Detecting Disease Outbreak Regions Using Multiple Data Streams","authors":"Sesha Dassanayake, Joshua P. French","doi":"10.1007/s12561-023-09387-5","DOIUrl":"https://doi.org/10.1007/s12561-023-09387-5","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135308009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-30DOI: 10.1007/s12561-023-09383-9
Jingru Zhang, M. Basner, Christopher W Jones, D. Dinges, H. Shou, Hongzhe Li
{"title":"Mediation Analysis with Random Distribution as Mediator with an Application to iCOMPARE Trial","authors":"Jingru Zhang, M. Basner, Christopher W Jones, D. Dinges, H. Shou, Hongzhe Li","doi":"10.1007/s12561-023-09383-9","DOIUrl":"https://doi.org/10.1007/s12561-023-09383-9","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44600654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-24DOI: 10.1007/s12561-023-09384-8
Xiwen Jiang, Weining Shen
{"title":"Simultaneous Denoising and Heterogeneity Learning for Time Series Data","authors":"Xiwen Jiang, Weining Shen","doi":"10.1007/s12561-023-09384-8","DOIUrl":"https://doi.org/10.1007/s12561-023-09384-8","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47622830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-11DOI: 10.1007/s12561-023-09382-w
Yasin Khadem Charvadeh, G. Yi
{"title":"Understanding Effective Virus Control Policies for Covid-19 with the Q-learning Method","authors":"Yasin Khadem Charvadeh, G. Yi","doi":"10.1007/s12561-023-09382-w","DOIUrl":"https://doi.org/10.1007/s12561-023-09382-w","url":null,"abstract":"","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44014981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Recurrent event data are frequently encountered in many longitudinal studies where each individual may experience more than one event. Wang and Chen (Biometrics 56(3):789-794, 2000) proposed a comparability constraint to estimate the time trend for the gap times, where the gap time pairs that satisfy the constraint have the same conditional distribution. However, the comparable paired gap times are also independent. Therefore, the comparable gap time pairs will be subject to a stronger constraint than needed for the estimation. Thus their procedure is subject to information loss. Under the accelerated failure time model, we propose a new comparability constraint that can overcome the drawback mentioned above. The gap time pairs being selected by the proposed comparability constraint will still have the same distribution, but they do not need to be independent of each other. We showed that the proposed comparability constraint will utilize more gap time data pairs than the strong comparability. And we showed via various simulation studies that the variance will be smaller than Wang and Chen's (2000) estimator. We apply the proposed method to the HIV Prevention Trial Network 052 study.
{"title":"Semiparametric Trend Analysis for Stratified Recurrent Gap Times Under Weak Comparability Constraint.","authors":"Peng Liu, Yijian Huang, Kwun Chuen Gary Chan, Ying Qing Chen","doi":"10.1007/s12561-023-09376-8","DOIUrl":"10.1007/s12561-023-09376-8","url":null,"abstract":"<p><p>Recurrent event data are frequently encountered in many longitudinal studies where each individual may experience more than one event. Wang and Chen (Biometrics 56(3):789-794, 2000) proposed a comparability constraint to estimate the time trend for the gap times, where the gap time pairs that satisfy the constraint have the same conditional distribution. However, the comparable paired gap times are also independent. Therefore, the comparable gap time pairs will be subject to a stronger constraint than needed for the estimation. Thus their procedure is subject to information loss. Under the accelerated failure time model, we propose a new comparability constraint that can overcome the drawback mentioned above. The gap time pairs being selected by the proposed comparability constraint will still have the same distribution, but they do not need to be independent of each other. We showed that the proposed comparability constraint will utilize more gap time data pairs than the strong comparability. And we showed via various simulation studies that the variance will be smaller than Wang and Chen's (2000) estimator. We apply the proposed method to the HIV Prevention Trial Network 052 study.</p>","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542620/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45506409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-01DOI: 10.1007/s12561-023-09375-9
Anton Sugolov, Eric Emmenegger, Andrew D. Paterson, Lei Sun
Abstract Teaching statistics through engaging applications to contemporary large-scale datasets is essential to attracting students to the field. To this end, we developed a hands-on, week-long workshop for senior high-school or junior undergraduate students, without prior knowledge in statistical genetics but with some basic knowledge in data science, to conduct their own genome-wide association study (GWAS). The GWAS was performed for open source gene expression data, using publicly available human genetics data. Assisted by a detailed instruction manual, students were able to obtain $$sim$$ ∼ 1.4 million p-values from a real scientific study, within several days. This early motivation kept students engaged in learning the theories that support their results, including regression, data visualization, results interpretation, and large-scale multiple hypothesis testing. To further their learning motivation by emphasizing the personal connection to this type of data analysis, students were encouraged to make short presentations about how GWAS has provided insights into the genetic basis of diseases that are present in their friends or families. The appended open source, step-by-step instruction manual includes descriptions of the datasets used, the software needed, and results from the workshop. Additionally, scripts used in the workshop are archived on Github and Zenodo to further enhance reproducible research and training.
{"title":"Statistical Learning of Large-Scale Genetic Data: How to Run a Genome-Wide Association Study of Gene-Expression Data Using the 1000 Genomes Project Data","authors":"Anton Sugolov, Eric Emmenegger, Andrew D. Paterson, Lei Sun","doi":"10.1007/s12561-023-09375-9","DOIUrl":"https://doi.org/10.1007/s12561-023-09375-9","url":null,"abstract":"Abstract Teaching statistics through engaging applications to contemporary large-scale datasets is essential to attracting students to the field. To this end, we developed a hands-on, week-long workshop for senior high-school or junior undergraduate students, without prior knowledge in statistical genetics but with some basic knowledge in data science, to conduct their own genome-wide association study (GWAS). The GWAS was performed for open source gene expression data, using publicly available human genetics data. Assisted by a detailed instruction manual, students were able to obtain $$sim$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:mo>∼</mml:mo> </mml:math> 1.4 million p-values from a real scientific study, within several days. This early motivation kept students engaged in learning the theories that support their results, including regression, data visualization, results interpretation, and large-scale multiple hypothesis testing. To further their learning motivation by emphasizing the personal connection to this type of data analysis, students were encouraged to make short presentations about how GWAS has provided insights into the genetic basis of diseases that are present in their friends or families. The appended open source, step-by-step instruction manual includes descriptions of the datasets used, the software needed, and results from the workshop. Additionally, scripts used in the workshop are archived on Github and Zenodo to further enhance reproducible research and training.","PeriodicalId":45094,"journal":{"name":"Statistics in Biosciences","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135315171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}