首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
Human Microbiomes and Disease for the Biomedical Data Scientist. 生物医学数据科学家的人类微生物组和疾病。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-043017
Jonathan L Golob

The human microbiome is complex, variable from person to person, essential for health, and related to both the risk for disease and the efficacy of our treatments. There are robust techniques to describe microbiota with high-throughput sequencing, and there are hundreds of thousands of already-sequenced specimens in public archives. The promise remains to use the microbiome both as a prognostic factor and as a target for precision medicine. However, when used as an input in biomedical data science modeling, the microbiome presents unique challenges. Here, we review the most common techniques used to describe microbial communities, explore these unique challenges, and discuss the more successful approaches for biomedical data scientists seeking to use the microbiome as an input in their studies.

人体微生物群是复杂的,因人而异,对健康至关重要,与疾病风险和治疗效果有关。有强大的技术可以用高通量测序来描述微生物群,并且在公共档案中有数十万个已经测序的标本。利用微生物组作为预测因素和精准医疗的目标仍然是有希望的。然而,当用作生物医学数据科学建模的输入时,微生物组呈现出独特的挑战。在这里,我们回顾了用于描述微生物群落的最常用技术,探索了这些独特的挑战,并讨论了生物医学数据科学家寻求将微生物组作为研究输入的更成功的方法。
{"title":"Human Microbiomes and Disease for the Biomedical Data Scientist.","authors":"Jonathan L Golob","doi":"10.1146/annurev-biodatasci-020722-043017","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-043017","url":null,"abstract":"<p><p>The human microbiome is complex, variable from person to person, essential for health, and related to both the risk for disease and the efficacy of our treatments. There are robust techniques to describe microbiota with high-throughput sequencing, and there are hundreds of thousands of already-sequenced specimens in public archives. The promise remains to use the microbiome both as a prognostic factor and as a target for precision medicine. However, when used as an input in biomedical data science modeling, the microbiome presents unique challenges. Here, we review the most common techniques used to describe microbial communities, explore these unique challenges, and discuss the more successful approaches for biomedical data scientists seeking to use the microbiome as an input in their studies.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"259-273"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-Cell RNA Sequencing for Studying Human Cancers. 单细胞RNA测序用于研究人类癌症。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-091857
Dvir Aran

Since the first publication a decade ago describing the use of single-cell RNA sequencing (scRNA-seq) in the context of cancer, over 200 datasets and thousands of scRNA-seq studies have been published in cancer biology. scRNA-seq technologies have been applied across dozens of cancer types and a diverse array of study designs to improve our understanding of tumor biology, the tumor microenvironment, and therapeutic responses, and scRNA-seq is on the verge of being used to improve decision-making in the clinic. Computational methodologies and analytical pipelines are key in facilitating scRNA-seq research. Numerous computational methods utilizing the most advanced tools in data science have been developed to extract meaningful insights. Here, we review the advancements in cancer biology gained by scRNA-seq and discuss the computational challenges of the technology that are specific to cancer research.

自十年前首次发表描述单细胞RNA测序(scRNA-seq)在癌症背景下的使用以来,已经在癌症生物学中发表了200多个数据集和数千个scRNA-seq研究。scRNA-seq技术已应用于数十种癌症类型和多种研究设计,以提高我们对肿瘤生物学、肿瘤微环境和治疗反应的理解,并且scRNA-seq即将用于改善临床决策。计算方法和分析管道是促进scRNA-seq研究的关键。利用数据科学中最先进的工具,已经开发了许多计算方法来提取有意义的见解。在这里,我们回顾了scRNA-seq在癌症生物学方面取得的进展,并讨论了该技术在癌症研究中所面临的计算挑战。
{"title":"Single-Cell RNA Sequencing for Studying Human Cancers.","authors":"Dvir Aran","doi":"10.1146/annurev-biodatasci-020722-091857","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-091857","url":null,"abstract":"<p><p>Since the first publication a decade ago describing the use of single-cell RNA sequencing (scRNA-seq) in the context of cancer, over 200 datasets and thousands of scRNA-seq studies have been published in cancer biology. scRNA-seq technologies have been applied across dozens of cancer types and a diverse array of study designs to improve our understanding of tumor biology, the tumor microenvironment, and therapeutic responses, and scRNA-seq is on the verge of being used to improve decision-making in the clinic. Computational methodologies and analytical pipelines are key in facilitating scRNA-seq research. Numerous computational methods utilizing the most advanced tools in data science have been developed to extract meaningful insights. Here, we review the advancements in cancer biology gained by scRNA-seq and discuss the computational challenges of the technology that are specific to cancer research.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"1-22"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9967040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Challenges and Opportunities for Data Science in Women's Health. 妇女健康数据科学的挑战与机遇。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-04-11 DOI: 10.1146/annurev-biodatasci-020722-105958
Todd L Edwards, Catherine A Greene, Jacqueline A Piekos, Jacklyn N Hellwege, Gabrielle Hampton, Elizabeth A Jasper, Digna R Velez Edwards

The intersection of women's health and data science is a field of research that has historically trailed other fields, but more recently it has gained momentum. This growth is being driven not only by new investigators who are moving into this area but also by the significant opportunities that have emerged in new methodologies, resources, and technologies in data science. Here, we describe some of the resources and methods being used by women's health researchers today to meet challenges in biomedical data science. We also describe the opportunities and limitations of applying these approaches to advance women's health outcomes and the future of the field, with emphasis on repurposing existing methodologies for women's health.

女性健康与数据科学的交叉研究是一个历来落后于其他领域的研究领域,但最近却获得了强劲的发展势头。推动这一增长的原因不仅有新的研究人员进入这一领域,还有数据科学的新方法、新资源和新技术带来的巨大机遇。在此,我们将介绍当今妇女健康研究人员为应对生物医学数据科学挑战而使用的一些资源和方法。我们还介绍了应用这些方法促进妇女健康成果的机会和局限性,以及该领域的未来,重点是将现有方法重新用于妇女健康。
{"title":"Challenges and Opportunities for Data Science in Women's Health.","authors":"Todd L Edwards, Catherine A Greene, Jacqueline A Piekos, Jacklyn N Hellwege, Gabrielle Hampton, Elizabeth A Jasper, Digna R Velez Edwards","doi":"10.1146/annurev-biodatasci-020722-105958","DOIUrl":"10.1146/annurev-biodatasci-020722-105958","url":null,"abstract":"<p><p>The intersection of women's health and data science is a field of research that has historically trailed other fields, but more recently it has gained momentum. This growth is being driven not only by new investigators who are moving into this area but also by the significant opportunities that have emerged in new methodologies, resources, and technologies in data science. Here, we describe some of the resources and methods being used by women's health researchers today to meet challenges in biomedical data science. We also describe the opportunities and limitations of applying these approaches to advance women's health outcomes and the future of the field, with emphasis on repurposing existing methodologies for women's health.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"23-45"},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10877578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9967041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Overview of Deep Generative Models in Functional and Evolutionary Genomics. 功能和进化基因组学中的深度生成模型概述。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-05-03 DOI: 10.1146/annurev-biodatasci-020722-115651
Burak Yelmen, Flora Jay

Following the widespread use of deep learning for genomics, deep generative modeling is also becoming a viable methodology for the broad field. Deep generative models (DGMs) can learn the complex structure of genomic data and allow researchers to generate novel genomic instances that retain the real characteristics of the original dataset. Aside from data generation, DGMs can also be used for dimensionality reduction by mapping the data space to a latent space, as well as for prediction tasks via exploitation of this learned mapping or supervised/semi-supervised DGM designs. In this review, we briefly introduce generative modeling and two currently prevailing architectures, we present conceptual applications along with notable examples in functional and evolutionary genomics, and we provide our perspective on potential challenges and future directions.

随着深度学习在基因组学领域的广泛应用,深度生成模型也正在成为这一广泛领域的可行方法。深度生成模型(DGM)可以学习基因组数据的复杂结构,使研究人员能够生成保留原始数据集真实特征的新型基因组实例。除了生成数据,DGM 还可以通过将数据空间映射到潜在空间来降低维度,以及通过利用学习到的映射或监督/半监督 DGM 设计来完成预测任务。在这篇综述中,我们简要介绍了生成建模和目前流行的两种架构,介绍了功能基因组学和进化基因组学中的概念应用和著名实例,并对潜在挑战和未来方向提出了自己的看法。
{"title":"An Overview of Deep Generative Models in Functional and Evolutionary Genomics.","authors":"Burak Yelmen, Flora Jay","doi":"10.1146/annurev-biodatasci-020722-115651","DOIUrl":"10.1146/annurev-biodatasci-020722-115651","url":null,"abstract":"<p><p>Following the widespread use of deep learning for genomics, deep generative modeling is also becoming a viable methodology for the broad field. Deep generative models (DGMs) can learn the complex structure of genomic data and allow researchers to generate novel genomic instances that retain the real characteristics of the original dataset. Aside from data generation, DGMs can also be used for dimensionality reduction by mapping the data space to a latent space, as well as for prediction tasks via exploitation of this learned mapping or supervised/semi-supervised DGM designs. In this review, we briefly introduce generative modeling and two currently prevailing architectures, we present conceptual applications along with notable examples in functional and evolutionary genomics, and we provide our perspective on potential challenges and future directions.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"173-189"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9967062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human Genomics of COVID-19 Pneumonia: Contributions of Rare and Common Variants. COVID-19 肺炎的人类基因组学:罕见和常见变异的贡献
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-05-17 DOI: 10.1146/annurev-biodatasci-020222-021705
Aurélie Cobat, Qian Zhang, Laurent Abel, Jean-Laurent Casanova, Jacques Fellay

SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) infection is silent or benign in most infected individuals, but causes hypoxemic COVID-19 pneumonia in about 10% of cases. We review studies of the human genetics of life-threatening COVID-19 pneumonia, focusing on both rare and common variants. Large-scale genome-wide association studies have identified more than 20 common loci robustly associated with COVID-19 pneumonia with modest effect sizes, some implicating genes expressed in the lungs or leukocytes. The most robust association, on chromosome 3, concerns a haplotype inherited from Neanderthals. Sequencing studies focusing on rare variants with a strong effect have been particularly successful, identifying inborn errors of type I interferon (IFN) immunity in 1-5% of unvaccinated patients with critical pneumonia, and their autoimmune phenocopy, autoantibodies against type I IFN, in another 15-20% of cases. Our growing understanding of the impact of human genetic variation on immunity to SARS-CoV-2 is enabling health systems to improve protection for individuals and populations.

SARS-CoV-2(严重急性呼吸系统综合征冠状病毒 2)感染在大多数感染者中是无症状或良性的,但在约 10% 的病例中会引起低氧血症 COVID-19 肺炎。我们回顾了危及生命的 COVID-19 肺炎的人类遗传学研究,重点关注罕见变异和常见变异。大规模的全基因组关联研究发现了 20 多个与 COVID-19 肺炎密切相关的常见基因位点,其效应大小适中,其中一些与肺部或白细胞中表达的基因有关。3号染色体上的一个单倍型与COVID-19肺炎关系最为密切。在未接种疫苗的重症肺炎患者中,有1%-5%的患者存在I型干扰素(IFN)免疫先天性错误,另有15%-20%的患者存在自身免疫表型,即抗I型干扰素的自身抗体。我们对人类基因变异对 SARS-CoV-2 免疫力的影响有了越来越多的了解,这使卫生系统能够改善对个人和人群的保护。
{"title":"Human Genomics of COVID-19 Pneumonia: Contributions of Rare and Common Variants.","authors":"Aurélie Cobat, Qian Zhang, Laurent Abel, Jean-Laurent Casanova, Jacques Fellay","doi":"10.1146/annurev-biodatasci-020222-021705","DOIUrl":"10.1146/annurev-biodatasci-020222-021705","url":null,"abstract":"<p><p>SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) infection is silent or benign in most infected individuals, but causes hypoxemic COVID-19 pneumonia in about 10% of cases. We review studies of the human genetics of life-threatening COVID-19 pneumonia, focusing on both rare and common variants. Large-scale genome-wide association studies have identified more than 20 common loci robustly associated with COVID-19 pneumonia with modest effect sizes, some implicating genes expressed in the lungs or leukocytes. The most robust association, on chromosome 3, concerns a haplotype inherited from Neanderthals. Sequencing studies focusing on rare variants with a strong effect have been particularly successful, identifying inborn errors of type I interferon (IFN) immunity in 1-5% of unvaccinated patients with critical pneumonia, and their autoimmune phenocopy, autoantibodies against type I IFN, in another 15-20% of cases. Our growing understanding of the impact of human genetic variation on immunity to SARS-CoV-2 is enabling health systems to improve protection for individuals and populations.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"465-486"},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10879986/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining Molecular and Radiomic Features for Risk Assessment in Breast Cancer. 结合分子和放射学特征评估乳腺癌的风险。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-092748
Alex A Nguyen, Anne Marie McCarthy, Despina Kontos

Breast cancer risk is highly variable within the population and current research is leading the shift toward personalized medicine. By accurately assessing an individual woman's risk, we can reduce the risk of over/undertreatment by preventing unnecessary procedures or by elevating screening procedures. Breast density measured from conventional mammography has been established as one of the most dominant risk factors for breast cancer; however, it is currently limited by its ability to characterize more complex breast parenchymal patterns that have been shown to provide additional information to strengthen cancer risk models. Molecular factors ranging from high penetrance, or high likelihood that a mutation will show signs and symptoms of the disease, to combinations of gene mutations with low penetrance have shown promise for augmenting risk assessment. Although imaging biomarkers and molecular biomarkers have both individually demonstrated improved performance in risk assessment, few studies have evaluated them together. This review aims to highlight the current state of the art in breast cancer risk assessment using imaging and genetic biomarkers.

乳腺癌的风险在人群中是高度可变的,目前的研究正引领着向个性化医疗的转变。通过准确评估女性个体的风险,我们可以通过避免不必要的手术或提高筛查程序来减少过度/治疗不足的风险。传统乳房x光检查测量的乳腺密度已被确定为乳腺癌最主要的危险因素之一;然而,它目前的局限性在于其表征更复杂的乳腺实质模式的能力,这些模式已被证明为加强癌症风险模型提供了额外的信息。从高外显率(或突变极有可能表现出疾病的体征和症状)到低外显率的基因突变组合等分子因素显示出增加风险评估的希望。虽然成像生物标志物和分子生物标志物都单独证明了在风险评估方面的性能提高,但很少有研究将它们结合起来进行评估。本文综述了利用影像学和遗传生物标志物进行乳腺癌风险评估的最新进展。
{"title":"Combining Molecular and Radiomic Features for Risk Assessment in Breast Cancer.","authors":"Alex A Nguyen,&nbsp;Anne Marie McCarthy,&nbsp;Despina Kontos","doi":"10.1146/annurev-biodatasci-020722-092748","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-092748","url":null,"abstract":"<p><p>Breast cancer risk is highly variable within the population and current research is leading the shift toward personalized medicine. By accurately assessing an individual woman's risk, we can reduce the risk of over/undertreatment by preventing unnecessary procedures or by elevating screening procedures. Breast density measured from conventional mammography has been established as one of the most dominant risk factors for breast cancer; however, it is currently limited by its ability to characterize more complex breast parenchymal patterns that have been shown to provide additional information to strengthen cancer risk models. Molecular factors ranging from high penetrance, or high likelihood that a mutation will show signs and symptoms of the disease, to combinations of gene mutations with low penetrance have shown promise for augmenting risk assessment. Although imaging biomarkers and molecular biomarkers have both individually demonstrated improved performance in risk assessment, few studies have evaluated them together. This review aims to highlight the current state of the art in breast cancer risk assessment using imaging and genetic biomarkers.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"299-311"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9967073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward Identification of Functional Sequences and Variants in Noncoding DNA. 非编码DNA功能序列和变异的鉴定。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-122120-110102
Remo Monti, Uwe Ohler

Understanding the noncoding part of the genome, which encodes gene regulation, is necessary to identify genetic mechanisms of disease and translate findings from genome-wide association studies into actionable results for treatments and personalized care. Here we provide an overview of the computational analysis of noncoding regions, starting from gene-regulatory mechanisms and their representation in data. Deep learning methods, when applied to these data, highlight important regulatory sequence elements and predict the functional effects of genetic variants. These and other algorithms are used to predict damaging sequence variants. Finally, we introduce rare-variant association tests that incorporate functional annotations and predictions in order to increase interpretability and statistical power.

了解基因组中编码基因调控的非编码部分,对于确定疾病的遗传机制和将全基因组关联研究的发现转化为治疗和个性化护理的可操作结果是必要的。在这里,我们从基因调控机制及其在数据中的表现开始,概述了非编码区域的计算分析。当将深度学习方法应用于这些数据时,可以突出重要的调控序列元素并预测遗传变异的功能影响。这些和其他算法用于预测破坏性序列变异。最后,我们引入了包含功能注释和预测的罕见变量关联测试,以提高可解释性和统计能力。
{"title":"Toward Identification of Functional Sequences and Variants in Noncoding DNA.","authors":"Remo Monti,&nbsp;Uwe Ohler","doi":"10.1146/annurev-biodatasci-122120-110102","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-110102","url":null,"abstract":"<p><p>Understanding the noncoding part of the genome, which encodes gene regulation, is necessary to identify genetic mechanisms of disease and translate findings from genome-wide association studies into actionable results for treatments and personalized care. Here we provide an overview of the computational analysis of noncoding regions, starting from gene-regulatory mechanisms and their representation in data. Deep learning methods, when applied to these data, highlight important regulatory sequence elements and predict the functional effects of genetic variants. These and other algorithms are used to predict damaging sequence variants. Finally, we introduce rare-variant association tests that incorporate functional annotations and predictions in order to increase interpretability and statistical power.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"191-210"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10024242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Challenges and Progress in Designing Broad-Spectrum Vaccines Against Rapidly Mutating Viruses. 针对快速变异病毒设计广谱疫苗的挑战与进展。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-041304
Rishi Bedi, Nicholas L Bayless, Jacob Glanville

Viruses evolve to evade prior immunity, causing significant disease burden. Vaccine effectiveness deteriorates as pathogens mutate, requiring redesign. This is a problem that has grown worse due to population increase, global travel, and farming practices. Thus, there is significant interest in developing broad-spectrum vaccines that mitigate disease severity and ideally inhibit disease transmission without requiring frequent updates. Even in cases where vaccines against rapidly mutating pathogens have been somewhat effective, such as seasonal influenza and SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), designing vaccines that provide broad-spectrum immunity against routinely observed viral variation remains a desirable but not yet achieved goal. This review highlights the key theoretical advances in understanding the interplay between polymorphism and vaccine efficacy, challenges in designing broad-spectrum vaccines, and technology advances and possible avenues forward. We also discuss data-driven approaches for monitoring vaccine efficacy and predicting viral escape from vaccine-induced protection. In each case, we consider illustrative examples in vaccine development from influenza, SARS-CoV-2, and HIV (human immunodeficiency virus)-three examples of highly prevalent rapidly mutating viruses with distinct phylogenetics and unique histories of vaccine technology development.

病毒进化以逃避先前的免疫,造成重大的疾病负担。疫苗的有效性随着病原体的突变而恶化,需要重新设计。由于人口增长、全球旅行和农业实践,这个问题变得越来越严重。因此,人们对开发能够减轻疾病严重程度并理想地抑制疾病传播而无需频繁更新的广谱疫苗非常感兴趣。即使在针对快速变异病原体的疫苗有一定效果的情况下,如季节性流感和SARS-CoV-2(严重急性呼吸综合征冠状病毒2),设计针对常规观察到的病毒变异提供广谱免疫的疫苗仍然是一个理想的目标,但尚未实现。这篇综述强调了在理解多态性与疫苗效力之间相互作用方面的关键理论进展,设计广谱疫苗的挑战,以及技术进步和可能的发展途径。我们还讨论了监测疫苗效力和预测病毒从疫苗诱导的保护中逃逸的数据驱动方法。在每种情况下,我们都考虑了流感、SARS-CoV-2和HIV(人类免疫缺陷病毒)疫苗开发中的说明性例子,这三个例子都是高度流行的快速变异病毒,具有不同的系统发育和独特的疫苗技术开发历史。
{"title":"Challenges and Progress in Designing Broad-Spectrum Vaccines Against Rapidly Mutating Viruses.","authors":"Rishi Bedi,&nbsp;Nicholas L Bayless,&nbsp;Jacob Glanville","doi":"10.1146/annurev-biodatasci-020722-041304","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-041304","url":null,"abstract":"<p><p>Viruses evolve to evade prior immunity, causing significant disease burden. Vaccine effectiveness deteriorates as pathogens mutate, requiring redesign. This is a problem that has grown worse due to population increase, global travel, and farming practices. Thus, there is significant interest in developing broad-spectrum vaccines that mitigate disease severity and ideally inhibit disease transmission without requiring frequent updates. Even in cases where vaccines against rapidly mutating pathogens have been somewhat effective, such as seasonal influenza and SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), designing vaccines that provide broad-spectrum immunity against routinely observed viral variation remains a desirable but not yet achieved goal. This review highlights the key theoretical advances in understanding the interplay between polymorphism and vaccine efficacy, challenges in designing broad-spectrum vaccines, and technology advances and possible avenues forward. We also discuss data-driven approaches for monitoring vaccine efficacy and predicting viral escape from vaccine-induced protection. In each case, we consider illustrative examples in vaccine development from influenza, SARS-CoV-2, and HIV (human immunodeficiency virus)-three examples of highly prevalent rapidly mutating viruses with distinct phylogenetics and unique histories of vaccine technology development.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"419-441"},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism. 自闭症神经精神表型的数据科学和机器学习综述和路线图。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-05-03 DOI: 10.1146/annurev-biodatasci-020722-125454
Peter Washington, Dennis P Wall

Autism spectrum disorder (autism) is a neurodevelopmental delay that affects at least 1 in 44 children. Like many neurological disorder phenotypes, the diagnostic features are observable, can be tracked over time, and can be managed or even eliminated through proper therapy and treatments. However, there are major bottlenecks in the diagnostic, therapeutic, and longitudinal tracking pipelines for autism and related neurodevelopmental delays, creating an opportunity for novel data science solutions to augment and transform existing workflows and provide increased access to services for affected families. Several efforts previously conducted by a multitude of research labs have spawned great progress toward improved digital diagnostics and digital therapies for children with autism. We review the literature on digital health methods for autism behavior quantification and beneficial therapies using data science. We describe both case-control studies and classification systems for digital phenotyping. We then discuss digital diagnostics and therapeutics that integrate machine learning models of autism-related behaviors, including the factors that must be addressed for translational use. Finally, we describe ongoing challenges and potential opportunities for the field of autism data science. Given the heterogeneous nature of autism and the complexities of the relevant behaviors, this review contains insights that are relevant to neurological behavior analysis and digital psychiatry more broadly.

自闭症谱系障碍是一种神经发育迟缓,至少每44名儿童中就有1名患有自闭症。与许多神经系统疾病表型一样,诊断特征是可观察的,可以随着时间的推移进行跟踪,并且可以通过适当的治疗和治疗来控制甚至消除。然而,自闭症和相关神经发育迟缓的诊断、治疗和纵向跟踪管道存在重大瓶颈,这为新的数据科学解决方案提供了机会,以增强和改造现有的工作流程,并为受影响的家庭提供更多的服务。许多研究实验室此前进行的几项努力在改善自闭症儿童的数字诊断和数字治疗方面取得了巨大进展。我们回顾了有关自闭症行为量化的数字健康方法和使用数据科学的有益疗法的文献。我们描述了数字表型的病例对照研究和分类系统。然后,我们讨论了整合自闭症相关行为的机器学习模型的数字诊断和治疗方法,包括转化使用必须解决的因素。最后,我们描述了自闭症数据科学领域正在面临的挑战和潜在的机遇。鉴于自闭症的异质性和相关行为的复杂性,这篇综述包含了与更广泛的神经行为分析和数字精神病学相关的见解。
{"title":"A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism.","authors":"Peter Washington, Dennis P Wall","doi":"10.1146/annurev-biodatasci-020722-125454","DOIUrl":"10.1146/annurev-biodatasci-020722-125454","url":null,"abstract":"<p><p>Autism spectrum disorder (autism) is a neurodevelopmental delay that affects at least 1 in 44 children. Like many neurological disorder phenotypes, the diagnostic features are observable, can be tracked over time, and can be managed or even eliminated through proper therapy and treatments. However, there are major bottlenecks in the diagnostic, therapeutic, and longitudinal tracking pipelines for autism and related neurodevelopmental delays, creating an opportunity for novel data science solutions to augment and transform existing workflows and provide increased access to services for affected families. Several efforts previously conducted by a multitude of research labs have spawned great progress toward improved digital diagnostics and digital therapies for children with autism. We review the literature on digital health methods for autism behavior quantification and beneficial therapies using data science. We describe both case-control studies and classification systems for digital phenotyping. We then discuss digital diagnostics and therapeutics that integrate machine learning models of autism-related behaviors, including the factors that must be addressed for translational use. Finally, we describe ongoing challenges and potential opportunities for the field of autism data science. Given the heterogeneous nature of autism and the complexities of the relevant behaviors, this review contains insights that are relevant to neurological behavior analysis and digital psychiatry more broadly.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"211-228"},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11093217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategies for the Genomic Analysis of Admixed Populations. 混血人群基因组分析策略。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-04-26 DOI: 10.1146/annurev-biodatasci-020722-014310
Taotao Tan, Elizabeth G Atkinson

Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.

混血人群占全球人类遗传多样性的很大一部分,但他们往往被排除在基因组学分析之外。这种排斥是有问题的,因为它导致了对不同人群遗传结构和历史的理解以及跨人群基因组医学表现的差异。混血人群尤其面临统计方面的挑战,因为他们继承了来自多个来源人群的基因组片段--这也是他们历来被排除在基因研究之外的主要原因。然而,近年来,越来越多的统计方法和软件工具被开发出来,以在基因组学分析中考虑和利用混杂性。在此,我们将对此类计算策略进行调查,以便在大规模基因组学研究中对混血人群进行充分校准。
{"title":"Strategies for the Genomic Analysis of Admixed Populations.","authors":"Taotao Tan, Elizabeth G Atkinson","doi":"10.1146/annurev-biodatasci-020722-014310","DOIUrl":"10.1146/annurev-biodatasci-020722-014310","url":null,"abstract":"<p><p>Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"6 ","pages":"105-127"},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10871708/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10023273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1