首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
Challenges and Progress in Designing Broad-Spectrum Vaccines Against Rapidly Mutating Viruses. 针对快速变异病毒设计广谱疫苗的挑战与进展。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-041304
Rishi Bedi, Nicholas L Bayless, Jacob Glanville

Viruses evolve to evade prior immunity, causing significant disease burden. Vaccine effectiveness deteriorates as pathogens mutate, requiring redesign. This is a problem that has grown worse due to population increase, global travel, and farming practices. Thus, there is significant interest in developing broad-spectrum vaccines that mitigate disease severity and ideally inhibit disease transmission without requiring frequent updates. Even in cases where vaccines against rapidly mutating pathogens have been somewhat effective, such as seasonal influenza and SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), designing vaccines that provide broad-spectrum immunity against routinely observed viral variation remains a desirable but not yet achieved goal. This review highlights the key theoretical advances in understanding the interplay between polymorphism and vaccine efficacy, challenges in designing broad-spectrum vaccines, and technology advances and possible avenues forward. We also discuss data-driven approaches for monitoring vaccine efficacy and predicting viral escape from vaccine-induced protection. In each case, we consider illustrative examples in vaccine development from influenza, SARS-CoV-2, and HIV (human immunodeficiency virus)-three examples of highly prevalent rapidly mutating viruses with distinct phylogenetics and unique histories of vaccine technology development.

病毒进化以逃避先前的免疫,造成重大的疾病负担。疫苗的有效性随着病原体的突变而恶化,需要重新设计。由于人口增长、全球旅行和农业实践,这个问题变得越来越严重。因此,人们对开发能够减轻疾病严重程度并理想地抑制疾病传播而无需频繁更新的广谱疫苗非常感兴趣。即使在针对快速变异病原体的疫苗有一定效果的情况下,如季节性流感和SARS-CoV-2(严重急性呼吸综合征冠状病毒2),设计针对常规观察到的病毒变异提供广谱免疫的疫苗仍然是一个理想的目标,但尚未实现。这篇综述强调了在理解多态性与疫苗效力之间相互作用方面的关键理论进展,设计广谱疫苗的挑战,以及技术进步和可能的发展途径。我们还讨论了监测疫苗效力和预测病毒从疫苗诱导的保护中逃逸的数据驱动方法。在每种情况下,我们都考虑了流感、SARS-CoV-2和HIV(人类免疫缺陷病毒)疫苗开发中的说明性例子,这三个例子都是高度流行的快速变异病毒,具有不同的系统发育和独特的疫苗技术开发历史。
{"title":"Challenges and Progress in Designing Broad-Spectrum Vaccines Against Rapidly Mutating Viruses.","authors":"Rishi Bedi,&nbsp;Nicholas L Bayless,&nbsp;Jacob Glanville","doi":"10.1146/annurev-biodatasci-020722-041304","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-041304","url":null,"abstract":"<p><p>Viruses evolve to evade prior immunity, causing significant disease burden. Vaccine effectiveness deteriorates as pathogens mutate, requiring redesign. This is a problem that has grown worse due to population increase, global travel, and farming practices. Thus, there is significant interest in developing broad-spectrum vaccines that mitigate disease severity and ideally inhibit disease transmission without requiring frequent updates. Even in cases where vaccines against rapidly mutating pathogens have been somewhat effective, such as seasonal influenza and SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), designing vaccines that provide broad-spectrum immunity against routinely observed viral variation remains a desirable but not yet achieved goal. This review highlights the key theoretical advances in understanding the interplay between polymorphism and vaccine efficacy, challenges in designing broad-spectrum vaccines, and technology advances and possible avenues forward. We also discuss data-driven approaches for monitoring vaccine efficacy and predicting viral escape from vaccine-induced protection. In each case, we consider illustrative examples in vaccine development from influenza, SARS-CoV-2, and HIV (human immunodeficiency virus)-three examples of highly prevalent rapidly mutating viruses with distinct phylogenetics and unique histories of vaccine technology development.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward Identification of Functional Sequences and Variants in Noncoding DNA. 非编码DNA功能序列和变异的鉴定。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-122120-110102
Remo Monti, Uwe Ohler

Understanding the noncoding part of the genome, which encodes gene regulation, is necessary to identify genetic mechanisms of disease and translate findings from genome-wide association studies into actionable results for treatments and personalized care. Here we provide an overview of the computational analysis of noncoding regions, starting from gene-regulatory mechanisms and their representation in data. Deep learning methods, when applied to these data, highlight important regulatory sequence elements and predict the functional effects of genetic variants. These and other algorithms are used to predict damaging sequence variants. Finally, we introduce rare-variant association tests that incorporate functional annotations and predictions in order to increase interpretability and statistical power.

了解基因组中编码基因调控的非编码部分,对于确定疾病的遗传机制和将全基因组关联研究的发现转化为治疗和个性化护理的可操作结果是必要的。在这里,我们从基因调控机制及其在数据中的表现开始,概述了非编码区域的计算分析。当将深度学习方法应用于这些数据时,可以突出重要的调控序列元素并预测遗传变异的功能影响。这些和其他算法用于预测破坏性序列变异。最后,我们引入了包含功能注释和预测的罕见变量关联测试,以提高可解释性和统计能力。
{"title":"Toward Identification of Functional Sequences and Variants in Noncoding DNA.","authors":"Remo Monti,&nbsp;Uwe Ohler","doi":"10.1146/annurev-biodatasci-122120-110102","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-110102","url":null,"abstract":"<p><p>Understanding the noncoding part of the genome, which encodes gene regulation, is necessary to identify genetic mechanisms of disease and translate findings from genome-wide association studies into actionable results for treatments and personalized care. Here we provide an overview of the computational analysis of noncoding regions, starting from gene-regulatory mechanisms and their representation in data. Deep learning methods, when applied to these data, highlight important regulatory sequence elements and predict the functional effects of genetic variants. These and other algorithms are used to predict damaging sequence variants. Finally, we introduce rare-variant association tests that incorporate functional annotations and predictions in order to increase interpretability and statistical power.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10024242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism. 自闭症神经精神表型的数据科学和机器学习综述和路线图。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-05-03 DOI: 10.1146/annurev-biodatasci-020722-125454
Peter Washington, Dennis P Wall

Autism spectrum disorder (autism) is a neurodevelopmental delay that affects at least 1 in 44 children. Like many neurological disorder phenotypes, the diagnostic features are observable, can be tracked over time, and can be managed or even eliminated through proper therapy and treatments. However, there are major bottlenecks in the diagnostic, therapeutic, and longitudinal tracking pipelines for autism and related neurodevelopmental delays, creating an opportunity for novel data science solutions to augment and transform existing workflows and provide increased access to services for affected families. Several efforts previously conducted by a multitude of research labs have spawned great progress toward improved digital diagnostics and digital therapies for children with autism. We review the literature on digital health methods for autism behavior quantification and beneficial therapies using data science. We describe both case-control studies and classification systems for digital phenotyping. We then discuss digital diagnostics and therapeutics that integrate machine learning models of autism-related behaviors, including the factors that must be addressed for translational use. Finally, we describe ongoing challenges and potential opportunities for the field of autism data science. Given the heterogeneous nature of autism and the complexities of the relevant behaviors, this review contains insights that are relevant to neurological behavior analysis and digital psychiatry more broadly.

自闭症谱系障碍是一种神经发育迟缓,至少每44名儿童中就有1名患有自闭症。与许多神经系统疾病表型一样,诊断特征是可观察的,可以随着时间的推移进行跟踪,并且可以通过适当的治疗和治疗来控制甚至消除。然而,自闭症和相关神经发育迟缓的诊断、治疗和纵向跟踪管道存在重大瓶颈,这为新的数据科学解决方案提供了机会,以增强和改造现有的工作流程,并为受影响的家庭提供更多的服务。许多研究实验室此前进行的几项努力在改善自闭症儿童的数字诊断和数字治疗方面取得了巨大进展。我们回顾了有关自闭症行为量化的数字健康方法和使用数据科学的有益疗法的文献。我们描述了数字表型的病例对照研究和分类系统。然后,我们讨论了整合自闭症相关行为的机器学习模型的数字诊断和治疗方法,包括转化使用必须解决的因素。最后,我们描述了自闭症数据科学领域正在面临的挑战和潜在的机遇。鉴于自闭症的异质性和相关行为的复杂性,这篇综述包含了与更广泛的神经行为分析和数字精神病学相关的见解。
{"title":"A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism.","authors":"Peter Washington, Dennis P Wall","doi":"10.1146/annurev-biodatasci-020722-125454","DOIUrl":"10.1146/annurev-biodatasci-020722-125454","url":null,"abstract":"<p><p>Autism spectrum disorder (autism) is a neurodevelopmental delay that affects at least 1 in 44 children. Like many neurological disorder phenotypes, the diagnostic features are observable, can be tracked over time, and can be managed or even eliminated through proper therapy and treatments. However, there are major bottlenecks in the diagnostic, therapeutic, and longitudinal tracking pipelines for autism and related neurodevelopmental delays, creating an opportunity for novel data science solutions to augment and transform existing workflows and provide increased access to services for affected families. Several efforts previously conducted by a multitude of research labs have spawned great progress toward improved digital diagnostics and digital therapies for children with autism. We review the literature on digital health methods for autism behavior quantification and beneficial therapies using data science. We describe both case-control studies and classification systems for digital phenotyping. We then discuss digital diagnostics and therapeutics that integrate machine learning models of autism-related behaviors, including the factors that must be addressed for translational use. Finally, we describe ongoing challenges and potential opportunities for the field of autism data science. Given the heterogeneous nature of autism and the complexities of the relevant behaviors, this review contains insights that are relevant to neurological behavior analysis and digital psychiatry more broadly.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11093217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategies for the Genomic Analysis of Admixed Populations. 混血人群基因组分析策略。
IF 6 Pub Date : 2023-08-10 Epub Date: 2023-04-26 DOI: 10.1146/annurev-biodatasci-020722-014310
Taotao Tan, Elizabeth G Atkinson

Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.

混血人群占全球人类遗传多样性的很大一部分,但他们往往被排除在基因组学分析之外。这种排斥是有问题的,因为它导致了对不同人群遗传结构和历史的理解以及跨人群基因组医学表现的差异。混血人群尤其面临统计方面的挑战,因为他们继承了来自多个来源人群的基因组片段--这也是他们历来被排除在基因研究之外的主要原因。然而,近年来,越来越多的统计方法和软件工具被开发出来,以在基因组学分析中考虑和利用混杂性。在此,我们将对此类计算策略进行调查,以便在大规模基因组学研究中对混血人群进行充分校准。
{"title":"Strategies for the Genomic Analysis of Admixed Populations.","authors":"Taotao Tan, Elizabeth G Atkinson","doi":"10.1146/annurev-biodatasci-020722-014310","DOIUrl":"10.1146/annurev-biodatasci-020722-014310","url":null,"abstract":"<p><p>Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10871708/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10023273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical Learning Methods for Neuroimaging Data Analysis with Applications. 神经影像数据分析的统计学习方法及其应用。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-100353
Hongtu Zhu, Tengfei Li, Bingxin Zhao

The aim of this review is to provide a comprehensive survey of statistical challenges in neuroimaging data analysis, from neuroimaging techniques to large-scale neuroimaging studies and statistical learning methods. We briefly review eight popular neuroimaging techniques and their potential applications in neuroscience research and clinical translation. We delineate four themes of neuroimaging data and review major image processing analysis methods for processing neuroimaging data at the individual level. We briefly review four large-scale neuroimaging-related studies and a consortium on imaging genomics and discuss four themes of neuroimaging data analysis at the population level. We review nine major population-based statistical analysis methods and their associated statistical challenges and present recent progress in statistical methodology to address these challenges.

这篇综述的目的是对神经成像数据分析中的统计挑战进行全面的调查,从神经成像技术到大规模神经成像研究和统计学习方法。我们简要回顾了八种流行的神经成像技术及其在神经科学研究和临床翻译中的潜在应用。我们描述了神经成像数据的四个主题,并回顾了在个体水平上处理神经成像数据的主要图像处理分析方法。我们简要回顾了四项大型神经成像相关研究和一个成像基因组学联盟,并讨论了人群水平上神经成像数据分析的四个主题。我们回顾了九种主要的基于人口的统计分析方法及其相关的统计挑战,并介绍了统计方法的最新进展,以应对这些挑战。
{"title":"Statistical Learning Methods for Neuroimaging Data Analysis with Applications.","authors":"Hongtu Zhu,&nbsp;Tengfei Li,&nbsp;Bingxin Zhao","doi":"10.1146/annurev-biodatasci-020722-100353","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-100353","url":null,"abstract":"<p><p>The aim of this review is to provide a comprehensive survey of statistical challenges in neuroimaging data analysis, from neuroimaging techniques to large-scale neuroimaging studies and statistical learning methods. We briefly review eight popular neuroimaging techniques and their potential applications in neuroscience research and clinical translation. We delineate four themes of neuroimaging data and review major image processing analysis methods for processing neuroimaging data at the individual level. We briefly review four large-scale neuroimaging-related studies and a consortium on imaging genomics and discuss four themes of neuroimaging data analysis at the population level. We review nine major population-based statistical analysis methods and their associated statistical challenges and present recent progress in statistical methodology to address these challenges.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10023733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Decoding Aging Hallmarks at the Single-Cell Level. 在单细胞水平上解码衰老特征。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-120642
Shuai Ma, Xu Chi, Yusheng Cai, Zhejun Ji, Si Wang, Jie Ren, Guang-Hui Liu

Organismal aging exhibits wide-ranging hallmarks in divergent cell types across tissues, organs, and systems. The advancement of single-cell technologies and generation of rich datasets have afforded the scientific community the opportunity to decode these hallmarks of aging at an unprecedented scope and resolution. In this review, we describe the technological advancements and bioinformatic methodologies enabling data interpretation at the cellular level. Then, we outline the application of such technologies for decoding aging hallmarks and potential intervention targets and summarize common themes and context-specific molecular features in representative organ systems across the body. Finally, we provide a brief summary of available databases relevant for aging research and present an outlook on the opportunities in this emerging field.

机体衰老在组织、器官和系统的不同细胞类型中表现出广泛的特征。单细胞技术的进步和丰富数据集的产生为科学界提供了以前所未有的范围和分辨率解码这些衰老特征的机会。在这篇综述中,我们描述了技术进步和生物信息学方法,使数据解释在细胞水平。然后,我们概述了这些技术在解码衰老标志和潜在干预目标方面的应用,并总结了全身代表性器官系统的共同主题和特定环境的分子特征。最后,我们简要总结了与老龄化研究相关的现有数据库,并对这一新兴领域的机会进行了展望。
{"title":"Decoding Aging Hallmarks at the Single-Cell Level.","authors":"Shuai Ma,&nbsp;Xu Chi,&nbsp;Yusheng Cai,&nbsp;Zhejun Ji,&nbsp;Si Wang,&nbsp;Jie Ren,&nbsp;Guang-Hui Liu","doi":"10.1146/annurev-biodatasci-020722-120642","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020722-120642","url":null,"abstract":"<p><p>Organismal aging exhibits wide-ranging hallmarks in divergent cell types across tissues, organs, and systems. The advancement of single-cell technologies and generation of rich datasets have afforded the scientific community the opportunity to decode these hallmarks of aging at an unprecedented scope and resolution. In this review, we describe the technological advancements and bioinformatic methodologies enabling data interpretation at the cellular level. Then, we outline the application of such technologies for decoding aging hallmarks and potential intervention targets and summarize common themes and context-specific molecular features in representative organ systems across the body. Finally, we provide a brief summary of available databases relevant for aging research and present an outlook on the opportunities in this emerging field.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10023274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Recent Developments in Ultralarge and Structure-Based Virtual Screening Approaches. 超大型和基于结构的虚拟筛选方法的最新进展。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020222-025013
Christoph Gorgulla

Drug development is a wide scientific field that faces many challenges these days. Among them are extremely high development costs, long development times, and a small number of new drugs that are approved each year. New and innovative technologies are needed to solve these problems that make the drug discovery process of small molecules more time and cost efficient, and that allow previously undruggable receptor classes to be targeted, such as protein-protein interactions. Structure-based virtual screenings (SBVSs) have become a leading contender in this context. In this review, we give an introduction to the foundations of SBVSs and survey their progress in the past few years with a focus on ultralarge virtual screenings (ULVSs). We outline key principles of SBVSs, recent success stories, new screening techniques, available deep learning-based docking methods, and promising future research directions. ULVSs have an enormous potential for the development of new small-molecule drugs and are already starting to transform early-stage drug discovery.

药物开发是一个广泛的科学领域,目前面临着许多挑战。其中包括开发成本极高,开发时间长,每年获批的新药数量少。需要新的创新技术来解决这些问题,使小分子药物发现过程更省时、成本更低,并使以前不可药物的受体类别成为靶标,例如蛋白质-蛋白质相互作用。基于结构的虚拟筛选(SBVSs)已成为这方面的主要竞争者。在本文中,我们介绍了超大虚拟筛检的基础,并对近年来的研究进展进行了综述,重点介绍了超大虚拟筛检(ULVSs)。我们概述了SBVSs的关键原理、最近的成功案例、新的筛选技术、可用的基于深度学习的对接方法以及未来的研究方向。ulvs在开发新的小分子药物方面具有巨大的潜力,并且已经开始改变早期药物发现。
{"title":"Recent Developments in Ultralarge and Structure-Based Virtual Screening Approaches.","authors":"Christoph Gorgulla","doi":"10.1146/annurev-biodatasci-020222-025013","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020222-025013","url":null,"abstract":"<p><p>Drug development is a wide scientific field that faces many challenges these days. Among them are extremely high development costs, long development times, and a small number of new drugs that are approved each year. New and innovative technologies are needed to solve these problems that make the drug discovery process of small molecules more time and cost efficient, and that allow previously undruggable receptor classes to be targeted, such as protein-protein interactions. Structure-based virtual screenings (SBVSs) have become a leading contender in this context. In this review, we give an introduction to the foundations of SBVSs and survey their progress in the past few years with a focus on ultralarge virtual screenings (ULVSs). We outline key principles of SBVSs, recent success stories, new screening techniques, available deep learning-based docking methods, and promising future research directions. ULVSs have an enormous potential for the development of new small-molecule drugs and are already starting to transform early-stage drug discovery.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9960543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Noninvasive Prenatal Testing Using Circulating DNA and RNA: Advances, Challenges, and Possibilities. 使用循环DNA和RNA进行无创产前检测:进展、挑战和可能性。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 Epub Date: 2023-05-17 DOI: 10.1146/annurev-biodatasci-020722-094144
Mira N Moufarrej, Diana W Bianchi, Gary M Shaw, David K Stevenson, Stephen R Quake

Prenatal screening using sequencing of circulating cell-free DNA has transformed obstetric care over the past decade and significantly reduced the number of invasive diagnostic procedures like amniocentesis for genetic disorders. Nonetheless, emergency care remains the only option for complications like preeclampsia and preterm birth, two of the most prevalent obstetrical syndromes. Advances in noninvasive prenatal testing expand the scope of precision medicine in obstetric care. In this review, we discuss advances, challenges, and possibilities toward the goal of providing proactive, personalized prenatal care. The highlighted advances focus mainly on cell-free nucleic acids; however, we also review research that uses signals from metabolomics, proteomics, intact cells, and the microbiome. We discuss ethical challenges in providing care. Finally, we look to future possibilities, including redefining disease taxonomy and moving from biomarker correlation to biological causation.

在过去的十年里,使用循环无细胞DNA测序的产前筛查改变了产科护理,并显著减少了羊水穿刺等遗传疾病侵入性诊断程序的数量。尽管如此,紧急护理仍然是先兆子痫和早产等并发症的唯一选择,这两种最常见的产科综合征。无创产前检测的进展扩大了产科护理中精准医学的范围。在这篇综述中,我们讨论了提供主动、个性化产前护理的进展、挑战和可能性。突出的进展主要集中在无细胞核酸上;然而,我们也回顾了使用来自代谢组学、蛋白质组学、完整细胞和微生物组的信号的研究。我们讨论了提供护理的道德挑战。最后,我们展望了未来的可能性,包括重新定义疾病分类学,从生物标志物相关性转向生物因果关系。
{"title":"Noninvasive Prenatal Testing Using Circulating DNA and RNA: Advances, Challenges, and Possibilities.","authors":"Mira N Moufarrej, Diana W Bianchi, Gary M Shaw, David K Stevenson, Stephen R Quake","doi":"10.1146/annurev-biodatasci-020722-094144","DOIUrl":"10.1146/annurev-biodatasci-020722-094144","url":null,"abstract":"<p><p>Prenatal screening using sequencing of circulating cell-free DNA has transformed obstetric care over the past decade and significantly reduced the number of invasive diagnostic procedures like amniocentesis for genetic disorders. Nonetheless, emergency care remains the only option for complications like preeclampsia and preterm birth, two of the most prevalent obstetrical syndromes. Advances in noninvasive prenatal testing expand the scope of precision medicine in obstetric care. In this review, we discuss advances, challenges, and possibilities toward the goal of providing proactive, personalized prenatal care. The highlighted advances focus mainly on cell-free nucleic acids; however, we also review research that uses signals from metabolomics, proteomics, intact cells, and the microbiome. We discuss ethical challenges in providing care. Finally, we look to future possibilities, including redefining disease taxonomy and moving from biomarker correlation to biological causation.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10528197/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9969611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Splice Variants and Isoforms in Transcriptomics and Proteomics. 转录组学和蛋白质组学中剪接变异体和异构体的鉴定。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-020722-044021
Taojunfeng Su, Michael A R Hollas, Ryan T Fellers, Neil L Kelleher

Alternative splicing is pivotal to the regulation of gene expression and protein diversity in eukaryotic cells. The detection of alternative splicing events requires specific omics technologies. Although short-read RNA sequencing has successfully supported a plethora of investigations on alternative splicing, the emerging technologies of long-read RNA sequencing and top-down mass spectrometry open new opportunities to identify alternative splicing and protein isoforms with less ambiguity. Here, we summarize improvements in short-read RNA sequencing for alternative splicing analysis, including percent splicing index estimation and differential analysis. We also review the computational methods used in top-down proteomics analysis regarding proteoform identification, including the construction of databases of protein isoforms and statistical analyses of search results. While many improvements in sequencing and computational methods will result from emerging technologies, there should be future endeavors to increase the effectiveness, integration, and proteome coverage of alternative splicing events.

选择性剪接对真核细胞中基因表达和蛋白质多样性的调节至关重要。选择性剪接事件的检测需要特定的组学技术。尽管短读RNA测序已经成功地支持了对替代剪接的大量研究,但新出现的长读RNA测序和自上而下的质谱技术为识别替代剪接和蛋白质异构体提供了新的机会,而不那么模糊。在这里,我们总结了用于选择性剪接分析的短读RNA测序的改进,包括百分比剪接指数估计和差异分析。我们还回顾了自上而下蛋白质组学分析中使用的蛋白质形态鉴定的计算方法,包括蛋白质异构体数据库的构建和搜索结果的统计分析。虽然测序和计算方法的许多改进将来自新兴技术,但未来应该努力提高替代剪接事件的有效性、整合性和蛋白质组覆盖率。
{"title":"Identification of Splice Variants and Isoforms in Transcriptomics and Proteomics.","authors":"Taojunfeng Su, Michael A R Hollas, Ryan T Fellers, Neil L Kelleher","doi":"10.1146/annurev-biodatasci-020722-044021","DOIUrl":"10.1146/annurev-biodatasci-020722-044021","url":null,"abstract":"<p><p>Alternative splicing is pivotal to the regulation of gene expression and protein diversity in eukaryotic cells. The detection of alternative splicing events requires specific omics technologies. Although short-read RNA sequencing has successfully supported a plethora of investigations on alternative splicing, the emerging technologies of long-read RNA sequencing and top-down mass spectrometry open new opportunities to identify alternative splicing and protein isoforms with less ambiguity. Here, we summarize improvements in short-read RNA sequencing for alternative splicing analysis, including percent splicing index estimation and differential analysis. We also review the computational methods used in top-down proteomics analysis regarding proteoform identification, including the construction of databases of protein isoforms and statistical analyses of search results. While many improvements in sequencing and computational methods will result from emerging technologies, there should be future endeavors to increase the effectiveness, integration, and proteome coverage of alternative splicing events.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10840079/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10339608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The All of Us Data and Research Center: Creating a Secure, Scalable, and Sustainable Ecosystem for Biomedical Research. 我们所有人的数据和研究中心:为生物医学研究创建一个安全、可扩展和可持续的生态系统。
IF 6 Pub Date : 2023-08-10 DOI: 10.1146/annurev-biodatasci-122120-104825
Kelsey R Mayo, Melissa A Basford, Robert J Carroll, Moira Dillon, Heather Fullen, Jesse Leung, Hiral Master, Shimon Rura, Lina Sulieman, Nan Kennedy, Eric Banks, David Bernick, Asmita Gauchan, Lee Lichtenstein, Brandy M Mapes, Kayla Marginean, Steve L Nyemba, Andrea Ramirez, Charissa Rotundo, Keri Wolfe, Weiyi Xia, Romuladus E Azuine, Robert M Cronin, Joshua C Denny, Abel Kho, Christopher Lunt, Bradley Malin, Karthik Natarajan, Consuelo H Wilkins, Hua Xu, George Hripcsak, Dan M Roden, Anthony A Philippakis, David Glazer, Paul A Harris

The All of Us Research Program's Data and Research Center (DRC) was established to help acquire, curate, and provide access to one of the world's largest and most diverse datasets for precision medicine research. Already, over 500,000 participants are enrolled in All of Us, 80% of whom are underrepresented in biomedical research, and data are being analyzed by a community of over 2,300 researchers. The DRC created this thriving data ecosystem by collaborating with engaged participants, innovative program partners, and empowered researchers. In this review, we first describe how the DRC is organized to meet the needs of this broad group of stakeholders. We then outline guiding principles, common challenges, and innovative approaches used to build the All of Us data ecosystem. Finally, we share lessons learned to help others navigate important decisions and trade-offs in building a modern biomedical data platform.

我们所有人研究计划的数据和研究中心(DRC)成立的目的是帮助获取、策划和访问世界上最大、最多样化的精准医学研究数据集之一。已经有超过50万名参与者参加了All of Us,其中80%在生物医学研究中的代表性不足,2300多名研究人员正在分析数据。DRC通过与参与者、创新项目合作伙伴和有能力的研究人员合作,创建了这个蓬勃发展的数据生态系统。在这篇综述中,我们首先描述了刚果民主共和国是如何组织起来以满足这一广泛利益相关者群体的需求的。然后,我们概述了用于构建All of Us数据生态系统的指导原则、共同挑战和创新方法。最后,我们分享经验教训,帮助其他人在构建现代生物医学数据平台时做出重要决策和权衡。
{"title":"The <i>All of Us</i> Data and Research Center: Creating a Secure, Scalable, and Sustainable Ecosystem for Biomedical Research.","authors":"Kelsey R Mayo, Melissa A Basford, Robert J Carroll, Moira Dillon, Heather Fullen, Jesse Leung, Hiral Master, Shimon Rura, Lina Sulieman, Nan Kennedy, Eric Banks, David Bernick, Asmita Gauchan, Lee Lichtenstein, Brandy M Mapes, Kayla Marginean, Steve L Nyemba, Andrea Ramirez, Charissa Rotundo, Keri Wolfe, Weiyi Xia, Romuladus E Azuine, Robert M Cronin, Joshua C Denny, Abel Kho, Christopher Lunt, Bradley Malin, Karthik Natarajan, Consuelo H Wilkins, Hua Xu, George Hripcsak, Dan M Roden, Anthony A Philippakis, David Glazer, Paul A Harris","doi":"10.1146/annurev-biodatasci-122120-104825","DOIUrl":"10.1146/annurev-biodatasci-122120-104825","url":null,"abstract":"<p><p>The <i>All of Us</i> Research Program's Data and Research Center (DRC) was established to help acquire, curate, and provide access to one of the world's largest and most diverse datasets for precision medicine research. Already, over 500,000 participants are enrolled in <i>All of Us</i>, 80% of whom are underrepresented in biomedical research, and data are being analyzed by a community of over 2,300 researchers. The DRC created this thriving data ecosystem by collaborating with engaged participants, innovative program partners, and empowered researchers. In this review, we first describe how the DRC is organized to meet the needs of this broad group of stakeholders. We then outline guiding principles, common challenges, and innovative approaches used to build the <i>All of Us</i> data ecosystem. Finally, we share lessons learned to help others navigate important decisions and trade-offs in building a modern biomedical data platform.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11157478/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10040579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1