首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
Discovering Biological Conflict Systems Through Genome Analysis: Evolutionary Principles and Biochemical Novelty. 通过基因组分析发现生物冲突系统:进化原理和生化新颖性。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-24 DOI: 10.1146/annurev-biodatasci-122220-101119
L. Aravind, L. Iyer, A. M. Burroughs
Biological replicators, from genes within a genome to whole organisms, are locked in conflicts. Comparative genomics has revealed a staggering diversity of molecular armaments and mechanisms regulating their deployment, collectively termed biological conflict systems. These encompass toxins used in inter- and intraspecific interactions, self/nonself discrimination, antiviral immune mechanisms, and counter-host effectors deployed by viruses and intragenomic selfish elements. These systems possess shared syntactical features in their organizational logic and a set of effectors targeting genetic information flow through the Central Dogma, certain membranes, and key molecules like NAD+. These principles can be exploited to discover new conflict systems through sensitive computational analyses. This has led to significant advances in our understanding of the biology of these systems and furnished new biotechnological reagents for genome editing, sequencing, and beyond. We discuss these advances using specific examples of toxins, restriction-modification, apoptosis, CRISPR/second messenger-regulated systems, and other enigmatic nucleic acid-targeting systems. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
生物复制因子,从基因组内的基因到整个生物体,都处于冲突之中。比较基因组学揭示了分子武器的惊人多样性和调节其部署的机制,统称为生物冲突系统。这些包括在种间和种内相互作用中使用的毒素,自我/非自我歧视,抗病毒免疫机制,病毒和基因组内自私元件部署的反宿主效应物。这些系统在组织逻辑上具有共同的语法特征,并具有一组针对遗传信息流的效应器,这些遗传信息流通过中央教条、某些膜和关键分子如NAD+。这些原理可以通过敏感的计算分析来发现新的冲突系统。这使得我们对这些系统的生物学理解取得了重大进展,并为基因组编辑、测序等提供了新的生物技术试剂。我们用毒素、限制性修饰、细胞凋亡、CRISPR/第二信使调节系统和其他神秘的核酸靶向系统的具体例子来讨论这些进展。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Discovering Biological Conflict Systems Through Genome Analysis: Evolutionary Principles and Biochemical Novelty.","authors":"L. Aravind, L. Iyer, A. M. Burroughs","doi":"10.1146/annurev-biodatasci-122220-101119","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122220-101119","url":null,"abstract":"Biological replicators, from genes within a genome to whole organisms, are locked in conflicts. Comparative genomics has revealed a staggering diversity of molecular armaments and mechanisms regulating their deployment, collectively termed biological conflict systems. These encompass toxins used in inter- and intraspecific interactions, self/nonself discrimination, antiviral immune mechanisms, and counter-host effectors deployed by viruses and intragenomic selfish elements. These systems possess shared syntactical features in their organizational logic and a set of effectors targeting genetic information flow through the Central Dogma, certain membranes, and key molecules like NAD+. These principles can be exploited to discover new conflict systems through sensitive computational analyses. This has led to significant advances in our understanding of the biology of these systems and furnished new biotechnological reagents for genome editing, sequencing, and beyond. We discuss these advances using specific examples of toxins, restriction-modification, apoptosis, CRISPR/second messenger-regulated systems, and other enigmatic nucleic acid-targeting systems. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42599953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Developing and Implementing Predictive Models in a Learning Healthcare System: Traditional and Artificial Intelligence Approaches in the Veterans Health Administration. 在学习医疗保健系统中开发和实施预测模型:退伍军人健康管理中的传统和人工智能方法。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-24 DOI: 10.1146/annurev-biodatasci-122220-110053
D. Atkins, C. A. Makridis, G. Alterovitz, R. Ramoni, C. Clancy
Predicting clinical risk is an important part of healthcare and can inform decisions about treatments, preventive interventions, and provision of extra services. The field of predictive models has been revolutionized over the past two decades by electronic health record data; the ability to link such data with other demographic, socioeconomic, and geographic information; the availability of high-capacity computing; and new machine learning and artificial intelligence methods for extracting insights from complex datasets. These advances have produced a new generation of computerized predictive models, but debate continues about their development, reporting, validation, evaluation, and implementation. In this review we reflect on more than 10 years of experience at the Veterans Health Administration, the largest integrated healthcare system in the United States, in developing, testing, and implementing such models at scale. We report lessons from the implementation of national risk prediction models and suggest an agenda for research. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
预测临床风险是医疗保健的重要组成部分,可以为有关治疗、预防性干预和提供额外服务的决策提供信息。在过去的二十年里,电子健康记录数据使预测模型领域发生了革命性的变化;将此类数据与其他人口、社会经济和地理信息联系起来的能力;高容量计算的可用性;以及从复杂数据集中提取见解的新机器学习和人工智能方法。这些进步已经产生了新一代的计算机化预测模型,但是关于它们的开发、报告、验证、评估和实现的争论仍在继续。在这篇综述中,我们回顾了退伍军人健康管理局(美国最大的综合医疗保健系统)10多年来在大规模开发、测试和实施这些模型方面的经验。我们报告了实施国家风险预测模型的经验教训,并提出了研究议程。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Developing and Implementing Predictive Models in a Learning Healthcare System: Traditional and Artificial Intelligence Approaches in the Veterans Health Administration.","authors":"D. Atkins, C. A. Makridis, G. Alterovitz, R. Ramoni, C. Clancy","doi":"10.1146/annurev-biodatasci-122220-110053","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122220-110053","url":null,"abstract":"Predicting clinical risk is an important part of healthcare and can inform decisions about treatments, preventive interventions, and provision of extra services. The field of predictive models has been revolutionized over the past two decades by electronic health record data; the ability to link such data with other demographic, socioeconomic, and geographic information; the availability of high-capacity computing; and new machine learning and artificial intelligence methods for extracting insights from complex datasets. These advances have produced a new generation of computerized predictive models, but debate continues about their development, reporting, validation, evaluation, and implementation. In this review we reflect on more than 10 years of experience at the Veterans Health Administration, the largest integrated healthcare system in the United States, in developing, testing, and implementing such models at scale. We report lessons from the implementation of national risk prediction models and suggest an agenda for research. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47617284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Extracellular Vesicle-Based Multianalyte Liquid Biopsy as a Diagnostic for Cancer. 基于细胞外囊泡的多分析物液体活检诊断癌症。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-13 DOI: 10.1146/annurev-biodatasci-122120-113218
Andrew A. Lin, V. Nimgaonkar, D. Issadore, E. Carpenter
Liquid biopsy is the analysis of materials shed by tumors into circulation, such as circulating tumor cells, nucleic acids, and extracellular vesicles (EVs), for the diagnosis and management of cancer. These assays have rapidly evolved with recent FDA approvals of single biomarkers in patients with advanced metastatic disease. However, they have lacked sensitivity or specificity as a diagnostic in early-stage cancer, primarily due to low concentrations in circulating plasma. EVs, membrane-enclosed nanoscale vesicles shed by tumor and other cells into circulation, are a promising liquid biopsy analyte owing to their protein and nucleic acid cargoes carried from their mother cells, their surface proteins specific to their cells of origin, and their higher concentrations over other noninvasive biomarkers across disease stages. Recently, the combination of EVs with non-EV biomarkers has driven improvements in sensitivity and accuracy; this has been fueled by the use of machine learning (ML) to algorithmically identify and combine multiple biomarkers into a composite biomarker for clinical prediction. This review presents an analysis of EV isolation methods, surveys approaches for and issues with using ML in multianalyte EV datasets, and describes best practices for bringing multianalyte liquid biopsy to clinical implementation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
液体活检是分析肿瘤进入循环的物质,如循环肿瘤细胞、核酸和细胞外囊泡(EVs),用于癌症的诊断和管理。随着美国食品药品监督管理局最近批准晚期转移性疾病患者使用单一生物标志物,这些检测方法迅速发展。然而,它们作为早期癌症的诊断缺乏敏感性或特异性,主要是由于循环血浆中的低浓度。EVs是一种由肿瘤和其他细胞脱落到循环中的膜包裹的纳米级囊泡,是一种很有前途的液体活检分析物,因为它们的蛋白质和核酸货物是从其母细胞携带的,它们的表面蛋白是其来源细胞特异性的,并且在整个疾病阶段的浓度高于其他非侵入性生物标志物。最近,电动汽车与非电动汽车生物标志物的结合推动了灵敏度和准确性的提高;这是由使用机器学习(ML)以算法识别多种生物标志物并将其组合成用于临床预测的复合生物标志物所推动的。这篇综述分析了EV分离方法,调查了在多分析细胞EV数据集中使用ML的方法和问题,并描述了将多分析细胞液体活检应用于临床的最佳实践。《生物医学数据科学年度评论》第5卷预计最终在线出版日期为2022年8月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Extracellular Vesicle-Based Multianalyte Liquid Biopsy as a Diagnostic for Cancer.","authors":"Andrew A. Lin, V. Nimgaonkar, D. Issadore, E. Carpenter","doi":"10.1146/annurev-biodatasci-122120-113218","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-113218","url":null,"abstract":"Liquid biopsy is the analysis of materials shed by tumors into circulation, such as circulating tumor cells, nucleic acids, and extracellular vesicles (EVs), for the diagnosis and management of cancer. These assays have rapidly evolved with recent FDA approvals of single biomarkers in patients with advanced metastatic disease. However, they have lacked sensitivity or specificity as a diagnostic in early-stage cancer, primarily due to low concentrations in circulating plasma. EVs, membrane-enclosed nanoscale vesicles shed by tumor and other cells into circulation, are a promising liquid biopsy analyte owing to their protein and nucleic acid cargoes carried from their mother cells, their surface proteins specific to their cells of origin, and their higher concentrations over other noninvasive biomarkers across disease stages. Recently, the combination of EVs with non-EV biomarkers has driven improvements in sensitivity and accuracy; this has been fueled by the use of machine learning (ML) to algorithmically identify and combine multiple biomarkers into a composite biomarker for clinical prediction. This review presents an analysis of EV isolation methods, surveys approaches for and issues with using ML in multianalyte EV datasets, and describes best practices for bringing multianalyte liquid biopsy to clinical implementation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49480054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Exchange of Human Data Across International Boundaries. 跨越国际边界的人类数据交换。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-10 DOI: 10.1146/annurev-biodatasci-122220-110811
H. Bentzen
There is a need to share personal data across jurisdictional boundaries. However, the laws regulating such transfers are not harmonized, and sometimes even conflict, causing challenges and occasional data stalls. This review describes the legal landscape for transfer of human data across international boundaries. The European Union's data protection legislation is used as the starting point for illustrating the legislation of countries across the world, how these diverge, and one's options for exchanging human data internationally in a legally compliant manner. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
有必要跨司法管辖区共享个人数据。然而,规范此类传输的法律并不统一,有时甚至会发生冲突,造成挑战和偶尔的数据停滞。这篇综述描述了跨越国际边界转移人类数据的法律环境。欧盟的数据保护立法被用作说明世界各国立法的起点,这些立法是如何分歧的,以及以合法的方式在国际上交换人类数据的选择。《生物医学数据科学年度评论》第5卷预计最终在线出版日期为2022年8月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Exchange of Human Data Across International Boundaries.","authors":"H. Bentzen","doi":"10.1146/annurev-biodatasci-122220-110811","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122220-110811","url":null,"abstract":"There is a need to share personal data across jurisdictional boundaries. However, the laws regulating such transfers are not harmonized, and sometimes even conflict, causing challenges and occasional data stalls. This review describes the legal landscape for transfer of human data across international boundaries. The European Union's data protection legislation is used as the starting point for illustrating the legislation of countries across the world, how these diverge, and one's options for exchanging human data internationally in a legally compliant manner. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45740463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computational Approaches for Understanding Sequence Variation Effects on the 3D Genome Architecture. 理解序列变异对三维基因组结构影响的计算方法。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-10 DOI: 10.1146/annurev-biodatasci-102521-012018
P. Avdeyev, Jian Zhou
Decoding how genomic sequence and its variations affect 3D genome architecture is indispensable for understanding the genetic architecture of various traits and diseases. The 3D genome organization can be significantly altered by genome variations and in turn impact the function of the genomic sequence. Techniques for measuring the 3D genome architecture across spatial scales have opened up new possibilities for understanding how the 3D genome depends upon the genomic sequence and how it can be altered by sequence variations. Computational methods have become instrumental in analyzing and modeling the sequence effects on 3D genome architecture, and recent development in deep learning sequence models have opened up new opportunities for studying the interplay between sequence variations and the 3D genome. In this review, we focus on computational approaches for both the detection and modeling of sequence variation effects on the 3D genome, and we discuss the opportunities presented by these approaches. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
解码基因组序列及其变异如何影响三维基因组结构对于理解各种性状和疾病的遗传结构是必不可少的。三维基因组组织可以通过基因组变异显著改变,进而影响基因组序列的功能。跨空间尺度测量三维基因组结构的技术为理解三维基因组如何依赖于基因组序列以及序列变化如何改变基因组结构开辟了新的可能性。计算方法已经成为分析和建模序列对三维基因组结构影响的工具,而深度学习序列模型的最新发展为研究序列变异与三维基因组之间的相互作用开辟了新的机会。在这篇综述中,我们重点介绍了序列变异对三维基因组影响的检测和建模的计算方法,并讨论了这些方法所带来的机会。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Computational Approaches for Understanding Sequence Variation Effects on the 3D Genome Architecture.","authors":"P. Avdeyev, Jian Zhou","doi":"10.1146/annurev-biodatasci-102521-012018","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-102521-012018","url":null,"abstract":"Decoding how genomic sequence and its variations affect 3D genome architecture is indispensable for understanding the genetic architecture of various traits and diseases. The 3D genome organization can be significantly altered by genome variations and in turn impact the function of the genomic sequence. Techniques for measuring the 3D genome architecture across spatial scales have opened up new possibilities for understanding how the 3D genome depends upon the genomic sequence and how it can be altered by sequence variations. Computational methods have become instrumental in analyzing and modeling the sequence effects on 3D genome architecture, and recent development in deep learning sequence models have opened up new opportunities for studying the interplay between sequence variations and the 3D genome. In this review, we focus on computational approaches for both the detection and modeling of sequence variation effects on the 3D genome, and we discuss the opportunities presented by these approaches. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48686416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bioinformatics of Corals: Investigating Heterogeneous Omics Data from Coral Holobionts for Insight into Reef Health and Resilience. 珊瑚礁生物信息学:研究珊瑚礁中的异构奥密克戎数据,深入了解珊瑚礁的健康和复原力。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-10 DOI: 10.1146/annurev-biodatasci-122120-030732
L. Cowen, H. Putnam
Coral reefs are home to over two million species and provide habitat for roughly 25% of all marine animals, but they are being severely threatened by pollution and climate change. A large amount of genomic, transcriptomic, and other omics data is becoming increasingly available from different species of reef-building corals, the unicellular dinoflagellates, and the coral microbiome (bacteria, archaea, viruses, fungi, etc.). Such new data present an opportunity for bioinformatics researchers and computational biologists to contribute to a timely, compelling, and urgent investigation of critical factors that influence reef health and resilience. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
珊瑚礁是200多万种物种的家园,为大约25%的海洋动物提供栖息地,但它们正受到污染和气候变化的严重威胁。从不同种类的造礁珊瑚、单细胞鞭毛藻和珊瑚微生物组(细菌、古细菌、病毒、真菌等)中获得的大量基因组学、转录组学和其他组学数据越来越多。这些新数据为生物信息学研究人员和计算生物学家提供了一个机会,可以对影响珊瑚礁健康和恢复力的关键因素进行及时、有说服力和紧急的调查。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Bioinformatics of Corals: Investigating Heterogeneous Omics Data from Coral Holobionts for Insight into Reef Health and Resilience.","authors":"L. Cowen, H. Putnam","doi":"10.1146/annurev-biodatasci-122120-030732","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-030732","url":null,"abstract":"Coral reefs are home to over two million species and provide habitat for roughly 25% of all marine animals, but they are being severely threatened by pollution and climate change. A large amount of genomic, transcriptomic, and other omics data is becoming increasingly available from different species of reef-building corals, the unicellular dinoflagellates, and the coral microbiome (bacteria, archaea, viruses, fungi, etc.). Such new data present an opportunity for bioinformatics researchers and computational biologists to contribute to a timely, compelling, and urgent investigation of critical factors that influence reef health and resilience. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43372901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integration of Protein Structure and Population-Scale DNA Sequence Data for Disease Gene Discovery and Variant Interpretation. 整合蛋白质结构和群体规模的DNA序列数据,用于疾病基因发现和变异解释。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-04 DOI: 10.1146/annurev-biodatasci-122220-112147
Bian Li, Bowen Jin, J. Capra, W. Bush
The experimental and computational techniques for capturing information about protein structures and genetic variation within the human genome have advanced dramatically in the past 20 years, generating extensive new data resources. In this review, we discuss these advances, along with new approaches for determining the impact a genetic variant has on protein function. We focus on the potential of new methods that integrate human genetic variation into protein structures to discover relationships to disease, including the discovery of mutational hotspots in cancer-related proteins, the localization of protein-altering variants within protein regions for common complex diseases, and the assessment of variants of unknown significance for Mendelian traits. We expect that approaches that integrate these data sources will play increasingly important roles in disease gene discovery and variant interpretation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
在过去的20年里,用于获取人类基因组中蛋白质结构和遗传变异信息的实验和计算技术取得了巨大进步,产生了大量新的数据资源。在这篇综述中,我们讨论了这些进展,以及确定遗传变异对蛋白质功能影响的新方法。我们专注于将人类遗传变异整合到蛋白质结构中以发现疾病关系的新方法的潜力,包括发现癌症相关蛋白质的突变热点,在常见复杂疾病的蛋白质区域内定位蛋白质改变变异,以及评估孟德尔性状的未知意义变异。我们期望整合这些数据源的方法将在疾病基因发现和变异解释中发挥越来越重要的作用。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Integration of Protein Structure and Population-Scale DNA Sequence Data for Disease Gene Discovery and Variant Interpretation.","authors":"Bian Li, Bowen Jin, J. Capra, W. Bush","doi":"10.1146/annurev-biodatasci-122220-112147","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122220-112147","url":null,"abstract":"The experimental and computational techniques for capturing information about protein structures and genetic variation within the human genome have advanced dramatically in the past 20 years, generating extensive new data resources. In this review, we discuss these advances, along with new approaches for determining the impact a genetic variant has on protein function. We focus on the potential of new methods that integrate human genetic variation into protein structures to discover relationships to disease, including the discovery of mutational hotspots in cancer-related proteins, the localization of protein-altering variants within protein regions for common complex diseases, and the assessment of variants of unknown significance for Mendelian traits. We expect that approaches that integrate these data sources will play increasingly important roles in disease gene discovery and variant interpretation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46663334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Characterization of Genetic Variant Effects on Expression. 基因变异对表达影响的功能表征。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-04-28 DOI: 10.1146/annurev-biodatasci-122120-010010
Elise D. Flynn, T. Lappalainen
Thousands of common genetic variants in the human population have been associated with disease risk and phenotypic variation by genome-wide association studies (GWAS). However, the majority of GWAS variants fall into noncoding regions of the genome, complicating our understanding of their regulatory functions, and few molecular mechanisms of GWAS variant effects have been clearly elucidated. Here, we set out to review genetic variant effects, focusing on expression quantitative trait loci (eQTLs), including their utility in interpreting GWAS variant mechanisms. We discuss the interrelated challenges and opportunities for eQTL analysis, covering determining causal variants, elucidating molecular mechanisms of action, and understanding context variability. Addressing these questions can enable better functional characterization of disease-associated loci and provide insights into fundamental biological questions of the noncoding genetic regulatory code and its control of gene expression. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
通过全基因组关联研究(GWAS),人类中数千种常见的遗传变异与疾病风险和表型变异有关。然而,大多数GWAS变体属于基因组的非编码区,这使我们对其调控功能的理解变得复杂,而且很少有GWAS变体效应的分子机制得到明确阐明。在这里,我们开始综述遗传变异效应,重点关注表达数量性状基因座(eQTL),包括它们在解释GWAS变异机制中的作用。我们讨论了eQTL分析的相关挑战和机遇,包括确定因果变异、阐明分子作用机制和理解上下文变异。解决这些问题可以更好地表征疾病相关基因座的功能,并深入了解非编码遗传调控密码及其对基因表达的控制的基本生物学问题。《生物医学数据科学年度评论》第5卷预计最终在线出版日期为2022年8月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Functional Characterization of Genetic Variant Effects on Expression.","authors":"Elise D. Flynn, T. Lappalainen","doi":"10.1146/annurev-biodatasci-122120-010010","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-010010","url":null,"abstract":"Thousands of common genetic variants in the human population have been associated with disease risk and phenotypic variation by genome-wide association studies (GWAS). However, the majority of GWAS variants fall into noncoding regions of the genome, complicating our understanding of their regulatory functions, and few molecular mechanisms of GWAS variant effects have been clearly elucidated. Here, we set out to review genetic variant effects, focusing on expression quantitative trait loci (eQTLs), including their utility in interpreting GWAS variant mechanisms. We discuss the interrelated challenges and opportunities for eQTL analysis, covering determining causal variants, elucidating molecular mechanisms of action, and understanding context variability. Addressing these questions can enable better functional characterization of disease-associated loci and provide insights into fundamental biological questions of the noncoding genetic regulatory code and its control of gene expression. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47338850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Open Structural Data in Precision Medicine. 精准医学中的开放结构数据。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-04-28 DOI: 10.1146/annurev-biodatasci-122220-012951
R. Nussinov, Hyunbum Jang, G. Nir, Chung-Jung Tsai, F. Cheng
Three-dimensional protein structural data at the molecular level are pivotal for successful precision medicine. Such data are crucial not only for discovering drugs that act to block the active site of the target mutant protein but also for clarifying to the patient and the clinician how the mutations harbored by the patient work. The relative paucity of structural data reflects their cost, challenges in their interpretation, and lack of clinical guidelines for their utilization. Rapid technological advancements in experimental high-resolution structural determination increasingly generate structures. Computationally, modeling algorithms, including molecular dynamics simulations, are becoming more powerful, as are compute-intensive hardware, particularly graphics processing units, overlapping with the inception of the exascale era. Accessible, freely available, and detailed structural and dynamical data can be merged with big data to powerfully transform personalized pharmacology. Here we review protein and emerging genome high-resolution data, along with means, applications, and examples underscoring their usefulness in precision medicine. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
在分子水平上的三维蛋白质结构数据是成功的精准医疗的关键。这些数据不仅对于发现阻断目标突变蛋白活性位点的药物至关重要,而且对于向患者和临床医生阐明患者所携带的突变如何起作用至关重要。结构数据的相对缺乏反映了它们的成本、解释上的挑战以及缺乏临床应用指南。实验高分辨率结构测定技术的快速发展越来越多地产生结构。在计算方面,包括分子动力学模拟在内的建模算法正变得越来越强大,计算密集型硬件,特别是图形处理单元,也随着百亿亿次时代的开始而变得越来越强大。可访问的、免费的、详细的结构和动态数据可以与大数据相结合,有力地改变个性化药理学。在这里,我们回顾了蛋白质和新兴的基因组高分辨率数据,以及方法、应用和例子,强调了它们在精准医学中的有用性。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Open Structural Data in Precision Medicine.","authors":"R. Nussinov, Hyunbum Jang, G. Nir, Chung-Jung Tsai, F. Cheng","doi":"10.1146/annurev-biodatasci-122220-012951","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122220-012951","url":null,"abstract":"Three-dimensional protein structural data at the molecular level are pivotal for successful precision medicine. Such data are crucial not only for discovering drugs that act to block the active site of the target mutant protein but also for clarifying to the patient and the clinician how the mutations harbored by the patient work. The relative paucity of structural data reflects their cost, challenges in their interpretation, and lack of clinical guidelines for their utilization. Rapid technological advancements in experimental high-resolution structural determination increasingly generate structures. Computationally, modeling algorithms, including molecular dynamics simulations, are becoming more powerful, as are compute-intensive hardware, particularly graphics processing units, overlapping with the inception of the exascale era. Accessible, freely available, and detailed structural and dynamical data can be merged with big data to powerfully transform personalized pharmacology. Here we review protein and emerging genome high-resolution data, along with means, applications, and examples underscoring their usefulness in precision medicine. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45448847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Machine Learning in Chemoinformatics and Medicinal Chemistry. 化学信息学和药物化学中的机器学习。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-04-19 DOI: 10.1146/annurev-biodatasci-122120-124216
Raquel Rodríguez-Pérez, Filip Miljković, J. Bajorath
In chemoinformatics and medicinal chemistry, machine learning has evolved into an important approach. In recent years, increasing computational resources and new deep learning algorithms have put machine learning onto a new level, addressing previously unmet challenges in pharmaceutical research. In silico approaches for compound activity predictions, de novo design, and reaction modeling have been further advanced by new algorithmic developments and the emergence of big data in the field. Herein, novel applications of machine learning and deep learning in chemoinformatics and medicinal chemistry are reviewed. Opportunities and challenges for new methods and applications are discussed, placing emphasis on proper baseline comparisons, robust validation methodologies, and new applicability domains. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
在化学信息学和药物化学中,机器学习已经发展成为一种重要的方法。近年来,越来越多的计算资源和新的深度学习算法将机器学习提升到一个新的水平,解决了制药研究中以前未遇到的挑战。随着新算法的发展和大数据的出现,化合物活性预测、从头设计和反应建模的计算机方法得到了进一步的发展。本文综述了机器学习和深度学习在化学信息学和药物化学中的新应用。讨论了新方法和应用程序的机遇和挑战,重点放在适当的基线比较,稳健的验证方法和新的适用领域。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Machine Learning in Chemoinformatics and Medicinal Chemistry.","authors":"Raquel Rodríguez-Pérez, Filip Miljković, J. Bajorath","doi":"10.1146/annurev-biodatasci-122120-124216","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-124216","url":null,"abstract":"In chemoinformatics and medicinal chemistry, machine learning has evolved into an important approach. In recent years, increasing computational resources and new deep learning algorithms have put machine learning onto a new level, addressing previously unmet challenges in pharmaceutical research. In silico approaches for compound activity predictions, de novo design, and reaction modeling have been further advanced by new algorithmic developments and the emergence of big data in the field. Herein, novel applications of machine learning and deep learning in chemoinformatics and medicinal chemistry are reviewed. Opportunities and challenges for new methods and applications are discussed, placing emphasis on proper baseline comparisons, robust validation methodologies, and new applicability domains. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48421704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1