首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
Static and Motion Facial Analysis for Craniofacial Assessment and Diagnosing Diseases. 静态和运动面部分析用于颅面评估和疾病诊断。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-04-19 DOI: 10.1146/annurev-biodatasci-122120-111413
H. Matthews, G. de Jong, T. Maal, P. Claes
Deviation from a normal facial shape and symmetry can arise from numerous sources, including physical injury and congenital birth defects. Such abnormalities can have important aesthetic and functional consequences. Furthermore, in clinical genetics distinctive facial appearances are often associated with clinical or genetic diagnoses; the recognition of a characteristic facial appearance can substantially narrow the search space of potential diagnoses for the clinician. Unusual patterns of facial movement and expression can indicate disturbances to normal mechanical functioning or emotional affect. Computational analyses of static and moving 2D and 3D images can serve clinicians and researchers by detecting and describing facial structural, mechanical, and affective abnormalities objectively. In this review we survey traditional and emerging methods of facial analysis, including statistical shape modeling, syndrome classification, modeling clinical face phenotype spaces, and analysis of facial motion and affect. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
与正常面部形状和对称性的偏差可能来自多种原因,包括身体损伤和先天性出生缺陷。这种异常可能会产生重要的美学和功能后果。此外,在临床遗传学中,独特的面部外观通常与临床或遗传诊断有关;特征面部外观的识别可以显著地缩小临床医生潜在诊断的搜索空间。面部运动和表情的异常模式可能表明正常的机械功能或情绪受到干扰。静态和运动2D和3D图像的计算分析可以通过客观地检测和描述面部结构、机械和情感异常来为临床医生和研究人员提供服务。在这篇综述中,我们综述了传统和新兴的面部分析方法,包括统计形状建模、综合征分类、临床面部表型空间建模以及面部运动和情感分析。《生物医学数据科学年度评论》第5卷预计最终在线出版日期为2022年8月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Static and Motion Facial Analysis for Craniofacial Assessment and Diagnosing Diseases.","authors":"H. Matthews, G. de Jong, T. Maal, P. Claes","doi":"10.1146/annurev-biodatasci-122120-111413","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-122120-111413","url":null,"abstract":"Deviation from a normal facial shape and symmetry can arise from numerous sources, including physical injury and congenital birth defects. Such abnormalities can have important aesthetic and functional consequences. Furthermore, in clinical genetics distinctive facial appearances are often associated with clinical or genetic diagnoses; the recognition of a characteristic facial appearance can substantially narrow the search space of potential diagnoses for the clinician. Unusual patterns of facial movement and expression can indicate disturbances to normal mechanical functioning or emotional affect. Computational analyses of static and moving 2D and 3D images can serve clinicians and researchers by detecting and describing facial structural, mechanical, and affective abnormalities objectively. In this review we survey traditional and emerging methods of facial analysis, including statistical shape modeling, syndrome classification, modeling clinical face phenotype spaces, and analysis of facial motion and affect. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"1 1","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41479900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Cell Physiome: What Do We Need in a Computational Physiology Framework for Predicting Single-Cell Biology? 细胞重组:我们在预测单细胞生物学的计算生理学框架中需要什么?
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-02-27 DOI: 10.1146/annurev-biodatasci-072018-021246
V. Rajagopal, S. Arumugam, Peter J. Hunter, A. Khadangi, Joshua Chung, Michael Pan
Modern biology and biomedicine are undergoing a big data explosion, needing advanced computational algorithms to extract mechanistic insights on the physiological state of living cells. We present the motivation for the Cell Physiome project: a framework and approach for creating, sharing, and using biophysics-based computational models of single-cell physiology. Using examples in calcium signaling, bioenergetics, and endosomal trafficking, we highlight the need for spatially detailed, biophysics-based computational models to uncover new mechanisms underlying cell biology. We review progress and challenges to date toward creating cell physiome models. We then introduce bond graphs as an efficient way to create cell physiome models that integrate chemical, mechanical, electromagnetic, and thermal processes while maintaining mass and energy balance. Bond graphs enhance modularization and reusability of computational models of cells at scale. We conclude with a look forward at steps that will help fully realize this exciting new field of mechanistic biomedical data science. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
现代生物学和生物医学正在经历大数据爆炸,需要先进的计算算法来提取活细胞生理状态的机制见解。我们提出了细胞生理组项目的动机:创建、共享和使用基于生物物理学的单细胞生理学计算模型的框架和方法。以钙信号传导、生物能量学和内体运输为例,我们强调需要空间详细的、基于生物物理学的计算模型来揭示细胞生物学的新机制。我们回顾了迄今为止在创建细胞生理组模型方面的进展和挑战。然后,我们引入键合图作为创建细胞生理组模型的有效方法,该模型集成了化学、机械、电磁和热过程,同时保持质量和能量平衡。键合图增强了细胞计算模型的模块化和可重用性。最后,我们展望了将有助于充分实现这一令人兴奋的机械生物医学数据科学新领域的步骤。预计《生物医学数据科学年度评论》第5卷的最终在线出版日期为2022年8月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"The Cell Physiome: What Do We Need in a Computational Physiology Framework for Predicting Single-Cell Biology?","authors":"V. Rajagopal, S. Arumugam, Peter J. Hunter, A. Khadangi, Joshua Chung, Michael Pan","doi":"10.1146/annurev-biodatasci-072018-021246","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-072018-021246","url":null,"abstract":"Modern biology and biomedicine are undergoing a big data explosion, needing advanced computational algorithms to extract mechanistic insights on the physiological state of living cells. We present the motivation for the Cell Physiome project: a framework and approach for creating, sharing, and using biophysics-based computational models of single-cell physiology. Using examples in calcium signaling, bioenergetics, and endosomal trafficking, we highlight the need for spatially detailed, biophysics-based computational models to uncover new mechanisms underlying cell biology. We review progress and challenges to date toward creating cell physiome models. We then introduce bond graphs as an efficient way to create cell physiome models that integrate chemical, mechanical, electromagnetic, and thermal processes while maintaining mass and energy balance. Bond graphs enhance modularization and reusability of computational models of cells at scale. We conclude with a look forward at steps that will help fully realize this exciting new field of mechanistic biomedical data science. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44647308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Best Practices on Big Data Analytics to Address Sex-Specific Biases in our Understanding of the Etiology, Diagnosis and Prognosis of Diseases 大数据分析的最佳实践,以解决我们对疾病病因、诊断和预后的理解中的性别特异性偏差
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-02-06 DOI: 10.1101/2022.01.31.22270183
S. Golder, K. O’Connor, Yunwen Wang, R. Stevens, G. Gonzalez-Hernandez
A bias in health research to favor understanding of diseases as they present in men can have a grave impact on the health of women. This paper reports on a conceptual review of the literature that used machine learning or NLP techniques to interrogate big data for identifying sex-specific health disparities. We searched Ovid MEDLINE, Embase, and PsycINFO in October 2021 using synonyms and indexing terms for (1) "women" or "men" or "sex," (2) "big data" or "artificial intelligence" or "NLP", and (3) "disparities" or "differences." From 902 records, 22 studies met the inclusion criteria and were analyzed. Results demonstrate that the inclusion by sex is inconsistent and often unreported, although the inclusion of men in the included studies is disproportionately less than women. Even though AI and NLP techniques are widely applied in health research, few studies use them to take advatage of unstructured text to investigate sex-related differences or disparities. Researchers are increasingly aware of sex-based data bias, but the process to- wards correction is slow. We reflected on what would be the best practices on using big data analytics to address sex-specific biases in understanding the etiology, diagnosis, and prognosis of diseases.
健康研究中倾向于理解男性疾病的偏见可能会对女性健康产生严重影响。本文报告了对使用机器学习或NLP技术询问大数据以识别性别特定健康差异的文献的概念性综述。2021年10月,我们使用同义词和索引词搜索了Ovid MEDLINE、Embase和PsycINFO,分别为(1)“女性”或“男性”或“性别”,(2)“大数据”或“人工智能”或“NLP”,以及(3)“差异”或“差异”。从902份记录中,有22项研究符合纳入标准并进行了分析。结果表明,按性别划分的纳入情况是不一致的,而且往往没有报告,尽管纳入研究的男性比例远远低于女性。尽管人工智能和NLP技术在健康研究中得到了广泛应用,但很少有研究使用它们来支持非结构化文本来调查与性别相关的差异或差异。研究人员越来越意识到基于性别的数据偏见,但纠正过程很慢。我们思考了使用大数据分析来解决在理解疾病病因、诊断和预后方面存在的性别偏见的最佳做法。
{"title":"Best Practices on Big Data Analytics to Address Sex-Specific Biases in our Understanding of the Etiology, Diagnosis and Prognosis of Diseases","authors":"S. Golder, K. O’Connor, Yunwen Wang, R. Stevens, G. Gonzalez-Hernandez","doi":"10.1101/2022.01.31.22270183","DOIUrl":"https://doi.org/10.1101/2022.01.31.22270183","url":null,"abstract":"A bias in health research to favor understanding of diseases as they present in men can have a grave impact on the health of women. This paper reports on a conceptual review of the literature that used machine learning or NLP techniques to interrogate big data for identifying sex-specific health disparities. We searched Ovid MEDLINE, Embase, and PsycINFO in October 2021 using synonyms and indexing terms for (1) \"women\" or \"men\" or \"sex,\" (2) \"big data\" or \"artificial intelligence\" or \"NLP\", and (3) \"disparities\" or \"differences.\" From 902 records, 22 studies met the inclusion criteria and were analyzed. Results demonstrate that the inclusion by sex is inconsistent and often unreported, although the inclusion of men in the included studies is disproportionately less than women. Even though AI and NLP techniques are widely applied in health research, few studies use them to take advatage of unstructured text to investigate sex-related differences or disparities. Researchers are increasingly aware of sex-based data bias, but the process to- wards correction is slow. We reflected on what would be the best practices on using big data analytics to address sex-specific biases in understanding the etiology, diagnosis, and prognosis of diseases.","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":""},"PeriodicalIF":6.0,"publicationDate":"2022-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44284431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Single-Cell Analysis for Whole-Organism Datasets. 全生物数据集的单细胞分析。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-11 DOI: 10.1146/annurev-biodatasci-092820-031008
Angela Oliveira Pisco, Bruno Tojo, Aaron McGeever

Cell atlases are essential companions to the genome as they elucidate how genes are used in a cell type-specific manner or how the usage of genes changes over the lifetime of an organism. This review explores recent advances in whole-organism single-cell atlases, which enable understanding of cell heterogeneity and tissue and cell fate, both in health and disease. Here we provide an overview of recent efforts to build cell atlases across species and discuss the challenges that the field is currently facing. Moreover, we propose the concept of having a knowledgebase that can scale with the number of experiments and computational approaches and a new feedback loop for development and benchmarking of computational methods that includes contributions from the users. These two aspects are key for community efforts in single-cell biology that will help produce a comprehensive annotated map of cell types and states with unparalleled resolution.

细胞图谱是基因组的重要伙伴,因为它们阐明了基因如何以特定细胞类型的方式使用,或者基因的使用如何在生物体的一生中发生变化。这篇综述探讨了生物体单细胞图谱的最新进展,使我们能够理解健康和疾病中的细胞异质性、组织和细胞命运。在这里,我们概述了最近建立跨物种细胞图谱的努力,并讨论了该领域目前面临的挑战。此外,我们提出了一个概念,即拥有一个可以随着实验和计算方法的数量而扩展的知识库,以及一个新的反馈回路,用于包括用户贡献的计算方法的开发和基准测试。这两个方面是单细胞生物学社区努力的关键,这将有助于以无与伦比的分辨率产生细胞类型和状态的综合注释图。
{"title":"Single-Cell Analysis for Whole-Organism Datasets.","authors":"Angela Oliveira Pisco,&nbsp;Bruno Tojo,&nbsp;Aaron McGeever","doi":"10.1146/annurev-biodatasci-092820-031008","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-092820-031008","url":null,"abstract":"<p><p>Cell atlases are essential companions to the genome as they elucidate how genes are used in a cell type-specific manner or how the usage of genes changes over the lifetime of an organism. This review explores recent advances in whole-organism single-cell atlases, which enable understanding of cell heterogeneity and tissue and cell fate, both in health and disease. Here we provide an overview of recent efforts to build cell atlases across species and discuss the challenges that the field is currently facing. Moreover, we propose the concept of having a knowledgebase that can scale with the number of experiments and computational approaches and a new feedback loop for development and benchmarking of computational methods that includes contributions from the users. These two aspects are key for community efforts in single-cell biology that will help produce a comprehensive annotated map of cell types and states with unparalleled resolution.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"207-226"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
The 3D Genome Structure of Single Cells. 单细胞的三维基因组结构。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-04-23 DOI: 10.1146/annurev-biodatasci-020121-084709
Tianming Zhou, Ruochi Zhang, Jian Ma

The spatial organization of the genome in the cell nucleus is pivotal to cell function. However, how the 3D genome organization and its dynamics influence cellular phenotypes remains poorly understood. The very recent development of single-cell technologies for probing the 3D genome, especially single-cell Hi-C (scHi-C), has ushered in a new era of unveiling cell-to-cell variability of 3D genome features at an unprecedented resolution. Here, we review recent developments in computational approaches to the analysis of scHi-C, including data processing, dimensionality reduction, imputation for enhancing data quality, and the revealing of 3D genome features at single-cell resolution. While much progress has been made in computational method development to analyze single-cell 3D genomes, substantial future work is needed to improve data interpretation and multimodal data integration, which are critical to reveal fundamental connections between genome structure and function among heterogeneous cell populations in various biological contexts.

基因组在细胞核中的空间组织对细胞功能至关重要。然而,三维基因组组织及其动力学如何影响细胞表型仍然知之甚少。用于探测3D基因组的单细胞技术的最新发展,特别是单细胞Hi-C (scHi-C),以前所未有的分辨率开启了揭示3D基因组特征的细胞间变异性的新时代。在这里,我们回顾了scHi-C分析的计算方法的最新进展,包括数据处理、降维、提高数据质量的imputation以及单细胞分辨率下3D基因组特征的揭示。虽然在分析单细胞三维基因组的计算方法开发方面取得了很大进展,但需要大量的未来工作来改进数据解释和多模态数据集成,这对于揭示不同生物学背景下异质细胞群体中基因组结构和功能之间的基本联系至关重要。
{"title":"The 3D Genome Structure of Single Cells.","authors":"Tianming Zhou,&nbsp;Ruochi Zhang,&nbsp;Jian Ma","doi":"10.1146/annurev-biodatasci-020121-084709","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020121-084709","url":null,"abstract":"<p><p>The spatial organization of the genome in the cell nucleus is pivotal to cell function. However, how the 3D genome organization and its dynamics influence cellular phenotypes remains poorly understood. The very recent development of single-cell technologies for probing the 3D genome, especially single-cell Hi-C (scHi-C), has ushered in a new era of unveiling cell-to-cell variability of 3D genome features at an unprecedented resolution. Here, we review recent developments in computational approaches to the analysis of scHi-C, including data processing, dimensionality reduction, imputation for enhancing data quality, and the revealing of 3D genome features at single-cell resolution. While much progress has been made in computational method development to analyze single-cell 3D genomes, substantial future work is needed to improve data interpretation and multimodal data integration, which are critical to reveal fundamental connections between genome structure and function among heterogeneous cell populations in various biological contexts.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"21-41"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Integration of Multimodal Data for Deciphering Brain Disorders. 多模态数据集成用于脑部疾病的破译。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-04-23 DOI: 10.1146/annurev-biodatasci-092820-020354
Jingqi Chen, Guiying Dong, Liting Song, Xingzhong Zhao, Jixin Cao, Xiaohui Luo, Jianfeng Feng, Xing-Ming Zhao

The accumulation of vast amounts of multimodal data for the human brain, in both normal and disease conditions, has provided unprecedented opportunities for understanding why and how brain disorders arise. Compared with traditional analyses of single datasets, the integration of multimodal datasets covering different types of data (i.e., genomics, transcriptomics, imaging, etc.) has shed light on the mechanisms underlying brain disorders in greater detail across both the microscopic and macroscopic levels. In this review, we first briefly introduce the popular large datasets for the brain. Then, we discuss in detail how integration of multimodal human brain datasets can reveal the genetic predispositions and the abnormal molecular pathways of brain disorders. Finally, we present an outlook on how future data integration efforts may advance the diagnosis and treatment of brain disorders.

人类大脑在正常和疾病条件下的大量多模态数据的积累,为理解大脑疾病产生的原因和方式提供了前所未有的机会。与传统的单一数据集分析相比,涵盖不同类型数据(即基因组学、转录组学、成像等)的多模态数据集的整合,在微观和宏观层面上更详细地揭示了大脑疾病的机制。在这篇综述中,我们首先简要介绍了流行的大脑大数据集。然后,我们详细讨论了多模态人脑数据集的整合如何揭示大脑疾病的遗传易感性和异常分子途径。最后,我们展望了未来数据整合工作将如何促进脑部疾病的诊断和治疗。
{"title":"Integration of Multimodal Data for Deciphering Brain Disorders.","authors":"Jingqi Chen,&nbsp;Guiying Dong,&nbsp;Liting Song,&nbsp;Xingzhong Zhao,&nbsp;Jixin Cao,&nbsp;Xiaohui Luo,&nbsp;Jianfeng Feng,&nbsp;Xing-Ming Zhao","doi":"10.1146/annurev-biodatasci-092820-020354","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-092820-020354","url":null,"abstract":"<p><p>The accumulation of vast amounts of multimodal data for the human brain, in both normal and disease conditions, has provided unprecedented opportunities for understanding why and how brain disorders arise. Compared with traditional analyses of single datasets, the integration of multimodal datasets covering different types of data (i.e., genomics, transcriptomics, imaging, etc.) has shed light on the mechanisms underlying brain disorders in greater detail across both the microscopic and macroscopic levels. In this review, we first briefly introduce the popular large datasets for the brain. Then, we discuss in detail how integration of multimodal human brain datasets can reveal the genetic predispositions and the abnormal molecular pathways of brain disorders. Finally, we present an outlook on how future data integration efforts may advance the diagnosis and treatment of brain disorders.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"43-56"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modern Clinical Text Mining: A Guide and Review. 现代临床文本挖掘:指南与综述。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-26 DOI: 10.1146/annurev-biodatasci-030421-030931
Bethany Percha

Electronic health records (EHRs) are becoming a vital source of data for healthcare quality improvement, research, and operations. However, much of the most valuable information contained in EHRs remains buried in unstructured text. The field of clinical text mining has advanced rapidly in recent years, transitioning from rule-based approaches to machine learning and, more recently, deep learning. With new methods come new challenges, however, especially for those new to the field. This review provides an overview of clinical text mining for those who are encountering it for the first time (e.g., physician researchers, operational analytics teams, machine learning scientists from other domains). While not a comprehensive survey, this review describes the state of the art, with a particular focus on new tasks and methods developed over the past few years. It also identifies key barriers between these remarkable technical advances and the practical realities of implementation in health systems and in industry.

电子健康记录(EHRs)正在成为医疗保健质量改进、研究和运营的重要数据来源。然而,电子病历中包含的许多最有价值的信息仍然隐藏在非结构化的文本中。近年来,临床文本挖掘领域发展迅速,从基于规则的方法过渡到机器学习,以及最近的深度学习。然而,新方法带来了新的挑战,特别是对那些刚进入该领域的人来说。这篇综述为那些第一次遇到临床文本挖掘的人(例如,医生研究人员,操作分析团队,来自其他领域的机器学习科学家)提供了临床文本挖掘的概述。虽然不是一个全面的调查,但这篇综述描述了最新的技术状况,特别关注了过去几年开发的新任务和方法。它还确定了这些显著的技术进步与卫生系统和工业实施的实际现实之间的主要障碍。
{"title":"Modern Clinical Text Mining: A Guide and Review.","authors":"Bethany Percha","doi":"10.1146/annurev-biodatasci-030421-030931","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-030421-030931","url":null,"abstract":"<p><p>Electronic health records (EHRs) are becoming a vital source of data for healthcare quality improvement, research, and operations. However, much of the most valuable information contained in EHRs remains buried in unstructured text. The field of clinical text mining has advanced rapidly in recent years, transitioning from rule-based approaches to machine learning and, more recently, deep learning. With new methods come new challenges, however, especially for those new to the field. This review provides an overview of clinical text mining for those who are encountering it for the first time (e.g., physician researchers, operational analytics teams, machine learning scientists from other domains). While not a comprehensive survey, this review describes the state of the art, with a particular focus on new tasks and methods developed over the past few years. It also identifies key barriers between these remarkable technical advances and the practical realities of implementation in health systems and in industry.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"165-187"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neoantigen Controversies. 新抗原争议。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-11 DOI: 10.1146/annurev-biodatasci-092820-112713
Andrea Castro, Maurizio Zanetti, Hannah Carter

Next-generation sequencing technologies have revolutionized our ability to catalog the landscape of somatic mutations in tumor genomes. These mutations can sometimes create so-called neoantigens, which allow the immune system to detect and eliminate tumor cells. However, efforts that stimulate the immune system to eliminate tumors based on their molecular differences have had less success than has been hoped for, and there are conflicting reports about the role of neoantigens in the success of this approach. Here we review some of the conflicting evidence in the literature and highlight key aspects of the tumor-immune interface that are emerging as major determinants of whether mutation-derived neoantigens will contribute to an immunotherapy response. Accounting for these factors is expected to improve success rates of future immunotherapy approaches.

下一代测序技术彻底改变了我们对肿瘤基因组中体细胞突变的编目能力。这些突变有时会产生所谓的新抗原,使免疫系统能够检测并消灭肿瘤细胞。然而,根据肿瘤的分子差异来刺激免疫系统消灭肿瘤的努力并没有取得预期的成功,关于新抗原在这种方法的成功中所起的作用,也有相互矛盾的报道。在此,我们回顾了文献中一些相互矛盾的证据,并强调了肿瘤免疫界面的一些关键方面,这些方面正在成为突变衍生的新抗原是否会促进免疫疗法反应的主要决定因素。考虑到这些因素有望提高未来免疫疗法的成功率。
{"title":"Neoantigen Controversies.","authors":"Andrea Castro, Maurizio Zanetti, Hannah Carter","doi":"10.1146/annurev-biodatasci-092820-112713","DOIUrl":"10.1146/annurev-biodatasci-092820-112713","url":null,"abstract":"<p><p>Next-generation sequencing technologies have revolutionized our ability to catalog the landscape of somatic mutations in tumor genomes. These mutations can sometimes create so-called neoantigens, which allow the immune system to detect and eliminate tumor cells. However, efforts that stimulate the immune system to eliminate tumors based on their molecular differences have had less success than has been hoped for, and there are conflicting reports about the role of neoantigens in the success of this approach. Here we review some of the conflicting evidence in the literature and highlight key aspects of the tumor-immune interface that are emerging as major determinants of whether mutation-derived neoantigens will contribute to an immunotherapy response. Accounting for these factors is expected to improve success rates of future immunotherapy approaches.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"4 ","pages":"227-253"},"PeriodicalIF":7.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10146390/pdf/nihms-1877401.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9746249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
African Global Representation in Biomedical Sciences. 非洲在生物医学科学领域的全球代表性。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 DOI: 10.1146/annurev-biodatasci-102920-112550
Nicola Mulder, Lyndon Zass, Yosr Hamdi, Houcemeddine Othman, Sumir Panji, Imane Allali, Yasmina Jaufeerally Fakim

African populations are diverse in their ethnicity, language, culture, and genetics. Although plagued by high disease burdens, until recently the continent has largely been excluded from biomedical studies. Along with limitations in research and clinical infrastructure, human capacity, and funding, this omission has resulted in an underrepresentation of African data and disadvantaged African scientists. This review interrogates the relative abundance of biomedical data from Africa, primarily in genomics and other omics. The visibility of African science through publications is also discussed. A challenge encountered in this review is the relative lack of annotation of data on their geographical or population origin, with African countries represented as a single group. In addition to the abovementioned limitations,the global representation of African data may also be attributed to the hesitation to deposit data in public repositories. Whatever the reason, the disparity should be addressed, as African data have enormous value for scientists in Africa and globally.

非洲人口在种族、语言、文化和基因上都是多样化的。尽管疾病负担沉重,但直到最近,非洲大陆在很大程度上一直被排除在生物医学研究之外。加上研究和临床基础设施、人员能力和资金方面的限制,这种遗漏导致了非洲数据的代表性不足,并使非洲科学家处于不利地位。这篇综述询问了来自非洲的相对丰富的生物医学数据,主要是基因组学和其他组学。还讨论了通过出版物提高非洲科学的知名度。本审查遇到的一个挑战是相对缺乏对其地理或人口来源的数据的注释,非洲国家作为一个单一的群体。除了上述限制之外,非洲数据的全球代表性也可能归因于对将数据存入公共存储库的犹豫。不管是什么原因,这种差异应该得到解决,因为非洲的数据对非洲和全球的科学家都有巨大的价值。
{"title":"African Global Representation in Biomedical Sciences.","authors":"Nicola Mulder,&nbsp;Lyndon Zass,&nbsp;Yosr Hamdi,&nbsp;Houcemeddine Othman,&nbsp;Sumir Panji,&nbsp;Imane Allali,&nbsp;Yasmina Jaufeerally Fakim","doi":"10.1146/annurev-biodatasci-102920-112550","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-102920-112550","url":null,"abstract":"<p><p>African populations are diverse in their ethnicity, language, culture, and genetics. Although plagued by high disease burdens, until recently the continent has largely been excluded from biomedical studies. Along with limitations in research and clinical infrastructure, human capacity, and funding, this omission has resulted in an underrepresentation of African data and disadvantaged African scientists. This review interrogates the relative abundance of biomedical data from Africa, primarily in genomics and other omics. The visibility of African science through publications is also discussed. A challenge encountered in this review is the relative lack of annotation of data on their geographical or population origin, with African countries represented as a single group. In addition to the abovementioned limitations,the global representation of African data may also be attributed to the hesitation to deposit data in public repositories. Whatever the reason, the disparity should be addressed, as African data have enormous value for scientists in Africa and globally.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"57-81"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39373761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Satellite Monitoring for Air Quality and Health. 空气质量和健康卫星监测。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-06-01 DOI: 10.1146/annurev-biodatasci-110920-093120
Tracey Holloway, Daegan Miller, Susan Anenberg, Minghui Diao, Bryan Duncan, Arlene M Fiore, Daven K Henze, Jeremy Hess, Patrick L Kinney, Yang Liu, Jessica L Neu, Susan M O'Neill, M Talat Odman, R Bradley Pierce, Armistead G Russell, Daniel Tong, J Jason West, Mark A Zondlo

Data from satellite instruments provide estimates of gas and particle levels relevant to human health, even pollutants invisible to the human eye. However, the successful interpretation of satellite data requires an understanding of how satellites relate to other data sources, as well as factors affecting their application to health challenges. Drawing from the expertise and experience of the 2016-2020 NASA HAQAST (Health and Air Quality Applied Sciences Team), we present a review of satellite data for air quality and health applications. We include a discussion of satellite data for epidemiological studies and health impact assessments, as well as the use of satellite data to evaluate air quality trends, support air quality regulation, characterize smoke from wildfires, and quantify emission sources. The primary advantage of satellite data compared to in situ measurements, e.g., from air quality monitoring stations, is their spatial coverage. Satellite data can reveal where pollution levels are highest around the world, how levels have changed over daily to decadal periods, and where pollutants are transported from urban to global scales. To date, air quality and health applications have primarily utilized satellite observations and satellite-derived products relevant to near-surface particulate matter <2.5 μm in diameter (PM2.5) and nitrogen dioxide (NO2). Health and air quality communities have grown increasingly engaged in the use of satellite data, and this trend is expected to continue. From health researchers to air quality managers, and from global applications to community impacts, satellite data are transforming the way air pollution exposure is evaluated.

来自卫星仪器的数据提供了对与人类健康有关的气体和颗粒水平的估计,甚至是人眼看不见的污染物。然而,要成功地解释卫星数据,就需要了解卫星与其他数据源的关系,以及影响其应用于卫生挑战的因素。根据2016-2020年NASA健康和空气质量应用科学小组的专业知识和经验,我们对空气质量和健康应用的卫星数据进行了审查。我们讨论了用于流行病学研究和健康影响评估的卫星数据,以及利用卫星数据评估空气质量趋势、支持空气质量监管、描述野火烟雾特征和量化排放源。与空气质量监测站等现场测量数据相比,卫星数据的主要优势在于其空间覆盖范围。卫星数据可以揭示世界上污染水平最高的地方,污染水平在每天到十年的时间内是如何变化的,以及污染物从城市到全球范围内的运输位置。迄今为止,空气质量和健康应用主要利用卫星观测和与近地表颗粒物(2.5)和二氧化氮(NO2)相关的卫星衍生产品。卫生和空气质量领域越来越多地使用卫星数据,预计这一趋势将继续下去。从卫生研究人员到空气质量管理人员,从全球应用到社区影响,卫星数据正在改变评估空气污染暴露的方式。
{"title":"Satellite Monitoring for Air Quality and Health.","authors":"Tracey Holloway,&nbsp;Daegan Miller,&nbsp;Susan Anenberg,&nbsp;Minghui Diao,&nbsp;Bryan Duncan,&nbsp;Arlene M Fiore,&nbsp;Daven K Henze,&nbsp;Jeremy Hess,&nbsp;Patrick L Kinney,&nbsp;Yang Liu,&nbsp;Jessica L Neu,&nbsp;Susan M O'Neill,&nbsp;M Talat Odman,&nbsp;R Bradley Pierce,&nbsp;Armistead G Russell,&nbsp;Daniel Tong,&nbsp;J Jason West,&nbsp;Mark A Zondlo","doi":"10.1146/annurev-biodatasci-110920-093120","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-110920-093120","url":null,"abstract":"<p><p>Data from satellite instruments provide estimates of gas and particle levels relevant to human health, even pollutants invisible to the human eye. However, the successful interpretation of satellite data requires an understanding of how satellites relate to other data sources, as well as factors affecting their application to health challenges. Drawing from the expertise and experience of the 2016-2020 NASA HAQAST (Health and Air Quality Applied Sciences Team), we present a review of satellite data for air quality and health applications. We include a discussion of satellite data for epidemiological studies and health impact assessments, as well as the use of satellite data to evaluate air quality trends, support air quality regulation, characterize smoke from wildfires, and quantify emission sources. The primary advantage of satellite data compared to in situ measurements, e.g., from air quality monitoring stations, is their spatial coverage. Satellite data can reveal where pollution levels are highest around the world, how levels have changed over daily to decadal periods, and where pollutants are transported from urban to global scales. To date, air quality and health applications have primarily utilized satellite observations and satellite-derived products relevant to near-surface particulate matter <2.5 μm in diameter (PM<sub>2.5</sub>) and nitrogen dioxide (NO<sub>2</sub>). Health and air quality communities have grown increasingly engaged in the use of satellite data, and this trend is expected to continue. From health researchers to air quality managers, and from global applications to community impacts, satellite data are transforming the way air pollution exposure is evaluated.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"417-447"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39373763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1