首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
African Global Representation in Biomedical Sciences. 非洲在生物医学科学领域的全球代表性。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 DOI: 10.1146/annurev-biodatasci-102920-112550
Nicola Mulder, Lyndon Zass, Yosr Hamdi, Houcemeddine Othman, Sumir Panji, Imane Allali, Yasmina Jaufeerally Fakim

African populations are diverse in their ethnicity, language, culture, and genetics. Although plagued by high disease burdens, until recently the continent has largely been excluded from biomedical studies. Along with limitations in research and clinical infrastructure, human capacity, and funding, this omission has resulted in an underrepresentation of African data and disadvantaged African scientists. This review interrogates the relative abundance of biomedical data from Africa, primarily in genomics and other omics. The visibility of African science through publications is also discussed. A challenge encountered in this review is the relative lack of annotation of data on their geographical or population origin, with African countries represented as a single group. In addition to the abovementioned limitations,the global representation of African data may also be attributed to the hesitation to deposit data in public repositories. Whatever the reason, the disparity should be addressed, as African data have enormous value for scientists in Africa and globally.

非洲人口在种族、语言、文化和基因上都是多样化的。尽管疾病负担沉重,但直到最近,非洲大陆在很大程度上一直被排除在生物医学研究之外。加上研究和临床基础设施、人员能力和资金方面的限制,这种遗漏导致了非洲数据的代表性不足,并使非洲科学家处于不利地位。这篇综述询问了来自非洲的相对丰富的生物医学数据,主要是基因组学和其他组学。还讨论了通过出版物提高非洲科学的知名度。本审查遇到的一个挑战是相对缺乏对其地理或人口来源的数据的注释,非洲国家作为一个单一的群体。除了上述限制之外,非洲数据的全球代表性也可能归因于对将数据存入公共存储库的犹豫。不管是什么原因,这种差异应该得到解决,因为非洲的数据对非洲和全球的科学家都有巨大的价值。
{"title":"African Global Representation in Biomedical Sciences.","authors":"Nicola Mulder,&nbsp;Lyndon Zass,&nbsp;Yosr Hamdi,&nbsp;Houcemeddine Othman,&nbsp;Sumir Panji,&nbsp;Imane Allali,&nbsp;Yasmina Jaufeerally Fakim","doi":"10.1146/annurev-biodatasci-102920-112550","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-102920-112550","url":null,"abstract":"<p><p>African populations are diverse in their ethnicity, language, culture, and genetics. Although plagued by high disease burdens, until recently the continent has largely been excluded from biomedical studies. Along with limitations in research and clinical infrastructure, human capacity, and funding, this omission has resulted in an underrepresentation of African data and disadvantaged African scientists. This review interrogates the relative abundance of biomedical data from Africa, primarily in genomics and other omics. The visibility of African science through publications is also discussed. A challenge encountered in this review is the relative lack of annotation of data on their geographical or population origin, with African countries represented as a single group. In addition to the abovementioned limitations,the global representation of African data may also be attributed to the hesitation to deposit data in public repositories. Whatever the reason, the disparity should be addressed, as African data have enormous value for scientists in Africa and globally.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"57-81"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39373761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Satellite Monitoring for Air Quality and Health. 空气质量和健康卫星监测。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-06-01 DOI: 10.1146/annurev-biodatasci-110920-093120
Tracey Holloway, Daegan Miller, Susan Anenberg, Minghui Diao, Bryan Duncan, Arlene M Fiore, Daven K Henze, Jeremy Hess, Patrick L Kinney, Yang Liu, Jessica L Neu, Susan M O'Neill, M Talat Odman, R Bradley Pierce, Armistead G Russell, Daniel Tong, J Jason West, Mark A Zondlo

Data from satellite instruments provide estimates of gas and particle levels relevant to human health, even pollutants invisible to the human eye. However, the successful interpretation of satellite data requires an understanding of how satellites relate to other data sources, as well as factors affecting their application to health challenges. Drawing from the expertise and experience of the 2016-2020 NASA HAQAST (Health and Air Quality Applied Sciences Team), we present a review of satellite data for air quality and health applications. We include a discussion of satellite data for epidemiological studies and health impact assessments, as well as the use of satellite data to evaluate air quality trends, support air quality regulation, characterize smoke from wildfires, and quantify emission sources. The primary advantage of satellite data compared to in situ measurements, e.g., from air quality monitoring stations, is their spatial coverage. Satellite data can reveal where pollution levels are highest around the world, how levels have changed over daily to decadal periods, and where pollutants are transported from urban to global scales. To date, air quality and health applications have primarily utilized satellite observations and satellite-derived products relevant to near-surface particulate matter <2.5 μm in diameter (PM2.5) and nitrogen dioxide (NO2). Health and air quality communities have grown increasingly engaged in the use of satellite data, and this trend is expected to continue. From health researchers to air quality managers, and from global applications to community impacts, satellite data are transforming the way air pollution exposure is evaluated.

来自卫星仪器的数据提供了对与人类健康有关的气体和颗粒水平的估计,甚至是人眼看不见的污染物。然而,要成功地解释卫星数据,就需要了解卫星与其他数据源的关系,以及影响其应用于卫生挑战的因素。根据2016-2020年NASA健康和空气质量应用科学小组的专业知识和经验,我们对空气质量和健康应用的卫星数据进行了审查。我们讨论了用于流行病学研究和健康影响评估的卫星数据,以及利用卫星数据评估空气质量趋势、支持空气质量监管、描述野火烟雾特征和量化排放源。与空气质量监测站等现场测量数据相比,卫星数据的主要优势在于其空间覆盖范围。卫星数据可以揭示世界上污染水平最高的地方,污染水平在每天到十年的时间内是如何变化的,以及污染物从城市到全球范围内的运输位置。迄今为止,空气质量和健康应用主要利用卫星观测和与近地表颗粒物(2.5)和二氧化氮(NO2)相关的卫星衍生产品。卫生和空气质量领域越来越多地使用卫星数据,预计这一趋势将继续下去。从卫生研究人员到空气质量管理人员,从全球应用到社区影响,卫星数据正在改变评估空气污染暴露的方式。
{"title":"Satellite Monitoring for Air Quality and Health.","authors":"Tracey Holloway,&nbsp;Daegan Miller,&nbsp;Susan Anenberg,&nbsp;Minghui Diao,&nbsp;Bryan Duncan,&nbsp;Arlene M Fiore,&nbsp;Daven K Henze,&nbsp;Jeremy Hess,&nbsp;Patrick L Kinney,&nbsp;Yang Liu,&nbsp;Jessica L Neu,&nbsp;Susan M O'Neill,&nbsp;M Talat Odman,&nbsp;R Bradley Pierce,&nbsp;Armistead G Russell,&nbsp;Daniel Tong,&nbsp;J Jason West,&nbsp;Mark A Zondlo","doi":"10.1146/annurev-biodatasci-110920-093120","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-110920-093120","url":null,"abstract":"<p><p>Data from satellite instruments provide estimates of gas and particle levels relevant to human health, even pollutants invisible to the human eye. However, the successful interpretation of satellite data requires an understanding of how satellites relate to other data sources, as well as factors affecting their application to health challenges. Drawing from the expertise and experience of the 2016-2020 NASA HAQAST (Health and Air Quality Applied Sciences Team), we present a review of satellite data for air quality and health applications. We include a discussion of satellite data for epidemiological studies and health impact assessments, as well as the use of satellite data to evaluate air quality trends, support air quality regulation, characterize smoke from wildfires, and quantify emission sources. The primary advantage of satellite data compared to in situ measurements, e.g., from air quality monitoring stations, is their spatial coverage. Satellite data can reveal where pollution levels are highest around the world, how levels have changed over daily to decadal periods, and where pollutants are transported from urban to global scales. To date, air quality and health applications have primarily utilized satellite observations and satellite-derived products relevant to near-surface particulate matter <2.5 μm in diameter (PM<sub>2.5</sub>) and nitrogen dioxide (NO<sub>2</sub>). Health and air quality communities have grown increasingly engaged in the use of satellite data, and this trend is expected to continue. From health researchers to air quality managers, and from global applications to community impacts, satellite data are transforming the way air pollution exposure is evaluated.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"417-447"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39373763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
The Ethics of Consent in a Shifting Genomic Ecosystem. 在不断变化的基因组生态系统中的同意伦理。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 DOI: 10.1146/annurev-biodatasci-030221-125715
Sandra Soo-Jin Lee

The collection and use of human genetic data raise important ethical questions about how to balance individual autonomy and privacy with the potential for public good. The proliferation of local, national, and international efforts to collect genetic data and create linkages to support large-scale initiatives in precision medicine and the learning health system creates new demands for broad data sharing that involve managing competing interests and careful consideration of what constitutes appropriate ethical trade-offs. This review describes these emerging ethical issues with a focus on approaches to consent and issues related to justice in the shifting genomic research ecosystem.

人类基因数据的收集和使用引发了重要的伦理问题,即如何在个人自主权和隐私与潜在的公共利益之间取得平衡。地方、国家和国际上收集遗传数据和建立联系以支持精准医学和学习型卫生系统的大规模倡议的努力的扩散,对广泛的数据共享产生了新的需求,这涉及管理相互竞争的利益和仔细考虑什么构成适当的伦理权衡。这篇综述描述了这些新兴的伦理问题,重点是在不断变化的基因组研究生态系统中,同意的方法和与正义相关的问题。
{"title":"The Ethics of Consent in a Shifting Genomic Ecosystem.","authors":"Sandra Soo-Jin Lee","doi":"10.1146/annurev-biodatasci-030221-125715","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-030221-125715","url":null,"abstract":"<p><p>The collection and use of human genetic data raise important ethical questions about how to balance individual autonomy and privacy with the potential for public good. The proliferation of local, national, and international efforts to collect genetic data and create linkages to support large-scale initiatives in precision medicine and the learning health system creates new demands for broad data sharing that involve managing competing interests and careful consideration of what constitutes appropriate ethical trade-offs. This review describes these emerging ethical issues with a focus on approaches to consent and issues related to justice in the shifting genomic research ecosystem.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"145-164"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683157/pdf/nihms-1760354.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing. 人工智能在行动:用自然语言处理应对COVID-19大流行。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-14 DOI: 10.1146/annurev-biodatasci-021821-061045
Qingyu Chen, Robert Leaman, Alexis Allot, Ling Luo, Chih-Hsuan Wei, Shankai Yan, Zhiyong Lu

The COVID-19 (coronavirus disease 2019) pandemic has had a significant impact on society, both because of the serious health effects of COVID-19 and because of public health measures implemented to slow its spread. Many of these difficulties are fundamentally information needs; attempts to address these needs have caused an information overload for both researchers and the public. Natural language processing (NLP)-the branch of artificial intelligence that interprets human language-can be applied to address many of the information needs made urgent by the COVID-19 pandemic. This review surveys approximately 150 NLP studies and more than 50 systems and datasets addressing the COVID-19 pandemic. We detail work on four core NLP tasks: information retrieval, named entity recognition, literature-based discovery, and question answering. We also describe work that directly addresses aspects of the pandemic through four additional tasks: topic modeling, sentiment and emotion analysis, caseload forecasting, and misinformation detection. We conclude by discussing observable trends and remaining challenges.

COVID-19(2019冠状病毒病)大流行对社会产生了重大影响,这既是因为COVID-19对健康造成严重影响,也是因为为减缓其传播而采取的公共卫生措施。其中许多困难从根本上说是信息需求;试图解决这些需求已经造成了研究人员和公众的信息过载。自然语言处理(NLP)是解释人类语言的人工智能分支,可用于解决因COVID-19大流行而迫切需要的许多信息需求。本综述调查了关于COVID-19大流行的约150项NLP研究和50多个系统和数据集。我们详细介绍了四个核心NLP任务:信息检索、命名实体识别、基于文献的发现和问题回答。我们还描述了通过四项额外任务直接解决大流行方面的工作:主题建模、情绪和情绪分析、病例量预测和错误信息检测。最后,我们讨论了可观察到的趋势和仍然存在的挑战。
{"title":"Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing.","authors":"Qingyu Chen,&nbsp;Robert Leaman,&nbsp;Alexis Allot,&nbsp;Ling Luo,&nbsp;Chih-Hsuan Wei,&nbsp;Shankai Yan,&nbsp;Zhiyong Lu","doi":"10.1146/annurev-biodatasci-021821-061045","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-021821-061045","url":null,"abstract":"<p><p>The COVID-19 (coronavirus disease 2019) pandemic has had a significant impact on society, both because of the serious health effects of COVID-19 and because of public health measures implemented to slow its spread. Many of these difficulties are fundamentally information needs; attempts to address these needs have caused an information overload for both researchers and the public. Natural language processing (NLP)-the branch of artificial intelligence that interprets human language-can be applied to address many of the information needs made urgent by the COVID-19 pandemic. This review surveys approximately 150 NLP studies and more than 50 systems and datasets addressing the COVID-19 pandemic. We detail work on four core NLP tasks: information retrieval, named entity recognition, literature-based discovery, and question answering. We also describe work that directly addresses aspects of the pandemic through four additional tasks: topic modeling, sentiment and emotion analysis, caseload forecasting, and misinformation detection. We conclude by discussing observable trends and remaining challenges.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"313-339"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Data Science in the Food Industry. 食品工业中的数据科学。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-13 DOI: 10.1146/annurev-biodatasci-020221-123602
George-John Nychas, Emma Sims, Panagiotis Tsakanikas, Fady Mohareb

Food safety is one of the main challenges of the agri-food industry that is expected to be addressed in the current environment of tremendous technological progress, where consumers' lifestyles and preferences are in a constant state of flux. Food chain transparency and trust are drivers for food integrity control and for improvements in efficiency and economic growth. Similarly, the circular economy has great potential to reduce wastage and improve the efficiency of operations in multi-stakeholder ecosystems. Throughout the food chain cycle, all food commodities are exposed to multiple hazards, resulting in a high likelihood of contamination. Such biological or chemical hazards may be naturally present at any stage of food production, whether accidentally introduced or fraudulently imposed, risking consumers' health and their faith in the food industry. Nowadays, a massive amount of data is generated, not only from the next generation of food safety monitoring systems and along the entire food chain (primary production included) but also from the Internet of things, media, and other devices. These data should be used for the benefit of society, and the scientific field of data science should be a vital player in helping to make this possible.

食品安全是农业食品行业面临的主要挑战之一,预计将在当前技术进步巨大的环境中得到解决,消费者的生活方式和偏好处于不断变化的状态。食品链的透明度和信任是食品完整性控制和提高效率和经济增长的驱动力。同样,循环经济在减少浪费和提高多方利益相关者生态系统的运营效率方面具有巨大潜力。在整个食品链周期中,所有食品商品都面临多重危害,导致污染的可能性很高。这种生物或化学危害可能自然存在于食品生产的任何阶段,无论是偶然引入还是欺诈强加,都可能危及消费者的健康和他们对食品工业的信心。如今,大量的数据不仅来自下一代食品安全监控系统和整个食品链(包括初级生产),还来自物联网、媒体和其他设备。这些数据应该用于造福社会,数据科学的科学领域应该在帮助实现这一目标方面发挥重要作用。
{"title":"Data Science in the Food Industry.","authors":"George-John Nychas,&nbsp;Emma Sims,&nbsp;Panagiotis Tsakanikas,&nbsp;Fady Mohareb","doi":"10.1146/annurev-biodatasci-020221-123602","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-020221-123602","url":null,"abstract":"<p><p>Food safety is one of the main challenges of the agri-food industry that is expected to be addressed in the current environment of tremendous technological progress, where consumers' lifestyles and preferences are in a constant state of flux. Food chain transparency and trust are drivers for food integrity control and for improvements in efficiency and economic growth. Similarly, the circular economy has great potential to reduce wastage and improve the efficiency of operations in multi-stakeholder ecosystems. Throughout the food chain cycle, all food commodities are exposed to multiple hazards, resulting in a high likelihood of contamination. Such biological or chemical hazards may be naturally present at any stage of food production, whether accidentally introduced or fraudulently imposed, risking consumers' health and their faith in the food industry. Nowadays, a massive amount of data is generated, not only from the next generation of food safety monitoring systems and along the entire food chain (primary production included) but also from the Internet of things, media, and other devices. These data should be used for the benefit of society, and the scientific field of data science should be a vital player in helping to make this possible.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"341-367"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Metatranscriptomics for the Human Microbiome and Microbial Community Functional Profiling. 人类微生物组和微生物群落功能分析的超转录组学。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-13 DOI: 10.1146/annurev-biodatasci-031121-103035
Yancong Zhang, Kelsey N Thompson, Tobyn Branck, Yan Yan, Long H Nguyen, Eric A Franzosa, Curtis Huttenhower

Shotgun metatranscriptomics (MTX) is an increasingly practical way to survey microbial community gene function and regulation at scale. This review begins by summarizing the motivations for community transcriptomics and the history of the field. We then explore the principles, best practices, and challenges of contemporary MTX workflows: beginning with laboratory methods for isolation and sequencing of community RNA, followed by informatics methods for quantifying RNA features, and finally statistical methods for detecting differential expression in a community context. In thesecond half of the review, we survey important biological findings from the MTX literature, drawing examples from the human microbiome, other (nonhuman) host-associated microbiomes, and the environment. Across these examples, MTX methods prove invaluable for probing microbe-microbe and host-microbe interactions, the dynamics of energy harvest and chemical cycling, and responses to environmental stresses. We conclude with a review of open challenges in the MTX field, including making assays and analyses more robust, accessible, and adaptable to new technologies; deciphering roles for millions of uncharacterized microbial transcripts; and solving applied problems such as biomarker discovery and development of microbial therapeutics.

散弹枪亚转录组学(MTX)是一种越来越实用的大规模调查微生物群落基因功能和调控的方法。本文首先概述了社区转录组学研究的动机和该领域的历史。然后,我们探讨了当代MTX工作流程的原则、最佳实践和挑战:从分离和测序社区RNA的实验室方法开始,接着是量化RNA特征的信息学方法,最后是检测社区背景下差异表达的统计方法。在这篇综述的后半部分,我们调查了MTX文献中重要的生物学发现,从人类微生物组、其他(非人类)宿主相关微生物组和环境中提取了例子。在这些例子中,MTX方法在探测微生物-微生物和宿主-微生物相互作用、能量收集和化学循环的动力学以及对环境胁迫的反应方面证明是无价的。最后,我们回顾了MTX领域面临的挑战,包括使检测和分析更强大、更容易获取和适应新技术;解读数百万未表征的微生物转录物的作用;并解决应用问题,如生物标志物的发现和微生物疗法的发展。
{"title":"Metatranscriptomics for the Human Microbiome and Microbial Community Functional Profiling.","authors":"Yancong Zhang,&nbsp;Kelsey N Thompson,&nbsp;Tobyn Branck,&nbsp;Yan Yan,&nbsp;Long H Nguyen,&nbsp;Eric A Franzosa,&nbsp;Curtis Huttenhower","doi":"10.1146/annurev-biodatasci-031121-103035","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-031121-103035","url":null,"abstract":"<p><p>Shotgun metatranscriptomics (MTX) is an increasingly practical way to survey microbial community gene function and regulation at scale. This review begins by summarizing the motivations for community transcriptomics and the history of the field. We then explore the principles, best practices, and challenges of contemporary MTX workflows: beginning with laboratory methods for isolation and sequencing of community RNA, followed by informatics methods for quantifying RNA features, and finally statistical methods for detecting differential expression in a community context. In thesecond half of the review, we survey important biological findings from the MTX literature, drawing examples from the human microbiome, other (nonhuman) host-associated microbiomes, and the environment. Across these examples, MTX methods prove invaluable for probing microbe-microbe and host-microbe interactions, the dynamics of energy harvest and chemical cycling, and responses to environmental stresses. We conclude with a review of open challenges in the MTX field, including making assays and analyses more robust, accessible, and adaptable to new technologies; deciphering roles for millions of uncharacterized microbial transcripts; and solving applied problems such as biomarker discovery and development of microbial therapeutics.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"279-311"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Phenotyping Neurodegeneration in Human iPSCs. 人类ips细胞神经变性的表型分析。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-04-23 DOI: 10.1146/annurev-biodatasci-092820-025214
Jonathan Li, Ernest Fraenkel

Induced pluripotent stem cell (iPSC) technology holds promise for modeling neurodegenerative diseases. Traditional approaches for disease modeling using animal and cellular models require knowledge of disease mutations. However, many patients with neurodegenerative diseases do not have a known genetic cause. iPSCs offer a way to generate patient-specific models and study pathways of dysfunction in an in vitro setting in order to understand the causes and subtypes of neurodegeneration. Furthermore, iPSC-based models can be used to search for candidate therapeutics using high-throughput screening. Here we review how iPSC-based models are currently being used to further our understanding of neurodegenerative diseases, as well as discuss their challenges and future directions.

诱导多能干细胞(iPSC)技术有望模拟神经退行性疾病。使用动物和细胞模型进行疾病建模的传统方法需要了解疾病突变。然而,许多患有神经退行性疾病的患者并没有已知的遗传原因。iPSCs提供了一种在体外环境中生成患者特异性模型和研究功能障碍途径的方法,以便了解神经退行性变的原因和亚型。此外,基于ipsc的模型可以通过高通量筛选来搜索候选治疗方法。在这里,我们回顾了基于ipsc的模型目前如何被用于进一步我们对神经退行性疾病的理解,并讨论了它们的挑战和未来的方向。
{"title":"Phenotyping Neurodegeneration in Human iPSCs.","authors":"Jonathan Li,&nbsp;Ernest Fraenkel","doi":"10.1146/annurev-biodatasci-092820-025214","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-092820-025214","url":null,"abstract":"<p><p>Induced pluripotent stem cell (iPSC) technology holds promise for modeling neurodegenerative diseases. Traditional approaches for disease modeling using animal and cellular models require knowledge of disease mutations. However, many patients with neurodegenerative diseases do not have a known genetic cause. iPSCs offer a way to generate patient-specific models and study pathways of dysfunction in an in vitro setting in order to understand the causes and subtypes of neurodegeneration. Furthermore, iPSC-based models can be used to search for candidate therapeutics using high-throughput screening. Here we review how iPSC-based models are currently being used to further our understanding of neurodegenerative diseases, as well as discuss their challenges and future directions.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"83-100"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237961/pdf/nihms-1816934.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Exposome in the Era of the Quantified Self. 自我量化时代的暴露。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 DOI: 10.1146/annurev-biodatasci-012721-122807
Xinyue Zhang, Peng Gao, Michael P Snyder

Human health is regulated by complex interactions among the genome, the microbiome, and the environment. While extensive research has been conducted on the human genome and microbiome, little is known about the human exposome. The exposome comprises the totality of chemical, biological, and physical exposures that individuals encounter over their lifetimes. Traditional environmental and biological monitoring only targets specific substances, whereas exposomic approaches identify and quantify thousands of substances simultaneously using nontargeted high-throughput and high-resolution analyses. The quantified self (QS) aims at enhancing our understanding of human health and disease through self-tracking. QS measurements are critical in exposome research, as external exposures impact an individual's health, behavior, and biology. This review discusses both the achievements and the shortcomings of current research and methodologies on the QS and the exposome and proposes future research directions.

人类健康是由基因组、微生物组和环境之间复杂的相互作用调节的。虽然对人类基因组和微生物组进行了广泛的研究,但对人类暴露体知之甚少。暴露包括个人一生中所接触的化学、生物和物理的全部暴露。传统的环境和生物监测仅针对特定物质,而暴露学方法使用非靶向高通量和高分辨率分析同时识别和量化数千种物质。量化自我(QS)旨在通过自我跟踪来增强我们对人类健康和疾病的了解。QS测量在暴露研究中至关重要,因为外部暴露会影响个人的健康、行为和生物学。本文综述了目前QS和暴露点的研究成果和方法的不足,并提出了未来的研究方向。
{"title":"The Exposome in the Era of the Quantified Self.","authors":"Xinyue Zhang,&nbsp;Peng Gao,&nbsp;Michael P Snyder","doi":"10.1146/annurev-biodatasci-012721-122807","DOIUrl":"https://doi.org/10.1146/annurev-biodatasci-012721-122807","url":null,"abstract":"<p><p>Human health is regulated by complex interactions among the genome, the microbiome, and the environment. While extensive research has been conducted on the human genome and microbiome, little is known about the human exposome. The exposome comprises the totality of chemical, biological, and physical exposures that individuals encounter over their lifetimes. Traditional environmental and biological monitoring only targets specific substances, whereas exposomic approaches identify and quantify thousands of substances simultaneously using nontargeted high-throughput and high-resolution analyses. The quantified self (QS) aims at enhancing our understanding of human health and disease through self-tracking. QS measurements are critical in exposome research, as external exposures impact an individual's health, behavior, and biology. This review discusses both the achievements and the shortcomings of current research and methodologies on the QS and the exposome and proposes future research directions.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"255-277"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Probabilistic Machine Learning for Healthcare. 医疗保健领域的概率机器学习。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-06-01 DOI: 10.1146/annurev-biodatasci-092820-033938
Irene Y Chen, Shalmali Joshi, Marzyeh Ghassemi, Rajesh Ranganath

Machine learning can be used to make sense of healthcare data. Probabilistic machine learning models help provide a complete picture of observed data in healthcare. In this review, we examine how probabilistic machine learning can advance healthcare. We consider challenges in the predictive model building pipeline where probabilistic models can be beneficial, including calibration and missing data. Beyond predictive models, we also investigate the utility of probabilistic machine learning models in phenotyping, in generative models for clinical use cases, and in reinforcement learning.

机器学习可以用来理解医疗数据。概率机器学习模型有助于提供医疗保健中观察数据的完整图像。在这篇综述中,我们研究了概率机器学习如何推进医疗保健。我们考虑了预测模型构建管道中的挑战,其中概率模型可能是有益的,包括校准和丢失数据。除了预测模型,我们还研究了概率机器学习模型在表型、临床用例生成模型和强化学习中的效用。
{"title":"Probabilistic Machine Learning for Healthcare.","authors":"Irene Y Chen, Shalmali Joshi, Marzyeh Ghassemi, Rajesh Ranganath","doi":"10.1146/annurev-biodatasci-092820-033938","DOIUrl":"10.1146/annurev-biodatasci-092820-033938","url":null,"abstract":"<p><p>Machine learning can be used to make sense of healthcare data. Probabilistic machine learning models help provide a complete picture of observed data in healthcare. In this review, we examine how probabilistic machine learning can advance healthcare. We consider challenges in the predictive model building pipeline where probabilistic models can be beneficial, including calibration and missing data. Beyond predictive models, we also investigate the utility of probabilistic machine learning models in phenotyping, in generative models for clinical use cases, and in reinforcement learning.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"393-415"},"PeriodicalIF":6.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mutational Signatures: From Methods to Mechanisms. 突变签名:从方法到机制。
IF 7 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-07-20 Epub Date: 2021-05-11 DOI: 10.1146/annurev-biodatasci-122320-120920
Yoo-Ah Kim, Mark D M Leiserson, Priya Moorjani, Roded Sharan, Damian Wojtowicz, Teresa M Przytycka

Mutations are the driving force of evolution, yet they underlie many diseases, in particular, cancer. They are thought to arise from a combination of stochastic errors in DNA processing, naturally occurring DNA damage (e.g., the spontaneous deamination of methylated CpG sites), replication errors, and dysregulation of DNA repair mechanisms. High-throughput sequencing has made it possible to generate large datasets to study mutational processes in health and disease. Since the emergence of the first mutational process studies in 2012, this field is gaining increasing attention and has already accumulated a host of computational approaches and biomedical applications.

突变是进化的驱动力,但它们是许多疾病,尤其是癌症的基础。它们被认为是由DNA加工中的随机错误、自然发生的DNA损伤(例如,甲基化CpG位点的自发脱氨)、复制错误和DNA修复机制失调的组合引起的。高通量测序使得产生大型数据集来研究健康和疾病的突变过程成为可能。自2012年首次出现突变过程研究以来,该领域受到越来越多的关注,并且已经积累了大量的计算方法和生物医学应用。
{"title":"Mutational Signatures: From Methods to Mechanisms.","authors":"Yoo-Ah Kim, Mark D M Leiserson, Priya Moorjani, Roded Sharan, Damian Wojtowicz, Teresa M Przytycka","doi":"10.1146/annurev-biodatasci-122320-120920","DOIUrl":"10.1146/annurev-biodatasci-122320-120920","url":null,"abstract":"<p><p>Mutations are the driving force of evolution, yet they underlie many diseases, in particular, cancer. They are thought to arise from a combination of stochastic errors in DNA processing, naturally occurring DNA damage (e.g., the spontaneous deamination of methylated CpG sites), replication errors, and dysregulation of DNA repair mechanisms. High-throughput sequencing has made it possible to generate large datasets to study mutational processes in health and disease. Since the emergence of the first mutational process studies in 2012, this field is gaining increasing attention and has already accumulated a host of computational approaches and biomedical applications.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"189-206"},"PeriodicalIF":7.0,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12232997/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39370516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1