首页 > 最新文献

Annual Review of Biomedical Data Science最新文献

英文 中文
Artificial Intelligence in Pathology: Advancing Large Models for Scalable Applications. 病理学中的人工智能:推进可扩展应用的大型模型。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1146/annurev-biodatasci-103123-095814
Zhiping Xiao, Bin Feng, Junwei Yang, Gongbo Sun, Yuxi Shen, Shengyuan Xu, Lina Yang, Hanwen Xu, Ming Zhang, Sheng Wang

The rapid development of artificial intelligence (AI) has had a significant impact on medical research, introducing new possibilities for pathology studies. There is a recent trend of applying large-scale AI models to many fields, and this trend has given rise to the pathology foundation models and pathology ensemble models. Large models in pathology are not standalone innovations; they build upon a legacy where AI has consistently played a vital role in pathology studies long before their advent. Numerous pathology datasets and AI models have been developed to support advancements in the field, with these combined efforts paving the way for the emergence of large models in pathology. AI greatly enhances pathology studies, yet its widespread use in sensitive applications also raises significant ethical concerns, including privacy risks. In this review, we summarize the datasets and models that are useful to pathology studies, with a particular focus on how they illuminate the path toward large-scale applications.

人工智能(AI)的快速发展对医学研究产生了重大影响,为病理学研究带来了新的可能性。近年来出现了将大规模人工智能模型应用于许多领域的趋势,这一趋势催生了病理基础模型和病理集合模型。病理学中的大型模型并不是独立的创新;早在人工智能出现之前,它们就一直在病理学研究中发挥着至关重要的作用。为了支持该领域的进步,已经开发了许多病理学数据集和人工智能模型,这些共同努力为病理学中大型模型的出现铺平了道路。人工智能极大地增强了病理学研究,但它在敏感应用中的广泛使用也引发了重大的伦理问题,包括隐私风险。在这篇综述中,我们总结了对病理学研究有用的数据集和模型,特别关注它们如何照亮大规模应用的道路。
{"title":"Artificial Intelligence in Pathology: Advancing Large Models for Scalable Applications.","authors":"Zhiping Xiao, Bin Feng, Junwei Yang, Gongbo Sun, Yuxi Shen, Shengyuan Xu, Lina Yang, Hanwen Xu, Ming Zhang, Sheng Wang","doi":"10.1146/annurev-biodatasci-103123-095814","DOIUrl":"10.1146/annurev-biodatasci-103123-095814","url":null,"abstract":"<p><p>The rapid development of artificial intelligence (AI) has had a significant impact on medical research, introducing new possibilities for pathology studies. There is a recent trend of applying large-scale AI models to many fields, and this trend has given rise to the pathology foundation models and pathology ensemble models. Large models in pathology are not standalone innovations; they build upon a legacy where AI has consistently played a vital role in pathology studies long before their advent. Numerous pathology datasets and AI models have been developed to support advancements in the field, with these combined efforts paving the way for the emergence of large models in pathology. AI greatly enhances pathology studies, yet its widespread use in sensitive applications also raises significant ethical concerns, including privacy risks. In this review, we summarize the datasets and models that are useful to pathology studies, with a particular focus on how they illuminate the path toward large-scale applications.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"8 1","pages":"149-171"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144822750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Unstructured Data in Electronic Health Records to Detect Adverse Events from Pediatric Drug Use: A Scoping Review. 利用电子健康记录中的非结构化数据来检测儿童药物使用的不良事件:范围审查。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1146/annurev-biodatasci-111224-124530
Su Golder, Karen O'Connor, Guillermo Lopez-Garcia, Nicholas P Tatonetti, Graciela Gonzalez-Hernandez

Adverse drug events (ADEs) in pediatric populations pose significant public health challenges, yet research on their detection and monitoring remains limited. This scoping review evaluates the use of unstructured data from electronic health records (EHRs) to identify ADEs in children. We searched six databases, including MEDLINE, Embase, and IEEE Xplore, in September 2024. From 984 records, only nine studies met our inclusion criteria, indicating a significant gap in research toward identifying ADEs in children. We found that unstructured data in EHRs can indeed be of value and enhance pediatric pharmacovigilance, although their use has been so far very limited. Traditional natural language processing methods have been employed to extract ADEs, but the approaches utilized face challenges in generalizability and context interpretation. These challenges could be addressed with recent advances in transformer-based models and large language models, unlocking the use of EHR data at scale for pediatric pharmacovigilance.

儿科人群的药物不良事件(ADEs)构成了重大的公共卫生挑战,但对其检测和监测的研究仍然有限。本综述评估了使用电子健康记录(EHRs)中的非结构化数据来识别儿童ade的情况。我们在2024年9月检索了六个数据库,包括MEDLINE、Embase和IEEE explore。从984项记录中,只有9项研究符合我们的纳入标准,这表明在识别儿童ADEs方面的研究存在显著差距。我们发现电子病历中的非结构化数据确实有价值,可以提高儿科药物警惕性,尽管迄今为止它们的使用非常有限。传统的自然语言处理方法被用来提取ade,但所使用的方法在泛化和上下文解释方面面临挑战。这些挑战可以通过基于转换器的模型和大型语言模型的最新进展来解决,从而实现大规模使用电子病历数据用于儿科药物警戒。
{"title":"Leveraging Unstructured Data in Electronic Health Records to Detect Adverse Events from Pediatric Drug Use: A Scoping Review.","authors":"Su Golder, Karen O'Connor, Guillermo Lopez-Garcia, Nicholas P Tatonetti, Graciela Gonzalez-Hernandez","doi":"10.1146/annurev-biodatasci-111224-124530","DOIUrl":"10.1146/annurev-biodatasci-111224-124530","url":null,"abstract":"<p><p>Adverse drug events (ADEs) in pediatric populations pose significant public health challenges, yet research on their detection and monitoring remains limited. This scoping review evaluates the use of unstructured data from electronic health records (EHRs) to identify ADEs in children. We searched six databases, including MEDLINE, Embase, and IEEE Xplore, in September 2024. From 984 records, only nine studies met our inclusion criteria, indicating a significant gap in research toward identifying ADEs in children. We found that unstructured data in EHRs can indeed be of value and enhance pediatric pharmacovigilance, although their use has been so far very limited. Traditional natural language processing methods have been employed to extract ADEs, but the approaches utilized face challenges in generalizability and context interpretation. These challenges could be addressed with recent advances in transformer-based models and large language models, unlocking the use of EHR data at scale for pediatric pharmacovigilance.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"8 1","pages":"227-250"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144822751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Prediction to Prescription: Machine Learning and Causal Inference for the Heterogeneous Treatment Effect. 从预测到处方:异质治疗效果的机器学习和因果推理。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-04-09 DOI: 10.1146/annurev-biodatasci-103123-095750
Judith Abécassis, Élise Dumas, Julie Alberge, Gaël Varoquaux

The increasing accumulation of medical data brings the hope of data-driven medical decision-making, but data's increasing complexity-as text or images in electronic health records-calls for complex models, such as machine learning. Here, we review how machine learning can be used to inform decisions for individualized interventions, a causal question. Going from prediction to causal effects is challenging, as no individual is seen as both treated and not. We detail how some data can support some causal claims and how to build causal estimators with machine learning. Beyond variable selection to adjust for confounding bias, we cover the broader notions of study design that make or break causal inference. As the problems span across diverse scientific communities, we use didactic yet statistically precise formulations to bridge machine learning to epidemiology.

医疗数据的不断积累为数据驱动的医疗决策带来了希望,但数据日益复杂——如电子健康记录中的文本或图像——需要复杂的模型,如机器学习。在这里,我们回顾了机器学习如何用于为个性化干预提供决策信息,这是一个因果问题。从预测到因果关系是具有挑战性的,因为没有一个人被视为既治疗又没有治疗。我们详细介绍了一些数据如何支持一些因果断言,以及如何使用机器学习构建因果估计器。除了调整混杂偏差的变量选择之外,我们还涵盖了研究设计的更广泛概念,这些概念可以建立或破坏因果推理。由于问题跨越不同的科学社区,我们使用教学但统计精确的公式将机器学习与流行病学联系起来。
{"title":"From Prediction to Prescription: Machine Learning and Causal Inference for the Heterogeneous Treatment Effect.","authors":"Judith Abécassis, Élise Dumas, Julie Alberge, Gaël Varoquaux","doi":"10.1146/annurev-biodatasci-103123-095750","DOIUrl":"10.1146/annurev-biodatasci-103123-095750","url":null,"abstract":"<p><p>The increasing accumulation of medical data brings the hope of data-driven medical decision-making, but data's increasing complexity-as text or images in electronic health records-calls for complex models, such as machine learning. Here, we review how machine learning can be used to inform decisions for individualized interventions, a causal question. Going from prediction to causal effects is challenging, as no individual is seen as both treated and not. We detail how some data can support some causal claims and how to build causal estimators with machine learning. Beyond variable selection to adjust for confounding bias, we cover the broader notions of study design that make or break causal inference. As the problems span across diverse scientific communities, we use didactic yet statistically precise formulations to bridge machine learning to epidemiology.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"381-404"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144003572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education. 生成式人工智能:对生物医学和卫生专业教育的影响。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-04-09 DOI: 10.1146/annurev-biodatasci-103123-094756
William Hersh

Generative artificial intelligence (AI) has had a profound impact on biomedicine and health, both in professional work and in education. Based on large language models (LLMs), generative AI has been found to perform as well as humans in simulated situations taking medical board exams, answering clinical questions, solving clinical cases, applying clinical reasoning, and summarizing information. Generative AI is also being used widely in education, performing well in academic courses and their assessments. This review summarizes the successes of LLMs and highlights some of their challenges in the context of education, most notably aspects that may undermine the acquisition of knowledge and skills for professional work. It then provides recommendations for best practices to overcome the shortcomings of LLM use in education. Although there are challenges for the use of generative AI in education, all students and faculty, in biomedicine and health and beyond, must have understanding of it and be competent in its use.

生成式人工智能(AI)对生物医学和健康产生了深远的影响,无论是在专业工作还是在教育方面。基于大型语言模型(llm),生成式人工智能已经被发现在模拟情况下表现得和人类一样好,比如参加医学委员会考试、回答临床问题、解决临床病例、应用临床推理和总结信息。生成式人工智能也被广泛应用于教育领域,在学术课程及其评估中表现出色。这篇综述总结了法学硕士的成功,并强调了他们在教育背景下面临的一些挑战,最明显的是可能会破坏专业工作知识和技能的获取。然后,它提供了最佳实践建议,以克服法学硕士在教育中使用的缺点。尽管在教育中使用生成式人工智能存在挑战,但生物医学和卫生等领域的所有学生和教师都必须理解并能够熟练使用人工智能。
{"title":"Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education.","authors":"William Hersh","doi":"10.1146/annurev-biodatasci-103123-094756","DOIUrl":"10.1146/annurev-biodatasci-103123-094756","url":null,"abstract":"<p><p>Generative artificial intelligence (AI) has had a profound impact on biomedicine and health, both in professional work and in education. Based on large language models (LLMs), generative AI has been found to perform as well as humans in simulated situations taking medical board exams, answering clinical questions, solving clinical cases, applying clinical reasoning, and summarizing information. Generative AI is also being used widely in education, performing well in academic courses and their assessments. This review summarizes the successes of LLMs and highlights some of their challenges in the context of education, most notably aspects that may undermine the acquisition of knowledge and skills for professional work. It then provides recommendations for best practices to overcome the shortcomings of LLM use in education. Although there are challenges for the use of generative AI in education, all students and faculty, in biomedicine and health and beyond, must have understanding of it and be competent in its use.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"355-380"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144052507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methylation Data Analysis and Interpretation. 甲基化数据分析与解释。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1146/annurev-biodatasci-120924-091033
Yuehua Zhu, Weiguang Mao, Rezwan Hosseini, Maria Chikina

DNA methylation, a covalent modification, fundamentally shapes mammalian gene regulation and cellular identity. This review examines methylation's biochemical underpinnings, genomic distribution patterns, and analytical approaches. We highlight three distinctive aspects that separate methylation from other epigenetic marks: its remarkable stability as a silencing mechanism, its capacity to maintain distinct states independently of DNA sequence, and its effectiveness as a quantitative trait linking genotype to disease risk. We also explore the phenomenon of methylation clocks and their biological significance. The review addresses technical considerations across major assay types-both array-based technologies and sequencing approaches-with emphasis on data normalization, quality control, cell proportion inference, and the specialized statistical models required for next-generation sequencing analysis.

DNA甲基化是一种共价修饰,从根本上塑造了哺乳动物的基因调控和细胞身份。本文综述了甲基化的生化基础、基因组分布模式和分析方法。我们强调了将甲基化与其他表观遗传标记区分开来的三个独特方面:其作为沉默机制的显著稳定性,其独立于DNA序列维持不同状态的能力,以及其作为将基因型与疾病风险联系起来的数量性状的有效性。我们还探讨了甲基化时钟现象及其生物学意义。这篇综述讨论了主要分析类型的技术考虑,包括基于阵列的技术和测序方法,重点是数据归一化、质量控制、细胞比例推断和下一代测序分析所需的专门统计模型。
{"title":"Methylation Data Analysis and Interpretation.","authors":"Yuehua Zhu, Weiguang Mao, Rezwan Hosseini, Maria Chikina","doi":"10.1146/annurev-biodatasci-120924-091033","DOIUrl":"10.1146/annurev-biodatasci-120924-091033","url":null,"abstract":"<p><p>DNA methylation, a covalent modification, fundamentally shapes mammalian gene regulation and cellular identity. This review examines methylation's biochemical underpinnings, genomic distribution patterns, and analytical approaches. We highlight three distinctive aspects that separate methylation from other epigenetic marks: its remarkable stability as a silencing mechanism, its capacity to maintain distinct states independently of DNA sequence, and its effectiveness as a quantitative trait linking genotype to disease risk. We also explore the phenomenon of methylation clocks and their biological significance. The review addresses technical considerations across major assay types-both array-based technologies and sequencing approaches-with emphasis on data normalization, quality control, cell proportion inference, and the specialized statistical models required for next-generation sequencing analysis.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":"8 1","pages":"605-632"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144822753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Development Landscape of Large Language Models for Biomedical Applications. 生物医学应用的大型语言模型的发展前景。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-04-01 DOI: 10.1146/annurev-biodatasci-102224-074736
Zhiyuan Cao, Vipina K Keloth, Qianqian Xie, Lingfei Qian, Yuntian Liu, Yan Wang, Rui Shi, Weipeng Zhou, Gui Yang, Jeffrey Zhang, Xueqing Peng, Ethan Zhen, Ruey-Ling Weng, Qingyu Chen, Hua Xu

Large language models (LLMs) have become powerful tools for biomedical applications, offering potential to transform healthcare and medical research. Since the release of ChatGPT in 2022, there has been a surge in LLMs for diverse biomedical applications. This review examines the landscape of text-based biomedical LLM development, analyzing model characteristics (e.g., architecture), development processes (e.g., training strategy), and applications (e.g., chatbots). Following PRISMA guidelines, 82 articles were selected out of 5,512 articles since 2022 that met our rigorous criteria, including the requirement of using biomedical data when training LLMs. Findings highlight the predominant use of decoder-only architectures such as Llama 7B, prevalence of task-specific fine-tuning, and reliance on biomedical literature for training. Challenges persist in balancing data openness with privacy concerns and detailing model development, including computational resources used. Future efforts would benefit from multimodal integration, LLMs for specialized medical applications, and improved data sharing and model accessibility.

大型语言模型(llm)已经成为生物医学应用的强大工具,提供了改变医疗保健和医学研究的潜力。自2022年ChatGPT发布以来,各种生物医学应用的法学硕士数量激增。这篇综述考察了基于文本的生物医学法学硕士发展的前景,分析了模型特征(例如,架构)、开发过程(例如,培训策略)和应用(例如,聊天机器人)。遵循PRISMA指南,从2022年以来的5512篇文章中选择了82篇符合我们严格标准的文章,包括在培训法学硕士时使用生物医学数据的要求。研究结果强调了仅解码器架构(如Llama 7B)的主要使用,特定任务微调的流行,以及对生物医学文献的依赖。在平衡数据开放与隐私问题以及详细描述模型开发(包括使用的计算资源)之间的关系方面,挑战依然存在。未来的努力将受益于多模式集成、专门医疗应用的法学硕士以及改进的数据共享和模型可访问性。
{"title":"The Development Landscape of Large Language Models for Biomedical Applications.","authors":"Zhiyuan Cao, Vipina K Keloth, Qianqian Xie, Lingfei Qian, Yuntian Liu, Yan Wang, Rui Shi, Weipeng Zhou, Gui Yang, Jeffrey Zhang, Xueqing Peng, Ethan Zhen, Ruey-Ling Weng, Qingyu Chen, Hua Xu","doi":"10.1146/annurev-biodatasci-102224-074736","DOIUrl":"10.1146/annurev-biodatasci-102224-074736","url":null,"abstract":"<p><p>Large language models (LLMs) have become powerful tools for biomedical applications, offering potential to transform healthcare and medical research. Since the release of ChatGPT in 2022, there has been a surge in LLMs for diverse biomedical applications. This review examines the landscape of text-based biomedical LLM development, analyzing model characteristics (e.g., architecture), development processes (e.g., training strategy), and applications (e.g., chatbots). Following PRISMA guidelines, 82 articles were selected out of 5,512 articles since 2022 that met our rigorous criteria, including the requirement of using biomedical data when training LLMs. Findings highlight the predominant use of decoder-only architectures such as Llama 7B, prevalence of task-specific fine-tuning, and reliance on biomedical literature for training. Challenges persist in balancing data openness with privacy concerns and detailing model development, including computational resources used. Future efforts would benefit from multimodal integration, LLMs for specialized medical applications, and improved data sharing and model accessibility.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"251-274"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372014/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143765290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Curriculum Design in an Evolving Field: Perspectives on Biomedical Data Science from Stanford. 不断发展的领域中的课程设计:斯坦福大学生物医学数据科学的视角。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-04-09 DOI: 10.1146/annurev-biodatasci-090624-022951
Christine Y Yeh, Dennis P Wall, Karen Matthys, Chiara Sabatti, Julia A Palacios

In recent decades, there has been an explosion of data streams spanning the entire spectrum of biomedicine, opening novel opportunities to tackle biological and medical research questions, increasing our ability to provide effective and efficient health care. In parallel, augmented computational power has allowed the development and deployment of quantitative approaches at unprecedented scales. To effectively take advantage of this progress, it is important to invest in the training of a new generation of biomedical data scientists. Designing a graduate curriculum in the backdrop of a rapidly changing landscape of data, methods, and computing power demands flexibility and openness to adaptation. At the same time, we strive to ensure that the students acquire foundational competencies that might fuel productive and evolving careers, without being constrained to and defined by a niche trendy topic. We offer here a view of graduate training in biomedical data science from the standpoint of our experience at Stanford University. We conclude with a series of open challenges, the answers to which we believe will shape training in biomedical data science.

近几十年来,跨越整个生物医学领域的数据流呈爆炸式增长,为解决生物和医学研究问题提供了新的机会,提高了我们提供有效和高效医疗保健的能力。与此同时,计算能力的增强使得定量方法的开发和部署达到了前所未有的规模。为了有效地利用这一进展,重要的是投资培训新一代生物医学数据科学家。在数据、方法和计算能力快速变化的背景下设计研究生课程需要灵活性和开放性。与此同时,我们努力确保学生获得可能促进生产和不断发展的职业生涯的基础能力,而不受限于和定义一个小众的时尚话题。从我们在斯坦福大学的经验来看,我们在这里提供了生物医学数据科学研究生培训的观点。我们总结了一系列公开的挑战,我们相信这些挑战的答案将塑造生物医学数据科学的培训。
{"title":"Curriculum Design in an Evolving Field: Perspectives on Biomedical Data Science from Stanford.","authors":"Christine Y Yeh, Dennis P Wall, Karen Matthys, Chiara Sabatti, Julia A Palacios","doi":"10.1146/annurev-biodatasci-090624-022951","DOIUrl":"10.1146/annurev-biodatasci-090624-022951","url":null,"abstract":"<p><p>In recent decades, there has been an explosion of data streams spanning the entire spectrum of biomedicine, opening novel opportunities to tackle biological and medical research questions, increasing our ability to provide effective and efficient health care. In parallel, augmented computational power has allowed the development and deployment of quantitative approaches at unprecedented scales. To effectively take advantage of this progress, it is important to invest in the training of a new generation of biomedical data scientists. Designing a graduate curriculum in the backdrop of a rapidly changing landscape of data, methods, and computing power demands flexibility and openness to adaptation. At the same time, we strive to ensure that the students acquire foundational competencies that might fuel productive and evolving careers, without being constrained to and defined by a niche trendy topic. We offer here a view of graduate training in biomedical data science from the standpoint of our experience at Stanford University. We conclude with a series of open challenges, the answers to which we believe will shape training in biomedical data science.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"341-354"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144027038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation and Regulation of Artificial Intelligence Medical Devices for Clinical Decision Support. 临床决策支持人工智能医疗器械的评价与规范
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-02-19 DOI: 10.1146/annurev-biodatasci-103123-095824
Gary E Weissman

Artificial intelligence (AI) methods were first developed nearly seven decades ago. Only in recent years have they demonstrated their potential to improve clinical care at the bedside. AI systems are now capable of interpreting, predicting, and even generating important medical information. AI medical devices share many similarities with traditional medical devices but also diverge from them in important ways. Despite widespread optimism and enthusiasm surrounding the use of such devices to improve care processes, patient outcomes, and the healthcare experience for patients, caregivers, and clinicians alike, little evidence exists so far for their effectiveness in practice. Even less is known about the safety or equity of AI medical devices. As with any new technology, this exciting time is accompanied by appropriate questions regarding if, how much, when, and who such AI systems really help. Different stakeholders, ranging from patients to clinicians to industry device developers, may have divergent preferences or assessments of risk and benefits, warranting an informed public discussion to guide emerging regulatory efforts. This review summarizes the rapidly evolving recent efforts and evidence related to the regulation and evaluation of AI medical devices and highlights opportunities for future work to ensure their effectiveness, safety, and equity.

人工智能(AI)方法最早是在近70年前开发出来的。直到最近几年,它们才显示出改善床边临床护理的潜力。人工智能系统现在能够解释、预测,甚至生成重要的医疗信息。人工智能医疗设备与传统医疗设备有许多相似之处,但在重要方面也有所不同。尽管人们普遍对使用此类设备来改善护理流程、患者预后以及患者、护理人员和临床医生的医疗保健体验感到乐观和热情,但迄今为止,几乎没有证据表明它们在实践中的有效性。人们对人工智能医疗设备的安全性或公平性了解得更少。与任何新技术一样,这一激动人心的时刻也伴随着一些适当的问题,如人工智能系统是否、多大程度、何时以及谁真正有帮助。不同的利益相关者,从患者到临床医生再到行业设备开发商,可能有不同的偏好或风险和收益评估,因此需要进行知情的公众讨论,以指导新兴的监管工作。本综述总结了近期与人工智能医疗器械监管和评估相关的快速发展的努力和证据,并强调了未来工作的机会,以确保其有效性、安全性和公平性。
{"title":"Evaluation and Regulation of Artificial Intelligence Medical Devices for Clinical Decision Support.","authors":"Gary E Weissman","doi":"10.1146/annurev-biodatasci-103123-095824","DOIUrl":"10.1146/annurev-biodatasci-103123-095824","url":null,"abstract":"<p><p>Artificial intelligence (AI) methods were first developed nearly seven decades ago. Only in recent years have they demonstrated their potential to improve clinical care at the bedside. AI systems are now capable of interpreting, predicting, and even generating important medical information. AI medical devices share many similarities with traditional medical devices but also diverge from them in important ways. Despite widespread optimism and enthusiasm surrounding the use of such devices to improve care processes, patient outcomes, and the healthcare experience for patients, caregivers, and clinicians alike, little evidence exists so far for their effectiveness in practice. Even less is known about the safety or equity of AI medical devices. As with any new technology, this exciting time is accompanied by appropriate questions regarding if, how much, when, and who such AI systems really help. Different stakeholders, ranging from patients to clinicians to industry device developers, may have divergent preferences or assessments of risk and benefits, warranting an informed public discussion to guide emerging regulatory efforts. This review summarizes the rapidly evolving recent efforts and evidence related to the regulation and evaluation of AI medical devices and highlights opportunities for future work to ensure their effectiveness, safety, and equity.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"81-99"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12339208/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143459781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial Transcriptomics Brings New Challenges and Opportunities for Trajectory Inference. 空间转录组学为轨迹推断带来新的挑战和机遇
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2024-11-14 DOI: 10.1146/annurev-biodatasci-040324-030052
Matthieu Heitz, Yujia Ma, Sharvaj Kubal, Geoffrey Schiebinger

Spatial transcriptomics (ST) brings new dimensions to the analysis of single-cell data. While some methods for data analysis can be ported over without major modifications, they are the exception rather than the rule. Trajectory inference (TI) methods in particular can suffer from significant challenges due to spatial batch effects in ST data. These can add independent sources of noise to each time point. Pioneering methods for TI on ST data have focused primarily on addressing the batch effects in physical arrangement, i.e., where tissues are deformed in different ways at different time points. However, other challenges arise due to the measurement granularity of ST technologies, as well as a bias from slicing. In this review, we examine the sources of these challenges, and we explore how they are addressed with current state-of-the-art STTI methods. We conclude by highlighting some opportunities for future method development.

空间转录组学(ST)为单细胞数据分析带来了新的维度。虽然有些数据分析方法无需进行重大修改即可移植,但它们只是例外,而不是常规。特别是轨迹推断(TI)方法,由于 ST 数据的空间批次效应,可能会面临巨大的挑战。这可能会给每个时间点增加独立的噪声源。ST 数据轨迹推断的开创性方法主要侧重于解决物理排列中的批次效应,即组织在不同时间点以不同方式变形。然而,由于 ST 技术的测量粒度以及切片产生的偏差,也带来了其他挑战。在本综述中,我们研究了这些挑战的来源,并探讨了当前最先进的 STTI 方法如何应对这些挑战。最后,我们强调了未来方法发展的一些机遇。
{"title":"Spatial Transcriptomics Brings New Challenges and Opportunities for Trajectory Inference.","authors":"Matthieu Heitz, Yujia Ma, Sharvaj Kubal, Geoffrey Schiebinger","doi":"10.1146/annurev-biodatasci-040324-030052","DOIUrl":"10.1146/annurev-biodatasci-040324-030052","url":null,"abstract":"<p><p>Spatial transcriptomics (ST) brings new dimensions to the analysis of single-cell data. While some methods for data analysis can be ported over without major modifications, they are the exception rather than the rule. Trajectory inference (TI) methods in particular can suffer from significant challenges due to spatial batch effects in ST data. These can add independent sources of noise to each time point. Pioneering methods for TI on ST data have focused primarily on addressing the batch effects in physical arrangement, i.e., where tissues are deformed in different ways at different time points. However, other challenges arise due to the measurement granularity of ST technologies, as well as a bias from slicing. In this review, we examine the sources of these challenges, and we explore how they are addressed with current state-of-the-art STTI methods. We conclude by highlighting some opportunities for future method development.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"1-19"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142628467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Artificial Intelligence in Medicine. 医学中的生成式人工智能。
IF 6 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-03-18 DOI: 10.1146/annurev-biodatasci-103123-095332
Divya Shanmugam, Monica Agrawal, Rajiv Movva, Irene Y Chen, Marzyeh Ghassemi, Maia Jacobs, Emma Pierson

The increased capabilities of generative artificial intelligence (AI) have dramatically expanded its possible use cases in medicine. We provide a comprehensive overview of generative AI use cases for clinicians, patients, clinical trial organizers, researchers, and trainees. We then discuss the many challenges-including maintaining privacy and security, improving transparency and interpretability, upholding equity, and rigorously evaluating models-that must be overcome to realize this potential, as well as the open research directions they give rise to.

生成式人工智能(AI)的能力不断增强,极大地扩展了其在医学领域的可能用例。我们为临床医生、患者、临床试验组织者、研究人员和受训者提供生成式人工智能用例的全面概述。然后我们讨论了许多挑战,包括维护隐私和安全,提高透明度和可解释性,维护公平,严格评估模型,必须克服这些挑战才能实现这一潜力,以及它们所带来的开放研究方向。
{"title":"Generative Artificial Intelligence in Medicine.","authors":"Divya Shanmugam, Monica Agrawal, Rajiv Movva, Irene Y Chen, Marzyeh Ghassemi, Maia Jacobs, Emma Pierson","doi":"10.1146/annurev-biodatasci-103123-095332","DOIUrl":"10.1146/annurev-biodatasci-103123-095332","url":null,"abstract":"<p><p>The increased capabilities of generative artificial intelligence (AI) have dramatically expanded its possible use cases in medicine. We provide a comprehensive overview of generative AI use cases for clinicians, patients, clinical trial organizers, researchers, and trainees. We then discuss the many challenges-including maintaining privacy and security, improving transparency and interpretability, upholding equity, and rigorously evaluating models-that must be overcome to realize this potential, as well as the open research directions they give rise to.</p>","PeriodicalId":29775,"journal":{"name":"Annual Review of Biomedical Data Science","volume":" ","pages":"199-226"},"PeriodicalIF":6.0,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143658878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Review of Biomedical Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1