人工智能时代的药物发现与开发:从机器学习到大型语言模型

Shenghui Guan , Guanyu Wang
{"title":"人工智能时代的药物发现与开发:从机器学习到大型语言模型","authors":"Shenghui Guan ,&nbsp;Guanyu Wang","doi":"10.1016/j.aichem.2024.100070","DOIUrl":null,"url":null,"abstract":"<div><p>Drug Research and Development (R&amp;D) is a complex and difficult process, and current drug R&amp;D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&amp;D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000289/pdfft?md5=3f360afc13a60f15f7dab8b6a1dd740b&pid=1-s2.0-S2949747724000289-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Drug discovery and development in the era of artificial intelligence: From machine learning to large language models\",\"authors\":\"Shenghui Guan ,&nbsp;Guanyu Wang\",\"doi\":\"10.1016/j.aichem.2024.100070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Drug Research and Development (R&amp;D) is a complex and difficult process, and current drug R&amp;D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&amp;D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.</p></div>\",\"PeriodicalId\":72302,\"journal\":{\"name\":\"Artificial intelligence chemistry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2949747724000289/pdfft?md5=3f360afc13a60f15f7dab8b6a1dd740b&pid=1-s2.0-S2949747724000289-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949747724000289\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence chemistry","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949747724000289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

药物研发(R&D)是一个复杂而艰难的过程,目前的药物研发面临着时间跨度长、投资大、失败率高等挑战。机器学习以其对大数据和复杂网络的强大学习能力,在提高药物研发的效率和成功率方面发挥着越来越大的作用。在此,我们回顾了机器学习方法在疾病基因预测、虚拟筛选、药物分子生成、分子属性预测和药物组合协同性预测等六个领域的最新应用实例。我们还讨论了整合学习在多属性预测中的优势。基于不同维度数据构建的基础学习器的整合模型,一方面充分利用了这些数据所包含的信息,另一方面提高了平均预测性能。最后,我们设想了一种新的药物发现和开发范式:以大型语言模型为中心枢纽,将公共资源组织成知识库,通过计算软件和小型预测模型以及基于有机体技术的高通量自动筛选平台来验证知识,从而加快开发速度,缩小疾病模型与人体之间的药效差异,提高药物的成功率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Drug discovery and development in the era of artificial intelligence: From machine learning to large language models

Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Artificial intelligence chemistry
Artificial intelligence chemistry Chemistry (General)
自引率
0.00%
发文量
0
审稿时长
21 days
期刊最新文献
Molecular similarity: Theory, applications, and perspectives Large-language models: The game-changers for materials science research Conf-GEM: A geometric information-assisted direct conformation generation model Top 20 influential AI-based technologies in chemistry User-friendly and industry-integrated AI for medicinal chemists and pharmaceuticals
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1