人工智能时代的药物发现与开发：从机器学习到大型语言模型

Artificial intelligence chemistry Pub Date : 2024-06-01 Epub Date: 2024-05-09 DOI:10.1016/j.aichem.2024.100070

Shenghui Guan , Guanyu Wang

{"title":"人工智能时代的药物发现与开发：从机器学习到大型语言模型","authors":"Shenghui Guan , Guanyu Wang","doi":"10.1016/j.aichem.2024.100070","DOIUrl":null,"url":null,"abstract":"<div><p>Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100070"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000289/pdfft?md5=3f360afc13a60f15f7dab8b6a1dd740b&pid=1-s2.0-S2949747724000289-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Drug discovery and development in the era of artificial intelligence: From machine learning to large language models\",\"authors\":\"Shenghui Guan , Guanyu Wang\",\"doi\":\"10.1016/j.aichem.2024.100070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.</p></div>\",\"PeriodicalId\":72302,\"journal\":{\"name\":\"Artificial intelligence chemistry\",\"volume\":\"2 1\",\"pages\":\"Article 100070\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2949747724000289/pdfft?md5=3f360afc13a60f15f7dab8b6a1dd740b&pid=1-s2.0-S2949747724000289-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949747724000289\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/5/9 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence chemistry","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949747724000289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/9 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

药物研发（R&D）是一个复杂而艰难的过程，目前的药物研发面临着时间跨度长、投资大、失败率高等挑战。机器学习以其对大数据和复杂网络的强大学习能力，在提高药物研发的效率和成功率方面发挥着越来越大的作用。在此，我们回顾了机器学习方法在疾病基因预测、虚拟筛选、药物分子生成、分子属性预测和药物组合协同性预测等六个领域的最新应用实例。我们还讨论了整合学习在多属性预测中的优势。基于不同维度数据构建的基础学习器的整合模型，一方面充分利用了这些数据所包含的信息，另一方面提高了平均预测性能。最后，我们设想了一种新的药物发现和开发范式：以大型语言模型为中心枢纽，将公共资源组织成知识库，通过计算软件和小型预测模型以及基于有机体技术的高通量自动筛选平台来验证知识，从而加快开发速度，缩小疾病模型与人体之间的药效差异，提高药物的成功率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Drug discovery and development in the era of artificial intelligence: From machine learning to large language models

Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Artificial intelligence chemistry Chemistry (General)

自引率

0.00%

发文量

审稿时长

21 days