首页 > 最新文献

Artificial intelligence chemistry最新文献

英文 中文
Molecular similarity: Theory, applications, and perspectives 分子相似性:理论、应用与展望
Pub Date : 2024-08-31 DOI: 10.1016/j.aichem.2024.100077

Molecular similarity pervades much of our understanding and rationalization of chemistry. This has become particularly evident in the current data-intensive era of chemical research, with similarity measures serving as the backbone of many Machine Learning (ML) supervised and unsupervised procedures. Here, we present a discussion on the role of molecular similarity in drug design, chemical space exploration, chemical “art” generation, molecular representations, and many more. We also discuss more recent topics in molecular similarity, like the ability to efficiently compare large molecular libraries.

分子相似性贯穿了我们对化学的大部分理解和合理化。在当前数据密集型的化学研究时代,这一点变得尤为明显,相似性度量成为许多机器学习(ML)监督和非监督程序的支柱。在此,我们将讨论分子相似性在药物设计、化学空间探索、化学 "艺术 "生成、分子表征等方面的作用。我们还讨论了分子相似性的最新话题,如高效比较大型分子库的能力。
{"title":"Molecular similarity: Theory, applications, and perspectives","authors":"","doi":"10.1016/j.aichem.2024.100077","DOIUrl":"10.1016/j.aichem.2024.100077","url":null,"abstract":"<div><p>Molecular similarity pervades much of our understanding and rationalization of chemistry. This has become particularly evident in the current data-intensive era of chemical research, with similarity measures serving as the backbone of many Machine Learning (ML) supervised and unsupervised procedures. Here, we present a discussion on the role of molecular similarity in drug design, chemical space exploration, chemical “art” generation, molecular representations, and many more. We also discuss more recent topics in molecular similarity, like the ability to efficiently compare large molecular libraries.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000356/pdfft?md5=7238a1972b367d1732b52f425b046ba9&pid=1-s2.0-S2949747724000356-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142150935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large-language models: The game-changers for materials science research 大型语言模型:改变材料科学研究的游戏规则
Pub Date : 2024-08-24 DOI: 10.1016/j.aichem.2024.100076

Large Language Models (LLMs), such as GPT-4, are precipitating a new "industrial revolution" by significantly enhancing productivity across various domains. These models encode an extensive corpus of scientific knowledge from vast textual datasets, functioning as near-universal generalists with the ability to engage in natural language communication and exhibit advanced reasoning capabilities. Notably, agents derived from LLMs can comprehend user intent and autonomously design, plan, and utilize tools to execute intricate tasks. These attributes are particularly advantageous for materials science research, an interdisciplinary field characterized by numerous complex and time-intensive activities. The integration of LLMs into materials science research holds the potential to fundamentally transform the research paradigm in this field.

大型语言模型(LLM),如 GPT-4,通过显著提高各领域的生产力,正在催生一场新的 "工业革命"。这些模型从庞大的文本数据集中编码了大量科学知识,可作为近乎万能的通才发挥作用,能够进行自然语言交流并展现高级推理能力。值得注意的是,由 LLM 衍生出的代理可以理解用户意图,并自主设计、规划和使用工具来执行复杂的任务。这些特性对于材料科学研究尤为有利,因为材料科学研究是一个跨学科领域,涉及众多复杂且耗时的活动。将 LLM 融入材料科学研究,有可能从根本上改变该领域的研究模式。
{"title":"Large-language models: The game-changers for materials science research","authors":"","doi":"10.1016/j.aichem.2024.100076","DOIUrl":"10.1016/j.aichem.2024.100076","url":null,"abstract":"<div><p>Large Language Models (LLMs), such as GPT-4, are precipitating a new \"industrial revolution\" by significantly enhancing productivity across various domains. These models encode an extensive corpus of scientific knowledge from vast textual datasets, functioning as near-universal generalists with the ability to engage in natural language communication and exhibit advanced reasoning capabilities. Notably, agents derived from LLMs can comprehend user intent and autonomously design, plan, and utilize tools to execute intricate tasks. These attributes are particularly advantageous for materials science research, an interdisciplinary field characterized by numerous complex and time-intensive activities. The integration of LLMs into materials science research holds the potential to fundamentally transform the research paradigm in this field.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000344/pdfft?md5=e80906f3aecc3736b5e0dcac5da9017c&pid=1-s2.0-S2949747724000344-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conf-GEM: A geometric information-assisted direct conformation generation model Conf-GEM:几何信息辅助直接构象生成模型
Pub Date : 2024-07-27 DOI: 10.1016/j.aichem.2024.100074

Molecular conformations generation (MCG) aims to efficiently obtain reasonable and stable three-dimensional (3D) atomic coordinates of the atoms in the molecule from scratch, providing a structural foundation for molecular representation learning models and advanced downstream molecular design tasks such as molecular property prediction, molecular generation, and molecular docking. Existing MCG methods mostly rely on indirect distance-based strategies, which which can result in geometrically unrealistic conformations, or direct coordinate-based methods, which have larger search spaces and are prone to overfitting. Therefore, this study introduces Conf-GEM, a novel geometric information-assisted direct conformation generation model based on E-GeoGNN, a geometrically augmented 3D graph neural network with multiple scales. Pre-training and divide-and-conquer strategies, are integrated into the proposed model. Conf-GEM outperforms RDKit and nine deep-learning-based MCG models on the GEOM-QM9 and GEOM-Drugs datasets, achieving conformational coverage of 96.69% and 96.07%, respectively, without force field optimization. It also excels on the X-ray diffraction crystal structure dataset with up to 97.04% conformational coverage. In conclusion, Conf-GEM provides a novel solution for stabilizing 3D conformations generation. We provide an online prediction service (https://confgem.cmdrg.com) with a user-friendly interface for researchers.

分子构象生成(MCG)旨在从零开始有效地获得分子中原子的合理而稳定的三维(3D)原子坐标,为分子表征学习模型和高级下游分子设计任务(如分子性质预测、分子生成和分子对接)提供结构基础。现有的 MCG 方法大多依赖于基于间接距离的策略,这可能导致几何构象不切实际;或基于直接坐标的方法,其搜索空间较大,容易出现过拟合。因此,本研究介绍了一种基于 E-GeoGNN 的新型几何信息辅助直接构象生成模型 Conf-GEM,E-GeoGNN 是一种几何增强的多尺度三维图神经网络。预训练和分而治之策略被集成到了所提出的模型中。在 GEOM-QM9 和 GEOM-Drugs 数据集上,Conf-GEM 的表现优于 RDKit 和九种基于深度学习的 MCG 模型,在不进行力场优化的情况下,构象覆盖率分别达到 96.69% 和 96.07%。它在 X 射线衍射晶体结构数据集上也表现出色,构象覆盖率高达 97.04%。总之,Conf-GEM 为稳定三维构象的生成提供了一种新的解决方案。我们为研究人员提供了具有友好用户界面的在线预测服务(https://confgem.cmdrg.com)。
{"title":"Conf-GEM: A geometric information-assisted direct conformation generation model","authors":"","doi":"10.1016/j.aichem.2024.100074","DOIUrl":"10.1016/j.aichem.2024.100074","url":null,"abstract":"<div><p>Molecular conformations generation (MCG) aims to efficiently obtain reasonable and stable three-dimensional (3D) atomic coordinates of the atoms in the molecule from scratch, providing a structural foundation for molecular representation learning models and advanced downstream molecular design tasks such as molecular property prediction, molecular generation, and molecular docking. Existing MCG methods mostly rely on indirect distance-based strategies, which which can result in geometrically unrealistic conformations, or direct coordinate-based methods, which have larger search spaces and are prone to overfitting. Therefore, this study introduces Conf-GEM, a novel geometric information-assisted direct conformation generation model based on E-GeoGNN, a geometrically augmented 3D graph neural network with multiple scales. Pre-training and divide-and-conquer strategies, are integrated into the proposed model. Conf-GEM outperforms RDKit and nine deep-learning-based MCG models on the GEOM-QM9 and GEOM-Drugs datasets, achieving conformational coverage of 96.69% and 96.07%, respectively, without force field optimization. It also excels on the X-ray diffraction crystal structure dataset with up to 97.04% conformational coverage. In conclusion, Conf-GEM provides a novel solution for stabilizing 3D conformations generation. We provide an online prediction service (<span><span>https://confgem.cmdrg.com</span><svg><path></path></svg></span>) with a user-friendly interface for researchers.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000320/pdfft?md5=48affbdd2252ef50c6eb12dedcdeacc7&pid=1-s2.0-S2949747724000320-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141845096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Top 20 influential AI-based technologies in chemistry 化学领域最具影响力的 20 项人工智能技术
Pub Date : 2024-07-27 DOI: 10.1016/j.aichem.2024.100075

The beginning and ripening of digital chemistry is analyzed focusing on the role of artificial intelligence (AI) in an expected leap in chemical sciences to bring this area to the next evolutionary level. The analytic description selects and highlights the top 20 AI-based technologies and 7 broader themes that are reshaping the field. It underscores the integration of digital tools such as machine learning, big data, digital twins, the Internet of Things (IoT), robotic platforms, smart control of chemical processes, virtual reality and blockchain, among many others, in enhancing research methods, educational approaches, and industrial practices in chemistry. The significance of this study lies in its focused overview of how these digital innovations foster a more efficient, sustainable, and innovative future in chemical sciences. This article not only illustrates the transformative impact of these technologies but also draws new pathways in chemistry, offering a broad appeal to researchers, educators, and industry professionals to embrace these advancements for addressing contemporary challenges in the field.

本报告分析了数字化学的起步与成熟,重点关注人工智能(AI)在化学科学领域预期飞跃中的作用,从而将这一领域带入下一个发展阶段。分析报告选择并强调了基于人工智能的 20 大技术和 7 大主题,这些技术和主题正在重塑化学领域。它强调了机器学习、大数据、数字双胞胎、物联网(IoT)、机器人平台、化学过程智能控制、虚拟现实和区块链等数字工具在加强化学研究方法、教育方法和工业实践方面的整合。本研究的意义在于,它重点概述了这些数字创新如何促进化学科学更高效、可持续和创新的未来。这篇文章不仅说明了这些技术的变革性影响,还为化学领域开辟了新的道路,广泛呼吁研究人员、教育工作者和行业专业人士拥抱这些进步,以应对该领域的当代挑战。
{"title":"Top 20 influential AI-based technologies in chemistry","authors":"","doi":"10.1016/j.aichem.2024.100075","DOIUrl":"10.1016/j.aichem.2024.100075","url":null,"abstract":"<div><p>The beginning and ripening of digital chemistry is analyzed focusing on the role of artificial intelligence (AI) in an expected leap in chemical sciences to bring this area to the next evolutionary level. The analytic description selects and highlights the top 20 AI-based technologies and 7 broader themes that are reshaping the field. It underscores the integration of digital tools such as machine learning, big data, digital twins, the Internet of Things (IoT), robotic platforms, smart control of chemical processes, virtual reality and blockchain, among many others, in enhancing research methods, educational approaches, and industrial practices in chemistry. The significance of this study lies in its focused overview of how these digital innovations foster a more efficient, sustainable, and innovative future in chemical sciences. This article not only illustrates the transformative impact of these technologies but also draws new pathways in chemistry, offering a broad appeal to researchers, educators, and industry professionals to embrace these advancements for addressing contemporary challenges in the field.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000332/pdfft?md5=a101cdd9b75aa2e13939289fee50e2d5&pid=1-s2.0-S2949747724000332-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141849804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User-friendly and industry-integrated AI for medicinal chemists and pharmaceuticals 面向药物化学家和制药业的用户友好型工业集成人工智能
Pub Date : 2024-07-14 DOI: 10.1016/j.aichem.2024.100072

Artificial intelligence has brought crucial changes to the whole field of natural sciences. Myriads of machine learning algorithms have been developed to facilitate the work of experimental scientists. Molecular property prediction and drug synthesis planning become routine tasks. Moreover, inverse design of compounds with tunable properties as well as on-the-fly autonomous process optimization and chemical space exploration became possible in silico. Affordable robotic platforms exist able to perform thousands of experiments every day, analyzing the results and tuning the protocols. Despite this, most of these developments get trapped at the stage of code or overlooked, limiting their use by experimental scientists. Meanwhile, visibility and the number of user-friendly tools and technologies available to date is too low to compensate for this fact, rendering the development of novel therapeutic compounds inefficient. In this Review, we set the goal to bridge the gap between modern technologies and experimental scientists to improve drug development efficacy. Here we survey advanced and easy-to-use technologies able to help medical chemists at every stage of their research, including those integrated in technological processes during COVID-19 pandemic motivated by the need for fast yet precise solutions. Moreover, we review how these technologies are integrated by industry and clinics to streamline drug development and production. These technologies already transform the current paradigm of scientific thinking and revolutionize not only medicinal chemistry, but the whole field of natural sciences.

人工智能给整个自然科学领域带来了至关重要的变化。无数机器学习算法应运而生,为实验科学家的工作提供了便利。分子性质预测和药物合成规划已成为常规任务。此外,反向设计具有可调特性的化合物,以及实时自主优化工艺和探索化学空间,都已成为可能。经济实惠的机器人平台能够每天进行数千次实验,分析实验结果并调整实验方案。尽管如此,这些开发成果大多停留在代码阶段或被忽视,限制了实验科学家对它们的使用。与此同时,迄今为止可用的用户友好型工具和技术的知名度和数量太低,无法弥补这一事实,导致新型治疗化合物的开发效率低下。在这篇综述中,我们的目标是弥合现代技术与实验科学家之间的差距,提高药物开发的效率。在此,我们调查了能够在研究的每个阶段为医学化学家提供帮助的先进且易于使用的技术,包括在 COVID-19 大流行期间因需要快速而精确的解决方案而整合到技术流程中的技术。此外,我们还回顾了工业和诊所如何将这些技术整合到一起,以简化药物开发和生产过程。这些技术已经改变了当前的科学思维模式,不仅彻底改变了药物化学,而且改变了整个自然科学领域。
{"title":"User-friendly and industry-integrated AI for medicinal chemists and pharmaceuticals","authors":"","doi":"10.1016/j.aichem.2024.100072","DOIUrl":"10.1016/j.aichem.2024.100072","url":null,"abstract":"<div><p>Artificial intelligence has brought crucial changes to the whole field of natural sciences. Myriads of machine learning algorithms have been developed to facilitate the work of experimental scientists. Molecular property prediction and drug synthesis planning become routine tasks. Moreover, inverse design of compounds with tunable properties as well as on-the-fly autonomous process optimization and chemical space exploration became possible <em>in silico</em>. Affordable robotic platforms exist able to perform thousands of experiments every day, analyzing the results and tuning the protocols. Despite this, most of these developments get trapped at the stage of code or overlooked, limiting their use by experimental scientists. Meanwhile, visibility and the number of user-friendly tools and technologies available to date is too low to compensate for this fact, rendering the development of novel therapeutic compounds inefficient. In this Review, we set the goal to bridge the gap between modern technologies and experimental scientists to improve drug development efficacy. Here we survey advanced and easy-to-use technologies able to help medical chemists at every stage of their research, including those integrated in technological processes during COVID-19 pandemic motivated by the need for fast yet precise solutions. Moreover, we review how these technologies are integrated by industry and clinics to streamline drug development and production. These technologies already transform the current paradigm of scientific thinking and revolutionize not only medicinal chemistry, but the whole field of natural sciences.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000307/pdfft?md5=6e0dd6833ec368337f7792d55171a0b8&pid=1-s2.0-S2949747724000307-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141691742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive approach utilizing quantum machine learning in the study of corrosion inhibition on quinoxaline compounds 利用量子机器学习研究喹喔啉化合物缓蚀作用的综合方法
Pub Date : 2024-07-10 DOI: 10.1016/j.aichem.2024.100073

In this investigation, a quantitative structure-property relationship (QSPR) model coupled with a quantum neural network (QNN) was used to explore the corrosion inhibition efficiency (CIE) of quinoxaline compounds. Integrating quantum chemical properties (QCP) features reduced computational burden by strategically reducing the features from 11 to 4 while maintaining prediction accuracy. QNN models outperform traditional methods like artificial neural networks (ANN) and multilayer perceptron neural networks (MLPNN), with a coefficient of determination (R2) value of 0.987, coupled with diminished root mean square error (RMSE), mean absolute error (MAE), and mean absolute deviation (MAD) values of 0.97, 0.92, and 1.10, respectively. Predictions for six newly synthesized quinoxaline derivatives: quinoxaline-6-carboxylic acid (Q1), methyl quinoxaline-6-carboxylate (Q2), (2E,3E)-2,3-dihydrazono-1,2,3,4-tetrahydroquinoxaline (Q3), (2E,3E) 2,3-dihydrazono-6-methyl-1,2,3,4-tetrahydroquinoxaline (Q4), (E)-3-(4-methoxyethyl)-7-methylquinoxalin-2(1 H)-one (Q5), and 2-(4-methoxyphenyl)-7-methylthieno[3,2-b] quinoxaline (Q6), show remarkable CIE values of 95.12, 96.72, 91.02, 92.43, 89.58, and 93.63 %, respectively. This breakthrough technique simplifies testing and production procedures for new anti-corrosion materials.

在这项研究中,定量结构-性质关系(QSPR)模型与量子神经网络(QNN)相结合,用于探索喹喔啉化合物的缓蚀效率(CIE)。在保持预测准确性的同时,将量子化学特性(QCP)特征从 11 个战略性地减少到 4 个,从而减轻了计算负担。QNN 模型优于人工神经网络(ANN)和多层感知器神经网络(MLPNN)等传统方法,其决定系数(R2)为 0.987,均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对偏差(MAD)分别为 0.97、0.92 和 1.10。六种新合成的喹喔啉衍生物的预测值:4-四氢喹喔啉 (Q4)、(E)-3-(4-甲氧基乙基)-7-甲基喹喔啉-2(1H)-酮 (Q5) 和 2-(4-甲氧基苯基)-7-甲基噻吩并[3,2-b] 喹喔啉 (Q6),显示出显著的 CIE 值 95.12、96.72、91.02、92.43、89.58 和 93.63 %。这一突破性技术简化了新型防腐材料的测试和生产程序。
{"title":"A comprehensive approach utilizing quantum machine learning in the study of corrosion inhibition on quinoxaline compounds","authors":"","doi":"10.1016/j.aichem.2024.100073","DOIUrl":"10.1016/j.aichem.2024.100073","url":null,"abstract":"<div><p>In this investigation, a quantitative structure-property relationship (QSPR) model coupled with a quantum neural network (QNN) was used to explore the corrosion inhibition efficiency (CIE) of quinoxaline compounds. Integrating quantum chemical properties (QCP) features reduced computational burden by strategically reducing the features from 11 to 4 while maintaining prediction accuracy. QNN models outperform traditional methods like artificial neural networks (ANN) and multilayer perceptron neural networks (MLPNN), with a coefficient of determination (R<sup>2</sup>) value of 0.987, coupled with diminished root mean square error (RMSE), mean absolute error (MAE), and mean absolute deviation (MAD) values of 0.97, 0.92, and 1.10, respectively. Predictions for six newly synthesized quinoxaline derivatives: quinoxaline-6-carboxylic acid <strong>(Q1)</strong>, methyl quinoxaline-6-carboxylate <strong>(Q2)</strong>, (2<em>E</em>,3<em>E</em>)-2,3-dihydrazono-1,2,3,4-tetrahydroquinoxaline <strong>(Q3)</strong>, (2<em>E</em>,3<em>E</em>) 2,3-dihydrazono-6-methyl-1,2,3,4-tetrahydroquinoxaline <strong>(Q4)</strong>, (<em>E</em>)-3-(4-methoxyethyl)-7-methylquinoxalin-2(1 H)-one <strong>(Q5)</strong>, and 2-(4-methoxyphenyl)-7-methylthieno[3,2-<em>b</em>] quinoxaline <strong>(Q6)</strong>, show remarkable CIE values of 95.12, 96.72, 91.02, 92.43, 89.58, and 93.63 %, respectively. This breakthrough technique simplifies testing and production procedures for new anti-corrosion materials.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000319/pdfft?md5=a7bac287a83d3748d5afc1de5b5f817c&pid=1-s2.0-S2949747724000319-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence for drug repurposing against infectious diseases 人工智能为防治传染病重新设计药物用途
Pub Date : 2024-06-12 DOI: 10.1016/j.aichem.2024.100071
Anuradha Singh

Traditional drug discovery struggles to keep pace with the ever-evolving threat of infectious diseases. New viruses and antibiotic-resistant bacteria, all demand rapid solutions. Artificial Intelligence (AI) offers a promising path forward through accelerated drug repurposing. AI allows researchers to analyze massive datasets, revealing hidden connections between existing drugs, disease targets, and potential treatments. This approach boasts several advantages. First, repurposing existing drugs leverages established safety data and reduces development time and costs. Second, AI can broaden the search for effective therapies by identifying unexpected connections between drugs and potential new targets. Finally, AI can help mitigate limitations by predicting and minimizing side effects, optimizing drugs for repurposing, and navigating intellectual property hurdles. The article explores specific AI strategies like virtual screening, target identification, structure base drug design and natural language processing. Real-world examples highlight the potential of AI-driven drug repurposing in discovering new treatments for infectious diseases.

传统的药物研发难以跟上不断发展的传染病威胁的步伐。新型病毒和抗生素耐药细菌都需要快速的解决方案。人工智能(AI)通过加速药物再利用,提供了一条充满希望的前进之路。人工智能使研究人员能够分析海量数据集,揭示现有药物、疾病靶点和潜在治疗方法之间隐藏的联系。这种方法有几个优势。首先,对现有药物进行再利用可以利用已有的安全性数据,并减少开发时间和成本。其次,人工智能可以发现药物与潜在新靶点之间意想不到的联系,从而扩大有效疗法的搜索范围。最后,人工智能可以通过预测和尽量减少副作用、优化药物的再利用以及克服知识产权障碍来帮助减少局限性。文章探讨了虚拟筛选、靶点识别、结构基础药物设计和自然语言处理等具体的人工智能策略。真实世界的例子突出了人工智能驱动的药物再利用在发现传染病新疗法方面的潜力。
{"title":"Artificial intelligence for drug repurposing against infectious diseases","authors":"Anuradha Singh","doi":"10.1016/j.aichem.2024.100071","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100071","url":null,"abstract":"<div><p>Traditional drug discovery struggles to keep pace with the ever-evolving threat of infectious diseases. New viruses and antibiotic-resistant bacteria, all demand rapid solutions. Artificial Intelligence (AI) offers a promising path forward through accelerated drug repurposing. AI allows researchers to analyze massive datasets, revealing hidden connections between existing drugs, disease targets, and potential treatments. This approach boasts several advantages. First, repurposing existing drugs leverages established safety data and reduces development time and costs. Second, AI can broaden the search for effective therapies by identifying unexpected connections between drugs and potential new targets. Finally, AI can help mitigate limitations by predicting and minimizing side effects, optimizing drugs for repurposing, and navigating intellectual property hurdles. The article explores specific AI strategies like virtual screening, target identification, structure base drug design and natural language processing. Real-world examples highlight the potential of AI-driven drug repurposing in discovering new treatments for infectious diseases.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000290/pdfft?md5=061eb4d8766a2284afb22976a6d28b51&pid=1-s2.0-S2949747724000290-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141322997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drug discovery and development in the era of artificial intelligence: From machine learning to large language models 人工智能时代的药物发现与开发:从机器学习到大型语言模型
Pub Date : 2024-05-09 DOI: 10.1016/j.aichem.2024.100070
Shenghui Guan , Guanyu Wang

Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.

药物研发(R&D)是一个复杂而艰难的过程,目前的药物研发面临着时间跨度长、投资大、失败率高等挑战。机器学习以其对大数据和复杂网络的强大学习能力,在提高药物研发的效率和成功率方面发挥着越来越大的作用。在此,我们回顾了机器学习方法在疾病基因预测、虚拟筛选、药物分子生成、分子属性预测和药物组合协同性预测等六个领域的最新应用实例。我们还讨论了整合学习在多属性预测中的优势。基于不同维度数据构建的基础学习器的整合模型,一方面充分利用了这些数据所包含的信息,另一方面提高了平均预测性能。最后,我们设想了一种新的药物发现和开发范式:以大型语言模型为中心枢纽,将公共资源组织成知识库,通过计算软件和小型预测模型以及基于有机体技术的高通量自动筛选平台来验证知识,从而加快开发速度,缩小疾病模型与人体之间的药效差异,提高药物的成功率。
{"title":"Drug discovery and development in the era of artificial intelligence: From machine learning to large language models","authors":"Shenghui Guan ,&nbsp;Guanyu Wang","doi":"10.1016/j.aichem.2024.100070","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100070","url":null,"abstract":"<div><p>Drug Research and Development (R&amp;D) is a complex and difficult process, and current drug R&amp;D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&amp;D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000289/pdfft?md5=3f360afc13a60f15f7dab8b6a1dd740b&pid=1-s2.0-S2949747724000289-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140906077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial-temporal self-attention network based on bayesian optimization for light olefins yields prediction in methanol-to-olefins process 基于贝叶斯优化的时空自我关注网络,用于预测甲醇制烯烃过程中的轻质烯烃产量
Pub Date : 2024-04-30 DOI: 10.1016/j.aichem.2024.100067
Jibin Zhou , Duiping Liu , Mao Ye , Zhongmin Liu

Methanol-to-olefins (MTO), as an alternative pathway for the synthesis of light olefins (ethylene and propylene), has gained extensive attention. Accurate prediction of light olefins yields can effectively facilitate process monitoring and optimization, as they are significant economic indexes and stable operation indicators of the industrial MTO process. However, the nonlinearity and dynamic interactions among process variables pose challenges for the prediction using traditional statistical methods. Additionally, physical-based methods relying on first-principle theory are always limited by an insufficient understanding of reaction mechanisms. In contrast, data-driven methods offer a viable solution for the prediction based solely on process data without requiring extensive process knowledge. Therefore, in this work, a data-driven approach that integrates spatial and temporal self-attention modules is proposed to capture complex interactions. Furthermore, Bayesian optimization is employed to determine the optimum hyperparameters and enhance the accuracy of the model. Studies on an actual MTO process demonstrate the superior prediction performance of the proposed model compared to baseline models. Specifically, 24 process variables are selected as the high-dimensional inputs, and yields of ethylene and propylene, as the low-dimensional outputs, are successfully predicted at various prediction horizons ranging from 2 to 8 h.

甲醇制烯烃(MTO)作为合成轻质烯烃(乙烯和丙烯)的替代途径,已受到广泛关注。轻烯烃产率是工业 MTO 工艺的重要经济指标和稳定运行指标,准确预测轻烯烃产率可有效促进工艺监控和优化。然而,工艺变量之间的非线性和动态相互作用给使用传统统计方法进行预测带来了挑战。此外,基于第一原理理论的物理方法总是受到对反应机理理解不足的限制。与此相反,数据驱动方法提供了一种可行的解决方案,即仅根据工艺数据进行预测,而无需大量的工艺知识。因此,在这项工作中,提出了一种整合了空间和时间自我关注模块的数据驱动方法,以捕捉复杂的相互作用。此外,还采用了贝叶斯优化法来确定最佳超参数,提高模型的准确性。对实际 MTO 过程的研究表明,与基线模型相比,所提出的模型具有更优越的预测性能。具体来说,选择 24 个过程变量作为高维输入,乙烯和丙烯的产量作为低维输出,在 2 到 8 小时的不同预测时间范围内都能成功预测。
{"title":"Spatial-temporal self-attention network based on bayesian optimization for light olefins yields prediction in methanol-to-olefins process","authors":"Jibin Zhou ,&nbsp;Duiping Liu ,&nbsp;Mao Ye ,&nbsp;Zhongmin Liu","doi":"10.1016/j.aichem.2024.100067","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100067","url":null,"abstract":"<div><p>Methanol-to-olefins (MTO), as an alternative pathway for the synthesis of light olefins (ethylene and propylene), has gained extensive attention. Accurate prediction of light olefins yields can effectively facilitate process monitoring and optimization, as they are significant economic indexes and stable operation indicators of the industrial MTO process. However, the nonlinearity and dynamic interactions among process variables pose challenges for the prediction using traditional statistical methods. Additionally, physical-based methods relying on first-principle theory are always limited by an insufficient understanding of reaction mechanisms. In contrast, data-driven methods offer a viable solution for the prediction based solely on process data without requiring extensive process knowledge. Therefore, in this work, a data-driven approach that integrates spatial and temporal self-attention modules is proposed to capture complex interactions. Furthermore, Bayesian optimization is employed to determine the optimum hyperparameters and enhance the accuracy of the model. Studies on an actual MTO process demonstrate the superior prediction performance of the proposed model compared to baseline models. Specifically, 24 process variables are selected as the high-dimensional inputs, and yields of ethylene and propylene, as the low-dimensional outputs, are successfully predicted at various prediction horizons ranging from 2 to 8 h.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000253/pdfft?md5=23aede3f145af7617d071f10a10c1e3f&pid=1-s2.0-S2949747724000253-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140880534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions 通过机器学习确定二元溶液中水的活度和电解质的活度系数
Pub Date : 2024-04-27 DOI: 10.1016/j.aichem.2024.100069
Guillaume Zante

Activity of water and electrolytes in aqueous solutions is of utmost importance for multiple industrial applications. However, experimental determination of such values is time-consuming, while calculation of activity coefficients using numerical methods is challenging. By training neural networks models on literature data, one could predict activity of water and electrolytes easily, without requiring any experiment. In this paper, multiple descriptors (or features) are compared to predict activity coefficients of electrolytes and activity of water in electrolyte solutions. A neural network based on the Levenberg-Marquardt algorithm (LM-NN) showed high accuracy to calculate values, despite the small size of the training datasets. Both activity coefficients of electrolytes and activity of water in electrolyte solutions can be predicted accurately even on unseen data, using simple descriptors such as electrolyte concentration, ion sizes and charges. However, some discrepancies were observed due to the lack of representativeness of the training dataset. This could be solved by selecting training data sets that are similar (e.g. same group of the periodic table) to the unknown values, or by including available experimental data for the salt considered. The ability of the LM-NN to solve non-linear least square curve fitting problems makes it a good candidate to fit experimental activity coefficient data, with the advantage of simplicity as compared to e-NRTL or UNIQUAC methods. This method paves the way for accurate and quick determination of thermodynamic data for electrolyte solutions (and beyond) using machine learning, without necessitating large training datasets.

水和电解质在水溶液中的活度对多种工业应用至关重要。然而,通过实验确定这些值非常耗时,而使用数值方法计算活度系数又极具挑战性。通过对文献数据进行神经网络模型训练,人们可以轻松预测水和电解质的活性,而无需进行任何实验。本文比较了多种描述符(或特征)来预测电解质的活度系数和电解质溶液中水的活度。基于 Levenberg-Marquardt 算法的神经网络(LM-NN)显示,尽管训练数据集的规模较小,但计算值的准确性很高。使用简单的描述符(如电解质浓度、离子大小和电荷),电解质的活度系数和电解质溶液中水的活度即使在未见过的数据上也能准确预测。不过,由于训练数据集缺乏代表性,也出现了一些差异。要解决这个问题,可以选择与未知值相似(如元素周期表中的同一组)的训练数据集,或加入所考虑盐类的可用实验数据。LM-NN 解决非线性最小平方曲线拟合问题的能力使其成为拟合实验活性系数数据的良好候选方法,与 e-NRTL 或 UNIQUAC 方法相比,它具有简单的优势。这种方法为利用机器学习准确、快速地确定电解质溶液(及其他)的热力学数据铺平了道路,而无需大量的训练数据集。
{"title":"Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions","authors":"Guillaume Zante","doi":"10.1016/j.aichem.2024.100069","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100069","url":null,"abstract":"<div><p>Activity of water and electrolytes in aqueous solutions is of utmost importance for multiple industrial applications. However, experimental determination of such values is time-consuming, while calculation of activity coefficients using numerical methods is challenging. By training neural networks models on literature data, one could predict activity of water and electrolytes easily, without requiring any experiment. In this paper, multiple descriptors (or features) are compared to predict activity coefficients of electrolytes and activity of water in electrolyte solutions. A neural network based on the Levenberg-Marquardt algorithm (LM-NN) showed high accuracy to calculate values, despite the small size of the training datasets. Both activity coefficients of electrolytes and activity of water in electrolyte solutions can be predicted accurately even on unseen data, using simple descriptors such as electrolyte concentration, ion sizes and charges. However, some discrepancies were observed due to the lack of representativeness of the training dataset. This could be solved by selecting training data sets that are similar (<em>e.g.</em> same group of the periodic table) to the unknown values, or by including available experimental data for the salt considered. The ability of the LM-NN to solve non-linear least square curve fitting problems makes it a good candidate to fit experimental activity coefficient data, with the advantage of simplicity as compared to e-NRTL or UNIQUAC methods. This method paves the way for accurate and quick determination of thermodynamic data for electrolyte solutions (and beyond) using machine learning, without necessitating large training datasets.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000277/pdfft?md5=e459d6b44dc65480007e2791c28b76ea&pid=1-s2.0-S2949747724000277-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140825119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial intelligence chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1