Artificial intelligence in the life sciences最新文献

英文中文

Pharmaceutical patent landscaping: A novel approach to understand patents from the drug discovery perspective 药物专利景观:一种从药物发现角度理解专利的新方法

Artificial intelligence in the life sciences

Pub Date : 2023-12-01 Epub Date: 2023-03-31 DOI: 10.1016/j.ailsci.2023.100069

Yojana Gadiya , Philip Gribbon , Martin Hofmann-Apitius , Andrea Zaliani

Patents play a crucial role in the drug discovery process by providing legal protection for discoveries and incentivising investments in research and development. By identifying patterns within patent data resources, researchers can gain insight into the market trends and priorities of the pharmaceutical and biotechnology industries, as well as provide additional perspectives on more fundamental aspects such as the emergence of potential new drug targets. In this paper, we used the patent enrichment tool, PEMT, to extract, integrate, and analyse patent literature for rare diseases (RD) and Alzheimer's disease (AD). This is followed by a systematic review of the underlying patent landscape to decipher trends and applications in patents for these diseases. To do so, we discuss prominent organisations involved in drug discovery research in AD and RD. This allows us to gain an understanding of the importance of AD and RD from specific organisational (pharmaceutical or university) perspectives. Next, we analyse the historical focus of patents in relation to individual therapeutic targets and correlate them with market scenarios allowing the identification of prominent targets for a disease. Lastly, we identified drug repurposing activities within the two diseases with the help of patents. This resulted in identifying existing repurposed drugs and novel potential therapeutic approaches applicable to the indication areas. The study demonstrates the expanded applicability of patent documents from legal to drug discovery, design, and research, thus, providing a valuable resource for future drug discovery efforts. Moreover, this study is an attempt towards understanding the importance of data underlying patent documents and raising the need for preparing the data for machine learning-based applications.

专利通过为发现提供法律保护和激励研发投资，在药物发现过程中发挥着至关重要的作用。通过识别专利数据资源中的模式，研究人员可以深入了解制药和生物技术行业的市场趋势和优先事项，并对潜在新药靶点的出现等更基本的方面提供更多的视角。在本文中，我们使用专利富集工具PEMT来提取、整合和分析罕见病（RD）和阿尔茨海默病（AD）的专利文献。接下来是对潜在专利前景的系统审查，以解读这些疾病专利的趋势和应用。为此，我们讨论了参与AD和RD药物发现研究的知名组织。这使我们能够从特定的组织（制药或大学）角度了解AD和RD的重要性。接下来，我们分析了专利与个体治疗靶点相关的历史焦点，并将其与市场情景相关联，从而确定疾病的突出靶点。最后，我们在专利的帮助下确定了这两种疾病中的药物再利用活动。这导致确定了适用于适应症领域的现有再利用药物和新的潜在治疗方法。该研究表明，专利文件的适用性从法律扩展到药物发现、设计和研究，从而为未来的药物发现工作提供了宝贵的资源。此外，这项研究试图理解专利文件中数据的重要性，并提出为基于机器学习的应用准备数据的必要性。

{"title":"Pharmaceutical patent landscaping: A novel approach to understand patents from the drug discovery perspective","authors":"Yojana Gadiya , Philip Gribbon , Martin Hofmann-Apitius , Andrea Zaliani","doi":"10.1016/j.ailsci.2023.100069","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100069","url":null,"abstract":"<div><p>Patents play a crucial role in the drug discovery process by providing legal protection for discoveries and incentivising investments in research and development. By identifying patterns within patent data resources, researchers can gain insight into the market trends and priorities of the pharmaceutical and biotechnology industries, as well as provide additional perspectives on more fundamental aspects such as the emergence of potential new drug targets. In this paper, we used the patent enrichment tool, PEMT, to extract, integrate, and analyse patent literature for rare diseases (RD) and Alzheimer's disease (AD). This is followed by a systematic review of the underlying patent landscape to decipher trends and applications in patents for these diseases. To do so, we discuss prominent organisations involved in drug discovery research in AD and RD. This allows us to gain an understanding of the importance of AD and RD from specific organisational (pharmaceutical or university) perspectives. Next, we analyse the historical focus of patents in relation to individual therapeutic targets and correlate them with market scenarios allowing the identification of prominent targets for a disease. Lastly, we identified drug repurposing activities within the two diseases with the help of patents. This resulted in identifying existing repurposed drugs and novel potential therapeutic approaches applicable to the indication areas. The study demonstrates the expanded applicability of patent documents from legal to drug discovery, design, and research, thus, providing a valuable resource for future drug discovery efforts. Moreover, this study is an attempt towards understanding the importance of data underlying patent documents and raising the need for preparing the data for machine learning-based applications.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100069"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial intelligence systems for the design of magic shotgun drugs 人工智能系统的神奇猎枪药物设计

Artificial intelligence in the life sciences

Pub Date : 2023-12-01 Epub Date: 2022-12-22 DOI: 10.1016/j.ailsci.2022.100055

José Teófilo Moreira-Filho , Meryck Felipe Brito da Silva , Joyce Villa Verde Bastos Borba , Arlindo Rodrigues Galvão Filho , Eugene N Muratov , Carolina Horta Andrade , Rodolpho de Campos Braga , Bruno Junior Neves

Designing magic shotgun compounds, i.e., compounds hitting multiple targets using artificial intelligence (AI) systems based on machine learning (ML) and deep learning (DL) approaches, has a huge potential to revolutionize drug discovery. Such intelligent systems enable computers to create new chemical structures and predict their multi-target properties at a low cost and in a time-efficient manner. Most examples of AI applied to drug discovery are single-target oriented and there is still a lack of concise information regarding the application of this technology for the discovery of multi-target drugs or drugs with broad-spectrum action. In this review, we focus on current developments in AI systems for the next generation of automated design of multi-target drugs. We discuss how classical ML methods, cutting-edge generative models, and multi-task deep neural networks can help de novo design and hit-to-lead optimization of multi-target drugs. Moreover, we present state-of-the-art workflows and highlight some studies demonstrating encouraging experimental results, which pave the way for de novo drug design and multi-target drug discovery.

设计神奇的霰弹枪化合物，即使用基于机器学习(ML)和深度学习(DL)方法的人工智能(AI)系统击中多个目标的化合物，具有彻底改变药物发现的巨大潜力。这种智能系统使计算机能够以低成本和高效率的方式创造新的化学结构并预测其多目标特性。人工智能应用于药物发现的大多数例子都是单靶点导向的，关于将该技术应用于发现多靶点药物或具有广谱作用的药物方面，仍然缺乏简明的信息。在这篇综述中，我们重点介绍了用于下一代多靶点药物自动化设计的人工智能系统的最新发展。我们讨论了经典的机器学习方法、尖端的生成模型和多任务深度神经网络如何帮助多靶点药物的从头设计和hit-to-lead优化。此外，我们还介绍了最先进的工作流程，并重点介绍了一些展示令人鼓舞的实验结果的研究，这些实验结果为新药物设计和多靶点药物发现铺平了道路。

{"title":"Artificial intelligence systems for the design of magic shotgun drugs","authors":"José Teófilo Moreira-Filho , Meryck Felipe Brito da Silva , Joyce Villa Verde Bastos Borba , Arlindo Rodrigues Galvão Filho , Eugene N Muratov , Carolina Horta Andrade , Rodolpho de Campos Braga , Bruno Junior Neves","doi":"10.1016/j.ailsci.2022.100055","DOIUrl":"10.1016/j.ailsci.2022.100055","url":null,"abstract":"<div><p>Designing magic shotgun compounds, i.e., compounds hitting multiple targets using artificial intelligence (AI) systems based on machine learning (ML) and deep learning (DL) approaches, has a huge potential to revolutionize drug discovery. Such intelligent systems enable computers to create new chemical structures and predict their multi-target properties at a low cost and in a time-efficient manner. Most examples of AI applied to drug discovery are single-target oriented and there is still a lack of concise information regarding the application of this technology for the discovery of multi-target drugs or drugs with broad-spectrum action. In this review, we focus on current developments in AI systems for the next generation of automated design of multi-target drugs. We discuss how classical ML methods, cutting-edge generative models, and multi-task deep neural networks can help <em>de novo</em> design and hit-to-lead optimization of multi-target drugs. Moreover, we present state-of-the-art workflows and highlight some studies demonstrating encouraging experimental results, which pave the way for <em>de novo</em> drug design and multi-target drug discovery.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100055"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43297571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Exploring new horizons: Empowering computer-assisted drug design with few-shot learning 探索新视野：通过少量的注射学习实现计算机辅助药物设计

Artificial intelligence in the life sciences

Pub Date : 2023-09-09 DOI: 10.1016/j.ailsci.2023.100086

Sabrina Silva-Mendonça , Arthur Ricardo de Sousa Vitória , Telma Woerle de Lima , Arlindo Rodrigues Galvão-Filho , Carolina Horta Andrade

Computational approaches have revolutionized the field of drug discovery, collectively known as Computer-Assisted Drug Design (CADD). Advancements in computing power, data generation, digitalization, and artificial intelligence (AI) techniques have played a crucial role in the rise of CADD. These approaches offer numerous benefits, enabling the analysis and interpretation of vast amounts of data from diverse sources, such as genomics, structural information, and clinical trials data. By integrating and analyzing these multiple data sources, researchers can efficiently identify potential drug targets and develop new drug candidates. Among the AI techniques, machine learning (ML) and deep learning (DL) have shown tremendous promise in drug discovery. ML and DL models can effectively utilize experimental data to accurately predict the efficacy and safety of drug candidates. However, despite these advancements, certain areas in drug discovery face data scarcity, particularly in neglected, rare, and emerging viral diseases. Few-shot learning (FSL) is an emerging approach that addresses the challenge of limited data in drug discovery. FSL enables ML models to learn from a small number of examples of a new task, achieving commendable performance by leveraging knowledge learned from related datasets or prior information. It often involves meta-learning, which trains a model to learn how to learn from few data. This ability to quickly adapt to new tasks with low data circumvents the need for extensive training on large datasets. By enabling efficient learning from a small amount of data, few-shot learning has the potential to accelerate the drug discovery process and enhance the success rate of drug development. In this review, we introduce the concept of few-shot learning and its application in drug discovery. Furthermore, we demonstrate the valuable application of few-shot learning in the identification of new drug targets, accurate prediction of drug efficacy, and the design of novel compounds possessing desired biological properties. This comprehensive review draws upon numerous papers from the literature to provide extensive insights into the effectiveness and potential of few-shot learning in these critical areas of drug discovery and development.

计算方法彻底改变了药物发现领域，统称为计算机辅助药物设计（CADD）。计算能力、数据生成、数字化和人工智能（AI）技术的进步在CADD的兴起中发挥了至关重要的作用。这些方法提供了许多好处，能够分析和解释来自不同来源的大量数据，如基因组学、结构信息和临床试验数据。通过整合和分析这些多个数据源，研究人员可以有效地识别潜在的药物靶点并开发新的候选药物。在人工智能技术中，机器学习（ML）和深度学习（DL）在药物发现方面显示出巨大的前景。ML和DL模型可以有效地利用实验数据来准确预测候选药物的疗效和安全性。然而，尽管取得了这些进展，药物发现的某些领域仍面临数据短缺，尤其是在被忽视、罕见和新出现的病毒性疾病方面。少量注射学习（FSL）是一种新兴的方法，可以解决药物发现中数据有限的挑战。FSL使ML模型能够从新任务的少量示例中学习，通过利用从相关数据集或先前信息中学习的知识实现了值得称赞的性能。它通常涉及元学习，它训练模型学习如何从少量数据中学习。这种快速适应低数据新任务的能力避免了在大型数据集上进行广泛训练的需要。通过从少量数据中实现高效学习，少镜头学习有可能加速药物发现过程，提高药物开发的成功率。在这篇综述中，我们介绍了少镜头学习的概念及其在药物发现中的应用。此外，我们展示了少镜头学习在识别新药靶点、准确预测药效以及设计具有所需生物特性的新型化合物方面的宝贵应用。这篇全面的综述借鉴了文献中的大量论文，对少针学习在药物发现和开发的这些关键领域的有效性和潜力提供了广泛的见解。

{"title":"Exploring new horizons: Empowering computer-assisted drug design with few-shot learning","authors":"Sabrina Silva-Mendonça , Arthur Ricardo de Sousa Vitória , Telma Woerle de Lima , Arlindo Rodrigues Galvão-Filho , Carolina Horta Andrade","doi":"10.1016/j.ailsci.2023.100086","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100086","url":null,"abstract":"<div><p>Computational approaches have revolutionized the field of drug discovery, collectively known as Computer-Assisted Drug Design (CADD). Advancements in computing power, data generation, digitalization, and artificial intelligence (AI) techniques have played a crucial role in the rise of CADD. These approaches offer numerous benefits, enabling the analysis and interpretation of vast amounts of data from diverse sources, such as genomics, structural information, and clinical trials data. By integrating and analyzing these multiple data sources, researchers can efficiently identify potential drug targets and develop new drug candidates. Among the AI techniques, machine learning (ML) and deep learning (DL) have shown tremendous promise in drug discovery. ML and DL models can effectively utilize experimental data to accurately predict the efficacy and safety of drug candidates. However, despite these advancements, certain areas in drug discovery face data scarcity, particularly in neglected, rare, and emerging viral diseases. Few-shot learning (FSL) is an emerging approach that addresses the challenge of limited data in drug discovery. FSL enables ML models to learn from a small number of examples of a new task, achieving commendable performance by leveraging knowledge learned from related datasets or prior information. It often involves meta-learning, which trains a model to learn how to learn from few data. This ability to quickly adapt to new tasks with low data circumvents the need for extensive training on large datasets. By enabling efficient learning from a small amount of data, few-shot learning has the potential to accelerate the drug discovery process and enhance the success rate of drug development. In this review, we introduce the concept of few-shot learning and its application in drug discovery. Furthermore, we demonstrate the valuable application of few-shot learning in the identification of new drug targets, accurate prediction of drug efficacy, and the design of novel compounds possessing desired biological properties. This comprehensive review draws upon numerous papers from the literature to provide extensive insights into the effectiveness and potential of few-shot learning in these critical areas of drug discovery and development.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":"Article 100086"},"PeriodicalIF":0.0,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A natural language processing system for the efficient updating of highly curated pathophysiology mechanism knowledge graphs 一个用于高效更新高度策划的病理生理机制知识图的自然语言处理系统

Artificial intelligence in the life sciences

Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100078

Negin Sadat Babaiha , Hassan Elsayed , Bide Zhang , Abish Kaladharan , Priya Sethumadhavan , Bruce Schultz , Jürgen Klein , Bruno Freudensprung , Vanessa Lage-Rupprecht , Alpha Tom Kodamullil , Marc Jacobs , Stefan Geissler , Sumit Madan , Martin Hofmann-Apitius

引用次数: 0

An industrial evaluation of proteochemometric modelling: Predicting drug-target affinities for kinases 蛋白化学计量建模的工业评估：预测激酶的药物靶点亲和力

Artificial intelligence in the life sciences

Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100079

Astrid Stroobants , Lewis H. Mervin , Ola Engkvist , Graeme R. Robb

引用次数: 0

Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model? 抗菌肽预测的耦合编码方法：高度准确的模型有多敏感？

Artificial intelligence in the life sciences

Pub Date : 2022-12-01 Epub Date: 2022-04-22 DOI: 10.1016/j.ailsci.2022.100034

Ivan Erjavac , Daniela Kalafatovic , Goran Mauša

Current application of machine learning in the process of antimicrobial peptide discovery call for the reduction of the false positive predictions that are produced by the classification models. Considering that the positive predictions of high confidence drive modern experimental design, the model’s sensitivity is crucial to reduce the number of unnecessary in vitro tests. Furthermore, taking into account the expert-based design approaches that employ random mutations on confirmed sequences, the machine learning models are required to distinguish between subtle differences among shuffled sequences. With the goal of reducing the false positive rate and improving sensitivity, we propose a hybrid approach to antimicrobial peptide prediction that utilizes combined encoding models. To this end, we implement models that employ both the physico-chemical features and sequence ordering information to stress the importance of using both representations. We also investigate the usage of binary encoding for peptide representation purposes, a method that is insufficiently represented in related research, which proved to act as a viable low dimensional alternative to the one-hot encoding. Our results, supported by Cochran and McNemar statistical tests and Spearman correlation analysis, indicate that the sequence-based encodings complement the physico-chemical features and their synergic effect yields improvement in terms of every evaluation metric. Finally, the proposed hybrid approach that combines physico-chemical features and binary encoding using logical conjunction was shown to be superior to other single models by a factor of 2.96 in terms of fall-out and up to 6.1% in terms of precision.

当前机器学习在抗菌肽发现过程中的应用要求减少由分类模型产生的假阳性预测。考虑到高置信度的积极预测驱动着现代实验设计，该模型的灵敏度对于减少不必要的体外试验数量至关重要。此外，考虑到基于专家的设计方法在已确认的序列上采用随机突变，机器学习模型需要区分洗牌序列之间的细微差异。为了降低假阳性率和提高敏感性，我们提出了一种利用组合编码模型进行抗菌肽预测的混合方法。为此，我们实现了同时使用物理化学特征和序列排序信息的模型，以强调使用这两种表示的重要性。我们还研究了用于肽表示目的的二进制编码的使用，这是一种在相关研究中没有充分代表的方法，它被证明是一种可行的低维替代单热编码。我们的研究结果得到了Cochran和McNemar统计测试和Spearman相关分析的支持，表明基于序列的编码补充了物理化学特征，它们的协同效应在每个评价指标方面都有所改善。最后，所提出的混合方法结合了物理化学特征和使用逻辑连接的二进制编码，在辐射系数方面优于其他单一模型2.96，在精度方面优于6.1%。

{"title":"Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model?","authors":"Ivan Erjavac , Daniela Kalafatovic , Goran Mauša","doi":"10.1016/j.ailsci.2022.100034","DOIUrl":"10.1016/j.ailsci.2022.100034","url":null,"abstract":"<div><p>Current application of machine learning in the process of antimicrobial peptide discovery call for the reduction of the false positive predictions that are produced by the classification models. Considering that the positive predictions of high confidence drive modern experimental design, the model’s sensitivity is crucial to reduce the number of unnecessary <em>in vitro</em> tests. Furthermore, taking into account the expert-based design approaches that employ random mutations on confirmed sequences, the machine learning models are required to distinguish between subtle differences among shuffled sequences. With the goal of reducing the false positive rate and improving sensitivity, we propose a hybrid approach to antimicrobial peptide prediction that utilizes combined encoding models. To this end, we implement models that employ both the physico-chemical features and sequence ordering information to stress the importance of using both representations. We also investigate the usage of binary encoding for peptide representation purposes, a method that is insufficiently represented in related research, which proved to act as a viable low dimensional alternative to the one-hot encoding. Our results, supported by Cochran and McNemar statistical tests and Spearman correlation analysis, indicate that the sequence-based encodings complement the physico-chemical features and their synergic effect yields improvement in terms of every evaluation metric. Finally, the proposed hybrid approach that combines physico-chemical features and binary encoding using logical conjunction was shown to be superior to other single models by a factor of 2.96 in terms of fall-out and up to 6.1% in terms of precision.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"2 ","pages":"Article 100034"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318522000058/pdfft?md5=3f5cf3ee0ab97ece8587283b98a0d00f&pid=1-s2.0-S2667318522000058-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49610236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Classification of JAK1 Inhibitors and SAR Research by Machine Learning Methods 基于机器学习方法的JAK1抑制剂分类及SAR研究

Artificial intelligence in the life sciences

Pub Date : 2022-12-01 Epub Date: 2022-06-08 DOI: 10.1016/j.ailsci.2022.100039

Zhenwu Yang , Yujia Tian , Yue Kong , Yushan Zhu , Aixia Yan

Janus kinase 1 (JAK1) is a key regulator of gene transcription, inhibition of JAK1 is an intervention for many diseases including rheumatoid arthritis and Crohn's disease. In this study, we collected a dataset containing 2982 JAK1 inhibitors, characterized molecules by MACCS fingerprints and Morgan fingerprints. We used support vector machine (SVM), decision tree (DT), random forest (RF) and extreme gradient boosting tree (XGBoost) algorithms to build 16 traditional machine learning classification models. Additionally, we utilized deep neural networks (DNN) to develop four deep learning models. The best model (Model 3B) built by RF and Morgan fingerprints achieved the accuracy (ACC) of 93.6% and Mathews correlation coefficient (MCC) of 0.87 on the test set. Furthermore, we made structure–activity relationship (SAR) analyses for JAK1 inhibitors, based on the output from the random forest models. After analyzing the important keys of two types of fingerprints, it was observed that some substructures such as pyrazole, pyrrolotriazolopyrimidine and pyrazolopyrimidine appeared frequently in highly active JAK1 inhibitors.

Janus kinase 1 (JAK1)是基因转录的关键调控因子，抑制JAK1可以干预包括类风湿关节炎和克罗恩病在内的许多疾病。在这项研究中，我们收集了包含2982个JAK1抑制剂的数据集，用MACCS指纹和Morgan指纹对分子进行了表征。采用支持向量机(SVM)、决策树(DT)、随机森林(RF)和极端梯度增强树(XGBoost)算法构建了16个传统的机器学习分类模型。此外，我们利用深度神经网络(DNN)开发了四个深度学习模型。采用RF指纹和Morgan指纹构建的最佳模型(model 3B)在测试集上的准确率(ACC)为93.6%，Mathews相关系数(MCC)为0.87。此外，基于随机森林模型的输出，我们对JAK1抑制剂进行了结构-活性关系(SAR)分析。通过分析两类指纹图谱的重要键，发现高活性JAK1抑制剂中频繁出现吡唑、吡咯三唑嘧啶和吡唑嘧啶等亚结构。

{"title":"Classification of JAK1 Inhibitors and SAR Research by Machine Learning Methods","authors":"Zhenwu Yang , Yujia Tian , Yue Kong , Yushan Zhu , Aixia Yan","doi":"10.1016/j.ailsci.2022.100039","DOIUrl":"https://doi.org/10.1016/j.ailsci.2022.100039","url":null,"abstract":"<div><p>Janus kinase 1 (JAK1) is a key regulator of gene transcription, inhibition of JAK1 is an intervention for many diseases including rheumatoid arthritis and Crohn's disease. In this study, we collected a dataset containing 2982 JAK1 inhibitors, characterized molecules by MACCS fingerprints and Morgan fingerprints. We used support vector machine (SVM), decision tree (DT), random forest (RF) and extreme gradient boosting tree (XGBoost) algorithms to build 16 traditional machine learning classification models. Additionally, we utilized deep neural networks (DNN) to develop four deep learning models. The best model (Model 3B) built by RF and Morgan fingerprints achieved the accuracy (ACC) of 93.6% and Mathews correlation coefficient (MCC) of 0.87 on the test set. Furthermore, we made structure–activity relationship (SAR) analyses for JAK1 inhibitors, based on the output from the random forest models. After analyzing the important keys of two types of fingerprints, it was observed that some substructures such as pyrazole, pyrrolotriazolopyrimidine and pyrazolopyrimidine appeared frequently in highly active JAK1 inhibitors.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"2 ","pages":"Article 100039"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318522000101/pdfft?md5=2754446c7965603153a27ece060160a4&pid=1-s2.0-S2667318522000101-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91728648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SyntaLinker-Hybrid: A deep learning approach for target specific drug design syntalink - hybrid:一种针对特定靶标药物设计的深度学习方法

Artificial intelligence in the life sciences

Pub Date : 2022-12-01 Epub Date: 2022-04-25 DOI: 10.1016/j.ailsci.2022.100035

Yu Feng , Yuyao Yang , Wenbin Deng , Hongming Chen , Ting Ran

Target specific drug design has attracted much attention in drug discovery. But, it is a great challenge to efficiently explore the target-focused chemical space. Fragment-based drug design (FBDD) has shown its potential to do this thing. In this study, we introduced a deep learning-based fragment linking method, namely SyntaLinker-Hybrid, for target specific molecular generation. By carrying out transfer learning and fragment hybridization, this method allows to generate a great number of linker fragments to assemble given terminal fragments into the molecules with target specificity. This work demonstrates that the method has the capacity to generate target specific structures for various targets. We believe that its application could be extended to a broader target scope.

靶向性药物设计在药物发现领域受到广泛关注。但是，如何有效地探索靶向化学领域是一个巨大的挑战。基于片段的药物设计(FBDD)已经显示出它在这方面的潜力。在这项研究中，我们引入了一种基于深度学习的片段连接方法，即SyntaLinker-Hybrid，用于目标特定分子的生成。该方法通过迁移学习和片段杂交，可以产生大量的连接子片段，将给定的末端片段组装成具有目标特异性的分子。这项工作表明，该方法具有为各种目标生成目标特定结构的能力。我们认为，它的适用可以扩大到更广泛的目标范围。

引用次数: 0

An unsupervised computational pipeline identifies potential repurposable drugs to treat Huntington's disease and multiple sclerosis 一个无监督的计算管道识别潜在的可重复利用的药物治疗亨廷顿氏病和多发性硬化症

Artificial intelligence in the life sciences

Pub Date : 2022-12-01 Epub Date: 2022-07-27 DOI: 10.1016/j.ailsci.2022.100042

Luca Menestrina, Maurizio Recanatini

Drug repurposing consists in identifying additional uses for known drugs and, since these new findings are built on previous knowledge, it reduces both the length and the costs of the drug development. In this work, we assembled an automated computational pipeline for drug repurposing, integrating also a network-based analysis for screening the possible drug combinations. The selection of drugs relies both on their proximity to the disease on the protein-protein interactome and on their influence on the expression of disease-related genes. Combined therapies are then prioritized on the basis of the drugs’ separation on the human interactome and the known drug-drug interactions. We eventually collected a number of molecules, and their plausible combinations, that could be proposed for the treatment of Huntington's disease and multiple sclerosis. Finally, this pipeline could potentially provide new suggestions also for other complex disorders.

药物再利用包括确定已知药物的额外用途，由于这些新发现是建立在以前的知识基础上的，它减少了药物开发的时间和成本。在这项工作中，我们组装了一个用于药物再利用的自动计算管道，并集成了一个基于网络的分析来筛选可能的药物组合。药物的选择既取决于它们与疾病的接近程度，也取决于蛋白质-蛋白质相互作用组，以及它们对疾病相关基因表达的影响。然后根据药物在人体相互作用组上的分离和已知的药物-药物相互作用来优先考虑联合治疗。我们最终收集了一些分子，以及它们的合理组合，这些分子可以用于治疗亨廷顿舞蹈症和多发性硬化症。最后，这个管道也可能为其他复杂疾病提供新的建议。

{"title":"An unsupervised computational pipeline identifies potential repurposable drugs to treat Huntington's disease and multiple sclerosis","authors":"Luca Menestrina, Maurizio Recanatini","doi":"10.1016/j.ailsci.2022.100042","DOIUrl":"10.1016/j.ailsci.2022.100042","url":null,"abstract":"<div><p>Drug repurposing consists in identifying additional uses for known drugs and, since these new findings are built on previous knowledge, it reduces both the length and the costs of the drug development. In this work, we assembled an automated computational pipeline for drug repurposing, integrating also a network-based analysis for screening the possible drug combinations. The selection of drugs relies both on their proximity to the disease on the protein-protein interactome and on their influence on the expression of disease-related genes. Combined therapies are then prioritized on the basis of the drugs’ separation on the human interactome and the known drug-drug interactions. We eventually collected a number of molecules, and their plausible combinations, that could be proposed for the treatment of Huntington's disease and multiple sclerosis. Finally, this pipeline could potentially provide new suggestions also for other complex disorders.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"2 ","pages":"Article 100042"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318522000125/pdfft?md5=02a08224e3d5097be5747fc8a22c3572&pid=1-s2.0-S2667318522000125-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42636492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LIDeB Tools: A Latin American resource of freely available, open-source cheminformatics apps LIDeB Tools:拉丁美洲免费提供的开源化学信息学应用程序资源

Artificial intelligence in the life sciences

Pub Date : 2022-12-01 Epub Date: 2022-12-02 DOI: 10.1016/j.ailsci.2022.100049

Denis N. Prada Gori, Lucas N. Alberca, Santiago Rodriguez, Juan I. Alice, Manuel A. Llanos, Carolina L. Bellera, Alan Talevi

Cheminformatics is the chemical field that deals with the storage, retrieval, analysis and manipulation of an increasing volume of available chemical data, and it plays a fundamental role in the fields of drug discovery, biology, chemistry, and biochemistry. Open source and freely available cheminformatics tools not only contribute to the generation of public knowledge, but also to reduce the technological gap between high- and low- to middle-income countries. Here, we describe a series of in-house cheminformatics applications developed by our academic drug discovery team, which are freely available on our website (https://lideb.biol.unlp.edu.ar/) as Web Apps and stand-alone versions. These apps include tools for clustering small molecules, decoy generation, druggability assessment, classificatory model evaluation, and data standardization and visualization.

化学信息学是化学领域的一门学科，它处理日益增多的可用化学数据的存储、检索、分析和操作，它在药物发现、生物学、化学和生物化学等领域起着重要作用。开源和免费提供的化学信息学工具不仅有助于公共知识的产生，而且还有助于缩小高、中低收入国家之间的技术差距。在这里，我们描述了一系列由我们的学术药物发现团队开发的内部化学信息学应用程序，这些应用程序可以在我们的网站(https://lideb.biol.unlp.edu.ar/)上作为Web应用程序和独立版本免费获得。这些应用程序包括小分子聚类、诱饵生成、药物评估、分类模型评估以及数据标准化和可视化的工具。

{"title":"LIDeB Tools: A Latin American resource of freely available, open-source cheminformatics apps","authors":"Denis N. Prada Gori, Lucas N. Alberca, Santiago Rodriguez, Juan I. Alice, Manuel A. Llanos, Carolina L. Bellera, Alan Talevi","doi":"10.1016/j.ailsci.2022.100049","DOIUrl":"10.1016/j.ailsci.2022.100049","url":null,"abstract":"<div><p>Cheminformatics is the chemical field that deals with the storage, retrieval, analysis and manipulation of an increasing volume of available chemical data, and it plays a fundamental role in the fields of drug discovery, biology, chemistry, and biochemistry. Open source and freely available cheminformatics tools not only contribute to the generation of public knowledge, but also to reduce the technological gap between high- and low- to middle-income countries. Here, we describe a series of in-house cheminformatics applications developed by our academic drug discovery team, which are freely available on our website (<span>https://lideb.biol.unlp.edu.ar/</span><svg><path></path></svg>) as Web Apps and stand-alone versions. These apps include tools for clustering small molecules, decoy generation, druggability assessment, classificatory model evaluation, and data standardization and visualization.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"2 ","pages":"Article 100049"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318522000198/pdfft?md5=022e4e88e07795a9a57aee98fede7162&pid=1-s2.0-S2667318522000198-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47979278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Artificial intelligence in the life sciences

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀