首页 > 最新文献

Artificial intelligence in the life sciences最新文献

英文 中文
Discovery of novel A2AR antagonists through deep learning-based virtual screening 通过基于深度学习的虚拟筛选发现新型A2AR拮抗剂
Pub Date : 2023-01-20 DOI: 10.1016/j.ailsci.2023.100058
Miru Tang , Chang Wen , Jie Lin , Hongming Chen , Ting Ran

The A2A adenosine receptor (A2AR) is emerging as a promising drug target for cancer immunotherapy. Novel A2AR antagonists are highly demanded due to few candidates entering clinic trials specific for cancer treatment. Structure-based virtual screening has made a great contribution to discover novel A2AR antagonists, but most depended on inefficient molecular docking on relatively small molecular databases. In this work, a deep learning strategy was applied to accelerate docking-based virtual screening, through which new structural types of A2AR antagonists for an extremely large molecular library were found successfully.

A2A腺苷受体(A2AR)正在成为癌症免疫治疗的一个有前途的药物靶点。由于进入癌症治疗临床试验的候选药物很少,因此对新型A2AR拮抗剂的需求量很大。基于结构的虚拟筛选为发现新型A2AR拮抗剂做出了巨大贡献,但大多依赖于相对较小的分子数据库进行低效的分子对接。在这项工作中,应用深度学习策略来加速基于对接的虚拟筛选,通过该筛选,成功发现了用于极大分子库的新结构类型的A2AR拮抗剂。
{"title":"Discovery of novel A2AR antagonists through deep learning-based virtual screening","authors":"Miru Tang ,&nbsp;Chang Wen ,&nbsp;Jie Lin ,&nbsp;Hongming Chen ,&nbsp;Ting Ran","doi":"10.1016/j.ailsci.2023.100058","DOIUrl":"10.1016/j.ailsci.2023.100058","url":null,"abstract":"<div><p>The A<sub>2A</sub> adenosine receptor (A<sub>2A</sub>R) is emerging as a promising drug target for cancer immunotherapy. Novel A<sub>2A</sub>R antagonists are highly demanded due to few candidates entering clinic trials specific for cancer treatment. Structure-based virtual screening has made a great contribution to discover novel A<sub>2A</sub>R antagonists, but most depended on inefficient molecular docking on relatively small molecular databases. In this work, a deep learning strategy was applied to accelerate docking-based virtual screening, through which new structural types of A<sub>2A</sub>R antagonists for an extremely large molecular library were found successfully.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100058"},"PeriodicalIF":0.0,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46817392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Application of AI techniques and robotics in agriculture: A review 人工智能技术和机器人技术在农业中的应用综述
Pub Date : 2023-01-06 DOI: 10.1016/j.ailsci.2023.100057
Manas Wakchaure , B.K. Patle , A.K. Mahindrakar

The aim of the proposed work is to review the various AI techniques (fuzzy logic (FL), artificial neural network (ANN), genetic algorithm (GA), particle swarm optimization (PSO), artificial potential field (APF), simulated annealing (SA), ant colony optimization (ACO), artificial bee colony algorithm (ABC), harmony search algorithm (HS), bat algorithm (BA), cell decomposition (CD) and firefly algorithm (FA)) in agriculture, focusing on expert systems, robots developed for agriculture, sensors technology for collecting and transmitting data, in an attempt to reveal their potential impact in the field of agriculture. None of the literature highlights the application of AI techniques and robots in (Cultivation, Monitoring, and Harvesting) to understand their contribution to the agriculture sector and the simultaneous comparison of each based on its usefulness and popularity. This work investigates the comparative analysis of three essential phases of agriculture: Cultivation, Monitoring, and Harvesting, by knowing the depth of AI involved and the robots utilized. The current study presents a systematic review of more than 150 papers based on the existing automation application in agriculture from 1960 to 2021. It highlights the future research gap in making intelligent autonomous systems in agriculture. The paper concludes with tabular data and charts comparing the frequency of individual AI approaches for specific applications in the agriculture field.

本文的目的是综述农业领域的各种人工智能技术(模糊逻辑(FL)、人工神经网络(ANN)、遗传算法(GA)、粒子群优化(PSO)、人工势场(APF)、模拟退火(SA)、蚁群优化(ACO)、人工蜂群算法(ABC)、和谐搜索算法(HS)、蝙蝠算法(BA)、细胞分解(CD)和萤火虫算法(FA)),重点介绍专家系统、农业机器人、用于收集和传输数据的传感器技术,试图揭示其在农业领域的潜在影响。没有一篇文献强调人工智能技术和机器人在(种植、监测和收获)中的应用,以了解它们对农业部门的贡献,并根据其实用性和受欢迎程度同时对每种技术和机器人进行比较。通过了解人工智能的深度和所使用的机器人,这项工作调查了农业的三个基本阶段:种植、监测和收获的比较分析。本研究系统回顾了从1960年到2021年150多篇基于现有自动化在农业中的应用的论文。它突出了未来在农业智能自主系统方面的研究差距。论文最后用表格数据和图表比较了农业领域特定应用中各个人工智能方法的频率。
{"title":"Application of AI techniques and robotics in agriculture: A review","authors":"Manas Wakchaure ,&nbsp;B.K. Patle ,&nbsp;A.K. Mahindrakar","doi":"10.1016/j.ailsci.2023.100057","DOIUrl":"10.1016/j.ailsci.2023.100057","url":null,"abstract":"<div><p>The aim of the proposed work is to review the various AI techniques (fuzzy logic (FL), artificial neural network (ANN), genetic algorithm (GA), particle swarm optimization (PSO), artificial potential field (APF), simulated annealing (SA), ant colony optimization (ACO), artificial bee colony algorithm (ABC), harmony search algorithm (HS), bat algorithm (BA), cell decomposition (CD) and firefly algorithm (FA)) in agriculture, focusing on expert systems, robots developed for agriculture, sensors technology for collecting and transmitting data, in an attempt to reveal their potential impact in the field of agriculture. None of the literature highlights the application of AI techniques and robots in (Cultivation, Monitoring, and Harvesting) to understand their contribution to the agriculture sector and the simultaneous comparison of each based on its usefulness and popularity. This work investigates the comparative analysis of three essential phases of agriculture: Cultivation, Monitoring, and Harvesting, by knowing the depth of AI involved and the robots utilized. The current study presents a systematic review of more than 150 papers based on the existing automation application in agriculture from 1960 to 2021. It highlights the future research gap in making intelligent autonomous systems in agriculture. The paper concludes with tabular data and charts comparing the frequency of individual AI approaches for specific applications in the agriculture field.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100057"},"PeriodicalIF":0.0,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44719076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Machine learning for small molecule drug discovery in academia and industry 学术界和工业界用于小分子药物发现的机器学习
Pub Date : 2023-01-05 DOI: 10.1016/j.ailsci.2022.100056
Andrea Volkamer , Sereina Riniker , Eva Nittinger , Jessica Lanini , Francesca Grisoni , Emma Evertsson , Raquel Rodríguez-Pérez , Nadine Schneider

Academic and pharmaceutical industry research are both key for progresses in the field of molecular machine learning. Despite common open research questions and long-term goals, the nature and scope of investigations typically differ between academia and industry. Herein, we highlight the opportunities that machine learning models offer to accelerate and improve compound selection. All parts of the model life cycle are discussed, including data preparation, model building, validation, and deployment. Main challenges in molecular machine learning as well as differences between academia and industry are highlighted. Furthermore, application aspects in the design-make-test-analyze cycle are discussed. We close with strategies that could improve collaboration between academic and industrial institutions and will advance the field even further.

学术研究和制药工业研究都是分子机器学习领域取得进展的关键。尽管有共同的开放研究问题和长期目标,但研究的性质和范围在学术界和工业界之间通常是不同的。在此,我们强调了机器学习模型提供的加速和改进化合物选择的机会。讨论了模型生命周期的所有部分,包括数据准备、模型构建、验证和部署。强调了分子机器学习的主要挑战以及学术界和工业界之间的差异。此外,还讨论了在设计-制造-测试-分析周期中的应用。我们的战略可以改善学术和工业机构之间的合作,并将进一步推动该领域的发展。
{"title":"Machine learning for small molecule drug discovery in academia and industry","authors":"Andrea Volkamer ,&nbsp;Sereina Riniker ,&nbsp;Eva Nittinger ,&nbsp;Jessica Lanini ,&nbsp;Francesca Grisoni ,&nbsp;Emma Evertsson ,&nbsp;Raquel Rodríguez-Pérez ,&nbsp;Nadine Schneider","doi":"10.1016/j.ailsci.2022.100056","DOIUrl":"10.1016/j.ailsci.2022.100056","url":null,"abstract":"<div><p>Academic and pharmaceutical industry research are both key for progresses in the field of molecular machine learning. Despite common open research questions and long-term goals, the nature and scope of investigations typically differ between academia and industry. Herein, we highlight the opportunities that machine learning models offer to accelerate and improve compound selection. All parts of the model life cycle are discussed, including data preparation, model building, validation, and deployment. Main challenges in molecular machine learning as well as differences between academia and industry are highlighted. Furthermore, application aspects in the design-make-test-analyze cycle are discussed. We close with strategies that could improve collaboration between academic and industrial institutions and will advance the field even further.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100056"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47508788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A natural language processing system for the efficient updating of highly curated pathophysiology mechanism knowledge graphs 一个用于高效更新高度策划的病理生理机制知识图的自然语言处理系统
Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100078
Negin Sadat Babaiha , Hassan Elsayed , Bide Zhang , Abish Kaladharan , Priya Sethumadhavan , Bruce Schultz , Jürgen Klein , Bruno Freudensprung , Vanessa Lage-Rupprecht , Alpha Tom Kodamullil , Marc Jacobs , Stefan Geissler , Sumit Madan , Martin Hofmann-Apitius
{"title":"A natural language processing system for the efficient updating of highly curated pathophysiology mechanism knowledge graphs","authors":"Negin Sadat Babaiha ,&nbsp;Hassan Elsayed ,&nbsp;Bide Zhang ,&nbsp;Abish Kaladharan ,&nbsp;Priya Sethumadhavan ,&nbsp;Bruce Schultz ,&nbsp;Jürgen Klein ,&nbsp;Bruno Freudensprung ,&nbsp;Vanessa Lage-Rupprecht ,&nbsp;Alpha Tom Kodamullil ,&nbsp;Marc Jacobs ,&nbsp;Stefan Geissler ,&nbsp;Sumit Madan ,&nbsp;Martin Hofmann-Apitius","doi":"10.1016/j.ailsci.2023.100078","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100078","url":null,"abstract":"","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrigendum to “Optimizing active learning for free energy Calculations” [Artificial Intelligence in the Life Sciences, 2 (2022) 100050] “优化自由能计算的主动学习”的勘误表[生命科学中的人工智能,2 (2022)100050]
Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100074
James Thompson , W Patrick Walters , Jianwen A Feng , Nicolas A Pabon , Hongcheng Xu , Brian B Goldman , Demetri Moustakas , Molly Schmidt , Forrest York
{"title":"Corrigendum to “Optimizing active learning for free energy Calculations” [Artificial Intelligence in the Life Sciences, 2 (2022) 100050]","authors":"James Thompson ,&nbsp;W Patrick Walters ,&nbsp;Jianwen A Feng ,&nbsp;Nicolas A Pabon ,&nbsp;Hongcheng Xu ,&nbsp;Brian B Goldman ,&nbsp;Demetri Moustakas ,&nbsp;Molly Schmidt ,&nbsp;Forrest York","doi":"10.1016/j.ailsci.2023.100074","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100074","url":null,"abstract":"","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49775001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum regarding missing Conflict of Interest Statement & Ethical Statement in previously published articles 关于先前发表的文章中缺少利益冲突声明和道德声明的勘误表
Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100076
{"title":"Erratum regarding missing Conflict of Interest Statement & Ethical Statement in previously published articles","authors":"","doi":"10.1016/j.ailsci.2023.100076","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100076","url":null,"abstract":"","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49775002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An industrial evaluation of proteochemometric modelling: Predicting drug-target affinities for kinases 蛋白化学计量建模的工业评估:预测激酶的药物靶点亲和力
Pub Date : 2023-01-01 DOI: 10.1016/j.ailsci.2023.100079
Astrid Stroobants , Lewis H. Mervin , Ola Engkvist , Graeme R. Robb
{"title":"An industrial evaluation of proteochemometric modelling: Predicting drug-target affinities for kinases","authors":"Astrid Stroobants ,&nbsp;Lewis H. Mervin ,&nbsp;Ola Engkvist ,&nbsp;Graeme R. Robb","doi":"10.1016/j.ailsci.2023.100079","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100079","url":null,"abstract":"","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence systems for the design of magic shotgun drugs 人工智能系统的神奇猎枪药物设计
Pub Date : 2022-12-22 DOI: 10.1016/j.ailsci.2022.100055
José Teófilo Moreira-Filho , Meryck Felipe Brito da Silva , Joyce Villa Verde Bastos Borba , Arlindo Rodrigues Galvão Filho , Eugene N Muratov , Carolina Horta Andrade , Rodolpho de Campos Braga , Bruno Junior Neves

Designing magic shotgun compounds, i.e., compounds hitting multiple targets using artificial intelligence (AI) systems based on machine learning (ML) and deep learning (DL) approaches, has a huge potential to revolutionize drug discovery. Such intelligent systems enable computers to create new chemical structures and predict their multi-target properties at a low cost and in a time-efficient manner. Most examples of AI applied to drug discovery are single-target oriented and there is still a lack of concise information regarding the application of this technology for the discovery of multi-target drugs or drugs with broad-spectrum action. In this review, we focus on current developments in AI systems for the next generation of automated design of multi-target drugs. We discuss how classical ML methods, cutting-edge generative models, and multi-task deep neural networks can help de novo design and hit-to-lead optimization of multi-target drugs. Moreover, we present state-of-the-art workflows and highlight some studies demonstrating encouraging experimental results, which pave the way for de novo drug design and multi-target drug discovery.

设计神奇的霰弹枪化合物,即使用基于机器学习(ML)和深度学习(DL)方法的人工智能(AI)系统击中多个目标的化合物,具有彻底改变药物发现的巨大潜力。这种智能系统使计算机能够以低成本和高效率的方式创造新的化学结构并预测其多目标特性。人工智能应用于药物发现的大多数例子都是单靶点导向的,关于将该技术应用于发现多靶点药物或具有广谱作用的药物方面,仍然缺乏简明的信息。在这篇综述中,我们重点介绍了用于下一代多靶点药物自动化设计的人工智能系统的最新发展。我们讨论了经典的机器学习方法、尖端的生成模型和多任务深度神经网络如何帮助多靶点药物的从头设计和hit-to-lead优化。此外,我们还介绍了最先进的工作流程,并重点介绍了一些展示令人鼓舞的实验结果的研究,这些实验结果为新药物设计和多靶点药物发现铺平了道路。
{"title":"Artificial intelligence systems for the design of magic shotgun drugs","authors":"José Teófilo Moreira-Filho ,&nbsp;Meryck Felipe Brito da Silva ,&nbsp;Joyce Villa Verde Bastos Borba ,&nbsp;Arlindo Rodrigues Galvão Filho ,&nbsp;Eugene N Muratov ,&nbsp;Carolina Horta Andrade ,&nbsp;Rodolpho de Campos Braga ,&nbsp;Bruno Junior Neves","doi":"10.1016/j.ailsci.2022.100055","DOIUrl":"10.1016/j.ailsci.2022.100055","url":null,"abstract":"<div><p>Designing magic shotgun compounds, i.e., compounds hitting multiple targets using artificial intelligence (AI) systems based on machine learning (ML) and deep learning (DL) approaches, has a huge potential to revolutionize drug discovery. Such intelligent systems enable computers to create new chemical structures and predict their multi-target properties at a low cost and in a time-efficient manner. Most examples of AI applied to drug discovery are single-target oriented and there is still a lack of concise information regarding the application of this technology for the discovery of multi-target drugs or drugs with broad-spectrum action. In this review, we focus on current developments in AI systems for the next generation of automated design of multi-target drugs. We discuss how classical ML methods, cutting-edge generative models, and multi-task deep neural networks can help <em>de novo</em> design and hit-to-lead optimization of multi-target drugs. Moreover, we present state-of-the-art workflows and highlight some studies demonstrating encouraging experimental results, which pave the way for <em>de novo</em> drug design and multi-target drug discovery.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100055"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43297571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Specific contributions of artificial intelligence to interdisciplinary life science research – exploring and communicating new opportunities 人工智能对跨学科生命科学研究的具体贡献——探索和交流新机遇
Pub Date : 2022-12-11 DOI: 10.1016/j.ailsci.2022.100052
Jürgen Bajorath
{"title":"Specific contributions of artificial intelligence to interdisciplinary life science research – exploring and communicating new opportunities","authors":"Jürgen Bajorath","doi":"10.1016/j.ailsci.2022.100052","DOIUrl":"10.1016/j.ailsci.2022.100052","url":null,"abstract":"","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100052"},"PeriodicalIF":0.0,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41990897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model? 抗菌肽预测的耦合编码方法:高度准确的模型有多敏感?
Pub Date : 2022-12-01 DOI: 10.1016/j.ailsci.2022.100034
Ivan Erjavac , Daniela Kalafatovic , Goran Mauša

Current application of machine learning in the process of antimicrobial peptide discovery call for the reduction of the false positive predictions that are produced by the classification models. Considering that the positive predictions of high confidence drive modern experimental design, the model’s sensitivity is crucial to reduce the number of unnecessary in vitro tests. Furthermore, taking into account the expert-based design approaches that employ random mutations on confirmed sequences, the machine learning models are required to distinguish between subtle differences among shuffled sequences. With the goal of reducing the false positive rate and improving sensitivity, we propose a hybrid approach to antimicrobial peptide prediction that utilizes combined encoding models. To this end, we implement models that employ both the physico-chemical features and sequence ordering information to stress the importance of using both representations. We also investigate the usage of binary encoding for peptide representation purposes, a method that is insufficiently represented in related research, which proved to act as a viable low dimensional alternative to the one-hot encoding. Our results, supported by Cochran and McNemar statistical tests and Spearman correlation analysis, indicate that the sequence-based encodings complement the physico-chemical features and their synergic effect yields improvement in terms of every evaluation metric. Finally, the proposed hybrid approach that combines physico-chemical features and binary encoding using logical conjunction was shown to be superior to other single models by a factor of 2.96 in terms of fall-out and up to 6.1% in terms of precision.

当前机器学习在抗菌肽发现过程中的应用要求减少由分类模型产生的假阳性预测。考虑到高置信度的积极预测驱动着现代实验设计,该模型的灵敏度对于减少不必要的体外试验数量至关重要。此外,考虑到基于专家的设计方法在已确认的序列上采用随机突变,机器学习模型需要区分洗牌序列之间的细微差异。为了降低假阳性率和提高敏感性,我们提出了一种利用组合编码模型进行抗菌肽预测的混合方法。为此,我们实现了同时使用物理化学特征和序列排序信息的模型,以强调使用这两种表示的重要性。我们还研究了用于肽表示目的的二进制编码的使用,这是一种在相关研究中没有充分代表的方法,它被证明是一种可行的低维替代单热编码。我们的研究结果得到了Cochran和McNemar统计测试和Spearman相关分析的支持,表明基于序列的编码补充了物理化学特征,它们的协同效应在每个评价指标方面都有所改善。最后,所提出的混合方法结合了物理化学特征和使用逻辑连接的二进制编码,在辐射系数方面优于其他单一模型2.96,在精度方面优于6.1%。
{"title":"Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model?","authors":"Ivan Erjavac ,&nbsp;Daniela Kalafatovic ,&nbsp;Goran Mauša","doi":"10.1016/j.ailsci.2022.100034","DOIUrl":"10.1016/j.ailsci.2022.100034","url":null,"abstract":"<div><p>Current application of machine learning in the process of antimicrobial peptide discovery call for the reduction of the false positive predictions that are produced by the classification models. Considering that the positive predictions of high confidence drive modern experimental design, the model’s sensitivity is crucial to reduce the number of unnecessary <em>in vitro</em> tests. Furthermore, taking into account the expert-based design approaches that employ random mutations on confirmed sequences, the machine learning models are required to distinguish between subtle differences among shuffled sequences. With the goal of reducing the false positive rate and improving sensitivity, we propose a hybrid approach to antimicrobial peptide prediction that utilizes combined encoding models. To this end, we implement models that employ both the physico-chemical features and sequence ordering information to stress the importance of using both representations. We also investigate the usage of binary encoding for peptide representation purposes, a method that is insufficiently represented in related research, which proved to act as a viable low dimensional alternative to the one-hot encoding. Our results, supported by Cochran and McNemar statistical tests and Spearman correlation analysis, indicate that the sequence-based encodings complement the physico-chemical features and their synergic effect yields improvement in terms of every evaluation metric. Finally, the proposed hybrid approach that combines physico-chemical features and binary encoding using logical conjunction was shown to be superior to other single models by a factor of 2.96 in terms of fall-out and up to 6.1% in terms of precision.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"2 ","pages":"Article 100034"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318522000058/pdfft?md5=3f5cf3ee0ab97ece8587283b98a0d00f&pid=1-s2.0-S2667318522000058-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49610236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Artificial intelligence in the life sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1