用KNIME预测COX-2抑制剂:基于无代码自动机器学习的虚拟筛选工作流程

IF 3.4 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY Journal of Computational Chemistry Pub Date : 2025-01-11 DOI:10.1002/jcc.70030
Powsali Ghosh, Ashok Kumar, Sushil Kumar Singh
{"title":"用KNIME预测COX-2抑制剂:基于无代码自动机器学习的虚拟筛选工作流程","authors":"Powsali Ghosh,&nbsp;Ashok Kumar,&nbsp;Sushil Kumar Singh","doi":"10.1002/jcc.70030","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Cyclooxygenase-2 (COX-2) is an enzyme that plays a crucial role in inflammation by converting arachidonic acid into prostaglandins. The overexpression of enzyme is associated with conditions such as cancer, arthritis, and Alzheimer's disease (AD), where it contributes to neuroinflammation. In silico virtual screening is pivotal in early-stage drug discovery; however, the absence of coding or machine learning expertise can impede the development of reliable computational models capable of accurately predicting inhibitor compounds based on their chemical structure. In this study, we developed an automated KNIME workflow for predicting the COX-2 inhibitory potential of novel molecules by building a multi-level ensemble model constructed with five machine learning algorithms (i.e., Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, and Extreme Gradient Boosting) and various molecular and fingerprint descriptors (i.e., AtomPair, Avalon, MACCS, Morgan, RDKit, and Pattern). Post-applicability domain filtering, the final majority voting-based ensemble model achieved 90.0% balanced accuracy, 87.7% precision, and 86.4% recall on the external validation set. The freely accessible workflow empowers users to swiftly and effortlessly predict COX-2 inhibitors, eliminating the need for any prior knowledge in machine learning, coding, or statistical modeling, significantly broadening its accessibility. While beginners can seamlessly use the tool as is, experienced KNIME users can leverage it as a foundation to build advanced workflows, driving further research and innovation.</p>\n </div>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 2","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"COX-2 Inhibitor Prediction With KNIME: A Codeless Automated Machine Learning-Based Virtual Screening Workflow\",\"authors\":\"Powsali Ghosh,&nbsp;Ashok Kumar,&nbsp;Sushil Kumar Singh\",\"doi\":\"10.1002/jcc.70030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Cyclooxygenase-2 (COX-2) is an enzyme that plays a crucial role in inflammation by converting arachidonic acid into prostaglandins. The overexpression of enzyme is associated with conditions such as cancer, arthritis, and Alzheimer's disease (AD), where it contributes to neuroinflammation. In silico virtual screening is pivotal in early-stage drug discovery; however, the absence of coding or machine learning expertise can impede the development of reliable computational models capable of accurately predicting inhibitor compounds based on their chemical structure. In this study, we developed an automated KNIME workflow for predicting the COX-2 inhibitory potential of novel molecules by building a multi-level ensemble model constructed with five machine learning algorithms (i.e., Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, and Extreme Gradient Boosting) and various molecular and fingerprint descriptors (i.e., AtomPair, Avalon, MACCS, Morgan, RDKit, and Pattern). Post-applicability domain filtering, the final majority voting-based ensemble model achieved 90.0% balanced accuracy, 87.7% precision, and 86.4% recall on the external validation set. The freely accessible workflow empowers users to swiftly and effortlessly predict COX-2 inhibitors, eliminating the need for any prior knowledge in machine learning, coding, or statistical modeling, significantly broadening its accessibility. While beginners can seamlessly use the tool as is, experienced KNIME users can leverage it as a foundation to build advanced workflows, driving further research and innovation.</p>\\n </div>\",\"PeriodicalId\":188,\"journal\":{\"name\":\"Journal of Computational Chemistry\",\"volume\":\"46 2\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70030\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70030","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

环氧合酶-2 (COX-2)是一种通过将花生四烯酸转化为前列腺素在炎症中起关键作用的酶。酶的过度表达与癌症、关节炎和阿尔茨海默病(AD)等疾病有关,在这些疾病中,它会导致神经炎症。计算机虚拟筛选是早期药物发现的关键;然而,缺乏编码或机器学习专业知识可能会阻碍可靠的计算模型的发展,这些计算模型能够根据其化学结构准确预测抑制剂化合物。在这项研究中,我们开发了一个自动化的KNIME工作流,用于预测新分子的COX-2抑制电位,通过构建一个多层集成模型,该模型由五种机器学习算法(即逻辑回归、k近邻、决策树、随机森林和极端梯度增强)和各种分子和指纹描述符(即AtomPair、Avalon、MACCS、Morgan、RDKit和Pattern)构建。经过适用性域过滤,最终基于多数投票的集成模型在外部验证集上达到90.0%的平衡准确率、87.7%的精度和86.4%的召回率。自由访问的工作流程使用户能够快速,毫不费力地预测COX-2抑制剂,消除了对机器学习,编码或统计建模方面的任何先验知识的需求,显着扩大了其可访问性。虽然初学者可以无缝地使用该工具,但经验丰富的KNIME用户可以利用它作为构建高级工作流程的基础,从而推动进一步的研究和创新。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
COX-2 Inhibitor Prediction With KNIME: A Codeless Automated Machine Learning-Based Virtual Screening Workflow

Cyclooxygenase-2 (COX-2) is an enzyme that plays a crucial role in inflammation by converting arachidonic acid into prostaglandins. The overexpression of enzyme is associated with conditions such as cancer, arthritis, and Alzheimer's disease (AD), where it contributes to neuroinflammation. In silico virtual screening is pivotal in early-stage drug discovery; however, the absence of coding or machine learning expertise can impede the development of reliable computational models capable of accurately predicting inhibitor compounds based on their chemical structure. In this study, we developed an automated KNIME workflow for predicting the COX-2 inhibitory potential of novel molecules by building a multi-level ensemble model constructed with five machine learning algorithms (i.e., Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, and Extreme Gradient Boosting) and various molecular and fingerprint descriptors (i.e., AtomPair, Avalon, MACCS, Morgan, RDKit, and Pattern). Post-applicability domain filtering, the final majority voting-based ensemble model achieved 90.0% balanced accuracy, 87.7% precision, and 86.4% recall on the external validation set. The freely accessible workflow empowers users to swiftly and effortlessly predict COX-2 inhibitors, eliminating the need for any prior knowledge in machine learning, coding, or statistical modeling, significantly broadening its accessibility. While beginners can seamlessly use the tool as is, experienced KNIME users can leverage it as a foundation to build advanced workflows, driving further research and innovation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.60
自引率
3.30%
发文量
247
审稿时长
1.7 months
期刊介绍: This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.
期刊最新文献
Issue Information Parallelized Tools for the Preparation and Curation of Large Libraries for Virtual Screening Predicting Molecular Energies of Small Organic Molecules With Multi-Fidelity Methods Not Just Another Crystal Field Software Optical Properties and Tautomerism of 2-Carbamido-1,3-Indandione in Ground and Excited States
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1