Jinwoo Jung, Jeon-Ok Moon, Song Ih Ahn, Haeseung Lee
{"title":"利用机器学习方法根据化学结构预测化合物的抗氧化活性。","authors":"Jinwoo Jung, Jeon-Ok Moon, Song Ih Ahn, Haeseung Lee","doi":"10.4196/kjpp.2024.28.6.527","DOIUrl":null,"url":null,"abstract":"<p><p>Oxidative stress is a well-established risk factor for numerous chronic diseases, emphasizing the need for efficient identification of potent antioxidants. Conventional methods for assessing antioxidant properties are often time-consuming and resource-intensive, typically relying on laborious biochemical assays. In this study, we investigated the applicability of machine learning (ML) algorithms for predicting the antioxidant activity of compounds based solely on their molecular structure. We evaluated the performance of five ML algorithms, Support Vector Machine (SVM), Logistic Regression (LR), XGBoost, Random Forest (RF), and Deep Neural Network (DNN), using a dataset of over 1,900 compounds with experimentally determined antioxidant activity. Both RF and SVM achieved the best overall performance, exhibiting high accuracy (> 0.9) and effectively distinguishing active and inactive compounds with high structural similarity. External validation using natural product data from the BATMAN database confirmed the generalizability of the RF and SVM models. Our results suggest that ML models serve as powerful tools to expedite the discovery of novel antioxidant candidates, potentially streamlining the development of future therapeutic interventions.</p>","PeriodicalId":54746,"journal":{"name":"Korean Journal of Physiology & Pharmacology","volume":"28 6","pages":"527-537"},"PeriodicalIF":1.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519722/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting antioxidant activity of compounds based on chemical structure using machine learning methods.\",\"authors\":\"Jinwoo Jung, Jeon-Ok Moon, Song Ih Ahn, Haeseung Lee\",\"doi\":\"10.4196/kjpp.2024.28.6.527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Oxidative stress is a well-established risk factor for numerous chronic diseases, emphasizing the need for efficient identification of potent antioxidants. Conventional methods for assessing antioxidant properties are often time-consuming and resource-intensive, typically relying on laborious biochemical assays. In this study, we investigated the applicability of machine learning (ML) algorithms for predicting the antioxidant activity of compounds based solely on their molecular structure. We evaluated the performance of five ML algorithms, Support Vector Machine (SVM), Logistic Regression (LR), XGBoost, Random Forest (RF), and Deep Neural Network (DNN), using a dataset of over 1,900 compounds with experimentally determined antioxidant activity. Both RF and SVM achieved the best overall performance, exhibiting high accuracy (> 0.9) and effectively distinguishing active and inactive compounds with high structural similarity. External validation using natural product data from the BATMAN database confirmed the generalizability of the RF and SVM models. Our results suggest that ML models serve as powerful tools to expedite the discovery of novel antioxidant candidates, potentially streamlining the development of future therapeutic interventions.</p>\",\"PeriodicalId\":54746,\"journal\":{\"name\":\"Korean Journal of Physiology & Pharmacology\",\"volume\":\"28 6\",\"pages\":\"527-537\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519722/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Korean Journal of Physiology & Pharmacology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.4196/kjpp.2024.28.6.527\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Korean Journal of Physiology & Pharmacology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4196/kjpp.2024.28.6.527","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
摘要
氧化应激是众多慢性疾病的既定风险因素,因此需要有效地识别强效抗氧化剂。评估抗氧化剂特性的传统方法往往耗费时间和资源,通常依赖于费力的生化试验。在本研究中,我们研究了机器学习(ML)算法在仅根据化合物分子结构预测其抗氧化活性方面的适用性。我们使用由 1900 多种经实验确定具有抗氧化活性的化合物组成的数据集,评估了支持向量机(SVM)、逻辑回归(LR)、XGBoost、随机森林(RF)和深度神经网络(DNN)这五种 ML 算法的性能。RF 和 SVM 的总体性能最佳,表现出较高的准确性(> 0.9),并能有效区分结构相似度较高的活性和非活性化合物。利用 BATMAN 数据库中的天然产品数据进行的外部验证证实了 RF 和 SVM 模型的通用性。我们的研究结果表明,ML 模型是加快发现新型抗氧化候选化合物的有力工具,有可能简化未来治疗干预措施的开发。
Predicting antioxidant activity of compounds based on chemical structure using machine learning methods.
Oxidative stress is a well-established risk factor for numerous chronic diseases, emphasizing the need for efficient identification of potent antioxidants. Conventional methods for assessing antioxidant properties are often time-consuming and resource-intensive, typically relying on laborious biochemical assays. In this study, we investigated the applicability of machine learning (ML) algorithms for predicting the antioxidant activity of compounds based solely on their molecular structure. We evaluated the performance of five ML algorithms, Support Vector Machine (SVM), Logistic Regression (LR), XGBoost, Random Forest (RF), and Deep Neural Network (DNN), using a dataset of over 1,900 compounds with experimentally determined antioxidant activity. Both RF and SVM achieved the best overall performance, exhibiting high accuracy (> 0.9) and effectively distinguishing active and inactive compounds with high structural similarity. External validation using natural product data from the BATMAN database confirmed the generalizability of the RF and SVM models. Our results suggest that ML models serve as powerful tools to expedite the discovery of novel antioxidant candidates, potentially streamlining the development of future therapeutic interventions.
期刊介绍:
The Korean Journal of Physiology & Pharmacology (Korean J. Physiol. Pharmacol., KJPP) is the official journal of both the Korean Physiological Society (KPS) and the Korean Society of Pharmacology (KSP). The journal launched in 1997 and is published bi-monthly in English. KJPP publishes original, peer-reviewed, scientific research-based articles that report successful advances in physiology and pharmacology. KJPP welcomes the submission of all original research articles in the field of physiology and pharmacology, especially the new and innovative findings. The scope of researches includes the action mechanism, pharmacological effect, utilization, and interaction of chemicals with biological system as well as the development of new drug targets. Theoretical articles that use computational models for further understanding of the physiological or pharmacological processes are also welcomed. Investigative translational research articles on human disease with an emphasis on physiology or pharmacology are also invited. KJPP does not publish work on the actions of crude biological extracts of either unknown chemical composition (e.g. unpurified and unvalidated) or unknown concentration. Reviews are normally commissioned, but consideration will be given to unsolicited contributions. All papers accepted for publication in KJPP will appear simultaneously in the printed Journal and online.