Comparative Analysis of Recurrent Neural Networks with Conjoint Fingerprints for Skin Corrosion Prediction.

IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Journal of Chemical Information and Modeling Pub Date : 2025-01-21 DOI:10.1021/acs.jcim.4c02062
Huynh Anh Duy,Tarapong Srisongkram
{"title":"Comparative Analysis of Recurrent Neural Networks with Conjoint Fingerprints for Skin Corrosion Prediction.","authors":"Huynh Anh Duy,Tarapong Srisongkram","doi":"10.1021/acs.jcim.4c02062","DOIUrl":null,"url":null,"abstract":"Skin corrosion assessment is an essential toxicity end point that addresses safety concerns for topical dosage forms and cosmetic products. Previously, skin corrosion assessments required animal testing; however, differences in skin architecture and ethical concerns regarding animal models have fostered the advancement of alternative methods such as in silico and in vitro models. This study aimed to develop deep learning (DL) models based on recurrent neural networks (RNNs) for classifying skin corrosion of chemical compounds based on chemical language notation, molecular substructure, physicochemical properties, and a combination of these three properties called conjoint fingerprints. Simple RNN, long short-term memory, bidirectional long short-term memory (BiLSTM), gated recurrent units, and bidirectional gated recurrent units models, along with 11 molecular features, were employed to generate 55 RNN-based models. Applicability domain and permutation importance analysis were exploited for additional trustable prediction and explanation ability of the models, respectively. Our findings indicate that BiLSTM with conjoint features of MACCS keys and physicochemical descriptors is the most effective model with 84.3% accuracy, 89.8% area under the curve, and 57.6% Matthews correlation coefficient for the external test performance. Furthermore, our model accurately predicted the skin corrosion toxicity of all new and unseen compounds beyond our test set, highlighting prominent classification performance compared to existing skin corrosion models. This finding will contribute to the utilization of DL and conjoint characteristics of molecular structure to enhance the model's predictive capability for skin toxicity assessment.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"19 1","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c02062","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Skin corrosion assessment is an essential toxicity end point that addresses safety concerns for topical dosage forms and cosmetic products. Previously, skin corrosion assessments required animal testing; however, differences in skin architecture and ethical concerns regarding animal models have fostered the advancement of alternative methods such as in silico and in vitro models. This study aimed to develop deep learning (DL) models based on recurrent neural networks (RNNs) for classifying skin corrosion of chemical compounds based on chemical language notation, molecular substructure, physicochemical properties, and a combination of these three properties called conjoint fingerprints. Simple RNN, long short-term memory, bidirectional long short-term memory (BiLSTM), gated recurrent units, and bidirectional gated recurrent units models, along with 11 molecular features, were employed to generate 55 RNN-based models. Applicability domain and permutation importance analysis were exploited for additional trustable prediction and explanation ability of the models, respectively. Our findings indicate that BiLSTM with conjoint features of MACCS keys and physicochemical descriptors is the most effective model with 84.3% accuracy, 89.8% area under the curve, and 57.6% Matthews correlation coefficient for the external test performance. Furthermore, our model accurately predicted the skin corrosion toxicity of all new and unseen compounds beyond our test set, highlighting prominent classification performance compared to existing skin corrosion models. This finding will contribute to the utilization of DL and conjoint characteristics of molecular structure to enhance the model's predictive capability for skin toxicity assessment.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
递归神经网络与联合指纹在皮肤腐蚀预测中的对比分析。
皮肤腐蚀评估是解决局部剂型和化妆品安全问题的基本毒性终点。以前,皮肤腐蚀评估需要动物试验;然而,皮肤结构的差异和动物模型的伦理问题促进了替代方法的发展,如硅和体外模型。本研究旨在开发基于递归神经网络(rnn)的深度学习(DL)模型,用于基于化学语言符号、分子亚结构、物理化学性质以及这三种性质的组合(称为联合指纹)对化合物的皮肤腐蚀进行分类。采用简单RNN、长短期记忆、双向长短期记忆(BiLSTM)、门控循环单元和双向门控循环单元模型,以及11个分子特征,生成了55个基于RNN的模型。利用适用域和排列重要性分析分别增加模型的可信预测和解释能力。结果表明,结合MACCS键和理化描述符特征的BiLSTM是最有效的外部测试模型,准确率为84.3%,曲线下面积为89.8%,马修斯相关系数为57.6%。此外,我们的模型准确地预测了我们测试集之外所有新的和未见过的化合物的皮肤腐蚀毒性,与现有的皮肤腐蚀模型相比,突出了突出的分类性能。这一发现将有助于利用DL和分子结构的联合特性来提高模型对皮肤毒性评估的预测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
期刊最新文献
Modeling Heterogeneous Catalysis Using Quantum Computers: An Academic and Industry Perspective. ComNet: A Multiview Deep Learning Model for Predicting Drug Combination Side Effects. Quick-and-Easy Validation of Protein-Ligand Binding Models Using Fragment-Based Semiempirical Quantum Chemistry. End-Point Affinity Estimation of Galectin Ligands by Classical and Semiempirical Quantum Mechanical Potentials. Development of Receptor Desolvation Scoring and Covalent Sampling in DOCK 6: Methods Evaluated on a RAS Test Set.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1