A comparative study of deep learning models and classification algorithms for chemical compound identification and Tox21 prediction

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2024-07-11 DOI:10.1016/j.compchemeng.2024.108805
{"title":"A comparative study of deep learning models and classification algorithms for chemical compound identification and Tox21 prediction","authors":"","doi":"10.1016/j.compchemeng.2024.108805","DOIUrl":null,"url":null,"abstract":"<div><p>Chemical compound classification, toxicity prediction, and environmental risk assessments are critically important in various applications within the field of chemistry. Deep learning models provide highly effective tools for extracting features from complex large datasets and performing classification tasks. Four different deep learning models, namely ResNet50V2, VGG19, InceptionV3, and MobileNetV2, have been compared with the random forest (RF) and k-nearest neighbors (KNN) algorithms. The results obtained from experiments conducted using QRCODE images of the Tox21SMILES dataset demonstrate the effectiveness of deep learning models for classifying chemical compounds and showcase the performance of different classification algorithms. The findings of the study thoroughly evaluate the performance of deep learning models and classification algorithms in the task of chemical classification. While ResNet50V2 and VGG19 models achieve high accuracy and precision, InceptionV3 and MobileNetV2 models provide more balanced results. Additionally, in terms of classification algorithms, the k-nearest neighbors (KNN) algorithm generally outperforms the Random Forest (RF) algorithm. Although the RF algorithm achieves good accuracy, the KNN algorithm proves to be more effective in terms of sensitivity and F1 score. These results emphasize the factors to consider when choosing which deep learning model or classification algorithm to use in chemical classification tasks. In conclusion, this study presents a comprehensive analysis comparing the performance of deep learning models and classification algorithms in chemical classification tasks. The selection of the most suitable model and algorithm for a specific task supports achieving better results in the classification of chemical compounds and related applications.</p></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":null,"pages":null},"PeriodicalIF":3.9000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424002230","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Chemical compound classification, toxicity prediction, and environmental risk assessments are critically important in various applications within the field of chemistry. Deep learning models provide highly effective tools for extracting features from complex large datasets and performing classification tasks. Four different deep learning models, namely ResNet50V2, VGG19, InceptionV3, and MobileNetV2, have been compared with the random forest (RF) and k-nearest neighbors (KNN) algorithms. The results obtained from experiments conducted using QRCODE images of the Tox21SMILES dataset demonstrate the effectiveness of deep learning models for classifying chemical compounds and showcase the performance of different classification algorithms. The findings of the study thoroughly evaluate the performance of deep learning models and classification algorithms in the task of chemical classification. While ResNet50V2 and VGG19 models achieve high accuracy and precision, InceptionV3 and MobileNetV2 models provide more balanced results. Additionally, in terms of classification algorithms, the k-nearest neighbors (KNN) algorithm generally outperforms the Random Forest (RF) algorithm. Although the RF algorithm achieves good accuracy, the KNN algorithm proves to be more effective in terms of sensitivity and F1 score. These results emphasize the factors to consider when choosing which deep learning model or classification algorithm to use in chemical classification tasks. In conclusion, this study presents a comprehensive analysis comparing the performance of deep learning models and classification algorithms in chemical classification tasks. The selection of the most suitable model and algorithm for a specific task supports achieving better results in the classification of chemical compounds and related applications.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于化合物识别和 Tox21 预测的深度学习模型和分类算法比较研究
化合物分类、毒性预测和环境风险评估在化学领域的各种应用中至关重要。深度学习模型为从复杂的大型数据集中提取特征和执行分类任务提供了高效的工具。我们将四种不同的深度学习模型,即 ResNet50V2、VGG19、InceptionV3 和 MobileNetV2,与随机森林(RF)和 k-nearest neighbors(KNN)算法进行了比较。使用 Tox21SMILES 数据集的 QRCODE 图像进行的实验结果证明了深度学习模型在化合物分类方面的有效性,并展示了不同分类算法的性能。研究结果全面评估了深度学习模型和分类算法在化学分类任务中的性能。ResNet50V2 和 VGG19 模型实现了较高的准确度和精确度,而 InceptionV3 和 MobileNetV2 模型则提供了更均衡的结果。此外,在分类算法方面,k-近邻(KNN)算法普遍优于随机森林(RF)算法。虽然 RF 算法取得了不错的准确率,但事实证明 KNN 算法在灵敏度和 F1 分数方面更为有效。这些结果强调了在化学分类任务中选择使用哪种深度学习模型或分类算法时需要考虑的因素。总之,本研究全面分析比较了深度学习模型和分类算法在化学分类任务中的表现。为特定任务选择最合适的模型和算法有助于在化合物分类和相关应用中取得更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
期刊最新文献
Integrating smart manufacturing techniques into undergraduate education: A case study with heat exchanger Semi-supervised regression based on Representation Learning for fermentation processes On speeding-up modifier-adaptation schemes for real-time optimization Machine learning-based input-augmented Koopman modeling and predictive control of nonlinear processes Resilience-based explainable reinforcement learning in chemical process safety
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1