Applying local interpretable model-agnostic explanations to identify substructures that are responsible for mutagenicity of chemical compounds

IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Molecular Systems Design & Engineering Pub Date : 2024-06-05 DOI:10.1039/d4me00038b
Lucca Caiaffa Santos Rosa, Andre Silva Pimentel
{"title":"Applying local interpretable model-agnostic explanations to identify substructures that are responsible for mutagenicity of chemical compounds","authors":"Lucca Caiaffa Santos Rosa, Andre Silva Pimentel","doi":"10.1039/d4me00038b","DOIUrl":null,"url":null,"abstract":"The local interpretable model-agnostic explanations method was applied to identify substructures that represent the mutagenicity of chemical compounds using machine learning models. Random forest and extremely randomized trees were used to build models to be explained using the Hansen and Bursi Ames mutagenicity datasets. The models were analyzed using precision, recall, F1, and accuracy metrics. The aim of this study is to address the challenge of identifying substructures that indicate the mutagenicity of chemical compounds. The goal is to provide stable and consistent explanations for the mutagenicity of chemical compounds, which is crucial for trust and acceptance of the findings, especially in the sensitive field of computational toxicology. This approach is significant as it contributes to the interpretability and explainability of machine learning models, particularly in the context of identifying substructures associated with mutagenicity, thereby advancing the field of computational toxicology. Identifying substructures that represent the mutagenicity of chemical compounds is important because it can help predict the potential toxicity of new chemical compounds. This is particularly relevant in fields such as drug development and environmental toxicology, where the potential risks of exposure to new compounds need to be carefully evaluated. Some examples of chemical compounds that have been identified as mutagenic include epoxides, <em>N</em>-aryl compounds, nitro compounds, aromatic amines, <em>N</em>-oxides, nitro-containing compounds, and polycyclic aromatic hydrocarbons with a bay-region. These examples demonstrate the importance of identifying and studying mutagenic chemical compounds to better understand their potential risks and adverse effects on human health and the environment.","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Systems Design & Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1039/d4me00038b","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The local interpretable model-agnostic explanations method was applied to identify substructures that represent the mutagenicity of chemical compounds using machine learning models. Random forest and extremely randomized trees were used to build models to be explained using the Hansen and Bursi Ames mutagenicity datasets. The models were analyzed using precision, recall, F1, and accuracy metrics. The aim of this study is to address the challenge of identifying substructures that indicate the mutagenicity of chemical compounds. The goal is to provide stable and consistent explanations for the mutagenicity of chemical compounds, which is crucial for trust and acceptance of the findings, especially in the sensitive field of computational toxicology. This approach is significant as it contributes to the interpretability and explainability of machine learning models, particularly in the context of identifying substructures associated with mutagenicity, thereby advancing the field of computational toxicology. Identifying substructures that represent the mutagenicity of chemical compounds is important because it can help predict the potential toxicity of new chemical compounds. This is particularly relevant in fields such as drug development and environmental toxicology, where the potential risks of exposure to new compounds need to be carefully evaluated. Some examples of chemical compounds that have been identified as mutagenic include epoxides, N-aryl compounds, nitro compounds, aromatic amines, N-oxides, nitro-containing compounds, and polycyclic aromatic hydrocarbons with a bay-region. These examples demonstrate the importance of identifying and studying mutagenic chemical compounds to better understand their potential risks and adverse effects on human health and the environment.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
应用局部可解释的模型--不可知论解释,确定导致化合物致突变性的亚结构
利用机器学习模型,采用局部可解释模型-不可知论解释方法来识别代表化合物致突变性的子结构。使用随机森林和极端随机树建立模型,并利用汉森和布尔西-艾姆斯诱变数据集进行解释。使用精确度、召回率、F1 和准确度指标对模型进行了分析。本研究的目的是应对识别表明化合物致突变性的亚结构这一挑战。其目的是为化合物的致突变性提供稳定一致的解释,这对研究结果的信任度和接受度至关重要,尤其是在敏感的计算毒理学领域。这种方法意义重大,因为它有助于提高机器学习模型的可解释性和可解释性,特别是在识别与致突变性相关的子结构方面,从而推动计算毒理学领域的发展。识别代表化合物致突变性的子结构非常重要,因为这有助于预测新化合物的潜在毒性。这与药物开发和环境毒理学等领域尤其相关,因为这些领域需要仔细评估接触新化合物的潜在风险。已确定为诱变化合物的一些例子包括环氧化物、N-芳基化合物、硝基化合物、芳香胺、N-氧化物、含硝基化合物以及带有畦区的多环芳烃。这些例子说明了识别和研究诱变化合物的重要性,以便更好地了解它们对人类健康和环境的潜在风险和不利影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Molecular Systems Design & Engineering
Molecular Systems Design & Engineering Engineering-Biomedical Engineering
CiteScore
6.40
自引率
2.80%
发文量
144
期刊介绍: Molecular Systems Design & Engineering provides a hub for cutting-edge research into how understanding of molecular properties, behaviour and interactions can be used to design and assemble better materials, systems, and processes to achieve specific functions. These may have applications of technological significance and help address global challenges.
期刊最新文献
Back cover Applying local interpretable model-agnostic explanations to identify substructures that are responsible for mutagenicity of chemical compounds Back cover GREEN SYNTHESIS OF THERMO/PHOTOCHROMIC DOPED CELLULOSE POLYMER: A BIOCOMPATIBLE FILM FOR POTENTIAL APPLICATION IN COLD CHAIN VISUAL TRACKING A Zn(ii) pillared-layer ultramicroporous metal–organic framework with matching molecular pockets for C2H2/CO2 separation†
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1