Applying local interpretable model-agnostic explanations to identify substructures that are responsible for mutagenicity of chemical compounds†

IF 3.2 3区工程技术 Q2 CHEMISTRY, PHYSICAL Molecular Systems Design & Engineering Pub Date : 2024-06-05 DOI:10.1039/D4ME00038B

Lucca Caiaffa Santos Rosa and Andre Silva Pimentel

{"title":"Applying local interpretable model-agnostic explanations to identify substructures that are responsible for mutagenicity of chemical compounds†","authors":"Lucca Caiaffa Santos Rosa and Andre Silva Pimentel","doi":"10.1039/D4ME00038B","DOIUrl":null,"url":null,"abstract":"The local interpretable model-agnostic explanations method was applied to identify substructures that represent the mutagenicity of chemical compounds using machine learning models. Random forest and extremely randomized trees were used to build models to be explained using the Hansen and Bursi Ames mutagenicity datasets. The models were analyzed using precision, recall, F1, and accuracy metrics. The aim of this study is to address the challenge of identifying substructures that indicate the mutagenicity of chemical compounds. The goal is to provide stable and consistent explanations for the mutagenicity of chemical compounds, which is crucial for trust and acceptance of the findings, especially in the sensitive field of computational toxicology. This approach is significant as it contributes to the interpretability and explainability of machine learning models, particularly in the context of identifying substructures associated with mutagenicity, thereby advancing the field of computational toxicology. Identifying substructures that represent the mutagenicity of chemical compounds is important because it can help predict the potential toxicity of new chemical compounds. This is particularly relevant in fields such as drug development and environmental toxicology, where the potential risks of exposure to new compounds need to be carefully evaluated. Some examples of chemical compounds that have been identified as mutagenic include epoxides, N-aryl compounds, nitro compounds, aromatic amines, N-oxides, nitro-containing compounds, and polycyclic aromatic hydrocarbons with a bay-region. These examples demonstrate the importance of identifying and studying mutagenic chemical compounds to better understand their potential risks and adverse effects on human health and the environment.","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 9","pages":" 920-936"},"PeriodicalIF":3.2000,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Systems Design & Engineering","FirstCategoryId":"5","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/me/d4me00038b","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

Abstract

The local interpretable model-agnostic explanations method was applied to identify substructures that represent the mutagenicity of chemical compounds using machine learning models. Random forest and extremely randomized trees were used to build models to be explained using the Hansen and Bursi Ames mutagenicity datasets. The models were analyzed using precision, recall, F1, and accuracy metrics. The aim of this study is to address the challenge of identifying substructures that indicate the mutagenicity of chemical compounds. The goal is to provide stable and consistent explanations for the mutagenicity of chemical compounds, which is crucial for trust and acceptance of the findings, especially in the sensitive field of computational toxicology. This approach is significant as it contributes to the interpretability and explainability of machine learning models, particularly in the context of identifying substructures associated with mutagenicity, thereby advancing the field of computational toxicology. Identifying substructures that represent the mutagenicity of chemical compounds is important because it can help predict the potential toxicity of new chemical compounds. This is particularly relevant in fields such as drug development and environmental toxicology, where the potential risks of exposure to new compounds need to be carefully evaluated. Some examples of chemical compounds that have been identified as mutagenic include epoxides, N-aryl compounds, nitro compounds, aromatic amines, N-oxides, nitro-containing compounds, and polycyclic aromatic hydrocarbons with a bay-region. These examples demonstrate the importance of identifying and studying mutagenic chemical compounds to better understand their potential risks and adverse effects on human health and the environment.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

应用局部可解释的模型--不可知论解释，确定导致化合物致突变性的亚结构

利用机器学习模型，采用局部可解释模型-不可知论解释方法来识别代表化合物致突变性的子结构。使用随机森林和极端随机树建立模型，并利用汉森和布尔西-艾姆斯诱变数据集进行解释。使用精确度、召回率、F1 和准确度指标对模型进行了分析。本研究的目的是应对识别表明化合物致突变性的亚结构这一挑战。其目的是为化合物的致突变性提供稳定一致的解释，这对研究结果的信任度和接受度至关重要，尤其是在敏感的计算毒理学领域。这种方法意义重大，因为它有助于提高机器学习模型的可解释性和可解释性，特别是在识别与致突变性相关的子结构方面，从而推动计算毒理学领域的发展。识别代表化合物致突变性的子结构非常重要，因为这有助于预测新化合物的潜在毒性。这与药物开发和环境毒理学等领域尤其相关，因为这些领域需要仔细评估接触新化合物的潜在风险。已确定为诱变化合物的一些例子包括环氧化物、N-芳基化合物、硝基化合物、芳香胺、N-氧化物、含硝基化合物以及带有畦区的多环芳烃。这些例子说明了识别和研究诱变化合物的重要性，以便更好地了解它们对人类健康和环境的潜在风险和不利影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Molecular Systems Design & Engineering Engineering-Biomedical Engineering

CiteScore

6.40

自引率

2.80%

发文量

144

期刊介绍： Molecular Systems Design & Engineering provides a hub for cutting-edge research into how understanding of molecular properties, behaviour and interactions can be used to design and assemble better materials, systems, and processes to achieve specific functions. These may have applications of technological significance and help address global challenges.

期刊最新文献

Back cover Back cover Dual responsive fluorescence switching of organohydrogel towards base/acid† Back cover Graph-based networks for accurate prediction of ground and excited state molecular properties from minimal features†