Hierarchical Graph Attention Network with Positive and Negative Attentions for Improved Interpretability: ISA-PN.

IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Journal of Chemical Information and Modeling Pub Date : 2025-02-10 Epub Date: 2024-12-09 DOI:10.1021/acs.jcim.4c01035
Jinyong Park, Minhi Han, Kiwoong Lee, Sungnam Park
{"title":"Hierarchical Graph Attention Network with Positive and Negative Attentions for Improved Interpretability: ISA-PN.","authors":"Jinyong Park, Minhi Han, Kiwoong Lee, Sungnam Park","doi":"10.1021/acs.jcim.4c01035","DOIUrl":null,"url":null,"abstract":"<p><p>With the advancement of deep learning (DL) methods in chemistry and materials science, the interpretability of DL models has become a critical issue in elucidating quantitative (molecular) structure-property relationships. Although attention mechanisms have been generally employed to explain the importance of molecular substructures that contribute to molecular properties, their interpretability remains limited. In this work, we introduce a versatile segmentation method and develop an interpretable subgraph attention (ISA) network with positive and negative streams (ISA-PN) to enhance the understanding of molecular structure-property relationships. The predictive performance of the ISA models was validated using data sets for aqueous solubility, lipophilicity, and melting temperature, with a particular focus on evaluating interpretability for the aqueous solubility data set. The ISA-PN model enables the quantification of the contributions of molecular substructures through positive and negative attention scores. Comparative analyses of the ISA, ISA-PN, and GC-Net (group contribution network) models demonstrate that the ISA-PN model significantly improves interpretability while maintaining similar accuracy levels. This study highlights the efficacy of the ISA-PN model in providing meaningful insights into the contributions of molecular substructures to molecular properties, thereby enhancing the interpretability of DL models in chemical applications.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"1115-1127"},"PeriodicalIF":5.3000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01035","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

With the advancement of deep learning (DL) methods in chemistry and materials science, the interpretability of DL models has become a critical issue in elucidating quantitative (molecular) structure-property relationships. Although attention mechanisms have been generally employed to explain the importance of molecular substructures that contribute to molecular properties, their interpretability remains limited. In this work, we introduce a versatile segmentation method and develop an interpretable subgraph attention (ISA) network with positive and negative streams (ISA-PN) to enhance the understanding of molecular structure-property relationships. The predictive performance of the ISA models was validated using data sets for aqueous solubility, lipophilicity, and melting temperature, with a particular focus on evaluating interpretability for the aqueous solubility data set. The ISA-PN model enables the quantification of the contributions of molecular substructures through positive and negative attention scores. Comparative analyses of the ISA, ISA-PN, and GC-Net (group contribution network) models demonstrate that the ISA-PN model significantly improves interpretability while maintaining similar accuracy levels. This study highlights the efficacy of the ISA-PN model in providing meaningful insights into the contributions of molecular substructures to molecular properties, thereby enhancing the interpretability of DL models in chemical applications.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
改进可解释性的正负注意层次图注意网络:ISA-PN。
随着化学和材料科学中深度学习(DL)方法的发展,深度学习模型的可解释性已成为阐明定量(分子)结构-性质关系的关键问题。虽然注意机制通常被用来解释分子亚结构对分子特性的重要性,但其可解释性仍然有限。在这项工作中,我们引入了一种通用的分割方法,并开发了一个具有正流和负流的可解释子图注意(ISA)网络(ISA- pn),以增强对分子结构-性质关系的理解。ISA模型的预测性能通过水溶解度、亲脂性和熔融温度数据集进行了验证,并特别关注了水溶解度数据集的可解释性。ISA-PN模型可以通过正注意力和负注意力评分来量化分子子结构的贡献。ISA、ISA- pn和GC-Net(群体贡献网络)模型的对比分析表明,ISA- pn模型在保持相似精度水平的同时显著提高了可解释性。本研究强调了ISA-PN模型的有效性,为分子亚结构对分子性质的贡献提供了有意义的见解,从而提高了DL模型在化学应用中的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
期刊最新文献
Enzyme Reset: Water-Mediated Tautomerization Restores the Catalytic Asparagine in Protein O-Fucosyltransferase 1. The Disordered JM Motif in RTKs Promotes Classical DFGout Conformation Formation via the Dynamic Effect. Machine Learning Potential-Enabled Platform for the In Silico Design of Functional Organic Molecular Crystals. Stoichiometric Modeling and Stability Analysis of Synthesis of Alkane Reactions Using PINNs with PDFP Optimization: Applications in Pharmaceutical, Rubber, and Fuel. Thermodynamic and Molecular Modeling Study of the Veratric Aldehyde/β-Cyclodextrin Inclusion Complex and Its Inhibitory Effect on Polyphenoloxidase.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1