3DSGIMD: An accurate and interpretable molecular property prediction method using 3D spatial graph focusing network and structure-based feature fusion

IF 6.2 2区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Future Generation Computer Systems-The International Journal of Escience Pub Date : 2024-07-08 DOI:10.1016/j.future.2024.07.004
{"title":"3DSGIMD: An accurate and interpretable molecular property prediction method using 3D spatial graph focusing network and structure-based feature fusion","authors":"","doi":"10.1016/j.future.2024.07.004","DOIUrl":null,"url":null,"abstract":"<div><p>A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.</p></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":null,"pages":null},"PeriodicalIF":6.2000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X24003662","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
3DSGIMD:利用三维空间图聚焦网络和基于结构的特征融合的精确且可解释的分子特性预测方法
分子结构的全面表征对于建立准确可靠的分子特性预测模型至关重要。然而,充分提取和学习分子结构的内在信息,尤其是空间结构特征,仍然是一项具有挑战性的任务,导致许多分子性质预测模型的准确性仍然不足以满足实际应用的需要。在本研究中,我们开发了一种创新的、可解释的深度学习方法,称为 3DSGIMD,该方法通过多层次整合和学习分子的空间结构和亚结构信息来预测分子性质,并通过聚合分子的空间信息和邻接信息来生成聚焦权重,以提高对预测结果的理解。我们在 10 个公共数据集和 14 个基于细胞的表型筛选数据集上对该模型进行了评估。广泛的实验结果表明,与现有的一些模型相比,3DSGIMD 的预测性能更胜一筹,甚至不相上下。此外,我们还通过可视化聚焦权重和扰动分析深入了解了模型的可解释性,结果表明 3DSGIMD 能够精确定位与预测性质相关的关键局部结构和分子描述符位点。总之,3DSGIMD 是一种有竞争力的分子性质预测方法,有望帮助药物设计和优化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
19.90
自引率
2.70%
发文量
376
审稿时长
10.6 months
期刊介绍: Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.
期刊最新文献
Analyzing inference workloads for spatiotemporal modeling An efficient federated learning solution for the artificial intelligence of things Generative adversarial networks to detect intrusion and anomaly in IP flow-based networks Blockchain-based conditional privacy-preserving authentication scheme using PUF for vehicular ad hoc networks UAV-IRS-assisted energy harvesting for edge computing based on deep reinforcement learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1