MPCD:一种整合公共知识和领域知识的分子性质预测多任务图转换器

IF 6.8 1区 医学 Q1 CHEMISTRY, MEDICINAL Journal of Medicinal Chemistry Pub Date : 2024-12-02 DOI:10.1021/acs.jmedchem.4c02193
Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu, Xiangxiang Zeng, Dongsheng Cao
{"title":"MPCD:一种整合公共知识和领域知识的分子性质预测多任务图转换器","authors":"Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu, Xiangxiang Zeng, Dongsheng Cao","doi":"10.1021/acs.jmedchem.4c02193","DOIUrl":null,"url":null,"abstract":"Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.","PeriodicalId":46,"journal":{"name":"Journal of Medicinal Chemistry","volume":"27 1","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge\",\"authors\":\"Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu, Xiangxiang Zeng, Dongsheng Cao\",\"doi\":\"10.1021/acs.jmedchem.4c02193\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.\",\"PeriodicalId\":46,\"journal\":{\"name\":\"Journal of Medicinal Chemistry\",\"volume\":\"27 1\",\"pages\":\"\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Medicinal Chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jmedchem.4c02193\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.jmedchem.4c02193","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

摘要

基于深度学习的分子性质预测通常采用自监督学习技术,通过掩模原子预测学习共性知识。然而,通过屏蔽原子预测获得的共同知识与下游任务的图级优化目标存在显著差异,从而导致次优问题。特别是对于数据有限的属性,不考虑领域知识会导致在一个巨大的公共空间中直接搜索,从而无法识别全局最优。为了解决这个问题,我们提出了MPCD,它通过将预训练和微调之间的优化目标与领域知识对齐来增强预训练的可转移性。MPCD还利用多任务学习来提高数据利用率和模型鲁棒性。从技术上讲,MPCD采用了一种关系感知的自关注机制来全面捕获分子的局部和全局结构。广泛的验证表明,MPCD在各种数据规模的吸收、分布、代谢、排泄和毒性(ADMET)和物理化学预测方面优于最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge
Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Medicinal Chemistry
Journal of Medicinal Chemistry 医学-医药化学
CiteScore
4.00
自引率
11.00%
发文量
804
审稿时长
1.9 months
期刊介绍: The Journal of Medicinal Chemistry is a prestigious biweekly peer-reviewed publication that focuses on the multifaceted field of medicinal chemistry. Since its inception in 1959 as the Journal of Medicinal and Pharmaceutical Chemistry, it has evolved to become a cornerstone in the dissemination of research findings related to the design, synthesis, and development of therapeutic agents. The Journal of Medicinal Chemistry is recognized for its significant impact in the scientific community, as evidenced by its 2022 impact factor of 7.3. This metric reflects the journal's influence and the importance of its content in shaping the future of drug discovery and development. The journal serves as a vital resource for chemists, pharmacologists, and other researchers interested in the molecular mechanisms of drug action and the optimization of therapeutic compounds.
期刊最新文献
DHODH Inhibitors Based on the Vidofludimus Scaffold Containing Carboxylic Acid Bioisosteres Exert a Superior Broad-Spectrum Antiviral Activity. A Hypoxia-Activated Photomolecular Glue with Synergistic Cyclin K Degradation and Phototherapy for Cancer Treatment. An Activator of Tailless' Repressor Function Stimulates Anti-Neurodegenerative Gene Expression. Linker-Driven Sampling of PROTAC-Induced Ternary Complexes. Lead Optimization of TgCDPK1 Inhibitors for the Treatment of Toxoplasmosis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1