MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge

IF 6.8 1区 医学 Q1 CHEMISTRY, MEDICINAL Journal of Medicinal Chemistry Pub Date : 2024-12-02 DOI:10.1021/acs.jmedchem.4c0219310.1021/acs.jmedchem.4c02193
Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu*, Xiangxiang Zeng and Dongsheng Cao*, 
{"title":"MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge","authors":"Xixi Yang,&nbsp;Yanjing Duan,&nbsp;Zhixiang Cheng,&nbsp;Kun Li,&nbsp;Yuansheng Liu*,&nbsp;Xiangxiang Zeng and Dongsheng Cao*,&nbsp;","doi":"10.1021/acs.jmedchem.4c0219310.1021/acs.jmedchem.4c02193","DOIUrl":null,"url":null,"abstract":"<p >Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.</p>","PeriodicalId":46,"journal":{"name":"Journal of Medicinal Chemistry","volume":"67 23","pages":"21303–21316 21303–21316"},"PeriodicalIF":6.8000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jmedchem.4c02193","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Medicinal Chemistry
Journal of Medicinal Chemistry 医学-医药化学
CiteScore
4.00
自引率
11.00%
发文量
804
审稿时长
1.9 months
期刊介绍: The Journal of Medicinal Chemistry is a prestigious biweekly peer-reviewed publication that focuses on the multifaceted field of medicinal chemistry. Since its inception in 1959 as the Journal of Medicinal and Pharmaceutical Chemistry, it has evolved to become a cornerstone in the dissemination of research findings related to the design, synthesis, and development of therapeutic agents. The Journal of Medicinal Chemistry is recognized for its significant impact in the scientific community, as evidenced by its 2022 impact factor of 7.3. This metric reflects the journal's influence and the importance of its content in shaping the future of drug discovery and development. The journal serves as a vital resource for chemists, pharmacologists, and other researchers interested in the molecular mechanisms of drug action and the optimization of therapeutic compounds.
期刊最新文献
Minimalist Natural ORPphilin Macarangin B Delineates OSBP Biological Function MoA Studies of the TEAD P-Site Binding Ligand MSC-4106 and Its Optimization to TEAD1-Selective Amide M3686 Identification of Novel Organo-Se BTSA-Based Derivatives as Potent, Reversible, and Selective PPARγ Covalent Modulators for Antidiabetic Drug Discovery F-CPI: A Multimodal Deep Learning Approach for Predicting Compound Bioactivity Changes Induced by Fluorine Substitution Fluorinated Coumarin Derivatives as Selective PET Tracer for MAO-B Imaging
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1