Multimodal Fusion-Based Lightweight Model for Enhanced Generalization in Drug-Target Interaction Prediction.

IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Journal of Chemical Information and Modeling Pub Date : 2024-12-23 Epub Date: 2024-12-03 DOI:10.1021/acs.jcim.4c01397
Jonghyun Lee, Dokyoon Kim, Dae Won Jun, Yun Kim
{"title":"Multimodal Fusion-Based Lightweight Model for Enhanced Generalization in Drug-Target Interaction Prediction.","authors":"Jonghyun Lee, Dokyoon Kim, Dae Won Jun, Yun Kim","doi":"10.1021/acs.jcim.4c01397","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting drug-target interactions (DTIs) with precision is a crucial challenge in the quest for efficient and cost-effective drug discovery. Existing DTI prediction models often require significant computational resources because of the intricate and exceptionally lengthy protein target sequences. This study introduces MMF-DTI, a lightweight model that uses multimodal fusion, to improve the generalizability of DTI predictions without sacrificing computational efficiency. The MMF-DTI model combines four distinct modalities: molecular sequence, molecular properties, target sequence, and target function description. This approach is noteworthy because it is the first to use natural language-based target function descriptions in predicting DTIs. The effectiveness of MMF-DTI has been confirmed through benchmark data sets, demonstrating its comparable performance in terms of generalizability, especially in scenarios with limited information about the drug or target. Remarkably, MMF-DTI accomplishes this using only half of the parameters and 17% of the VRAM compared with previous state-of-the-art models. This allows it to function even in constrained computational environments. The combination of performance and efficiency highlights the potential of multimodal data fusion in improving the overall applicability of models, providing promising opportunities for future drug discovery endeavors.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"9215-9226"},"PeriodicalIF":5.3000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01397","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting drug-target interactions (DTIs) with precision is a crucial challenge in the quest for efficient and cost-effective drug discovery. Existing DTI prediction models often require significant computational resources because of the intricate and exceptionally lengthy protein target sequences. This study introduces MMF-DTI, a lightweight model that uses multimodal fusion, to improve the generalizability of DTI predictions without sacrificing computational efficiency. The MMF-DTI model combines four distinct modalities: molecular sequence, molecular properties, target sequence, and target function description. This approach is noteworthy because it is the first to use natural language-based target function descriptions in predicting DTIs. The effectiveness of MMF-DTI has been confirmed through benchmark data sets, demonstrating its comparable performance in terms of generalizability, especially in scenarios with limited information about the drug or target. Remarkably, MMF-DTI accomplishes this using only half of the parameters and 17% of the VRAM compared with previous state-of-the-art models. This allows it to function even in constrained computational environments. The combination of performance and efficiency highlights the potential of multimodal data fusion in improving the overall applicability of models, providing promising opportunities for future drug discovery endeavors.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多模态融合的药物-靶标相互作用预测增强泛化轻量级模型。
准确预测药物-靶标相互作用(DTIs)是寻求高效和具有成本效益的药物发现的关键挑战。现有的DTI预测模型往往需要大量的计算资源,因为复杂和异常长的蛋白质靶序列。本研究引入了MMF-DTI,一种使用多模态融合的轻量级模型,在不牺牲计算效率的情况下提高了DTI预测的泛化性。MMF-DTI模型结合了四种不同的模式:分子序列、分子特性、目标序列和目标函数描述。这种方法值得注意,因为它是第一个使用基于自然语言的目标函数描述来预测dti的方法。MMF-DTI的有效性已经通过基准数据集得到证实,证明了其在泛化方面的可比性,特别是在药物或靶标信息有限的情况下。值得注意的是,与之前的先进型号相比,MMF-DTI仅使用了一半的参数和17%的VRAM就实现了这一目标。这使得它即使在受限的计算环境中也能发挥作用。性能和效率的结合突出了多模态数据融合在提高模型整体适用性方面的潜力,为未来的药物发现工作提供了有希望的机会。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
期刊最新文献
Assessing Molecular Dynamics in Predicting Aptamer-Ligand Binding Thermodynamics: Insights from the OTA Binding Aptamers. Identification of Subtype-Selective Binding Sites in the Opioid Receptor Family. Wavelet-Enhanced Data-Driven Collective Variables for Efficient Sampling of Protein Folding Landscapes. EVODEX: A Cheminformatics Framework for Mechanistic Abstraction and Prediction of Enzymatic Reactions. CHAOS - A Large-scale Database for σ-Profiles and Other Molecular Descriptors
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1