LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.

IF 4.7 3区 医学 Q1 MEDICAL INFORMATICS Health Information Science and Systems Pub Date : 2023-09-02 eCollection Date: 2023-12-01 DOI:10.1007/s13755-023-00243-w
Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu
{"title":"LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.","authors":"Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu","doi":"10.1007/s13755-023-00243-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.</p><p><strong>Method: </strong>Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.</p><p><strong>Result: </strong>On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.</p><p><strong>Conclusion: </strong>In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"42"},"PeriodicalIF":4.7000,"publicationDate":"2023-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475000/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Information Science and Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13755-023-00243-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.

Method: Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.

Result: On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.

Conclusion: In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LDS-CN:一个基于大规模药物筛选的药物-靶标相互作用预测的深度学习框架。
背景:药物-靶标相互作用(DTI)是一种重要的药物设计策略,在复杂疾病和细胞事件的许多过程中发挥着重要作用。面对广泛的蛋白质数据和实验成本等挑战,建议应用生物信息学方法来利用潜在的相互作用来设计新的靶向药物。不同的数据和交互类型给涉及不兼容和异质格式的研究带来了困难。在一个全面统一的模型中分析药物-靶标相互作用是一个重大挑战。方法:在这里,我们提出了一种预测小分子药物和蛋白质靶标之间相互作用的通用方法,即大规模药物靶标筛选卷积神经网络(LDS-CNN),该网络使用统一编码来实现对集成模型中不同数据格式的计算,以实现特征提取和潜在靶标预测。结果:在88亿条记录中涉及1683个小分子化合物和14350个人类蛋白质的898142个相互作用数据上,该方法的曲线下面积(AUC)为0.96,精密度-召回曲线下面积为0.95,准确度为90.13%。实验结果表明,该方法在测试集上具有较高的准确度,表明其在药物-靶标相互作用预测方面具有较高的预测能力。LDS-NN对于大规模数据集和由不同格式的数据组成的数据集的预测是有效的。结论:在本研究中,我们提出了一种DTI预测方法来解决多格式大规模数据的统一编码问题。它提供了一种有效提取不同类型药物相关数据特征的可行方法,从而降低了实验成本和时间消耗。所提出的方法可用于确定治疗复杂疾病的潜在药物靶点和候选药物。这项工作为DTI用深度学习方法处理大规模数据和不同格式的数据提供了参考,并为未来的研究提供了一定的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.30
自引率
5.00%
发文量
30
期刊介绍: Health Information Science and Systems is a multidisciplinary journal that integrates artificial intelligence/computer science/information technology with health science and services, embracing information science research coupled with topics related to the modeling, design, development, integration and management of health information systems, smart health, artificial intelligence in medicine, and computer aided diagnosis, medical expert systems. The scope includes: i.) smart health, artificial Intelligence in medicine, computer aided diagnosis, medical image processing, medical expert systems ii.) medical big data, medical/health/biomedicine information resources such as patient medical records, devices and equipments, software and tools to capture, store, retrieve, process, analyze, optimize the use of information in the health domain, iii.) data management, data mining, and knowledge discovery, all of which play a key role in decision making, management of public health, examination of standards, privacy and security issues, iv.) development of new architectures and applications for health information systems.
期刊最新文献
Advancing personalized healthcare: leveraging explainable AI for BPPV risk assessment. A new multivariate blood glucose prediction method with hybrid feature clustering and online transfer learning. Memetic ant colony optimization for multi-constrained cognitive diagnostic test construction. Forecasting fMRI images from video sequences: linear model analysis. KSDKG: construction and application of knowledge graph for kidney stone disease based on biomedical literature and public databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1