LDS-CN:一个基于大规模药物筛选的药物-靶标相互作用预测的深度学习框架。

IF 4.7 3区医学 Q1 MEDICAL INFORMATICS Health Information Science and Systems Pub Date : 2023-09-02 eCollection Date: 2023-12-01 DOI:10.1007/s13755-023-00243-w

Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu

{"title":"LDS-CN:一个基于大规模药物筛选的药物-靶标相互作用预测的深度学习框架。","authors":"Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu","doi":"10.1007/s13755-023-00243-w","DOIUrl":null,"url":null,"abstract":"Background: Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.Method: Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.Result: On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.Conclusion: In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"42"},"PeriodicalIF":4.7000,"publicationDate":"2023-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475000/pdf/","citationCount":"0","resultStr":"{\"title\":\"LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.\",\"authors\":\"Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu\",\"doi\":\"10.1007/s13755-023-00243-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.Method: Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.Result: On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.Conclusion: In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.\",\"PeriodicalId\":46312,\"journal\":{\"name\":\"Health Information Science and Systems\",\"volume\":\"11 1\",\"pages\":\"42\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2023-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475000/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Information Science and Systems\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13755-023-00243-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/12/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Information Science and Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13755-023-00243-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

摘要

背景：药物-靶标相互作用（DTI）是一种重要的药物设计策略，在复杂疾病和细胞事件的许多过程中发挥着重要作用。面对广泛的蛋白质数据和实验成本等挑战，建议应用生物信息学方法来利用潜在的相互作用来设计新的靶向药物。不同的数据和交互类型给涉及不兼容和异质格式的研究带来了困难。在一个全面统一的模型中分析药物-靶标相互作用是一个重大挑战。方法：在这里，我们提出了一种预测小分子药物和蛋白质靶标之间相互作用的通用方法，即大规模药物靶标筛选卷积神经网络（LDS-CNN），该网络使用统一编码来实现对集成模型中不同数据格式的计算，以实现特征提取和潜在靶标预测。结果：在88亿条记录中涉及1683个小分子化合物和14350个人类蛋白质的898142个相互作用数据上，该方法的曲线下面积（AUC）为0.96，精密度-召回曲线下面积为0.95，准确度为90.13%。实验结果表明，该方法在测试集上具有较高的准确度，表明其在药物-靶标相互作用预测方面具有较高的预测能力。LDS-NN对于大规模数据集和由不同格式的数据组成的数据集的预测是有效的。结论：在本研究中，我们提出了一种DTI预测方法来解决多格式大规模数据的统一编码问题。它提供了一种有效提取不同类型药物相关数据特征的可行方法，从而降低了实验成本和时间消耗。所提出的方法可用于确定治疗复杂疾病的潜在药物靶点和候选药物。这项工作为DTI用深度学习方法处理大规模数据和不同格式的数据提供了参考，并为未来的研究提供了一定的建议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.

Background: Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.

Method: Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.

Result: On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.

Conclusion: In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Health Information Science and Systems MEDICAL INFORMATICS-

CiteScore

11.30

自引率

5.00%

发文量

期刊介绍： Health Information Science and Systems is a multidisciplinary journal that integrates artificial intelligence/computer science/information technology with health science and services, embracing information science research coupled with topics related to the modeling, design, development, integration and management of health information systems, smart health, artificial intelligence in medicine, and computer aided diagnosis, medical expert systems. The scope includes: i.) smart health, artificial Intelligence in medicine, computer aided diagnosis, medical image processing, medical expert systems ii.) medical big data, medical/health/biomedicine information resources such as patient medical records, devices and equipments, software and tools to capture, store, retrieve, process, analyze, optimize the use of information in the health domain, iii.) data management, data mining, and knowledge discovery, all of which play a key role in decision making, management of public health, examination of standards, privacy and security issues, iv.) development of new architectures and applications for health information systems.