{"title":"GFLearn: Generalized Feature Learning for Drug-Target Binding Affinity Prediction.","authors":"Zibo Huang, Xinrui Weng, Le Ou-Yang","doi":"10.1109/JBHI.2025.3538497","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting drug-target binding affinity is critical for drug discovery, as it helps identify promising drug candidates and predict their effectiveness. Recent advancements in deep learning have made significant progress in tackling this task. However, existing methods heavily rely on training data, and their performance is often limited when predicting binding affinities for new drugs and targets. To address this challenge, we propose a novel Generalized Feature Learning (GFLearn) model for drug-target binding affinity prediction. By integrating Graph Neural Networks (GNNs) with a self-supervised invariant feature learning module, our GFLearn model can extract robust and highly generalizable features from both drugs and targets, significantly enhancing prediction performance. This innovation enables the model to effectively predict binding affinities for previously unseen drugs or targets, while also mitigates the common issue of prediction performance degrading due to shifts in data distribution. Extensive experiments were conducted on two diverse datasets across three challenging scenarios: new drugs, new targets, and combinations of both. Comparisons with state-of-the-art methods demonstrated that our GFLearn model consistently outperformed others, showcasing its robustness across various prediction tasks. Additionally, cross-dataset evaluations and noise perturbation experiments further validated the model's generalizability across different data distributions. Case studies on two drug-target pairs, Canertinib-PIK3C2G and MLN8054-FLT1, provided further evidence of GFLearn's ability to make accurate binding affinity predictions, offering valuable insights for drug screening and repurposing efforts.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3538497","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
GFLearn: Generalized Feature Learning for Drug-Target Binding Affinity Prediction.
Predicting drug-target binding affinity is critical for drug discovery, as it helps identify promising drug candidates and predict their effectiveness. Recent advancements in deep learning have made significant progress in tackling this task. However, existing methods heavily rely on training data, and their performance is often limited when predicting binding affinities for new drugs and targets. To address this challenge, we propose a novel Generalized Feature Learning (GFLearn) model for drug-target binding affinity prediction. By integrating Graph Neural Networks (GNNs) with a self-supervised invariant feature learning module, our GFLearn model can extract robust and highly generalizable features from both drugs and targets, significantly enhancing prediction performance. This innovation enables the model to effectively predict binding affinities for previously unseen drugs or targets, while also mitigates the common issue of prediction performance degrading due to shifts in data distribution. Extensive experiments were conducted on two diverse datasets across three challenging scenarios: new drugs, new targets, and combinations of both. Comparisons with state-of-the-art methods demonstrated that our GFLearn model consistently outperformed others, showcasing its robustness across various prediction tasks. Additionally, cross-dataset evaluations and noise perturbation experiments further validated the model's generalizability across different data distributions. Case studies on two drug-target pairs, Canertinib-PIK3C2G and MLN8054-FLT1, provided further evidence of GFLearn's ability to make accurate binding affinity predictions, offering valuable insights for drug screening and repurposing efforts.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.