Autoencoder-based drug-virus association prediction with reliable negative sample selection: A case study with COVID-19

IF 3.3 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Biophysical chemistry Pub Date : 2025-03-10 DOI:10.1016/j.bpc.2025.107434
A.S. Aruna , K.R. Remesh Babu , K. Deepthi
{"title":"Autoencoder-based drug-virus association prediction with reliable negative sample selection: A case study with COVID-19","authors":"A.S. Aruna ,&nbsp;K.R. Remesh Babu ,&nbsp;K. Deepthi","doi":"10.1016/j.bpc.2025.107434","DOIUrl":null,"url":null,"abstract":"<div><div>Emergence of viruses cause unprecedented challenges and thus leading to wide-ranging consequences today. The world has faced massive disruptions like COVID-19 and continues to suffer in terms of public health and world economy. Fighting with this emergence of viruses and its reemergence plays a critical role in the health care industry. Identification of novel virus-drug associations is a vital step in drug discovery. Prediction and prioritization of novel virus-drug associations through computational approaches is an alternative and best choice considering the cost and risk of biological experiments. This study proposes a method, KR-AEVDA that relies on k-nearest neighbor based reliable negative sample selection and autoencoder based feature extraction to explore promising virus-drug associations for further experimental validation. The method analyzes complex relationships among drugs and viruses by investigating similarity and association data between drugs and viruses. It generates feature vectors from the similarity data, and reliable negative samples are extracted through an effective distance-based algorithm from the unlabeled samples in the dataset. Then high level features are extracted via an autoencoder and is fed to an ensemble classifier for inferring novel associations. Experimental results on three different datasets showed that KR-AEVDA reliably attained better performance than other state-of-the-art methods. Molecular docking is carried out between the top-predicted drugs and the crystal structure of the SARS-CoV-2's main protease to further validate the predictions. Case studies for SARS-CoV-2 illustrate the effectiveness of KR-AEVDA in identifying potential virus-drug associations.</div></div>","PeriodicalId":8979,"journal":{"name":"Biophysical chemistry","volume":"322 ","pages":"Article 107434"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biophysical chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0301462225000468","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Emergence of viruses cause unprecedented challenges and thus leading to wide-ranging consequences today. The world has faced massive disruptions like COVID-19 and continues to suffer in terms of public health and world economy. Fighting with this emergence of viruses and its reemergence plays a critical role in the health care industry. Identification of novel virus-drug associations is a vital step in drug discovery. Prediction and prioritization of novel virus-drug associations through computational approaches is an alternative and best choice considering the cost and risk of biological experiments. This study proposes a method, KR-AEVDA that relies on k-nearest neighbor based reliable negative sample selection and autoencoder based feature extraction to explore promising virus-drug associations for further experimental validation. The method analyzes complex relationships among drugs and viruses by investigating similarity and association data between drugs and viruses. It generates feature vectors from the similarity data, and reliable negative samples are extracted through an effective distance-based algorithm from the unlabeled samples in the dataset. Then high level features are extracted via an autoencoder and is fed to an ensemble classifier for inferring novel associations. Experimental results on three different datasets showed that KR-AEVDA reliably attained better performance than other state-of-the-art methods. Molecular docking is carried out between the top-predicted drugs and the crystal structure of the SARS-CoV-2's main protease to further validate the predictions. Case studies for SARS-CoV-2 illustrate the effectiveness of KR-AEVDA in identifying potential virus-drug associations.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
病毒的出现带来了前所未有的挑战,从而导致了今天广泛的后果。世界已经面临了像 COVID-19 这样的大规模破坏,在公共卫生和世界经济方面继续遭受损失。应对这种病毒的出现和再次出现在医疗保健行业中起着至关重要的作用。鉴定新型病毒与药物的关联是药物发现的重要一步。考虑到生物实验的成本和风险,通过计算方法预测新型病毒与药物的关联并确定其优先次序是一种可供选择的最佳方法。本研究提出了一种名为 KR-AEVDA 的方法,它依赖于基于 k 近邻的可靠负样本选择和基于自动编码器的特征提取来探索有前景的病毒-药物关联,以便进一步进行实验验证。该方法通过研究药物和病毒之间的相似性和关联数据,分析药物和病毒之间的复杂关系。它从相似性数据中生成特征向量,并通过有效的基于距离的算法从数据集中的未标记样本中提取可靠的阴性样本。然后通过自动编码器提取高级特征,并将其输入集合分类器以推断新的关联。在三个不同数据集上的实验结果表明,KR-AEVDA 能可靠地获得比其他先进方法更好的性能。为了进一步验证预测结果,KR-AEVDA 将预测结果最好的药物与 SARS-CoV-2 主要蛋白酶的晶体结构进行了分子对接。针对 SARS-CoV-2 的案例研究说明了 KR-AEVDA 在识别潜在病毒-药物关联方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biophysical chemistry
Biophysical chemistry 生物-生化与分子生物学
CiteScore
6.10
自引率
10.50%
发文量
121
审稿时长
20 days
期刊介绍: Biophysical Chemistry publishes original work and reviews in the areas of chemistry and physics directly impacting biological phenomena. Quantitative analysis of the properties of biological macromolecules, biologically active molecules, macromolecular assemblies and cell components in terms of kinetics, thermodynamics, spatio-temporal organization, NMR and X-ray structural biology, as well as single-molecule detection represent a major focus of the journal. Theoretical and computational treatments of biomacromolecular systems, macromolecular interactions, regulatory control and systems biology are also of interest to the journal.
期刊最新文献
Characterization of metabolism associated with outcomes in severe acute pancreatitis: Insights from serum metabolomic analysis Interaction of native and aggregated albumin with DMPC bilayers Decoding the interaction of an imidazo-pyrimidine derivative with serum proteins: Spectroscopic, computational and structure-activity relationship analysis Equilibrium solubility, solvent effect, solvation and thermodynamic modeling of 1, 3-dinitropyrazole in solutions of methyl alcohol /ethyl alcohol + water Autoencoder-based drug-virus association prediction with reliable negative sample selection: A case study with COVID-19
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1