Kaiyi Xu , Minhui Wang , Xin Zou , Jingjing Liu , Ao Wei , Jiajia Chen , Chang Tang
{"title":"HSTrans: Homogeneous substructures transformer for predicting frequencies of drug-side effects","authors":"Kaiyi Xu , Minhui Wang , Xin Zou , Jingjing Liu , Ao Wei , Jiajia Chen , Chang Tang","doi":"10.1016/j.neunet.2024.106779","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying the frequencies of drug-side effects is crucial for assessing drug risk-benefit. However, accurately determining these frequencies remains challenging due to the limitations of time and scale in clinical randomized controlled trials. As a result, several computational methods have been proposed to address these issues. Nonetheless, two primary problems still persist. Firstly, most of these methods face challenges in generating accurate predictions for novel drugs, as they heavily depend on the interaction graph between drugs and side effects (SEs) within their modeling framework. Secondly, some previous methods often simply concatenate the features of drugs and SEs, which fails to effectively capture their underlying association. In this work, we present HSTrans, a novel approach that treats drugs and SEs as sets of substructures, leveraging a transformer encoder for unified substructure embedding and incorporating an interaction module for association capture. Specifically, HSTrans extracts drug substructures through a specialized algorithm and identifies effective substructures for each SE by employing an indicator that measures the importance of each substructure and SE. Additionally, HSTrans applies convolutional neural network (CNN) in the interaction module to capture complex relationships between drugs and SEs. Experimental results on datasets from Galeano et al.’s study demonstrate that the proposed method outperforms other state-of-the-art approaches. The demo codes for HSTrans are available at <span><span>https://github.com/Dtdtxuky/HSTrans/tree/master</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"181 ","pages":"Article 106779"},"PeriodicalIF":6.0000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608024007032","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying the frequencies of drug-side effects is crucial for assessing drug risk-benefit. However, accurately determining these frequencies remains challenging due to the limitations of time and scale in clinical randomized controlled trials. As a result, several computational methods have been proposed to address these issues. Nonetheless, two primary problems still persist. Firstly, most of these methods face challenges in generating accurate predictions for novel drugs, as they heavily depend on the interaction graph between drugs and side effects (SEs) within their modeling framework. Secondly, some previous methods often simply concatenate the features of drugs and SEs, which fails to effectively capture their underlying association. In this work, we present HSTrans, a novel approach that treats drugs and SEs as sets of substructures, leveraging a transformer encoder for unified substructure embedding and incorporating an interaction module for association capture. Specifically, HSTrans extracts drug substructures through a specialized algorithm and identifies effective substructures for each SE by employing an indicator that measures the importance of each substructure and SE. Additionally, HSTrans applies convolutional neural network (CNN) in the interaction module to capture complex relationships between drugs and SEs. Experimental results on datasets from Galeano et al.’s study demonstrate that the proposed method outperforms other state-of-the-art approaches. The demo codes for HSTrans are available at https://github.com/Dtdtxuky/HSTrans/tree/master.
确定药物副作用的频率对于评估药物的风险-效益至关重要。然而,由于临床随机对照试验在时间和规模上的限制,准确确定这些频率仍具有挑战性。因此,人们提出了几种计算方法来解决这些问题。然而,两个主要问题仍然存在。首先,大多数这些方法在对新药进行准确预测时都面临挑战,因为它们在建模框架内严重依赖于药物与副作用(SEs)之间的相互作用图。其次,以前的一些方法往往只是简单地将药物和副作用的特征串联起来,无法有效捕捉它们之间的内在联系。在这项工作中,我们提出了 HSTrans,这是一种将药物和副作用作为子结构集来处理的新方法,它利用变换器编码器进行统一的子结构嵌入,并结合了一个用于关联捕捉的交互模块。具体来说,HSTrans 通过专门的算法提取药物子结构,并通过采用衡量每个子结构和 SE 重要性的指标来识别每个 SE 的有效子结构。此外,HSTrans 还在交互模块中应用了卷积神经网络 (CNN),以捕捉药物与 SE 之间的复杂关系。在 Galeano 等人的研究数据集上的实验结果表明,所提出的方法优于其他最先进的方法。HSTrans 的演示代码请访问 https://github.com/Dtdtxuky/HSTrans/tree/master。
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.