Pseudo-Siamese Neural Network Based Graph and Sequence Representation Learning for Molecular Property Prediction

Chaoran Zhang, Xiangfeng Yan, Yong Liu
{"title":"Pseudo-Siamese Neural Network Based Graph and Sequence Representation Learning for Molecular Property Prediction","authors":"Chaoran Zhang, Xiangfeng Yan, Yong Liu","doi":"10.1109/BIBM55620.2022.9994859","DOIUrl":null,"url":null,"abstract":"Molecular property prediction has received great attention due to its wide application in biomedical field. Effective molecular representation learning is of substantial significance to facilitate molecular property prediction. In recent years, with the development of artificial intelligence technology, more and more computer scientists began to apply deep learning methods to molecular property prediction instead of traditional machine learning methods. However, these methods only utilize the SMILES sequences to learn sequence representation or use the molecular graphs to learn graph representation to predict molecular property, which fails to integrate the capabilities of both approaches in preserving molecular characteristics for further improvement. In this study, we propose a joint graph and sequence representation learning model for molecular property prediction, called PSGS. Specifically, PSGS utilizes a fusion layer to combine graph and sequence representation and capture the critical features of the molecular. In addition, PSGS is trained by a new self-supervised task, which maximizes the similarity between graph and sequence representations of the same molecular by using a pseudo-Siamese neural network. We conduct extensive experiments to compare our model with state-of-the-art models. Experimental results show that our model significantly outperforms the current state-of-the-art methods on four independent datasets.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9994859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular property prediction has received great attention due to its wide application in biomedical field. Effective molecular representation learning is of substantial significance to facilitate molecular property prediction. In recent years, with the development of artificial intelligence technology, more and more computer scientists began to apply deep learning methods to molecular property prediction instead of traditional machine learning methods. However, these methods only utilize the SMILES sequences to learn sequence representation or use the molecular graphs to learn graph representation to predict molecular property, which fails to integrate the capabilities of both approaches in preserving molecular characteristics for further improvement. In this study, we propose a joint graph and sequence representation learning model for molecular property prediction, called PSGS. Specifically, PSGS utilizes a fusion layer to combine graph and sequence representation and capture the critical features of the molecular. In addition, PSGS is trained by a new self-supervised task, which maximizes the similarity between graph and sequence representations of the same molecular by using a pseudo-Siamese neural network. We conduct extensive experiments to compare our model with state-of-the-art models. Experimental results show that our model significantly outperforms the current state-of-the-art methods on four independent datasets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于伪连体神经网络的图和序列表示学习的分子性质预测
分子性质预测因其在生物医学领域的广泛应用而备受关注。有效的分子表征学习对促进分子性质预测具有重要意义。近年来,随着人工智能技术的发展,越来越多的计算机科学家开始将深度学习方法应用于分子性质预测,取代传统的机器学习方法。然而,这些方法仅利用SMILES序列学习序列表示或利用分子图学习图表示来预测分子性质,未能整合两种方法在保留分子特征方面的能力,以供进一步改进。在这项研究中,我们提出了一种用于分子性质预测的联合图和序列表示学习模型,称为PSGS。具体来说,PSGS利用融合层将图和序列表示结合起来,并捕获分子的关键特征。此外,PSGS通过一种新的自监督任务进行训练,该任务通过使用伪暹罗神经网络最大化相同分子的图和序列表示之间的相似性。我们进行了大量的实验,将我们的模型与最先进的模型进行比较。实验结果表明,我们的模型在四个独立的数据集上显著优于当前最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A framework for associating structural variants with cell-specific transcription factors and histone modifications in defect phenotypes Secure Password Using EEG-based BrainPrint System: Unlock Smartphone Password Using Brain-Computer Interface Technology On functional annotation with gene co-expression networks ST-ChIP: Accurate prediction of spatiotemporal ChIP-seq data with recurrent neural networks Discovering the Knowledge in Unstructured Early Drug Development Data Using NLP and Advanced Analytics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1