{"title":"利用集成了深度学习模型的特征诱导结构诊断进行深度结构级 N-糖识别。","authors":"Suideng Qin, Zhixin Tian","doi":"10.1007/s00216-024-05505-4","DOIUrl":null,"url":null,"abstract":"<p><p>Being a widely occurring protein post-translational modification, N-glycosylation features unique multi-dimensional structures including sequence and linkage isomers. There have been successful bioinformatics efforts in N-glycan structure identification using N-glycoproteomics data; however, symmetric \"mirror\" branch isomers and linkage isomers are largely unresolved. Here, we report deep structure-level N-glycan identification using feature-induced structure diagnosis (FISD) integrated with a deep learning model. A neural network model is integrated to conduct the identification of featured N-glycan motifs and boosts the process of structure diagnosis and distinction for linkage isomers. By adopting publicly available N-glycoproteomics datasets of five mouse tissues (17,136 intact N-glycopeptide spectrum matches) and a consideration of 23 motif features, a deep learning model integrated with a convolutional autoencoder and a multilayer perceptron was trained to be capable of predicting N-glycan featured motifs in the MS/MS spectra with previously identified compositions. In the test of the trained model, a prediction accuracy of 0.8 and AUC value of 0.95 were achieved; 5701 previously unresolved N-glycan structures were assigned by matched structure-diagnostic ions; and by using an explainable learning algorithm, two new fragmentation features of m/z = 674.25 and m/z = 835.28 were found to be significant to three N-glycan structure motifs with fucose, NeuAc, and NeuGc, proving the capability of FISD to discover new features in the MS/MS spectra.</p>","PeriodicalId":462,"journal":{"name":"Analytical and Bioanalytical Chemistry","volume":null,"pages":null},"PeriodicalIF":3.8000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep structure-level N-glycan identification using feature-induced structure diagnosis integrated with a deep learning model.\",\"authors\":\"Suideng Qin, Zhixin Tian\",\"doi\":\"10.1007/s00216-024-05505-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Being a widely occurring protein post-translational modification, N-glycosylation features unique multi-dimensional structures including sequence and linkage isomers. There have been successful bioinformatics efforts in N-glycan structure identification using N-glycoproteomics data; however, symmetric \\\"mirror\\\" branch isomers and linkage isomers are largely unresolved. Here, we report deep structure-level N-glycan identification using feature-induced structure diagnosis (FISD) integrated with a deep learning model. A neural network model is integrated to conduct the identification of featured N-glycan motifs and boosts the process of structure diagnosis and distinction for linkage isomers. By adopting publicly available N-glycoproteomics datasets of five mouse tissues (17,136 intact N-glycopeptide spectrum matches) and a consideration of 23 motif features, a deep learning model integrated with a convolutional autoencoder and a multilayer perceptron was trained to be capable of predicting N-glycan featured motifs in the MS/MS spectra with previously identified compositions. In the test of the trained model, a prediction accuracy of 0.8 and AUC value of 0.95 were achieved; 5701 previously unresolved N-glycan structures were assigned by matched structure-diagnostic ions; and by using an explainable learning algorithm, two new fragmentation features of m/z = 674.25 and m/z = 835.28 were found to be significant to three N-glycan structure motifs with fucose, NeuAc, and NeuGc, proving the capability of FISD to discover new features in the MS/MS spectra.</p>\",\"PeriodicalId\":462,\"journal\":{\"name\":\"Analytical and Bioanalytical Chemistry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical and Bioanalytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1007/s00216-024-05505-4\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical and Bioanalytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s00216-024-05505-4","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
Deep structure-level N-glycan identification using feature-induced structure diagnosis integrated with a deep learning model.
Being a widely occurring protein post-translational modification, N-glycosylation features unique multi-dimensional structures including sequence and linkage isomers. There have been successful bioinformatics efforts in N-glycan structure identification using N-glycoproteomics data; however, symmetric "mirror" branch isomers and linkage isomers are largely unresolved. Here, we report deep structure-level N-glycan identification using feature-induced structure diagnosis (FISD) integrated with a deep learning model. A neural network model is integrated to conduct the identification of featured N-glycan motifs and boosts the process of structure diagnosis and distinction for linkage isomers. By adopting publicly available N-glycoproteomics datasets of five mouse tissues (17,136 intact N-glycopeptide spectrum matches) and a consideration of 23 motif features, a deep learning model integrated with a convolutional autoencoder and a multilayer perceptron was trained to be capable of predicting N-glycan featured motifs in the MS/MS spectra with previously identified compositions. In the test of the trained model, a prediction accuracy of 0.8 and AUC value of 0.95 were achieved; 5701 previously unresolved N-glycan structures were assigned by matched structure-diagnostic ions; and by using an explainable learning algorithm, two new fragmentation features of m/z = 674.25 and m/z = 835.28 were found to be significant to three N-glycan structure motifs with fucose, NeuAc, and NeuGc, proving the capability of FISD to discover new features in the MS/MS spectra.
期刊介绍:
Analytical and Bioanalytical Chemistry’s mission is the rapid publication of excellent and high-impact research articles on fundamental and applied topics of analytical and bioanalytical measurement science. Its scope is broad, and ranges from novel measurement platforms and their characterization to multidisciplinary approaches that effectively address important scientific problems. The Editors encourage submissions presenting innovative analytical research in concept, instrumentation, methods, and/or applications, including: mass spectrometry, spectroscopy, and electroanalysis; advanced separations; analytical strategies in “-omics” and imaging, bioanalysis, and sampling; miniaturized devices, medical diagnostics, sensors; analytical characterization of nano- and biomaterials; chemometrics and advanced data analysis.