FTIR和CD光谱数据的人工智能分析,用于预测和定量蛋白质二级结构的长度和含量

IF 0.3 Q4 SPECTROSCOPY Biomedical Spectroscopy and Imaging Pub Date : 2021-03-24 DOI:10.3233/BSI-210210
P. Haris, J. A. Hering
{"title":"FTIR和CD光谱数据的人工智能分析,用于预测和定量蛋白质二级结构的长度和含量","authors":"P. Haris, J. A. Hering","doi":"10.3233/BSI-210210","DOIUrl":null,"url":null,"abstract":"Besides NMR and X-ray crystallography, FTIR and CD spectroscopy are widely considered to be useful for determining protein secondary structure. These techniques can be used to obtain data in few minutes, using small quantities of proteins, which make them amenable for proteomics research. Here we explore the possibility of using artificial intelligence techniques to simultaneously analyse both FTIR and CD spectroscopic data for an identical set of proteins. Neural network analysis was carried out on normalised regions of FTIR (1700-1600 cm−1) and CD (180-259 nm) spectral data both with and without boxcar averaging in order to quantify the average length and percentages of secondary structures. A hybrid genetic algorithm/neural network approach, that automatically selects structure-sensitive wavelength/frequency, was used for the quantification of the protein secondary structure. Using this algorithm we also successfully identified the region of the CD spectrum that contains the most structure-sensitive information. This was located between 214-251 nm, suggesting that this region alone may be sufficient to rapidly determine the secondary structure content from CD spectral data. Overall, CD spectroscopic analysis produced better results compared to FTIR spectroscopy when selected wavelengths were used, although FTIR was better when the entire region between 1700-1600 cm−1 (FTIR), and 180-259 nm (CD), was subjected to neural network analysis. Application of Adaptive Neuro-Fuzzy Inference System (ANFIS) with fuzzy subtractive clustering for the analysis of the spectral data led to a slightly better prediction of the average helix/sheet length for FTIR spectroscopy compared to CD. Our findings reveal the potential of using artificial intelligence techniques for not only extracting structural information but also for better understanding of the relationship between complex spectral data and biologically important information.","PeriodicalId":44239,"journal":{"name":"Biomedical Spectroscopy and Imaging","volume":"1 1","pages":"1-7"},"PeriodicalIF":0.3000,"publicationDate":"2021-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3233/BSI-210210","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence analysis of FTIR and CD spectroscopic data for predicting and quantifying the length and content of protein secondary structures\",\"authors\":\"P. Haris, J. A. Hering\",\"doi\":\"10.3233/BSI-210210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Besides NMR and X-ray crystallography, FTIR and CD spectroscopy are widely considered to be useful for determining protein secondary structure. These techniques can be used to obtain data in few minutes, using small quantities of proteins, which make them amenable for proteomics research. Here we explore the possibility of using artificial intelligence techniques to simultaneously analyse both FTIR and CD spectroscopic data for an identical set of proteins. Neural network analysis was carried out on normalised regions of FTIR (1700-1600 cm−1) and CD (180-259 nm) spectral data both with and without boxcar averaging in order to quantify the average length and percentages of secondary structures. A hybrid genetic algorithm/neural network approach, that automatically selects structure-sensitive wavelength/frequency, was used for the quantification of the protein secondary structure. Using this algorithm we also successfully identified the region of the CD spectrum that contains the most structure-sensitive information. This was located between 214-251 nm, suggesting that this region alone may be sufficient to rapidly determine the secondary structure content from CD spectral data. Overall, CD spectroscopic analysis produced better results compared to FTIR spectroscopy when selected wavelengths were used, although FTIR was better when the entire region between 1700-1600 cm−1 (FTIR), and 180-259 nm (CD), was subjected to neural network analysis. Application of Adaptive Neuro-Fuzzy Inference System (ANFIS) with fuzzy subtractive clustering for the analysis of the spectral data led to a slightly better prediction of the average helix/sheet length for FTIR spectroscopy compared to CD. Our findings reveal the potential of using artificial intelligence techniques for not only extracting structural information but also for better understanding of the relationship between complex spectral data and biologically important information.\",\"PeriodicalId\":44239,\"journal\":{\"name\":\"Biomedical Spectroscopy and Imaging\",\"volume\":\"1 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2021-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.3233/BSI-210210\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Spectroscopy and Imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/BSI-210210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Spectroscopy and Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/BSI-210210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

摘要

除了核磁共振和x射线晶体学外,FTIR和CD光谱被广泛认为是测定蛋白质二级结构的有用方法。这些技术可以在几分钟内获得数据,使用少量的蛋白质,这使得它们适用于蛋白质组学研究。在这里,我们探索了使用人工智能技术同时分析同一组蛋白质的FTIR和CD光谱数据的可能性。神经网络分析对FTIR (1700-1600 cm−1)和CD (180-259 nm)光谱数据的归一化区域进行了神经网络分析,以量化平均长度和二级结构的百分比。采用自动选择结构敏感波长/频率的混合遗传算法/神经网络方法定量蛋白质二级结构。利用该算法,我们还成功地识别出了CD光谱中包含最多结构敏感信息的区域。这一区域位于214-251 nm之间,表明仅这一区域就足以从CD光谱数据中快速确定二级结构的含量。总的来说,当使用选定的波长时,CD光谱分析比FTIR光谱分析产生更好的结果,尽管FTIR在1700-1600 cm−1 (FTIR)和180-259 nm (CD)之间的整个区域进行神经网络分析时效果更好。应用自适应神经模糊推理系统(ANFIS)和模糊减聚类对光谱数据进行分析,与CD相比,FTIR光谱的平均螺旋/片长预测略好。我们的研究结果揭示了使用人工智能技术不仅可以提取结构信息,还可以更好地理解复杂光谱数据与生物学重要信息之间的关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Artificial intelligence analysis of FTIR and CD spectroscopic data for predicting and quantifying the length and content of protein secondary structures
Besides NMR and X-ray crystallography, FTIR and CD spectroscopy are widely considered to be useful for determining protein secondary structure. These techniques can be used to obtain data in few minutes, using small quantities of proteins, which make them amenable for proteomics research. Here we explore the possibility of using artificial intelligence techniques to simultaneously analyse both FTIR and CD spectroscopic data for an identical set of proteins. Neural network analysis was carried out on normalised regions of FTIR (1700-1600 cm−1) and CD (180-259 nm) spectral data both with and without boxcar averaging in order to quantify the average length and percentages of secondary structures. A hybrid genetic algorithm/neural network approach, that automatically selects structure-sensitive wavelength/frequency, was used for the quantification of the protein secondary structure. Using this algorithm we also successfully identified the region of the CD spectrum that contains the most structure-sensitive information. This was located between 214-251 nm, suggesting that this region alone may be sufficient to rapidly determine the secondary structure content from CD spectral data. Overall, CD spectroscopic analysis produced better results compared to FTIR spectroscopy when selected wavelengths were used, although FTIR was better when the entire region between 1700-1600 cm−1 (FTIR), and 180-259 nm (CD), was subjected to neural network analysis. Application of Adaptive Neuro-Fuzzy Inference System (ANFIS) with fuzzy subtractive clustering for the analysis of the spectral data led to a slightly better prediction of the average helix/sheet length for FTIR spectroscopy compared to CD. Our findings reveal the potential of using artificial intelligence techniques for not only extracting structural information but also for better understanding of the relationship between complex spectral data and biologically important information.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊介绍: Biomedical Spectroscopy and Imaging (BSI) is a multidisciplinary journal devoted to the timely publication of basic and applied research that uses spectroscopic and imaging techniques in different areas of life science including biology, biochemistry, biotechnology, bionanotechnology, environmental science, food science, pharmaceutical science, physiology and medicine. Scientists are encouraged to submit their work for publication in the form of original articles, brief communications, rapid communications, reviews and mini-reviews. Techniques covered include, but are not limited, to the following: • Vibrational Spectroscopy (Infrared, Raman, Teraherz) • Circular Dichroism Spectroscopy • Magnetic Resonance Spectroscopy (NMR, ESR) • UV-vis Spectroscopy • Mössbauer Spectroscopy • X-ray Spectroscopy (Absorption, Emission, Photoelectron, Fluorescence) • Neutron Spectroscopy • Mass Spectroscopy • Fluorescence Spectroscopy • X-ray and Neutron Scattering • Differential Scanning Calorimetry • Atomic Force Microscopy • Surface Plasmon Resonance • Magnetic Resonance Imaging • X-ray Imaging • Electron Imaging • Neutron Imaging • Raman Imaging • Infrared Imaging • Terahertz Imaging • Fluorescence Imaging • Near-infrared spectroscopy.
期刊最新文献
Covid-19 pandemic has been a set-back for scientific productivity and the road to recovery must focus on improving the mental health and well-being of scientists Portable NMR for the investigation of models of mammographic density ex vivo: Androgens antagonise the promotional effect of oestrogen A method to detect thermal damage in bovine liver utilising diffuse reflectance spectroscopy Clinical applications of spectroscopic techniques in conjunction with multivariate analysis in virus diagnosis Determination of arsenic, cadmium, selenium, zinc and other trace elements in Bangladeshi fish and arsenic speciation study of Hilsa fish flesh and eggs: Implications for dietary intake
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1