首页 > 最新文献

Interdisciplinary Sciences: Computational Life Sciences最新文献

英文 中文
Predicting Promoters in Multiple Prokaryotes with Prompt. 利用 Prompt 预测多种原核生物的启动子
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-08-07 DOI: 10.1007/s12539-024-00637-8
Qimeng Du, Yixue Guo, Junpeng Zhang, Fuping Lu, Chong Peng, Chichun Zhou

Promoters are important cis-regulatory elements for the regulation of gene expression, and their accurate predictions are crucial for elucidating the biological functions and potential mechanisms of genes. Many previous prokaryotic promoter prediction methods are encouraging in terms of the prediction performance, but most of them focus on the recognition of promoters in only one or a few bacterial species. Moreover, due to ignoring the promoter sequence motifs, the interpretability of predictions with existing methods is limited. In this work, we present a generalized method Prompt (Promoters in multiple prokaryotes) to predict promoters in 16 prokaryotes and improve the interpretability of prediction results. Prompt integrates three methods including RSK (Regression based on Selected k-mer), CL (Contrastive Learning) and MLP (Multilayer Perception), and employs a voting strategy to divide the datasets into high-confidence and low-confidence categories. Results on the promoter prediction tasks in 16 prokaryotes show that the accuracy (Accuracy, Matthews correlation coefficient) of Prompt is greater than 80% in highly credible datasets of 16 prokaryotes, and is greater than 90% in 12 prokaryotes, and Prompt performs the best compared with other existing methods. Moreover, by identifying promoter sequence motifs, Prompt can improve the interpretability of the predictions. Prompt is freely available at https://github.com/duqimeng/PromptPrompt , and will contribute to the research of promoters in prokaryote.

启动子是调控基因表达的重要顺式调控元件,准确预测启动子对于阐明基因的生物学功能和潜在机制至关重要。以往的许多原核生物启动子预测方法在预测性能方面令人鼓舞,但它们大多只侧重于识别一种或少数几种细菌的启动子。此外,由于忽略了启动子序列的母题,现有方法的预测结果可解释性有限。在这项工作中,我们提出了一种通用方法 Prompt(多种原核生物中的启动子),用于预测 16 种原核生物中的启动子,并提高了预测结果的可解释性。Prompt 整合了三种方法,包括 RSK(基于选择 k-mer 的回归)、CL(对比学习)和 MLP(多层感知),并采用投票策略将数据集分为高置信度和低置信度两类。对16种原核生物启动子预测任务的结果表明,在16种原核生物的高可信度数据集中,Prompt的准确率(Accuracy,马修斯相关系数)大于80%,在12种原核生物中大于90%,与其他现有方法相比,Prompt的表现最佳。此外,通过识别启动子序列母题,Prompt 还能提高预测结果的可解释性。Prompt 可在 https://github.com/duqimeng/PromptPrompt 免费获取,它将为原核生物启动子的研究做出贡献。
{"title":"Predicting Promoters in Multiple Prokaryotes with Prompt.","authors":"Qimeng Du, Yixue Guo, Junpeng Zhang, Fuping Lu, Chong Peng, Chichun Zhou","doi":"10.1007/s12539-024-00637-8","DOIUrl":"10.1007/s12539-024-00637-8","url":null,"abstract":"<p><p>Promoters are important cis-regulatory elements for the regulation of gene expression, and their accurate predictions are crucial for elucidating the biological functions and potential mechanisms of genes. Many previous prokaryotic promoter prediction methods are encouraging in terms of the prediction performance, but most of them focus on the recognition of promoters in only one or a few bacterial species. Moreover, due to ignoring the promoter sequence motifs, the interpretability of predictions with existing methods is limited. In this work, we present a generalized method Prompt (Promoters in multiple prokaryotes) to predict promoters in 16 prokaryotes and improve the interpretability of prediction results. Prompt integrates three methods including RSK (Regression based on Selected k-mer), CL (Contrastive Learning) and MLP (Multilayer Perception), and employs a voting strategy to divide the datasets into high-confidence and low-confidence categories. Results on the promoter prediction tasks in 16 prokaryotes show that the accuracy (Accuracy, Matthews correlation coefficient) of Prompt is greater than 80% in highly credible datasets of 16 prokaryotes, and is greater than 90% in 12 prokaryotes, and Prompt performs the best compared with other existing methods. Moreover, by identifying promoter sequence motifs, Prompt can improve the interpretability of the predictions. Prompt is freely available at https://github.com/duqimeng/PromptPrompt , and will contribute to the research of promoters in prokaryote.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"814-828"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141897299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCINet: A Segmentation and Classification Interaction CNN Method for Arteriosclerotic Retinopathy Grading. SCINet:用于动脉硬化性视网膜病变分级的分割与分类交互 CNN 方法。
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-09-02 DOI: 10.1007/s12539-024-00650-x
Xiongwen Quan, Xingyuan Ou, Li Gao, Wenya Yin, Guangyao Hou, Han Zhang

As a common disease, cardiovascular and cerebrovascular diseases pose a great harm threat to human wellness. Even using advanced and comprehensive treatment methods, there is still a high mortality rate. Arteriosclerosis, as an important factor reflecting the severity of cardiovascular and cerebrovascular diseases, is imperative to detect the arteriosclerotic retinopathy. However, the detection of arteriosclerosis retinopathy requires expensive and time-consuming manual evaluation, while end-to-end deep learning detection methods also need interpretable design to high light task-related features. Considering the importance of automatic arteriosclerotic retinopathy grading, we propose a segmentation and classification interaction network (SCINet). We propose a segmentation and classification interaction architecture for grading arteriosclerotic retinopathy. After IterNet is used to segment retinal vessel from original fundus images, the backbone feature extractor roughly extracts features from the segmented and original fundus arteriosclerosis images and further enhances them through the vessel aware module. The last classifier module generates fundus arteriosclerosis grading results. Specifically, the vessel aware module is designed to highlight the important areal vessel features segmented from original images by attention mechanism, thereby achieving information interaction. The attention mechanism selectively learns the vessel features of segmentation region information under the proposed interactive architecture, which leads to reweighting the extracted features and enhances significant feature information. Extensive experiments have confirmed the effect of our model. SCINet has the best performance on the task of arteriosclerotic retinopathy grading. Additionally, the CNN method is scalable to similar tasks by incorporating segmented images as auxiliary information.

心脑血管疾病作为一种常见病,对人类健康造成了极大的危害。即使采用先进的综合治疗方法,死亡率仍然很高。动脉硬化作为反映心脑血管疾病严重程度的重要因素,对动脉硬化性视网膜病变的检测势在必行。然而,动脉硬化性视网膜病变的检测需要昂贵而耗时的人工评估,而端到端的深度学习检测方法也需要对高亮任务相关特征进行可解释性设计。考虑到动脉硬化性视网膜病变自动分级的重要性,我们提出了一种分割与分类交互网络(SCINet)。我们提出了一种用于动脉硬化性视网膜病变分级的分割与分类交互架构。在使用 IterNet 从原始眼底图像中分割出视网膜血管后,主干特征提取器从分割后的原始眼底动脉硬化图像中粗略提取特征,并通过血管感知模块进一步增强这些特征。最后一个分类器模块生成眼底动脉硬化分级结果。具体来说,血管感知模块旨在通过注意力机制,突出从原始图像中分割出的重要区域血管特征,从而实现信息交互。在所提出的交互式架构下,注意力机制会选择性地学习分割区域信息中的血管特征,从而对提取的特征进行重新加权,增强重要的特征信息。大量实验证实了我们模型的效果。SCINet 在动脉硬化性视网膜病变分级任务中表现最佳。此外,通过将分割图像作为辅助信息,CNN 方法还可扩展到类似任务。
{"title":"SCINet: A Segmentation and Classification Interaction CNN Method for Arteriosclerotic Retinopathy Grading.","authors":"Xiongwen Quan, Xingyuan Ou, Li Gao, Wenya Yin, Guangyao Hou, Han Zhang","doi":"10.1007/s12539-024-00650-x","DOIUrl":"10.1007/s12539-024-00650-x","url":null,"abstract":"<p><p>As a common disease, cardiovascular and cerebrovascular diseases pose a great harm threat to human wellness. Even using advanced and comprehensive treatment methods, there is still a high mortality rate. Arteriosclerosis, as an important factor reflecting the severity of cardiovascular and cerebrovascular diseases, is imperative to detect the arteriosclerotic retinopathy. However, the detection of arteriosclerosis retinopathy requires expensive and time-consuming manual evaluation, while end-to-end deep learning detection methods also need interpretable design to high light task-related features. Considering the importance of automatic arteriosclerotic retinopathy grading, we propose a segmentation and classification interaction network (SCINet). We propose a segmentation and classification interaction architecture for grading arteriosclerotic retinopathy. After IterNet is used to segment retinal vessel from original fundus images, the backbone feature extractor roughly extracts features from the segmented and original fundus arteriosclerosis images and further enhances them through the vessel aware module. The last classifier module generates fundus arteriosclerosis grading results. Specifically, the vessel aware module is designed to highlight the important areal vessel features segmented from original images by attention mechanism, thereby achieving information interaction. The attention mechanism selectively learns the vessel features of segmentation region information under the proposed interactive architecture, which leads to reweighting the extracted features and enhances significant feature information. Extensive experiments have confirmed the effect of our model. SCINet has the best performance on the task of arteriosclerotic retinopathy grading. Additionally, the CNN method is scalable to similar tasks by incorporating segmented images as auxiliary information.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"926-935"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142107051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data. 基于对比学习的深度神经网络,通过整合多种光学数据进行癌症亚型分析
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-09-04 DOI: 10.1007/s12539-024-00641-y
Hua Chai, Weizhen Deng, Junyu Wei, Ting Guan, Minfan He, Yong Liang, Le Li

Background: Accurate identification of cancer subtypes is crucial for disease prognosis evaluation and personalized patient management. Recent advances in computational methods have demonstrated that multi-omics data provides valuable insights into tumor molecular subtyping. However, the high dimensionality and small sample size of the data may result in ambiguous and overlapping cancer subtypes during clustering. In this study, we propose a novel contrastive-learning-based approach to address this issue. The proposed end-to-end deep learning method can extract crucial information from the multi-omics features by self-supervised learning for patient clustering.

Results: By applying our method to nine public cancer datasets, we have demonstrated superior performance compared to existing methods in separating patients with different survival outcomes (p < 0.05). To further evaluate the impact of various omics data on cancer survival, we developed an XGBoost classification model and found that mRNA had the highest importance score, followed by DNA methylation and miRNA. In the presented case study, our method successfully clustered subtypes and identified 14 cancer-related genes, of which 12 (85.7%) were validated through literature review.

Conclusions: Our findings demonstrate that our method is capable of identifying cancer subtypes that are both statistically and biologically significant. The code about COLCS is given at: https://github.com/Mercuriiio/COLCS .

背景:准确识别癌症亚型对于疾病预后评估和个性化患者管理至关重要。计算方法的最新进展表明,多组学数据可为肿瘤分子亚型鉴定提供有价值的见解。然而,数据的高维度和小样本量可能会导致聚类过程中出现模糊和重叠的癌症亚型。在本研究中,我们提出了一种基于对比学习的新方法来解决这一问题。所提出的端到端深度学习方法可以通过自我监督学习从多组学特征中提取关键信息,用于患者聚类:通过将我们的方法应用于九个公共癌症数据集,与现有方法相比,我们的方法在分离不同生存结果的患者方面表现出了更优越的性能(p 结论:我们的研究结果表明,我们的方法能够从多组学特征中提取关键信息,并通过自我监督学习对患者进行聚类:我们的研究结果表明,我们的方法能够识别具有统计学和生物学意义的癌症亚型。有关 COLCS 的代码请访问:https://github.com/Mercuriiio/COLCS 。
{"title":"A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data.","authors":"Hua Chai, Weizhen Deng, Junyu Wei, Ting Guan, Minfan He, Yong Liang, Le Li","doi":"10.1007/s12539-024-00641-y","DOIUrl":"10.1007/s12539-024-00641-y","url":null,"abstract":"<p><strong>Background: </strong>Accurate identification of cancer subtypes is crucial for disease prognosis evaluation and personalized patient management. Recent advances in computational methods have demonstrated that multi-omics data provides valuable insights into tumor molecular subtyping. However, the high dimensionality and small sample size of the data may result in ambiguous and overlapping cancer subtypes during clustering. In this study, we propose a novel contrastive-learning-based approach to address this issue. The proposed end-to-end deep learning method can extract crucial information from the multi-omics features by self-supervised learning for patient clustering.</p><p><strong>Results: </strong>By applying our method to nine public cancer datasets, we have demonstrated superior performance compared to existing methods in separating patients with different survival outcomes (p < 0.05). To further evaluate the impact of various omics data on cancer survival, we developed an XGBoost classification model and found that mRNA had the highest importance score, followed by DNA methylation and miRNA. In the presented case study, our method successfully clustered subtypes and identified 14 cancer-related genes, of which 12 (85.7%) were validated through literature review.</p><p><strong>Conclusions: </strong>Our findings demonstrate that our method is capable of identifying cancer subtypes that are both statistically and biologically significant. The code about COLCS is given at: https://github.com/Mercuriiio/COLCS .</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"966-975"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142125654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deniable Encryption Method for Modulation-Based DNA Storage. 基于调制的 DNA 存储的可抵赖加密方法。
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-08-19 DOI: 10.1007/s12539-024-00648-5
Ling Chu, Yanqing Su, Xiangzhen Zan, Wanmin Lin, Xiangyu Yao, Peng Xu, Wenbin Liu

Recent advancements in synthesis and sequencing techniques have made deoxyribonucleic acid (DNA) a promising alternative for next-generation digital storage. As it approaches practical application, ensuring the security of DNA-stored information has become a critical problem. Deniable encryption allows the decryption of different information from the same ciphertext, ensuring that the "plausible" fake information can be provided when users are coerced to reveal the real information. In this paper, we propose a deniable encryption method that uniquely leverages DNA noise channels. Specifically, true and fake messages are encrypted by two similar modulation carriers and subsequently obfuscated by inherent errors. Experiment results demonstrate that our method not only can conceal true information among fake ones indistinguishably, but also allow both the coercive adversary and the legitimate receiver to decrypt the intended information accurately. Further security analysis validates the resistance of our method against various typical attacks. Compared with conventional DNA cryptography methods based on complex biological operations, our method offers superior practicality and reliability, positioning it as an ideal solution for data encryption in future large-scale DNA storage applications.

合成和测序技术的最新进展使脱氧核糖核酸(DNA)成为下一代数字存储的理想选择。随着脱氧核糖核酸越来越接近实际应用,确保其存储信息的安全性已成为一个关键问题。可抵赖加密允许对同一密文中的不同信息进行解密,确保用户在被迫透露真实信息时,可以提供 "似是而非 "的假信息。在本文中,我们提出了一种独特利用 DNA 噪音通道的可抵赖加密方法。具体来说,真假信息通过两个相似的调制载波进行加密,然后通过固有误差进行混淆。实验结果表明,我们的方法不仅能将真实信息无差别地隐藏在虚假信息中,还能让胁迫对手和合法接收者准确解密预期信息。进一步的安全性分析验证了我们的方法可以抵御各种典型攻击。与基于复杂生物操作的传统 DNA 密码学方法相比,我们的方法具有更高的实用性和可靠性,是未来大规模 DNA 存储应用中数据加密的理想解决方案。
{"title":"A Deniable Encryption Method for Modulation-Based DNA Storage.","authors":"Ling Chu, Yanqing Su, Xiangzhen Zan, Wanmin Lin, Xiangyu Yao, Peng Xu, Wenbin Liu","doi":"10.1007/s12539-024-00648-5","DOIUrl":"10.1007/s12539-024-00648-5","url":null,"abstract":"<p><p>Recent advancements in synthesis and sequencing techniques have made deoxyribonucleic acid (DNA) a promising alternative for next-generation digital storage. As it approaches practical application, ensuring the security of DNA-stored information has become a critical problem. Deniable encryption allows the decryption of different information from the same ciphertext, ensuring that the \"plausible\" fake information can be provided when users are coerced to reveal the real information. In this paper, we propose a deniable encryption method that uniquely leverages DNA noise channels. Specifically, true and fake messages are encrypted by two similar modulation carriers and subsequently obfuscated by inherent errors. Experiment results demonstrate that our method not only can conceal true information among fake ones indistinguishably, but also allow both the coercive adversary and the legitimate receiver to decrypt the intended information accurately. Further security analysis validates the resistance of our method against various typical attacks. Compared with conventional DNA cryptography methods based on complex biological operations, our method offers superior practicality and reliability, positioning it as an ideal solution for data encryption in future large-scale DNA storage applications.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"872-881"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141999872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unraveling Brain Synchronisation Dynamics by Explainable Neural Networks using EEG Signals: Application to Dyslexia Diagnosis. 利用脑电信号的可解释神经网络揭示大脑同步动态:应用于阅读障碍诊断。
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-07-02 DOI: 10.1007/s12539-024-00634-x
Nicolás J Gallego-Molina, Andrés Ortiz, Juan E Arco, Francisco J Martinez-Murcia, Wai Lok Woo

The electrical activity of the neural processes involved in cognitive functions is captured in EEG signals, allowing the exploration of the integration and coordination of neuronal oscillations across multiple spatiotemporal scales. We have proposed a novel approach that combines the transformation of EEG signal into image sequences, considering cross-frequency phase synchronisation (CFS) dynamics involved in low-level auditory processing, with the development of a two-stage deep learning model for the detection of developmental dyslexia (DD). This deep learning model exploits spatial and temporal information preserved in the image sequences to find discriminative patterns of phase synchronisation over time achieving a balanced accuracy of up to 83%. This result supports the existence of differential brain synchronisation dynamics between typical and dyslexic seven-year-old readers. Furthermore, we have obtained interpretable representations using a novel feature mask to link the most relevant regions during classification with the cognitive processes attributed to normal reading and those corresponding to compensatory mechanisms found in dyslexia.

脑电信号可以捕捉到认知功能所涉及的神经过程的电活动,从而探索神经元振荡在多个时空尺度上的整合与协调。我们提出了一种将脑电图信号转化为图像序列的新方法,该方法考虑了低级听觉处理过程中的跨频相位同步(CFS)动力学,并开发了一种用于检测发育性阅读障碍(DD)的两阶段深度学习模型。该深度学习模型利用图像序列中保留的空间和时间信息,找到随时间变化的相位同步的判别模式,实现了高达 83% 的均衡准确率。这一结果证明,在典型阅读障碍和阅读障碍的七岁读者之间存在着不同的大脑同步动态。此外,我们还利用一种新颖的特征掩码获得了可解释的表征,将分类过程中最相关的区域与正常阅读的认知过程和阅读障碍的补偿机制联系起来。
{"title":"Unraveling Brain Synchronisation Dynamics by Explainable Neural Networks using EEG Signals: Application to Dyslexia Diagnosis.","authors":"Nicolás J Gallego-Molina, Andrés Ortiz, Juan E Arco, Francisco J Martinez-Murcia, Wai Lok Woo","doi":"10.1007/s12539-024-00634-x","DOIUrl":"10.1007/s12539-024-00634-x","url":null,"abstract":"<p><p>The electrical activity of the neural processes involved in cognitive functions is captured in EEG signals, allowing the exploration of the integration and coordination of neuronal oscillations across multiple spatiotemporal scales. We have proposed a novel approach that combines the transformation of EEG signal into image sequences, considering cross-frequency phase synchronisation (CFS) dynamics involved in low-level auditory processing, with the development of a two-stage deep learning model for the detection of developmental dyslexia (DD). This deep learning model exploits spatial and temporal information preserved in the image sequences to find discriminative patterns of phase synchronisation over time achieving a balanced accuracy of up to 83%. This result supports the existence of differential brain synchronisation dynamics between typical and dyslexic seven-year-old readers. Furthermore, we have obtained interpretable representations using a novel feature mask to link the most relevant regions during classification with the cognitive processes attributed to normal reading and those corresponding to compensatory mechanisms found in dyslexia.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"1005-1018"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512920/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141491790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Machine Learning and Predicted Properties Promote Antimicrobial Peptide Identification. 集合机器学习和预测特性促进了抗菌肽的鉴定。
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-07-07 DOI: 10.1007/s12539-024-00640-z
Guolun Zhong, Hui Liu, Lei Deng

The emergence of antibiotic-resistant microbes raises a pressing demand for novel alternative treatments. One promising alternative is the antimicrobial peptides (AMPs), a class of innate immunity mediators within the therapeutic peptide realm. AMPs offer salient advantages such as high specificity, cost-effective synthesis, and reduced toxicity. Although some computational methodologies have been proposed to identify potential AMPs with the rapid development of artificial intelligence techniques, there is still ample room to improve their performance. This study proposes a predictive framework which ensembles deep learning and statistical learning methods to screen peptides with antimicrobial activity. We integrate multiple LightGBM classifiers and convolution neural networks which leverages various predicted sequential, structural and physicochemical properties from their residue sequences extracted by diverse machine learning paradigms. Comparative experiments exhibit that our method outperforms other state-of-the-art approaches on an independent test dataset, in terms of representative capability measures. Besides, we analyse the discrimination quality under different varieties of attribute information and it reveals that combination of multiple features could improve prediction. In addition, a case study is carried out to illustrate the exemplary favorable identification effect. We establish a web application at http://amp.denglab.org to provide convenient usage of our proposal and make the predictive framework, source code, and datasets publicly accessible at https://github.com/researchprotein/amp .

抗生素耐药微生物的出现提出了对新型替代疗法的迫切需求。抗菌肽(AMPs)是治疗肽领域的一类先天性免疫介质,是一种前景广阔的替代疗法。AMPs 具有特异性强、合成成本低、毒性小等显著优势。虽然随着人工智能技术的快速发展,人们已经提出了一些计算方法来识别潜在的 AMPs,但其性能仍有很大的提升空间。本研究提出了一种预测框架,它集合了深度学习和统计学习方法来筛选具有抗菌活性的多肽。我们整合了多个 LightGBM 分类器和卷积神经网络,利用从不同机器学习范式提取的残基序列中预测出的各种序列、结构和理化特性。对比实验表明,在一个独立的测试数据集上,就代表性能力指标而言,我们的方法优于其他最先进的方法。此外,我们还分析了不同属性信息下的判别质量,结果表明多种特征的组合可以提高预测效果。此外,我们还进行了案例研究,以说明良好的识别效果。为了方便使用我们的建议,我们在 http://amp.denglab.org 上建立了一个网络应用程序,并在 https://github.com/researchprotein/amp 上公开了预测框架、源代码和数据集。
{"title":"Ensemble Machine Learning and Predicted Properties Promote Antimicrobial Peptide Identification.","authors":"Guolun Zhong, Hui Liu, Lei Deng","doi":"10.1007/s12539-024-00640-z","DOIUrl":"10.1007/s12539-024-00640-z","url":null,"abstract":"<p><p>The emergence of antibiotic-resistant microbes raises a pressing demand for novel alternative treatments. One promising alternative is the antimicrobial peptides (AMPs), a class of innate immunity mediators within the therapeutic peptide realm. AMPs offer salient advantages such as high specificity, cost-effective synthesis, and reduced toxicity. Although some computational methodologies have been proposed to identify potential AMPs with the rapid development of artificial intelligence techniques, there is still ample room to improve their performance. This study proposes a predictive framework which ensembles deep learning and statistical learning methods to screen peptides with antimicrobial activity. We integrate multiple LightGBM classifiers and convolution neural networks which leverages various predicted sequential, structural and physicochemical properties from their residue sequences extracted by diverse machine learning paradigms. Comparative experiments exhibit that our method outperforms other state-of-the-art approaches on an independent test dataset, in terms of representative capability measures. Besides, we analyse the discrimination quality under different varieties of attribute information and it reveals that combination of multiple features could improve prediction. In addition, a case study is carried out to illustrate the exemplary favorable identification effect. We establish a web application at http://amp.denglab.org to provide convenient usage of our proposal and make the predictive framework, source code, and datasets publicly accessible at https://github.com/researchprotein/amp .</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"951-965"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141544779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Luteinizing Hormone Receptor Mutation (LHRN316S) Causes Abnormal Follicular Development Revealed by Follicle Single-Cell Analysis and CRISPR/Cas9. 卵泡单细胞分析和 CRISPR/Cas9 发现促黄体生成素受体突变(LHRN316S)导致卵泡发育异常
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-01 Epub Date: 2024-08-16 DOI: 10.1007/s12539-024-00646-7
Chen Zhang, Yongqiang Nie, Bufang Xu, Chunlan Mu, Geng G Tian, Xiaoyong Li, Weiwei Cheng, Aijun Zhang, Dali Li, Ji Wu

Abnormal interaction between granulosa cells and oocytes causes disordered development of ovarian follicles. However, the interactions between oocytes and cumulus granulosa cells (CGs), oocytes and mural granulosa cells (MGs), and CGs and MGs remain to be fully explored. Using single-cell RNA-sequencing (scRNA-seq), we determined the transcriptional profiles of oocytes, CGs and MGs in antral follicles. Analysis of scRNA-seq data revealed that CGs may regulate follicular development through the BMP15-KITL-KIT-PI3K-ARF6 pathway with elevated expression of luteinizing hormone receptor (LHR). Because internalization of the LHR is regulated by Arf6, we constructed LHRN316S mice by CRISPR/Cas9 to further explore mechanisms of follicular development and novel treatment strategies for female infertility. Ovaries of LHRN316S mice exhibited reduced numbers of corpora lutea and ovulation. The LHRN316S mice had a reduced rate of oocyte maturation in vitro and decreased serum progesterone levels. Mating LHRN316S female mice with ICR wild type male mice revealed that the infertility rate of LHRN316S mice was 21.4% (3/14). Litter sizes from LHRN316S mice were smaller than those from control wild type female mice. The oocytes from LHRN316S mice had an increased rate of maturation in vitro after progesterone administration in vitro. Furthermore, progesterone treated LHRN316S mice produced offspring numbers per litter equivalent to WT mice. These findings provide key insights into cellular interactions in ovarian follicles and provide important clues for infertility treatment.

颗粒细胞和卵母细胞之间的异常相互作用会导致卵泡发育紊乱。然而,卵母细胞与积层颗粒细胞(CGs)、卵母细胞与壁层颗粒细胞(MGs)以及CGs与MGs之间的相互作用仍有待充分探索。我们利用单细胞RNA测序(scRNA-seq)测定了前卵泡中卵母细胞、CGs和MGs的转录谱。scRNA-seq数据分析显示,CGs可能通过BMP15-KITL-KIT-PI3K-ARF6通路调控卵泡的发育,同时升高黄体生成素受体(LHR)的表达。由于LHR的内化受Arf6调控,我们通过CRISPR/Cas9构建了LHRN316S小鼠,以进一步探索卵泡发育机制和女性不孕症的新型治疗策略。LHRN316S小鼠的卵巢表现出黄体数量和排卵减少。LHRN316S 小鼠体外卵母细胞成熟率降低,血清孕酮水平下降。LHRN316S雌性小鼠与ICR野生型雄性小鼠交配显示,LHRN316S小鼠的不育率为21.4%(3/14)。与对照野生型雌性小鼠相比,LHRN316S小鼠的产仔数较少。体外注射黄体酮后,LHRN316S小鼠卵母细胞的体外成熟率增加。此外,经黄体酮处理的 LHRN316S 小鼠每胎产生的后代数量与 WT 小鼠相当。这些发现为卵巢卵泡中的细胞相互作用提供了重要见解,并为不孕症的治疗提供了重要线索。
{"title":"Luteinizing Hormone Receptor Mutation (LHR<sup>N316S</sup>) Causes Abnormal Follicular Development Revealed by Follicle Single-Cell Analysis and CRISPR/Cas9.","authors":"Chen Zhang, Yongqiang Nie, Bufang Xu, Chunlan Mu, Geng G Tian, Xiaoyong Li, Weiwei Cheng, Aijun Zhang, Dali Li, Ji Wu","doi":"10.1007/s12539-024-00646-7","DOIUrl":"10.1007/s12539-024-00646-7","url":null,"abstract":"<p><p>Abnormal interaction between granulosa cells and oocytes causes disordered development of ovarian follicles. However, the interactions between oocytes and cumulus granulosa cells (CGs), oocytes and mural granulosa cells (MGs), and CGs and MGs remain to be fully explored. Using single-cell RNA-sequencing (scRNA-seq), we determined the transcriptional profiles of oocytes, CGs and MGs in antral follicles. Analysis of scRNA-seq data revealed that CGs may regulate follicular development through the BMP15-KITL-KIT-PI3K-ARF6 pathway with elevated expression of luteinizing hormone receptor (LHR). Because internalization of the LHR is regulated by Arf6, we constructed LHR<sup>N316S</sup> mice by CRISPR/Cas9 to further explore mechanisms of follicular development and novel treatment strategies for female infertility. Ovaries of LHR<sup>N316S</sup> mice exhibited reduced numbers of corpora lutea and ovulation. The LHR<sup>N316S</sup> mice had a reduced rate of oocyte maturation in vitro and decreased serum progesterone levels. Mating LHR<sup>N316S</sup> female mice with ICR wild type male mice revealed that the infertility rate of LHR<sup>N316S</sup> mice was 21.4% (3/14). Litter sizes from LHR<sup>N316S</sup> mice were smaller than those from control wild type female mice. The oocytes from LHR<sup>N316S</sup> mice had an increased rate of maturation in vitro after progesterone administration in vitro. Furthermore, progesterone treated LHR<sup>N316S</sup> mice produced offspring numbers per litter equivalent to WT mice. These findings provide key insights into cellular interactions in ovarian follicles and provide important clues for infertility treatment.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"976-989"},"PeriodicalIF":3.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512921/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141987894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovery of Active Ingredient of Yinchenhao Decoction Targeting TLR4 for Hepatic Inflammatory Diseases Based on Deep Learning Approach. 基于深度学习方法发现靶向 TLR4 治疗肝脏炎症疾病的银翘解毒片有效成分
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-19 DOI: 10.1007/s12539-024-00670-7
Sizhe Zhang, Peng Han, Haiqing Sun, Ying Su, Chen Chen, Cheng Chen, Jinyao Li, Xiaoyi Lv, Xuecong Tian, Yandan Xu

Yinchenhao Decoction (YCHD), a classic formula in traditional Chinese medicine, is believed to have the potential to treat liver diseases by modulating the Toll-like receptor 4 (TLR4) target. Therefore, a thorough exploration of the effective components and therapeutic mechanisms targeting TLR4 in YCHD is a promising strategy for liver diseases. In this study, the AIGO-DTI deep learning framework was proposed to predict the targeting probability of major components in YCHD for TLR4. Comparative evaluations with four machine learning models (RF, SVM, KNN, XGBoost) and two deep learning models (GCN, GAT) demonstrated that the AIGO-DTI framework exhibited the best overall performance, with Recall and AUC reaching 0.968 and 0.991, respectively.This study further utilized the AIGO-DTI model to identify the potential impact of Isoscopoletin, a major component of YCHD, on TLR4. Subsequent wet experiments revealed that Isoscopoletin could influence the maturation of Dendritic Cells (DCs) induced by Lipopolysaccharide (LPS) through TLR4, suggesting its therapeutic potential for liver diseases, especially hepatitis. Additionally, based on the AIGO-DTI framework, this study established an online platform named TLR4-Predict to facilitate domain experts in discovering more compounds related to TLR4. Overall, the proposed AIGO-DTI framework accurately predicts unique compounds in YCHD that interact with TLR4, providing new insights for identifying and screening lead compounds targeting TLR4.

银翘散(YCHD)是传统中药中的经典方剂,被认为具有通过调节Toll样受体4(TLR4)靶点治疗肝病的潜力。因此,深入探讨 "养生堂 "中针对 TLR4 的有效成分和治疗机制,是治疗肝病的一项前景广阔的策略。本研究提出了AIGO-DTI深度学习框架来预测YCHD中主要成分对TLR4的靶向概率。与四种机器学习模型(RF、SVM、KNN、XGBoost)和两种深度学习模型(GCN、GAT)的比较评估表明,AIGO-DTI 框架表现出最佳的整体性能,其 Recall 和 AUC 分别达到 0.968 和 0.991。随后的湿实验表明,异莨菪亭可通过TLR4影响由脂多糖(LPS)诱导的树突状细胞(DCs)的成熟,这表明它对肝脏疾病,尤其是肝炎具有治疗潜力。此外,基于 AIGO-DTI 框架,本研究建立了一个名为 TLR4-Predict 的在线平台,以方便领域专家发现更多与 TLR4 相关的化合物。总之,所提出的 AIGO-DTI 框架能准确预测 YCHD 中与 TLR4 相互作用的独特化合物,为识别和筛选靶向 TLR4 的先导化合物提供了新的见解。
{"title":"Discovery of Active Ingredient of Yinchenhao Decoction Targeting TLR4 for Hepatic Inflammatory Diseases Based on Deep Learning Approach.","authors":"Sizhe Zhang, Peng Han, Haiqing Sun, Ying Su, Chen Chen, Cheng Chen, Jinyao Li, Xiaoyi Lv, Xuecong Tian, Yandan Xu","doi":"10.1007/s12539-024-00670-7","DOIUrl":"10.1007/s12539-024-00670-7","url":null,"abstract":"<p><p>Yinchenhao Decoction (YCHD), a classic formula in traditional Chinese medicine, is believed to have the potential to treat liver diseases by modulating the Toll-like receptor 4 (TLR4) target. Therefore, a thorough exploration of the effective components and therapeutic mechanisms targeting TLR4 in YCHD is a promising strategy for liver diseases. In this study, the AIGO-DTI deep learning framework was proposed to predict the targeting probability of major components in YCHD for TLR4. Comparative evaluations with four machine learning models (RF, SVM, KNN, XGBoost) and two deep learning models (GCN, GAT) demonstrated that the AIGO-DTI framework exhibited the best overall performance, with Recall and AUC reaching 0.968 and 0.991, respectively.This study further utilized the AIGO-DTI model to identify the potential impact of Isoscopoletin, a major component of YCHD, on TLR4. Subsequent wet experiments revealed that Isoscopoletin could influence the maturation of Dendritic Cells (DCs) induced by Lipopolysaccharide (LPS) through TLR4, suggesting its therapeutic potential for liver diseases, especially hepatitis. Additionally, based on the AIGO-DTI framework, this study established an online platform named TLR4-Predict to facilitate domain experts in discovering more compounds related to TLR4. Overall, the proposed AIGO-DTI framework accurately predicts unique compounds in YCHD that interact with TLR4, providing new insights for identifying and screening lead compounds targeting TLR4.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142667815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RAEPI: Predicting Enhancer-Promoter Interactions Based on Restricted Attention Mechanism. RAEPI:RAEPI:基于受限注意力机制预测促进剂与增强剂之间的相互作用
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-15 DOI: 10.1007/s12539-024-00669-0
Wanjing Zhang, Mingyang Zhang, Min Zhu

Enhancer-promoter interactions (EPIs) are crucial in gene transcription regulation and cell differentiation. Traditional biological experiments are costly and time-consuming, motivating the development of computational prediction methods. However, existing EPI prediction methods inadequately capture the intricate direct interactions between enhancer and promoter sequences, which limits their prediction performance to some extent. In this work, we propose an innovative attention-based approach RAEPI, which uses convolutional neural networks to extract initial features of enhancers and promoters, combined with a specially designed Restricted Attention mechanism with Query-Key-Value constrained to simulate the interactions between them for further feature extraction. To improve cross-cell line prediction, we employ a transfer learning strategy for pre-training. Furthermore, we extracted sequence motifs to evaluate the RAEPI's effectiveness from a visualization perspective. Experimental results show that RAEPI achieves competitive prediction performance to existing methods on the benchmark dataset.

增强子-启动子相互作用(EPIs)在基因转录调控和细胞分化中至关重要。传统的生物学实验既费钱又费时,因此人们开始开发计算预测方法。然而,现有的 EPI 预测方法不能充分捕捉增强子和启动子序列之间错综复杂的直接相互作用,这在一定程度上限制了它们的预测性能。在这项工作中,我们提出了一种基于注意力的创新方法 RAEPI,该方法利用卷积神经网络提取增强子和启动子的初始特征,并结合专门设计的限制注意力机制(Restricted Attention mechanism)和查询键值(Query-Key-Value)约束来模拟它们之间的相互作用,从而进一步提取特征。为了改进跨细胞系预测,我们采用了迁移学习策略进行预训练。此外,我们还提取了序列主题,从可视化角度评估 RAEPI 的有效性。实验结果表明,RAEPI 在基准数据集上取得了与现有方法相当的预测性能。
{"title":"RAEPI: Predicting Enhancer-Promoter Interactions Based on Restricted Attention Mechanism.","authors":"Wanjing Zhang, Mingyang Zhang, Min Zhu","doi":"10.1007/s12539-024-00669-0","DOIUrl":"https://doi.org/10.1007/s12539-024-00669-0","url":null,"abstract":"<p><p>Enhancer-promoter interactions (EPIs) are crucial in gene transcription regulation and cell differentiation. Traditional biological experiments are costly and time-consuming, motivating the development of computational prediction methods. However, existing EPI prediction methods inadequately capture the intricate direct interactions between enhancer and promoter sequences, which limits their prediction performance to some extent. In this work, we propose an innovative attention-based approach RAEPI, which uses convolutional neural networks to extract initial features of enhancers and promoters, combined with a specially designed Restricted Attention mechanism with Query-Key-Value constrained to simulate the interactions between them for further feature extraction. To improve cross-cell line prediction, we employ a transfer learning strategy for pre-training. Furthermore, we extracted sequence motifs to evaluate the RAEPI's effectiveness from a visualization perspective. Experimental results show that RAEPI achieves competitive prediction performance to existing methods on the benchmark dataset.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142638852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iAmyP: A Multi-view Learning for Amyloidogenic Hexapeptides Identification Based on Sequence Least Squares Programming. iAmyP:基于序列最小二乘法编程的淀粉样蛋白六肽识别多视角学习。
IF 3.9 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-15 DOI: 10.1007/s12539-024-00666-3
Jinling Cai, Jianping Zhao, Yannan Bin, Junfeng Xia, Chunhou Zheng

The development of peptide drug is hindered by the risk of amyloidogenic aggregation; if peptides tend to aggregate in this manner, they may be unsuitable for drug design. Computational methods aimed at predicting amyloidogenic sequences often face challenges in extracting high-quality features, and their predictive performance can be enchanced. To surmount these challenges, iAmyP was introduced as a specialized computational tool designed for predicting amyloidogenic hexapeptides. Utilizing multi-view learning, iAmyP incorporated sequence, structural, and evolutionary features, performing feature selection and feature fusion through recursive feature elimination and attention mechanisms. This amalgamation of features and subsequent feature selection and fusion lead to optimal performance facilitated by an optimization algorithm based on sequence least squares programming. Notably, iAmyP exhibited robust generalization for peptides with lengths of 7-10 amino acids. The role of hydrophobic amino acids in the aggregation process is critical, and a thorough analysis have significantly enhanced our insight into their significance in amyloidogenic hexapeptides. This tool represented an advancement in the development of peptide therapeutics by providing an understanding of amyloidogenic aggregation, establishing itself as a valuable framework for assessing amyloidogenic sequences. The data and code can be freely accessed at https://github.com/xialab-ahu/iAmyP .

多肽药物的开发受到淀粉样蛋白聚集风险的阻碍;如果多肽倾向于以这种方式聚集,则可能不适合药物设计。旨在预测淀粉样蛋白生成序列的计算方法往往在提取高质量特征方面面临挑战,因此可以提高其预测性能。为了克服这些挑战,我们推出了 iAmyP,它是一种专门用于预测淀粉样蛋白生成六肽的计算工具。iAmyP 利用多视角学习,纳入了序列、结构和进化特征,通过递归特征消除和注意机制进行特征选择和特征融合。通过基于序列最小二乘法编程的优化算法,这种特征合并以及随后的特征选择和融合实现了最佳性能。值得注意的是,iAmyP 对长度为 7-10 个氨基酸的肽表现出强大的泛化能力。疏水氨基酸在聚合过程中的作用至关重要,对它们的深入分析大大提高了我们对它们在淀粉样蛋白六肽中的重要性的认识。该工具提供了对淀粉样蛋白致性聚集的理解,成为评估淀粉样蛋白致性序列的重要框架,从而推动了肽疗法的开发。数据和代码可在 https://github.com/xialab-ahu/iAmyP 免费获取。
{"title":"iAmyP: A Multi-view Learning for Amyloidogenic Hexapeptides Identification Based on Sequence Least Squares Programming.","authors":"Jinling Cai, Jianping Zhao, Yannan Bin, Junfeng Xia, Chunhou Zheng","doi":"10.1007/s12539-024-00666-3","DOIUrl":"https://doi.org/10.1007/s12539-024-00666-3","url":null,"abstract":"<p><p>The development of peptide drug is hindered by the risk of amyloidogenic aggregation; if peptides tend to aggregate in this manner, they may be unsuitable for drug design. Computational methods aimed at predicting amyloidogenic sequences often face challenges in extracting high-quality features, and their predictive performance can be enchanced. To surmount these challenges, iAmyP was introduced as a specialized computational tool designed for predicting amyloidogenic hexapeptides. Utilizing multi-view learning, iAmyP incorporated sequence, structural, and evolutionary features, performing feature selection and feature fusion through recursive feature elimination and attention mechanisms. This amalgamation of features and subsequent feature selection and fusion lead to optimal performance facilitated by an optimization algorithm based on sequence least squares programming. Notably, iAmyP exhibited robust generalization for peptides with lengths of 7-10 amino acids. The role of hydrophobic amino acids in the aggregation process is critical, and a thorough analysis have significantly enhanced our insight into their significance in amyloidogenic hexapeptides. This tool represented an advancement in the development of peptide therapeutics by providing an understanding of amyloidogenic aggregation, establishing itself as a valuable framework for assessing amyloidogenic sequences. The data and code can be freely accessed at https://github.com/xialab-ahu/iAmyP .</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142638847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Interdisciplinary Sciences: Computational Life Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1