首页 > 最新文献

Current Bioinformatics最新文献

英文 中文
Identification of Spatial Domains, Spatially Variable Genes, and Genetic Association Studies of Alzheimer Disease with an Autoencoder-based Fuzzy Clustering Algorithm 利用基于自动编码器的模糊聚类算法识别阿尔茨海默病的空间域、空间变异基因和遗传关联研究
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-01-21 DOI: 10.2174/0115748936278884240102094058
Yaxuan Cui, Leyi Wei, Ruheng Wang, Xiucai Ye, Tetsuya Sakurai
Introduction: Transcriptional gene expressions and their corresponding spatial information are critical for understanding the biological function, mutual regulation, and identification of various cell types. Materials and Methods: Recently, several computational methods have been proposed for clustering using spatial transcriptional expression. Although these algorithms have certain practicability, they cannot utilize spatial information effectively and are highly sensitive to noise and outliers. In this study, we propose ACSpot, an autoencoder-based fuzzy clustering algorithm, as a solution to tackle these problems. Specifically, we employed a self-supervised autoencoder to reduce feature dimensionality, mitigate nonlinear noise, and learn high-quality representations. Additionally, a commonly used clustering method, Fuzzy c-means, is used to achieve improved clustering results. In particular, we utilize spatial neighbor information to optimize the clustering process and to fine-tune each spot to its associated cluster category using probabilistic and statistical methods. Result and Discussion: The comparative analysis on the 10x Visium human dorsolateral prefrontal cortex (DLPFC) dataset demonstrates that ACSpot outperforms other clustering algorithms. Subsequently, spatially variable genes were identified based on the clustering outcomes, revealing a striking similarity between their spatial distribution and the subcluster spatial distribution from the clustering results. Notably, these spatially variable genes include APP, PSEN1, APOE, SORL1, BIN1, and PICALM, all of which are well-known Alzheimer's disease-associated genes. Conclusion: In addition, we applied our model to explore some potential Alzheimer's disease correlated genes within the dataset and performed Gene Ontology (GO) enrichment and gene-pathway analyses for validation, illustrating the capability of our model to pinpoint genes linked to Alzheimer’s disease.
引言转录基因表达及其相应的空间信息对于了解各种细胞类型的生物功能、相互调控和识别至关重要。材料与方法:最近,人们提出了几种利用空间转录表达进行聚类的计算方法。虽然这些算法具有一定的实用性,但它们不能有效利用空间信息,而且对噪声和异常值非常敏感。在本研究中,我们提出了基于自动编码器的模糊聚类算法 ACSpot 来解决这些问题。具体来说,我们采用了一种自监督自动编码器来降低特征维度、减轻非线性噪声并学习高质量的表示。此外,我们还采用了一种常用的聚类方法--模糊 c-means 来改善聚类结果。特别是,我们利用空间邻域信息来优化聚类过程,并使用概率和统计方法对每个点的相关聚类类别进行微调。结果与讨论:对 10 倍 Visium 人类背外侧前额叶皮层(DLPFC)数据集的比较分析表明,ACSpot 优于其他聚类算法。随后,根据聚类结果确定了空间可变基因,发现这些基因的空间分布与聚类结果中的子聚类空间分布具有惊人的相似性。值得注意的是,这些空间可变基因包括 APP、PSEN1、APOE、SORL1、BIN1 和 PICALM,它们都是众所周知的阿尔茨海默病相关基因。结论此外,我们还应用我们的模型探索了数据集中一些潜在的阿尔茨海默病相关基因,并进行了基因本体(GO)富集和基因通路分析进行验证,这说明我们的模型有能力准确定位与阿尔茨海默病相关的基因。
{"title":"Identification of Spatial Domains, Spatially Variable Genes, and Genetic Association Studies of Alzheimer Disease with an Autoencoder-based Fuzzy Clustering Algorithm","authors":"Yaxuan Cui, Leyi Wei, Ruheng Wang, Xiucai Ye, Tetsuya Sakurai","doi":"10.2174/0115748936278884240102094058","DOIUrl":"https://doi.org/10.2174/0115748936278884240102094058","url":null,"abstract":"Introduction: Transcriptional gene expressions and their corresponding spatial information are critical for understanding the biological function, mutual regulation, and identification of various cell types. Materials and Methods: Recently, several computational methods have been proposed for clustering using spatial transcriptional expression. Although these algorithms have certain practicability, they cannot utilize spatial information effectively and are highly sensitive to noise and outliers. In this study, we propose ACSpot, an autoencoder-based fuzzy clustering algorithm, as a solution to tackle these problems. Specifically, we employed a self-supervised autoencoder to reduce feature dimensionality, mitigate nonlinear noise, and learn high-quality representations. Additionally, a commonly used clustering method, Fuzzy c-means, is used to achieve improved clustering results. In particular, we utilize spatial neighbor information to optimize the clustering process and to fine-tune each spot to its associated cluster category using probabilistic and statistical methods. Result and Discussion: The comparative analysis on the 10x Visium human dorsolateral prefrontal cortex (DLPFC) dataset demonstrates that ACSpot outperforms other clustering algorithms. Subsequently, spatially variable genes were identified based on the clustering outcomes, revealing a striking similarity between their spatial distribution and the subcluster spatial distribution from the clustering results. Notably, these spatially variable genes include APP, PSEN1, APOE, SORL1, BIN1, and PICALM, all of which are well-known Alzheimer's disease-associated genes. Conclusion: In addition, we applied our model to explore some potential Alzheimer's disease correlated genes within the dataset and performed Gene Ontology (GO) enrichment and gene-pathway analyses for validation, illustrating the capability of our model to pinpoint genes linked to Alzheimer’s disease.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"240 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139518523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Mitophagy-Related Genes in Sepsis 鉴定败血症中与丝裂噬相关的基因
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-01-05 DOI: 10.2174/0115748936266722231116050255
Xiao-Yan Zeng, Min Zhang, Si-Jing Liao, Yong Wang, Ying-Bo Ren, Run Li, Tian-Mei Li, An-Qiong Mao, Guang-Zhen Li, Ying Zhang
Background: Numerous studies have shown that mitochondrial damage induces inflammation and activates inflammatory cells, leading to sepsis, while sepsis, a systemic inflammatory response syndrome, also exacerbates mitochondrial damage and hyperactivation. Mitochondrial autophagy eliminates aged, abnormal or damaged mitochondria to reduce intracellular mitochondrial stress and the release of mitochondria-associated molecules, thereby reducing the inflammatory response and cellular damage caused by sepsis. In addition, mitochondrial autophagy may also influence the onset and progression of sepsis, but the exact mechanisms are unclear. background: Sepsis is a critical systemic infection, a syndrome of severe inflammatory response of the organism to various pathogenic microorganisms. Methods: In this study, we mined the available publicly available microarray data in the GEO database (Home - GEO - NCBI (nih.gov)) with the aim of identifying key genes associated with mitochondrial autophagy in sepsis. objective: In this study, we used a bioinformatics approach to integrate multiple microarray data to screen for mitochondrial autophagy-related hub genes associated with sepsis onset and progression in a more scientific and systematic manner. Results: We identified four mitophagy-related genes in sepsis, TOMM20, TOMM22, TOMM40, and MFN1. method: Robust rank aggregation (RRA) Conclusion: This study provides preliminary evidence for the treatment of sepsis and may provide a solid foundation for subsequent biological studies. result: we constructed a PPI network combined with RRA analysis method to finally identify 4 key genes, namely TOMM20, TOMM22, TOMM40, and MFN1. conclusion: In this study, we used a bioinformatics analysis method, RRA, to integrate five gene microarray datasets to identify pivotal genes associated with mitochondrial autophagy in sepsis. Gene ontology (GO) functional annotation results show that these hub genes are mainly enriched in mitochondrial transport and establishment of protein localization to mitochondrion. Finally, we constructed the PPI network with the top 100 genes obtained from the rra method analysis. Based on the RRA results, the PPI results and the mitochondrial autophagy-related genes we found in the Reactome Pathway Database, we finally identified four key genes as TOMM20, TOMM22, TOMM40, and MFN1, respectively.
背景:大量研究表明,线粒体损伤会诱发炎症并激活炎症细胞,导致败血症,而败血症作为一种全身炎症反应综合征,也会加剧线粒体损伤和过度激活。线粒体自噬可消除老化、异常或受损的线粒体,减轻细胞内线粒体应激和线粒体相关分子的释放,从而减轻败血症引起的炎症反应和细胞损伤。此外,线粒体自噬也可能影响败血症的发生和发展,但具体机制尚不清楚:败血症是一种危重的全身性感染,是机体对各种病原微生物产生严重炎症反应的综合征。方法:在本研究中,我们挖掘了 GEO 数据库(Home - GEO - NCBI (nih.gov))中可公开获得的微阵列数据,目的是找出与败血症中线粒体自噬相关的关键基因:在本研究中,我们采用生物信息学方法整合多种微阵列数据,以更科学、更系统的方式筛选与脓毒症发病和进展相关的线粒体自噬相关枢纽基因。结果我们发现了四个与脓毒症相关的线粒体自噬基因:TOMM20、TOMM22、TOMM40 和 MFN1:稳健秩聚合(RRA) 结论:该研究为脓毒症的治疗提供了初步证据:本研究为脓毒症的治疗提供了初步证据,可为后续生物学研究奠定坚实基础。结果:我们构建了一个 PPI 网络,结合 RRA 分析方法,最终确定了 4 个关键基因,即 TOMM20、TOMM22、TOMM40 和 MFN1。结论:本研究为脓毒症的治疗提供了初步证据,可为后续生物学研究奠定坚实基础:在这项研究中,我们利用生物信息学分析方法 RRA 整合了 5 个基因芯片数据集,以确定脓毒症中与线粒体自噬相关的关键基因。基因本体论(GO)功能注释结果表明,这些枢纽基因主要富集于线粒体转运和蛋白质线粒体定位的建立。最后,我们利用 RRA 方法分析得到的前 100 个基因构建了 PPI 网络。根据RRA结果、PPI结果以及我们在Reactome Pathway数据库中发现的线粒体自噬相关基因,我们最终确定了四个关键基因,分别是TOMM20、TOMM22、TOMM40和MFN1。
{"title":"Identification of Mitophagy-Related Genes in Sepsis","authors":"Xiao-Yan Zeng, Min Zhang, Si-Jing Liao, Yong Wang, Ying-Bo Ren, Run Li, Tian-Mei Li, An-Qiong Mao, Guang-Zhen Li, Ying Zhang","doi":"10.2174/0115748936266722231116050255","DOIUrl":"https://doi.org/10.2174/0115748936266722231116050255","url":null,"abstract":"Background: Numerous studies have shown that mitochondrial damage induces inflammation and activates inflammatory cells, leading to sepsis, while sepsis, a systemic inflammatory response syndrome, also exacerbates mitochondrial damage and hyperactivation. Mitochondrial autophagy eliminates aged, abnormal or damaged mitochondria to reduce intracellular mitochondrial stress and the release of mitochondria-associated molecules, thereby reducing the inflammatory response and cellular damage caused by sepsis. In addition, mitochondrial autophagy may also influence the onset and progression of sepsis, but the exact mechanisms are unclear. background: Sepsis is a critical systemic infection, a syndrome of severe inflammatory response of the organism to various pathogenic microorganisms. Methods: In this study, we mined the available publicly available microarray data in the GEO database (Home - GEO - NCBI (nih.gov)) with the aim of identifying key genes associated with mitochondrial autophagy in sepsis. objective: In this study, we used a bioinformatics approach to integrate multiple microarray data to screen for mitochondrial autophagy-related hub genes associated with sepsis onset and progression in a more scientific and systematic manner. Results: We identified four mitophagy-related genes in sepsis, TOMM20, TOMM22, TOMM40, and MFN1. method: Robust rank aggregation (RRA) Conclusion: This study provides preliminary evidence for the treatment of sepsis and may provide a solid foundation for subsequent biological studies. result: we constructed a PPI network combined with RRA analysis method to finally identify 4 key genes, namely TOMM20, TOMM22, TOMM40, and MFN1. conclusion: In this study, we used a bioinformatics analysis method, RRA, to integrate five gene microarray datasets to identify pivotal genes associated with mitochondrial autophagy in sepsis. Gene ontology (GO) functional annotation results show that these hub genes are mainly enriched in mitochondrial transport and establishment of protein localization to mitochondrion. Finally, we constructed the PPI network with the top 100 genes obtained from the rra method analysis. Based on the RRA results, the PPI results and the mitochondrial autophagy-related genes we found in the Reactome Pathway Database, we finally identified four key genes as TOMM20, TOMM22, TOMM40, and MFN1, respectively.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"21 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139376298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Natural Graph for Efficient Clustering of Virus Genome Sequences 高效聚类病毒基因组序列的新型自然图谱
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-15 DOI: 10.2174/0115748936269106231025064143
Harris Song, Nan Sun, Wenping Yu, Stephen Yau
Background: This study addresses the need for analyzing viral genome sequences and understanding their genetic relationships. The focus is on introducing a novel natural graph approach as a solution. Objective: The objective of this study is to demonstrate the effectiveness and advantages of the proposed natural graph approach in clustering viral genome sequences into distinct clades, subtypes, or districts. Additionally, the aim is to explore its interpretability, potential applications, and implications for pandemic control and public health interventions. Methods: The study utilizes the proposed natural graph algorithm to cluster viral genome sequences. The results are compared with existing methods and multidimensional scaling to evaluate the performance and effectiveness of the approach. Results: The natural graph approach successfully clusters viral genome sequences, providing valuable insights into viral evolution and transmission dynamics. The ability to generate directed connections between nodes enhances the interpretability of the results, facilitating the investigation of transmission pathways and viral fitness. Conclusion: The findings highlight the potential applications of the natural graph algorithm in pandemic control, transmission tracing, and vaccine design. Future research directions may involve scaling up the analysis to larger datasets and incorporating additional genetic features for improved resolution. The natural graph approach presents a promising tool for viral genomics research with implications for public health interventions.
研究背景本研究解决了分析病毒基因组序列和了解其遗传关系的需求。重点是引入一种新颖的自然图方法作为解决方案。研究目的本研究的目的是证明所提出的自然图方法在将病毒基因组序列聚类为不同的支系、亚型或区方面的有效性和优势。此外,目的还在于探索其可解释性、潜在应用以及对流行病控制和公共卫生干预的影响。研究方法本研究利用提出的自然图算法对病毒基因组序列进行聚类。研究结果与现有方法和多维尺度进行了比较,以评估该方法的性能和有效性。结果:自然图方法成功聚类了病毒基因组序列,为了解病毒进化和传播动态提供了宝贵的信息。节点之间产生有向连接的能力增强了结果的可解释性,为研究传播途径和病毒适应性提供了便利。结论研究结果凸显了自然图算法在大流行病控制、传播追踪和疫苗设计方面的潜在应用。未来的研究方向可能包括将分析扩展到更大的数据集,并纳入更多遗传特征以提高分辨率。自然图方法为病毒基因组学研究提供了一种前景广阔的工具,对公共卫生干预具有重要意义。
{"title":"A Novel Natural Graph for Efficient Clustering of Virus Genome Sequences","authors":"Harris Song, Nan Sun, Wenping Yu, Stephen Yau","doi":"10.2174/0115748936269106231025064143","DOIUrl":"https://doi.org/10.2174/0115748936269106231025064143","url":null,"abstract":"Background: This study addresses the need for analyzing viral genome sequences and understanding their genetic relationships. The focus is on introducing a novel natural graph approach as a solution. Objective: The objective of this study is to demonstrate the effectiveness and advantages of the proposed natural graph approach in clustering viral genome sequences into distinct clades, subtypes, or districts. Additionally, the aim is to explore its interpretability, potential applications, and implications for pandemic control and public health interventions. Methods: The study utilizes the proposed natural graph algorithm to cluster viral genome sequences. The results are compared with existing methods and multidimensional scaling to evaluate the performance and effectiveness of the approach. Results: The natural graph approach successfully clusters viral genome sequences, providing valuable insights into viral evolution and transmission dynamics. The ability to generate directed connections between nodes enhances the interpretability of the results, facilitating the investigation of transmission pathways and viral fitness. Conclusion: The findings highlight the potential applications of the natural graph algorithm in pandemic control, transmission tracing, and vaccine design. Future research directions may involve scaling up the analysis to larger datasets and incorporating additional genetic features for improved resolution. The natural graph approach presents a promising tool for viral genomics research with implications for public health interventions.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"33 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138687256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Single-cell and Bulk RNA Sequencing Reveals Stemness Phenotype Associated with Clinical Outcomes and Potential Immune Evasion Mechanisms in Hepatocellular Carcinoma 整合单细胞和大容量 RNA 测序揭示与肝细胞癌临床结果和潜在免疫逃避机制相关的干性表型
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-15 DOI: 10.2174/0115748936268168231114103440
Xiaojing Zhu, Jiaxing Zhang, Zixin Zhang, Hongyan Yuan, Aimin xie, Nan Zhang, Mingwei wang, Minghui jiang, Yanqi Xiao, Hao Wang, Xing Wang, Yan Xu
Aims: Bulk and single-cell RNA sequencing data were analyzed to explore the association of stemness phenotype with dysfunctional anti-tumor immunity and its impact on clinical outcomes of primary and relapse HCC. Background: The stemness phenotype is gradually acquired during cancer progression; however, it remains unclear the effect of stemness phenotype on recurrence and clinical outcomes in hepatocellular carcinoma (HCC). Methods: The stemness index (mRNAsi) calculated by a one-class logistic regression algorithm in multiple HCC cohorts was defined as the stemness phenotype of the patient. Using single-cell profiling in primary or early-relapse HCC, cell stemness phenotypes were evaluated by developmental potential. Differential analysis of stemness phenotype, gene expression and interactions between primary and recurrent samples revealed the underlying immune evasion mechanisms. Results: A significant mRNAsi association with HCC patient clinical outcomes was found. The high and low mRNAsi groups had distinct tumor immune microenvironments. Cellular stemness phenotype varied by cell type. Moreover, compared with primary tumors, early-relapse tumors had increased stemness of dendritic cells and tumor cells and reduced stemness of T cells and B cells. Moreover, in relapse tumors, CD8+ T cells displayed a low stemness state, with a high exhausted state, unlike the high stemness state observed in primary HCC. Conclusions: The comprehensive characterization of the HCC stemness phenotype provides insights into the clinical outcomes and immune escape mechanisms associated with recurrence.
目的:通过分析大量和单细胞 RNA 测序数据,探讨干性表型与抗肿瘤免疫功能失调的关联及其对原发性和复发性 HCC 临床预后的影响。研究背景干性表型是在癌症进展过程中逐渐获得的;然而,干性表型对肝细胞癌(HCC)复发和临床预后的影响仍不清楚。研究方法在多个HCC队列中通过单类逻辑回归算法计算出的干性指数(mRNAsi)被定义为患者的干性表型。在原发性或早期复发的HCC中使用单细胞图谱,通过发育潜能评估细胞的干性表型。对原发样本和复发样本的干性表型、基因表达和相互作用的差异分析揭示了潜在的免疫逃避机制。结果显示发现mRNAsi与HCC患者的临床结果有明显的关联。高mRNAsi组和低mRNAsi组具有不同的肿瘤免疫微环境。细胞干表型因细胞类型而异。此外,与原发性肿瘤相比,早期复发肿瘤的树突状细胞和肿瘤细胞的干性增强,而T细胞和B细胞的干性降低。此外,在复发肿瘤中,CD8+ T细胞显示出低干性状态和高耗竭状态,这与原发性HCC中观察到的高干性状态不同。结论对HCC干性表型的全面描述有助于深入了解与复发相关的临床结果和免疫逃逸机制。
{"title":"Integrating Single-cell and Bulk RNA Sequencing Reveals Stemness Phenotype Associated with Clinical Outcomes and Potential Immune Evasion Mechanisms in Hepatocellular Carcinoma","authors":"Xiaojing Zhu, Jiaxing Zhang, Zixin Zhang, Hongyan Yuan, Aimin xie, Nan Zhang, Mingwei wang, Minghui jiang, Yanqi Xiao, Hao Wang, Xing Wang, Yan Xu","doi":"10.2174/0115748936268168231114103440","DOIUrl":"https://doi.org/10.2174/0115748936268168231114103440","url":null,"abstract":"Aims: Bulk and single-cell RNA sequencing data were analyzed to explore the association of stemness phenotype with dysfunctional anti-tumor immunity and its impact on clinical outcomes of primary and relapse HCC. Background: The stemness phenotype is gradually acquired during cancer progression; however, it remains unclear the effect of stemness phenotype on recurrence and clinical outcomes in hepatocellular carcinoma (HCC). Methods: The stemness index (mRNAsi) calculated by a one-class logistic regression algorithm in multiple HCC cohorts was defined as the stemness phenotype of the patient. Using single-cell profiling in primary or early-relapse HCC, cell stemness phenotypes were evaluated by developmental potential. Differential analysis of stemness phenotype, gene expression and interactions between primary and recurrent samples revealed the underlying immune evasion mechanisms. Results: A significant mRNAsi association with HCC patient clinical outcomes was found. The high and low mRNAsi groups had distinct tumor immune microenvironments. Cellular stemness phenotype varied by cell type. Moreover, compared with primary tumors, early-relapse tumors had increased stemness of dendritic cells and tumor cells and reduced stemness of T cells and B cells. Moreover, in relapse tumors, CD8+ T cells displayed a low stemness state, with a high exhausted state, unlike the high stemness state observed in primary HCC. Conclusions: The comprehensive characterization of the HCC stemness phenotype provides insights into the clinical outcomes and immune escape mechanisms associated with recurrence.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"10 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138686838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel In silico Filtration Method for Discovery of Encrypted Antimicrobial Peptides 发现加密抗菌肽的新型硅学过滤方法
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-07 DOI: 10.2174/0115748936274103231114105340
Farnoosh Barneh, Ahmad Nazarian, Rezvan Mousavi-nadushan, Kamran Pooshang Bagheri
Background:: Antibacterial resistance has been one of the most important causes of death in the last few decades, necessitating the need to discover new antibiotics. Antimicrobial peptides (AMPs) are among the best candidates due to their broad-spectrum and potent activity against bacteria and low probability of developing resistance against them. Objective:: In this study, we proposed a novel filtration method using knowledge-based approaches to discover encrypted AMPs within a protein sequence Methods:: The encrypted AMPs were selected from a protein sequence, in this case, lactoferrin, based on hydrophobicity, cationicity, alpha-helix structure, helical wheel projection, and binding affinities to gram-negative and positive bacterial membranes. Results:: Six out of 20 potential encrypted AMPs were ultimately selected for further assays. Molecular docking of the selected AMPs with outer and inner membranes of gram-negative bacteria and also gram-positive bacterial membranes showed reasonable binding affinity ranging from ‘-6.7 to -7.5’ and ‘- 4.5 to -5.7’ and ‘-4.6 to -5.7’ kcal/mol, respectively. No toxicity was shown in the candidate AMPs. Conclusion:: According to in silico results, our method succeeded to discover six new encrypted AMPs from human lactoferrin, designated as lactoferrin-derived peptides (LDPs). Further in silico and experimental assays should also be performed to prove the efficiency of our knowledge-based filtration method.
背景过去几十年来,抗菌药耐药性已成为导致死亡的最重要原因之一,因此有必要发现新的抗生素。抗菌肽(AMPs)具有广谱、强效的抗菌活性,而且产生抗药性的可能性较低,因此是最佳候选药物之一。研究目的在这项研究中,我们提出了一种新颖的过滤方法,利用基于知识的方法来发现蛋白质序列中的加密 AMPs 方法::根据疏水性、阳离子性、α-螺旋结构、螺旋轮投影以及与革兰氏阴性和阳性细菌膜的结合亲和力,从蛋白质序列(本例中为乳铁蛋白)中筛选出加密的 AMPs。研究结果最终从 20 个潜在的加密 AMP 中选出了 6 个进行进一步检测。所选 AMP 与革兰氏阴性细菌外膜和内膜以及革兰氏阳性细菌膜的分子对接显示出合理的结合亲和力,分别为"-6.7 至 -7.5"、"-4.5 至 -5.7 "和"-4.6 至 -5.7 "千卡/摩尔。候选 AMP 未显示出毒性。结论根据硅学结果,我们的方法成功地从人类乳铁蛋白中发现了六种新的加密 AMPs,并将其命名为乳铁蛋白衍生肽(LDPs)。为了证明我们基于知识的过滤方法的效率,还需要进行进一步的硅学和实验分析。
{"title":"A Novel In silico Filtration Method for Discovery of Encrypted Antimicrobial Peptides","authors":"Farnoosh Barneh, Ahmad Nazarian, Rezvan Mousavi-nadushan, Kamran Pooshang Bagheri","doi":"10.2174/0115748936274103231114105340","DOIUrl":"https://doi.org/10.2174/0115748936274103231114105340","url":null,"abstract":"Background:: Antibacterial resistance has been one of the most important causes of death in the last few decades, necessitating the need to discover new antibiotics. Antimicrobial peptides (AMPs) are among the best candidates due to their broad-spectrum and potent activity against bacteria and low probability of developing resistance against them. Objective:: In this study, we proposed a novel filtration method using knowledge-based approaches to discover encrypted AMPs within a protein sequence Methods:: The encrypted AMPs were selected from a protein sequence, in this case, lactoferrin, based on hydrophobicity, cationicity, alpha-helix structure, helical wheel projection, and binding affinities to gram-negative and positive bacterial membranes. Results:: Six out of 20 potential encrypted AMPs were ultimately selected for further assays. Molecular docking of the selected AMPs with outer and inner membranes of gram-negative bacteria and also gram-positive bacterial membranes showed reasonable binding affinity ranging from ‘-6.7 to -7.5’ and ‘- 4.5 to -5.7’ and ‘-4.6 to -5.7’ kcal/mol, respectively. No toxicity was shown in the candidate AMPs. Conclusion:: According to in silico results, our method succeeded to discover six new encrypted AMPs from human lactoferrin, designated as lactoferrin-derived peptides (LDPs). Further in silico and experimental assays should also be performed to prove the efficiency of our knowledge-based filtration method.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"11 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138555680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized Hybrid Deep Learning for Real-Time Pandemic Data Forecasting: Long and Short-Term Perspectives 用于实时流行病数据预测的优化混合深度学习:长期和短期视角
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-07 DOI: 10.2174/0115748936257412231120113648
Sujata Dash, Sourav Kumar Giri, Subhendu Kumar Pani, Saurav Mallik, Mingqiang Wang, Hong Qin
Background:: With new variants of COVID-19 causing challenges, we need to focus on integrating multiple deep-learning frameworks to develop intelligent healthcare systems for early detection and diagnosis. Objective:: This article suggests three hybrid deep learning models, namely CNN-LSTM, CNN-Bi- LSTM, and CNN-GRU, to address the pressing need for an intelligent healthcare system. These models are designed to capture spatial and temporal patterns in COVID-19 data, thereby improving the accuracy and timeliness of predictions. An output forecasting framework integrates these models, and an optimization algorithm automatically selects the hyperparameters for the 13 baselines and the three proposed hybrid models. Methods:: Real-time time series data from the five most affected countries were used to test the effectiveness of the proposed models. Baseline models were compared, and optimization algorithms were employed to improve forecasting capabilities. Results:: CNN-GRU and CNN-LSTM are the top short- and long-term forecasting models. CNNGRU had the best performance with the lowest SMAPE and MAPE values for long-term forecasting in India at 3.07% and 3.17%, respectively, and impressive results for short-term forecasting with SMAPE and MAPE values of 1.46% and 1.47%. Conclusion:: Hybrid deep learning models, like CNN-GRU, can aid in early COVID-19 assessment and diagnosis. They detect patterns in data for effective governmental strategies and forecasting. This helps manage and mitigate the pandemic faster and more accurately.
背景::随着 COVID-19 的新变种带来挑战,我们需要专注于整合多种深度学习框架,以开发用于早期检测和诊断的智能医疗系统。目标本文提出了三种混合深度学习模型,即 CNN-LSTM、CNN-Bi-LSTM 和 CNN-GRU,以满足对智能医疗系统的迫切需求。这些模型旨在捕捉 COVID-19 数据中的空间和时间模式,从而提高预测的准确性和及时性。一个输出预测框架集成了这些模型,一个优化算法自动为 13 个基线模型和三个拟议的混合模型选择超参数。方法使用五个受影响最严重国家的实时时间序列数据来测试所提模型的有效性。对基线模型进行比较,并采用优化算法提高预测能力。结果CNN-GRU 和 CNN-LSTM 是最优秀的短期和长期预测模型。CNNGRU 性能最佳,在印度的长期预测中 SMAPE 和 MAPE 值最低,分别为 3.07% 和 3.17%,在短期预测中 SMAPE 和 MAPE 值分别为 1.46% 和 1.47%,结果令人印象深刻。结论混合深度学习模型(如 CNN-GRU)可以帮助进行早期 COVID-19 评估和诊断。它们能检测数据中的模式,从而制定有效的政府战略并进行预测。这有助于更快、更准确地管理和缓解大流行病。
{"title":"Optimized Hybrid Deep Learning for Real-Time Pandemic Data Forecasting: Long and Short-Term Perspectives","authors":"Sujata Dash, Sourav Kumar Giri, Subhendu Kumar Pani, Saurav Mallik, Mingqiang Wang, Hong Qin","doi":"10.2174/0115748936257412231120113648","DOIUrl":"https://doi.org/10.2174/0115748936257412231120113648","url":null,"abstract":"Background:: With new variants of COVID-19 causing challenges, we need to focus on integrating multiple deep-learning frameworks to develop intelligent healthcare systems for early detection and diagnosis. Objective:: This article suggests three hybrid deep learning models, namely CNN-LSTM, CNN-Bi- LSTM, and CNN-GRU, to address the pressing need for an intelligent healthcare system. These models are designed to capture spatial and temporal patterns in COVID-19 data, thereby improving the accuracy and timeliness of predictions. An output forecasting framework integrates these models, and an optimization algorithm automatically selects the hyperparameters for the 13 baselines and the three proposed hybrid models. Methods:: Real-time time series data from the five most affected countries were used to test the effectiveness of the proposed models. Baseline models were compared, and optimization algorithms were employed to improve forecasting capabilities. Results:: CNN-GRU and CNN-LSTM are the top short- and long-term forecasting models. CNNGRU had the best performance with the lowest SMAPE and MAPE values for long-term forecasting in India at 3.07% and 3.17%, respectively, and impressive results for short-term forecasting with SMAPE and MAPE values of 1.46% and 1.47%. Conclusion:: Hybrid deep learning models, like CNN-GRU, can aid in early COVID-19 assessment and diagnosis. They detect patterns in data for effective governmental strategies and forecasting. This helps manage and mitigate the pandemic faster and more accurately.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"92 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138556751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Pathological Myopia Associated Genes with A Random Walk- Based Method in Protein-Protein Interaction Network 用基于随机漫步的方法在蛋白质-蛋白质相互作用网络中识别病理性近视相关基因
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-07 DOI: 10.2174/0115748936268218231114070754
Jiyu Zhang, Tao Huang, Qiao Sun, Jian Zhang
Background:: Pathological myopia, a severe variant of myopia, extends beyond the typical refractive error associated with nearsightedness. While the condition has a strong genetic component, the intricate mechanisms of inheritance remain elusive. Some genes have been associated with the development of pathological myopia, but their exact roles are not fully understood. Objective:: This study aimed to identify novel genes associated with pathological myopia Methods:: Our study leveraged DisGeNET to identify 184 genes linked with high myopia and 39 genes related to degenerative myopia. To uncover additional pathological myopia-associated genes, we employed the random walk with restart algorithm to investigate the protein-protein interactions network. We used the previously identified 184 high myopia and 39 degenerative myopia genes as seed nodes. Results:: Through subsequent screening tests, we discarded genes with weak associations, yielding 103 new genes for high myopia and 33 for degenerative myopia. Conclusion:: We confirmed the association of certain genes, including six genes that were confirmed to be associated with both high and degenerative myopia. The newly discovered genes are helpful to uncover and understand the pathogenesis of myopia.
背景病理性近视是近视的一种严重变异,超出了与近视相关的典型屈光不正。虽然这种病症有很强的遗传因素,但其复杂的遗传机制仍然难以捉摸。一些基因与病理性近视的发生有关,但其确切作用尚不完全清楚。研究目的本研究旨在确定与病理性近视相关的新基因 方法::我们的研究利用 DisGeNET 发现了 184 个与高度近视相关的基因和 39 个与退化性近视相关的基因。为了发现更多的病理性近视相关基因,我们采用了随机行走与重启算法来研究蛋白质-蛋白质相互作用网络。我们将之前确定的 184 个高度近视基因和 39 个退化性近视基因作为种子节点。结果通过随后的筛选测试,我们剔除了关联性较弱的基因,得到了 103 个新的高度近视基因和 33 个新的退化性近视基因。结论我们证实了某些基因之间的关联,其中有 6 个基因被证实与高度近视和退化性近视都有关联。新发现的基因有助于揭示和了解近视的发病机理。
{"title":"Identifying Pathological Myopia Associated Genes with A Random Walk- Based Method in Protein-Protein Interaction Network","authors":"Jiyu Zhang, Tao Huang, Qiao Sun, Jian Zhang","doi":"10.2174/0115748936268218231114070754","DOIUrl":"https://doi.org/10.2174/0115748936268218231114070754","url":null,"abstract":"Background:: Pathological myopia, a severe variant of myopia, extends beyond the typical refractive error associated with nearsightedness. While the condition has a strong genetic component, the intricate mechanisms of inheritance remain elusive. Some genes have been associated with the development of pathological myopia, but their exact roles are not fully understood. Objective:: This study aimed to identify novel genes associated with pathological myopia Methods:: Our study leveraged DisGeNET to identify 184 genes linked with high myopia and 39 genes related to degenerative myopia. To uncover additional pathological myopia-associated genes, we employed the random walk with restart algorithm to investigate the protein-protein interactions network. We used the previously identified 184 high myopia and 39 degenerative myopia genes as seed nodes. Results:: Through subsequent screening tests, we discarded genes with weak associations, yielding 103 new genes for high myopia and 33 for degenerative myopia. Conclusion:: We confirmed the association of certain genes, including six genes that were confirmed to be associated with both high and degenerative myopia. The newly discovered genes are helpful to uncover and understand the pathogenesis of myopia.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"82 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138555910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering Microbe-disease Associations with Weighted Graph Convolution Networks and Taxonomy Common Tree 利用加权图卷积网络和分类公共树发现微生物与疾病的关联
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-04 DOI: 10.2174/0115748936270441231116093650
Jieqi Xing, Yu Shi, Xiaoquan Su, Shunyao Wu
Background:: Microbe-disease associations are integral to understanding complex dis-eases and their screening procedures. Objective:: While numerous computational methods have been developed to detect these associa-tions, their performance remains limited due to inadequate utilization of weighted inherent similari-ties and microbial taxonomy hierarchy. To address this limitation, we have introduced WTHMDA (weighted taxonomic heterogeneous network-based microbe-disease association), a novel deep learning framework. Methods:: WTHMDA combines a weighted graph convolution network and the microbial taxono-my common tree to predict microbe-disease associations effectively. The framework extracts mul-tiple microbe similarities from the taxonomy common tree, facilitating the construction of a mi-crobe-disease heterogeneous interaction network. Utilizing a weighted DeepWalk algorithm, node embeddings in the network incorporate weight information from the similarities. Subsequently, a deep neural network (DNN) model accurately predicts microbe-disease associations based on this interaction network. Results:: Extensive experiments on multiple datasets and case studies demonstrate WTHMDA's su-periority over existing approaches, particularly in predicting unknown associations. Conclusion:: Our proposed method offers a new strategy for discovering microbe-disease linkages, showcasing remarkable performance and enhancing the feasibility of identifying disease risk.
背景:微生物与疾病的关联是理解复杂疾病及其筛查程序的必要条件。虽然已经开发了许多计算方法来检测这些关联,但由于加权固有相似性和微生物分类层次的利用不足,它们的性能仍然有限。为了解决这一限制,我们引入了一种新的深度学习框架WTHMDA(加权分类异构网络微生物-疾病关联)。方法:WTHMDA将加权图卷积网络与微生物分类树相结合,有效预测微生物与疾病的关联。该框架从分类共同树中提取多个微生物相似性,促进了微生物-微生物-疾病异质相互作用网络的构建。利用加权DeepWalk算法,网络中的节点嵌入结合了相似度的权重信息。随后,深度神经网络(DNN)模型基于该相互作用网络准确预测微生物与疾病的关联。结果:对多个数据集的广泛实验和案例研究表明,WTHMDA优于现有方法,特别是在预测未知关联方面。结论:我们提出的方法为发现微生物与疾病的联系提供了一种新的策略,表现出显著的性能,并提高了识别疾病风险的可行性。
{"title":"Discovering Microbe-disease Associations with Weighted Graph Convolution Networks and Taxonomy Common Tree","authors":"Jieqi Xing, Yu Shi, Xiaoquan Su, Shunyao Wu","doi":"10.2174/0115748936270441231116093650","DOIUrl":"https://doi.org/10.2174/0115748936270441231116093650","url":null,"abstract":"Background:: Microbe-disease associations are integral to understanding complex dis-eases and their screening procedures. Objective:: While numerous computational methods have been developed to detect these associa-tions, their performance remains limited due to inadequate utilization of weighted inherent similari-ties and microbial taxonomy hierarchy. To address this limitation, we have introduced WTHMDA (weighted taxonomic heterogeneous network-based microbe-disease association), a novel deep learning framework. Methods:: WTHMDA combines a weighted graph convolution network and the microbial taxono-my common tree to predict microbe-disease associations effectively. The framework extracts mul-tiple microbe similarities from the taxonomy common tree, facilitating the construction of a mi-crobe-disease heterogeneous interaction network. Utilizing a weighted DeepWalk algorithm, node embeddings in the network incorporate weight information from the similarities. Subsequently, a deep neural network (DNN) model accurately predicts microbe-disease associations based on this interaction network. Results:: Extensive experiments on multiple datasets and case studies demonstrate WTHMDA's su-periority over existing approaches, particularly in predicting unknown associations. Conclusion:: Our proposed method offers a new strategy for discovering microbe-disease linkages, showcasing remarkable performance and enhancing the feasibility of identifying disease risk.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"2 2 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138514996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metabolomics: Recent Advances and Future Prospects Unveiled 代谢组学:最新进展和未来展望
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-04 DOI: 10.2174/0115748936270744231115110329
Shweta Sharma, Garima Singh, Mymoona Akhter
: In the era of genomics, fueled by advanced technologies and analytical tools, metabo-lomics has become a vital component in biomedical research. Its significance spans various do-mains, encompassing biomarker identification, uncovering underlying mechanisms and pathways, as well as the exploration of new drug targets and precision medicine. This article presents a com-prehensive overview of the latest developments in metabolomics techniques, emphasizing their wide-ranging applications across diverse research fields and underscoring their immense potential for future advancements.
在基因组学时代,在先进技术和分析工具的推动下,代谢组学已成为生物医学研究的重要组成部分。它的意义涵盖了各种各样的领域,包括生物标志物鉴定,揭示潜在的机制和途径,以及探索新的药物靶点和精准医学。本文全面概述了代谢组学技术的最新发展,强调了它们在不同研究领域的广泛应用,并强调了它们未来发展的巨大潜力。
{"title":"Metabolomics: Recent Advances and Future Prospects Unveiled","authors":"Shweta Sharma, Garima Singh, Mymoona Akhter","doi":"10.2174/0115748936270744231115110329","DOIUrl":"https://doi.org/10.2174/0115748936270744231115110329","url":null,"abstract":": In the era of genomics, fueled by advanced technologies and analytical tools, metabo-lomics has become a vital component in biomedical research. Its significance spans various do-mains, encompassing biomarker identification, uncovering underlying mechanisms and pathways, as well as the exploration of new drug targets and precision medicine. This article presents a com-prehensive overview of the latest developments in metabolomics techniques, emphasizing their wide-ranging applications across diverse research fields and underscoring their immense potential for future advancements.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"1 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138514997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stacking-Kcr: A Stacking Model for Predicting the Crotonylation Sites of Lysine by Fusing Serial and Automatic Encoder 堆叠- kcr:融合序列和自动编码器预测赖氨酸Crotonylation位点的堆叠模型
IF 4 3区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS Pub Date : 2023-12-04 DOI: 10.2174/0115748936272040231117114252
Ying Liang, Suhui Li, Xiya You, You Guo, Jianjun Tang
Background:: Protein lysine crotonylation (Kcr), a newly discovered important post-translational modification (PTM), is typically localized at the transcription start site and regulates gene expression, which is associated with a variety of pathological conditions such as developmen-tal defects and malignant transformation. Objective:: Identifying Kcr sites is advantageous for the discovery of its biological mechanism and the development of new drugs for related diseases. However, traditional experimental methods for identifying Kcr sites are expensive and inefficient, necessitating the development of new computa-tional techniques. Methods:: In this work, to accurately identify Kcr sites, we propose a model for ensemble learning called Stacking-Kcr. Firstly, extract features from sequence information, physicochemical proper-ties, and sequence fragment similarity. Then, the two characteristics of sequence information and physicochemical properties are fused using automatic encoder and serial, respectively. Finally, the fused two features and sequence fragment similarity features are then respectively input into the four base classifiers, a meta classifier is constructed using the first level prediction results, and the final forecasting results are obtained. method: In this work, to accurately identify Kcr sites, we propose a model for ensemble learning called Stacking-Kcr. Firstly, extract features from sequence information, physicochemical properties, and sequence fragment similarity. Then, the two characteristics of sequence information and physicochemical properties are fused using automatic encoder and serial, respectively. Finally, the fused two features and sequence fragment similarity features are then respectively input into the four base classifiers, a meta classifier is constructed using the first level prediction results, and the final forecasting results are obtained. Results:: The five-fold cross-validation of this model has achieved an accuracy of 0.828 and an AUC of 0.910. This shows that the Stacking-Kcr method has obvious advantages over traditional machine learning methods. On independent test sets, Stacking-Kcr achieved an accuracy of 84.89% and an AUC of 92.21%, which was higher than 1.7% and 0.8% of other state-of-the-art tools. Addi-tionally, we trained Stacking-Kcr on the phosphorylation site, and the result is superior to the cur-rent model. Conclusion:: These outcomes are additional evidence that Stacking-Kcr has strong application po-tential and generalization performance.
背景:蛋白质赖氨酸巴丁酰化(Protein lysine crotonylation, Kcr)是一种新发现的重要的翻译后修饰(PTM),它通常定位于转录起始位点,调控基因表达,与发育缺陷和恶性转化等多种病理状况有关。目的:确定Kcr位点有利于发现其生物学机制和开发治疗相关疾病的新药。然而,传统的实验方法是昂贵和低效的,需要新的计算技术的发展。在这项工作中,为了准确地识别Kcr位点,我们提出了一个称为堆叠-Kcr的集成学习模型。首先,从序列信息、理化性质和序列片段相似性中提取特征;然后,分别利用自动编码器和串行将序列信息和理化性质两个特征融合。最后,将融合后的两个特征和序列片段相似特征分别输入到四个基分类器中,利用一级预测结果构建元分类器,得到最终的预测结果。方法:在这项工作中,为了准确地识别Kcr位点,我们提出了一个称为堆叠-Kcr的集成学习模型。首先,从序列信息、理化性质、序列片段相似性等方面提取特征;然后,分别利用自动编码器和串行将序列信息和理化性质两个特征融合。最后,将融合后的两个特征和序列片段相似特征分别输入到四个基分类器中,利用一级预测结果构建元分类器,得到最终的预测结果。结果:该模型经五重交叉验证,准确率为0.828,AUC为0.910。这表明stack - kcr方法相对于传统的机器学习方法具有明显的优势。在独立测试集上,stack - kcr的准确率为84.89%,AUC为92.21%,高于其他先进工具的1.7%和0.8%。此外,我们在磷酸化位点上训练stack - kcr,结果优于目前的模型。结论:这些结果进一步证明了堆叠- kcr具有较强的应用潜力和泛化性能。
{"title":"Stacking-Kcr: A Stacking Model for Predicting the Crotonylation Sites of Lysine by Fusing Serial and Automatic Encoder","authors":"Ying Liang, Suhui Li, Xiya You, You Guo, Jianjun Tang","doi":"10.2174/0115748936272040231117114252","DOIUrl":"https://doi.org/10.2174/0115748936272040231117114252","url":null,"abstract":"Background:: Protein lysine crotonylation (Kcr), a newly discovered important post-translational modification (PTM), is typically localized at the transcription start site and regulates gene expression, which is associated with a variety of pathological conditions such as developmen-tal defects and malignant transformation. Objective:: Identifying Kcr sites is advantageous for the discovery of its biological mechanism and the development of new drugs for related diseases. However, traditional experimental methods for identifying Kcr sites are expensive and inefficient, necessitating the development of new computa-tional techniques. Methods:: In this work, to accurately identify Kcr sites, we propose a model for ensemble learning called Stacking-Kcr. Firstly, extract features from sequence information, physicochemical proper-ties, and sequence fragment similarity. Then, the two characteristics of sequence information and physicochemical properties are fused using automatic encoder and serial, respectively. Finally, the fused two features and sequence fragment similarity features are then respectively input into the four base classifiers, a meta classifier is constructed using the first level prediction results, and the final forecasting results are obtained. method: In this work, to accurately identify Kcr sites, we propose a model for ensemble learning called Stacking-Kcr. Firstly, extract features from sequence information, physicochemical properties, and sequence fragment similarity. Then, the two characteristics of sequence information and physicochemical properties are fused using automatic encoder and serial, respectively. Finally, the fused two features and sequence fragment similarity features are then respectively input into the four base classifiers, a meta classifier is constructed using the first level prediction results, and the final forecasting results are obtained. Results:: The five-fold cross-validation of this model has achieved an accuracy of 0.828 and an AUC of 0.910. This shows that the Stacking-Kcr method has obvious advantages over traditional machine learning methods. On independent test sets, Stacking-Kcr achieved an accuracy of 84.89% and an AUC of 92.21%, which was higher than 1.7% and 0.8% of other state-of-the-art tools. Addi-tionally, we trained Stacking-Kcr on the phosphorylation site, and the result is superior to the cur-rent model. Conclusion:: These outcomes are additional evidence that Stacking-Kcr has strong application po-tential and generalization performance.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"44 1","pages":""},"PeriodicalIF":4.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138515001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Current Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1