Pub Date : 2024-10-16DOI: 10.1016/j.compbiolchem.2024.108254
Esraa Hamdi Abdelaziz , Rasha Ismail , Mai S. Mabrouk , Eman Amin
Precision medicine has gained considerable popularity since the "one-size-fits-all" approach did not seem very effective or reflective of the complexity of the human body. Subsequently, since single-omics does not reflect the complexity of the human body’s inner workings, it did not result in the expected advancement in the medical field. Therefore, the multi-omics approach has emerged. The multi-omics approach involves integrating data from different omics technologies, such as DNA sequencing, RNA sequencing, mass spectrometry, and others, using computational methods and then analyzing the integrated result for different downstream analysis applications such as survival analysis, cancer classification, or biomarker identification. Most of the recent reviews were constrained to discussing one aspect of the multi-omics analysis pipeline, such as the dimensionality reduction step, the integration methods, or the interpretability aspect; however, very few provide a comprehensive review of every step of the analysis. This study aims to give an overview of the multi-omics analysis pipeline, starting with the most popular multi-omics databases used in recent literature, dimensionality reduction techniques, details the different types of data integration techniques and their downstream analysis applications, describes the most commonly used evaluation metrics, highlights the importance of model interpretability, and lastly discusses the challenges and potential future work for multi-omics data integration in precision medicine.
由于 "千篇一律 "的方法似乎并不十分有效,也不能反映人体的复杂性,精准医疗因此大受欢迎。随后,由于单一组学无法反映人体内部运作的复杂性,也就无法在医学领域取得预期的进步。因此,多组学方法应运而生。多组学方法包括利用计算方法整合来自不同组学技术(如 DNA 测序、RNA 测序、质谱分析等)的数据,然后将整合结果用于不同的下游分析应用,如生存分析、癌症分类或生物标记物鉴定。近期的大多数综述都局限于讨论多组学分析管道的一个方面,如降维步骤、整合方法或可解释性方面;然而,很少有综述对分析的每一个步骤进行全面评述。本研究旨在概述多组学分析流水线,从近期文献中最常用的多组学数据库、降维技术入手,详细介绍不同类型的数据整合技术及其下游分析应用,描述最常用的评估指标,强调模型可解释性的重要性,最后讨论精准医疗中多组学数据整合面临的挑战和未来可能开展的工作。
{"title":"Multi-omics data integration and analysis pipeline for precision medicine: Systematic review","authors":"Esraa Hamdi Abdelaziz , Rasha Ismail , Mai S. Mabrouk , Eman Amin","doi":"10.1016/j.compbiolchem.2024.108254","DOIUrl":"10.1016/j.compbiolchem.2024.108254","url":null,"abstract":"<div><div>Precision medicine has gained considerable popularity since the \"one-size-fits-all\" approach did not seem very effective or reflective of the complexity of the human body. Subsequently, since single-omics does not reflect the complexity of the human body’s inner workings, it did not result in the expected advancement in the medical field. Therefore, the multi-omics approach has emerged. The multi-omics approach involves integrating data from different omics technologies, such as DNA sequencing, RNA sequencing, mass spectrometry, and others, using computational methods and then analyzing the integrated result for different downstream analysis applications such as survival analysis, cancer classification, or biomarker identification. Most of the recent reviews were constrained to discussing one aspect of the multi-omics analysis pipeline, such as the dimensionality reduction step, the integration methods, or the interpretability aspect; however, very few provide a comprehensive review of every step of the analysis. This study aims to give an overview of the multi-omics analysis pipeline, starting with the most popular multi-omics databases used in recent literature, dimensionality reduction techniques, details the different types of data integration techniques and their downstream analysis applications, describes the most commonly used evaluation metrics, highlights the importance of model interpretability, and lastly discusses the challenges and potential future work for multi-omics data integration in precision medicine.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108254"},"PeriodicalIF":2.6,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-15DOI: 10.1016/j.compbiolchem.2024.108248
Umair Arif , Chunxia Zhang , Sajid Hussain , Abdul Rauf Abbasi
Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and identifies key risk factors. Using a Kaggle dataset of 1000 patients with 22 variables, the model classifies prognosis into Low, Medium, and High-risk categories. The bootstrap method was employed for evaluation metrics, while SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations) assessed model interpretability. Results showed SEM's superior interpretability over traditional models, such as Random Forest, Logistic Regression, Decision Tree, Gradient Boosting Machine, Extreme Gradient Boosting Machine, and Light Gradient Boosting Machine. SEM achieved an accuracy of 98.90 %, precision of 98.70 %, F1 score of 98.85 %, sensitivity of 98.77 %, specificity of 95.45 %, Cohen’s kappa value of 94.56 %, and an AUC of 98.10 %. The SEM demonstrated robust performance in lung cancer prognosis, revealing chronic lung cancer and genetic risk as major factors.
{"title":"An efficient interpretable stacking ensemble model for lung cancer prognosis","authors":"Umair Arif , Chunxia Zhang , Sajid Hussain , Abdul Rauf Abbasi","doi":"10.1016/j.compbiolchem.2024.108248","DOIUrl":"10.1016/j.compbiolchem.2024.108248","url":null,"abstract":"<div><div>Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and identifies key risk factors. Using a Kaggle dataset of 1000 patients with 22 variables, the model classifies prognosis into Low, Medium, and High-risk categories. The bootstrap method was employed for evaluation metrics, while SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations) assessed model interpretability. Results showed SEM's superior interpretability over traditional models, such as Random Forest, Logistic Regression, Decision Tree, Gradient Boosting Machine, Extreme Gradient Boosting Machine, and Light Gradient Boosting Machine. SEM achieved an accuracy of 98.90 %, precision of 98.70 %, F1 score of 98.85 %, sensitivity of 98.77 %, specificity of 95.45 %, Cohen’s kappa value of 94.56 %, and an AUC of 98.10 %. The SEM demonstrated robust performance in lung cancer prognosis, revealing chronic lung cancer and genetic risk as major factors.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108248"},"PeriodicalIF":2.6,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142483221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-15DOI: 10.1016/j.compbiolchem.2024.108247
Faisal Ali , Azhar Iqbal , Iqra Azhar , Adiba Qayyum , Syed Ali Hassan , Md. Sakib Al Hasan , Motasim Jawi , Hesham M. Hassan , Ahmed Al-Emam , Muhammad Sajid
Triple-negative breast cancer (TNBC) is a subtype of breast cancer with a poor prognosis. This research aims to find real hub genes for prognostic biomarkers of TNBC therapy. The GEO datasets GSE27447 and GSE233242 were analyzed using R package limma to explore DEGs. The PPI was generated using the STRING database. Cytoscape software plug-ins were used to screen the hub genes. Using the DAVID database, GO functional enrichment and KEGG pathway enrichment analysis were performed. Different online expression databases were employed to investigate the functions of real hub genes in tumor driving, diagnosis, and prognosis in TNBC patients with various clinicopathologic characteristics. A total of one hundred DEGs were identified between both datasets. The seven hub genes were identified after the topological parameter analysis of the PPI network. The KEGG pathway and GO analysis suggest that four genes (PSMB1, PSMC1, PSMF1, and PSMD8) are highly enriched in proteasome and were finally considered as real hub genes. Additionally, the expression analysis demonstrated that hub genes were notably up-regulated in TNBC patients compared to controls. Furthermore, correlational analyses revealed the positive and negative correlations among the expression of the real hub genes and various ancillary data, including tumor purity, promoter methylation status, overall survival (OS), genetic alterations, infiltration of CD8+ T and CD4+ immune cells, and a few more, across TNBC samples. Finally, our analysis identified a couple of significant chemotherapeutic drugs, miRNAs and transcription factors (TFS) with intriguing curative potential. In conclusion, we identified four real hub genes as novel biomarkers to overcome heterogenetic-particular challenges in diagnosis, prognosis, and therapy for TNBC patients.
三阴性乳腺癌(TNBC)是预后不良的乳腺癌亚型。本研究旨在寻找 TNBC 治疗预后生物标志物的真正枢纽基因。研究人员使用 R 软件包 limma 分析了 GEO 数据集 GSE27447 和 GSE233242,以探索 DEGs。PPI使用STRING数据库生成。使用 Cytoscape 软件插件筛选枢纽基因。利用 DAVID 数据库进行了 GO 功能富集和 KEGG 通路富集分析。利用不同的在线表达数据库,研究真实的枢纽基因在具有不同临床病理特征的 TNBC 患者的肿瘤驱动、诊断和预后中的功能。两个数据集共鉴定出 100 个 DEGs。在对 PPI 网络进行拓扑参数分析后,确定了七个枢纽基因。KEGG通路和GO分析表明,四个基因(PSMB1、PSMC1、PSMF1和PSMD8)高度富集于蛋白酶体,最终被认为是真正的枢纽基因。此外,表达分析表明,与对照组相比,TNBC 患者的枢纽基因明显上调。此外,相关性分析表明,在 TNBC 样本中,真正中心基因的表达与各种辅助数据(包括肿瘤纯度、启动子甲基化状态、总生存期(OS)、基因改变、CD8+ T 和 CD4+ 免疫细胞浸润等)之间存在正相关和负相关。最后,我们的分析确定了几种重要的化疗药物、miRNA 和转录因子 (TFS),它们具有令人感兴趣的治疗潜力。总之,我们发现了四个真正的枢纽基因,它们是新型生物标记物,能克服 TNBC 患者在诊断、预后和治疗方面面临的异质性挑战。
{"title":"Exploring a novel four-gene system as a diagnostic and prognostic biomarker for triple-negative breast cancer, using clinical variables","authors":"Faisal Ali , Azhar Iqbal , Iqra Azhar , Adiba Qayyum , Syed Ali Hassan , Md. Sakib Al Hasan , Motasim Jawi , Hesham M. Hassan , Ahmed Al-Emam , Muhammad Sajid","doi":"10.1016/j.compbiolchem.2024.108247","DOIUrl":"10.1016/j.compbiolchem.2024.108247","url":null,"abstract":"<div><div>Triple-negative breast cancer (TNBC) is a subtype of breast cancer with a poor prognosis. This research aims to find real hub genes for prognostic biomarkers of TNBC therapy. The GEO datasets GSE27447 and GSE233242 were analyzed using R package limma to explore DEGs. The PPI was generated using the STRING database. Cytoscape software plug-ins were used to screen the hub genes. Using the DAVID database, GO functional enrichment and KEGG pathway enrichment analysis were performed. Different online expression databases were employed to investigate the functions of real hub genes in tumor driving, diagnosis, and prognosis in TNBC patients with various clinicopathologic characteristics. A total of one hundred DEGs were identified between both datasets. The seven hub genes were identified after the topological parameter analysis of the PPI network. The KEGG pathway and GO analysis suggest that four genes (PSMB1, PSMC1, PSMF1, and PSMD8) are highly enriched in proteasome and were finally considered as real hub genes. Additionally, the expression analysis demonstrated that hub genes were notably up-regulated in TNBC patients compared to controls. Furthermore, correlational analyses revealed the positive and negative correlations among the expression of the real hub genes and various ancillary data, including tumor purity, promoter methylation status, overall survival (OS), genetic alterations, infiltration of CD8+ T and CD4+ immune cells, and a few more, across TNBC samples. Finally, our analysis identified a couple of significant chemotherapeutic drugs, miRNAs and transcription factors (TFS) with intriguing curative potential. In conclusion, we identified four real hub genes as novel biomarkers to overcome heterogenetic-particular challenges in diagnosis, prognosis, and therapy for TNBC patients.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108247"},"PeriodicalIF":2.6,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142483223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cystic fibrosis is an autosomal recessive condition caused by mutations in the CFTR gene, which encodes the CFTR protein. Currently, CF is a life-limiting illness that has a limited cure. The present study aimed to identify top leads against CFTR protein with F508del in comparison with Lumacaftor. In this study, a homology model of the NBD domain of CFTR protein was developed using the available NBD domain crystal structure. The protein model was refined through apo dynamics. Energy-optimized pharmacophore mapping was carried out to identify essential features for CFTR, resulting in a model with a hydrogen-bond donor, two hydrogen-bond acceptors, and three aromatic ring sites. A screening of a compound from the NPASS database using these DAARRR six-point-pharmacophore features led to the identification of potential ligands that could act against CFTR protein. Further studies such as ADME/T, molecular dynamics, MM_GBSA, and DFT were performed to identify the top-hit compound from the NPASS database. The compound Anguibactin (NPC41982) has been identified as a top lead that exhibits higher binding affinity and stability than the reference compound Lumacaftor, suggesting their potential to bind to the active site of the CFTR protein. These compounds could serve as starting points for the development of drug-like molecules for treating cystic fibrosis.
{"title":"E-pharmacophore based virtual screening of potent lead molecules against Cystic Fibrosis: An in silico study","authors":"Sabareeswari Jeyaraman , Jeyanthi Sankar , Ling Shing Wong , Karthikeyan Muthusamy","doi":"10.1016/j.compbiolchem.2024.108249","DOIUrl":"10.1016/j.compbiolchem.2024.108249","url":null,"abstract":"<div><div>Cystic fibrosis is an autosomal recessive condition caused by mutations in the CFTR gene, which encodes the CFTR protein. Currently, CF is a life-limiting illness that has a limited cure. The present study aimed to identify top leads against CFTR protein with F508del in comparison with Lumacaftor. In this study, a homology model of the NBD domain of CFTR protein was developed using the available NBD domain crystal structure. The protein model was refined through apo dynamics. Energy-optimized pharmacophore mapping was carried out to identify essential features for CFTR, resulting in a model with a hydrogen-bond donor, two hydrogen-bond acceptors, and three aromatic ring sites. A screening of a compound from the NPASS database using these DAARRR six-point-pharmacophore features led to the identification of potential ligands that could act against CFTR protein. Further studies such as ADME/T, molecular dynamics, MM_GBSA, and DFT were performed to identify the top-hit compound from the NPASS database. The compound Anguibactin (NPC41982) has been identified as a top lead that exhibits higher binding affinity and stability than the reference compound Lumacaftor, suggesting their potential to bind to the active site of the CFTR protein. These compounds could serve as starting points for the development of drug-like molecules for treating cystic fibrosis.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108249"},"PeriodicalIF":2.6,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142483222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Venous thromboembolism (VTE) is the third most common cardiovascular disease and is a major cause of mobility and mortality worldwide. VTE is a complex multifactorial disease and genetic mechanisms underlying its pathogenesis is yet to be completely elucidated. The aim of the present study was to identify hub genes and pathways involved in development and progression of blood clot during VTE using gene expression data from public repositories.
Methodology
Differential gene expression (DEG) data from two datasets, GSE48000 and GSE19151 were analysed using GEO2R tool. Gene expression data of VTE patients were compared to that of healthy controls using various bioinformatics tools.
Results
When the differentially expressed genes of the two datasets were compared, it was found that 19 genes were up-regulated while 134 genes were down-regulated. Gene ontology (GO) and pathway analysis revealed that pathways such as complement and coagulation cascade and B-cell receptor signalling along with DNA methylation, DNA alkylation and inflammatory genes were significantly up-regulated in VTE patients. On the other hand, differentially down-regulated genes included mitochondrial translation elongation, termination and biosysthesis along with heme biosynthesis, erythrocyte differentiation and homeostasis. The top 5 up-regulated hub genes obtained by protein-protein interaction (PPI) network analysis included MYC, FOS, SGK1, CR2 and CXCR4, whereas the top 5 down-regulated hub genes included MRPL13, MRPL3, MRPL11, RPS29 and RPL9. The up-regulated hub genes are functionally involved in maintain vascular integrity and complementation cascade while the down-regulated hub genes were mostly mitochondrial ribosomal proteins.
Conclusion
Present study highlights significantly enriched pathways and genes associated with VTE development and prognosis. The data hereby obtained could be used for designing newer diagnostic and therapeutic tools for VTE management.
{"title":"Gene expression profiling in Venous thromboembolism: Insights from publicly available datasets","authors":"Sunanda Arya, Rashi Khare, Iti Garg, Swati Srivastava","doi":"10.1016/j.compbiolchem.2024.108246","DOIUrl":"10.1016/j.compbiolchem.2024.108246","url":null,"abstract":"<div><h3>Background</h3><div>Venous thromboembolism (VTE) is the third most common cardiovascular disease and is a major cause of mobility and mortality worldwide. VTE is a complex multifactorial disease and genetic mechanisms underlying its pathogenesis is yet to be completely elucidated. The aim of the present study was to identify hub genes and pathways involved in development and progression of blood clot during VTE using gene expression data from public repositories.</div></div><div><h3>Methodology</h3><div>Differential gene expression (DEG) data from two datasets, GSE48000 and GSE19151 were analysed using GEO2R tool. Gene expression data of VTE patients were compared to that of healthy controls using various bioinformatics tools.</div></div><div><h3>Results</h3><div>When the differentially expressed genes of the two datasets were compared, it was found that 19 genes were up-regulated while 134 genes were down-regulated. Gene ontology (GO) and pathway analysis revealed that pathways such as complement and coagulation cascade and B-cell receptor signalling along with DNA methylation, DNA alkylation and inflammatory genes were significantly up-regulated in VTE patients. On the other hand, differentially down-regulated genes included mitochondrial translation elongation, termination and biosysthesis along with heme biosynthesis, erythrocyte differentiation and homeostasis. The top 5 up-regulated hub genes obtained by protein-protein interaction (PPI) network analysis included MYC, FOS, SGK1, CR2 and CXCR4, whereas the top 5 down-regulated hub genes included MRPL13, MRPL3, MRPL11, RPS29 and RPL9. The up-regulated hub genes are functionally involved in maintain vascular integrity and complementation cascade while the down-regulated hub genes were mostly mitochondrial ribosomal proteins.</div></div><div><h3>Conclusion</h3><div>Present study highlights significantly enriched pathways and genes associated with VTE development and prognosis. The data hereby obtained could be used for designing newer diagnostic and therapeutic tools for VTE management.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108246"},"PeriodicalIF":2.6,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142437805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-11DOI: 10.1016/j.compbiolchem.2024.108241
Shiyu Zhou, Jue Wang, Bo Li
Fundus images are crucial in the observation and detection of ophthalmic diseases. However, detecting multiple ophthalmic diseases from fundus images using deep learning techniques is a complex and challenging task One challenge is the complexity of fundus disease structures, which leads to low detection accuracy. Another challenge is the class imbalance problem common in multi-label image classification, which increases the difficulty of algorithm training and evaluation. To address these issues, this study leverages deep learning to propose an ophthalmic disease classification system. We first employ ResNet50 as the backbone network to extract image features, and then use our designed multi-dimensional attention module and adaptive scale discriminator to enhance the network's ability to detect disease features. During training, we innovatively propose a hybrid loss function method to improve the detection capability on imbalanced data. Finally, we conducted experiments on the ODRI-5K dataset with the proposed classification system. In the test set, our method achieved an AUC of 98.53 and an F1-score of 89.73. This result fully demonstrates the excellent disease classification capability of our method. In summary, the multi-label fundus image disease classification system we proposed exhibits outstanding recognition capability, providing an effective solution for the diagnosis of multi-label fundus image diseases.
{"title":"A multi-class fundus disease classification system based on an adaptive scale discriminator and hybrid loss","authors":"Shiyu Zhou, Jue Wang, Bo Li","doi":"10.1016/j.compbiolchem.2024.108241","DOIUrl":"10.1016/j.compbiolchem.2024.108241","url":null,"abstract":"<div><div>Fundus images are crucial in the observation and detection of ophthalmic diseases. However, detecting multiple ophthalmic diseases from fundus images using deep learning techniques is a complex and challenging task One challenge is the complexity of fundus disease structures, which leads to low detection accuracy. Another challenge is the class imbalance problem common in multi-label image classification, which increases the difficulty of algorithm training and evaluation. To address these issues, this study leverages deep learning to propose an ophthalmic disease classification system. We first employ ResNet50 as the backbone network to extract image features, and then use our designed multi-dimensional attention module and adaptive scale discriminator to enhance the network's ability to detect disease features. During training, we innovatively propose a hybrid loss function method to improve the detection capability on imbalanced data. Finally, we conducted experiments on the ODRI-5K dataset with the proposed classification system. In the test set, our method achieved an AUC of 98.53 and an F1-score of 89.73. This result fully demonstrates the excellent disease classification capability of our method. In summary, the multi-label fundus image disease classification system we proposed exhibits outstanding recognition capability, providing an effective solution for the diagnosis of multi-label fundus image diseases.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108241"},"PeriodicalIF":2.6,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142433266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-11DOI: 10.1016/j.compbiolchem.2024.108219
Qi Zhu , Wulin Shan , Xiaoyu Li , Yao Chen , Xu Huang , Bairong Xia , Liting Qian
Background
Uterine corpus endometrial carcinoma (UCEC) is a prevalent gynecological tumor with a bleak prognosis. Anomalous glycosylation plays a pivotal role in tumorigenesis. Currently, there is a lack of prognostic signatures based on glycosylation-related genes for UCEC. Thus, our research aims to construct a predictive model and validate the correlation between relevant genes and biological functions.
Methods
Using the TCGA database, we developed prognostic models and explored their relationships with survival outcomes. We further selected key genes to verify their expression in tissues and assess their impact on cellular behavior.
Results
The clinical prognosis of the high-risk group was significantly worse than that of the low-risk group. The nomogram model accurately predicted UCEC patient prognosis. Additionally, we identified OLFML1 as a unique signature gene that can inhibit UCEC progression and reduce radiation resistance in vitro.
Conclusions
Our model, which is based on glycosylation-related genes in UCEC, effectively identifies high-risk patients and provides valuable prognostic information. In addition, OLFML1 acts as a tumor suppressor in UCEC and enhances radiosensitivity, suggesting a new potential target for improving therapeutic efficacy.
{"title":"Unraveling the biological functions of UCEC: Insights from a prognostic signature model","authors":"Qi Zhu , Wulin Shan , Xiaoyu Li , Yao Chen , Xu Huang , Bairong Xia , Liting Qian","doi":"10.1016/j.compbiolchem.2024.108219","DOIUrl":"10.1016/j.compbiolchem.2024.108219","url":null,"abstract":"<div><h3>Background</h3><div>Uterine corpus endometrial carcinoma (UCEC) is a prevalent gynecological tumor with a bleak prognosis. Anomalous glycosylation plays a pivotal role in tumorigenesis. Currently, there is a lack of prognostic signatures based on glycosylation-related genes for UCEC. Thus, our research aims to construct a predictive model and validate the correlation between relevant genes and biological functions.</div></div><div><h3>Methods</h3><div>Using the TCGA database, we developed prognostic models and explored their relationships with survival outcomes. We further selected key genes to verify their expression in tissues and assess their impact on cellular behavior.</div></div><div><h3>Results</h3><div>The clinical prognosis of the high-risk group was significantly worse than that of the low-risk group. The nomogram model accurately predicted UCEC patient prognosis. Additionally, we identified OLFML1 as a unique signature gene that can inhibit UCEC progression and reduce radiation resistance in vitro.</div></div><div><h3>Conclusions</h3><div>Our model, which is based on glycosylation-related genes in UCEC, effectively identifies high-risk patients and provides valuable prognostic information. In addition, OLFML1 acts as a tumor suppressor in UCEC and enhances radiosensitivity, suggesting a new potential target for improving therapeutic efficacy.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108219"},"PeriodicalIF":2.6,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1016/j.compbiolchem.2024.108220
Jie Li , Jing Dong , Ming Li , Hongbo Zhu , Peicheng Xin
Objective
This study aims to utilize multiple bioinformatics tools to elucidate the potential mechanisms underlying the comorbidity of Multiple Myeloma (MM) and Osteonecrosis of the Femoral Head (ONFH).
Method
High-throughput microarray datasets for MM and ONFH were retrieved from the GEO database, followed by separate preprocessing. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to construct co-expression networks within the MM datasets, further identifying modules and genes associated with MM clinical characteristics. Potential comorbid genes were enriched and analyzed using pathway and network analysis tools, and key genes for MM and ONFH comorbidity were preliminarily screened using Cytoscape. The gene expression capabilities and performance were validated using two disease-related datasets, and we evaluated the differences and consistencies in the immune microenvironment between the two diseases.
Results
Our screening identified 418 immune-related comorbid genes, showing consistent biological processes in ribosome synthesis, particularly protein synthesis across both diseases. Key genes were further identified through Protein-Protein Interaction (PPI) networks, and their performance was validated in a validation cohort, with Receiver Operating Characteristic (ROC) curve areas exceeding 0.8. The immune microenvironment analysis highlighted consistent plasma cell infiltration correlated with disease comorbidity, suggesting potential immune targets for future therapies.
Conclusion
MM and ONFH share common pathogenic genes that mediate changes in signaling pathways and immune cell dynamics, potentially influencing the comorbidity and progression of these diseases. Key genes identified, such as RPS19, RPL35, RPL24, RPL36, and EIF3G, along with plasma cell infiltration, may serve as central mechanisms in the development of both diseases. This study offers insights and references for further research into targeted treatments for these conditions.
{"title":"Potential mechanisms for predicting comorbidity between multiple myeloma and femoral head necrosis based on multiple bioinformatics","authors":"Jie Li , Jing Dong , Ming Li , Hongbo Zhu , Peicheng Xin","doi":"10.1016/j.compbiolchem.2024.108220","DOIUrl":"10.1016/j.compbiolchem.2024.108220","url":null,"abstract":"<div><h3>Objective</h3><div>This study aims to utilize multiple bioinformatics tools to elucidate the potential mechanisms underlying the comorbidity of Multiple Myeloma (MM) and Osteonecrosis of the Femoral Head (ONFH).</div></div><div><h3>Method</h3><div>High-throughput microarray datasets for MM and ONFH were retrieved from the GEO database, followed by separate preprocessing. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to construct co-expression networks within the MM datasets, further identifying modules and genes associated with MM clinical characteristics. Potential comorbid genes were enriched and analyzed using pathway and network analysis tools, and key genes for MM and ONFH comorbidity were preliminarily screened using Cytoscape. The gene expression capabilities and performance were validated using two disease-related datasets, and we evaluated the differences and consistencies in the immune microenvironment between the two diseases.</div></div><div><h3>Results</h3><div>Our screening identified 418 immune-related comorbid genes, showing consistent biological processes in ribosome synthesis, particularly protein synthesis across both diseases. Key genes were further identified through Protein-Protein Interaction (PPI) networks, and their performance was validated in a validation cohort, with Receiver Operating Characteristic (ROC) curve areas exceeding 0.8. The immune microenvironment analysis highlighted consistent plasma cell infiltration correlated with disease comorbidity, suggesting potential immune targets for future therapies.</div></div><div><h3>Conclusion</h3><div>MM and ONFH share common pathogenic genes that mediate changes in signaling pathways and immune cell dynamics, potentially influencing the comorbidity and progression of these diseases. Key genes identified, such as RPS19, RPL35, RPL24, RPL36, and EIF3G, along with plasma cell infiltration, may serve as central mechanisms in the development of both diseases. This study offers insights and references for further research into targeted treatments for these conditions<strong>.</strong></div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108220"},"PeriodicalIF":2.6,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142433268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1016/j.compbiolchem.2024.108242
Aleksandra Ilic, Nemanja Djokovic, Teodora Djikic , Katarina Nikolic
Selective inhibitors of sirtuin-2 (SIRT2) are increasingly recognized as potential therapeutics for cancer and neurodegenerative diseases. Derivatives of 5-((3-amidobenzyl)oxy)nicotinamides have been identified as some of the most potent and selective SIRT2 inhibitors reported to date (Ai et al., 2016, Ai et al., 2023, Baroni et al., 2007). In this study, a 3D-QSAR (3D-Quantitative Structure-Activity Relationship) model was developed using a dataset of 86 nicotinamide-based SIRT2 inhibitors from the literature, along with GRIND-derived pharmacophore models for selected inhibitors. External validation parameters emphasized the reliability of the 3D-QSAR model in predicting SIRT2 inhibition within the defined applicability domain. The interpretation of the 3D-QSAR model facilitated the generation of GRIND-derived pharmacophore models, which in turn enabled the design of novel SIRT2 inhibitors. Furthermore, based on molecular docking results for the SIRT1–3 isoforms, two classification models were developed: a SIRT1/2 model using the Naive Bayes algorithm and a SIRT2/3 model using the k-nearest neighbors algorithm, to predict the selectivity of inhibitors for SIRT1/2 and SIRT2/3. External validation parameters of the selectivity models confirmed their predictive power. Ultimately, the integration of 3D-QSAR, selectivity models and prediction of ADMET properties facilitated the identification of the most promising selective SIRT2 inhibitors for further development.
{"title":"Integration of 3D-QSAR, molecular docking, and machine learning techniques for rational design of nicotinamide-based SIRT2 inhibitors","authors":"Aleksandra Ilic, Nemanja Djokovic, Teodora Djikic , Katarina Nikolic","doi":"10.1016/j.compbiolchem.2024.108242","DOIUrl":"10.1016/j.compbiolchem.2024.108242","url":null,"abstract":"<div><div>Selective inhibitors of sirtuin-2 (SIRT2) are increasingly recognized as potential therapeutics for cancer and neurodegenerative diseases. Derivatives of 5-((3-amidobenzyl)oxy)nicotinamides have been identified as some of the most potent and selective SIRT2 inhibitors reported to date (<span><span>Ai et al., 2016</span></span>, <span><span>Ai et al., 2023</span></span>, <span><span>Baroni et al., 2007</span></span>). In this study, a 3D-QSAR (3D-Quantitative Structure-Activity Relationship) model was developed using a dataset of 86 nicotinamide-based SIRT2 inhibitors from the literature, along with GRIND-derived pharmacophore models for selected inhibitors. External validation parameters emphasized the reliability of the 3D-QSAR model in predicting SIRT2 inhibition within the defined applicability domain. The interpretation of the 3D-QSAR model facilitated the generation of GRIND-derived pharmacophore models, which in turn enabled the design of novel SIRT2 inhibitors. Furthermore, based on molecular docking results for the SIRT1–3 isoforms, two classification models were developed: a SIRT1/2 model using the Naive Bayes algorithm and a SIRT2/3 model using the k-nearest neighbors algorithm, to predict the selectivity of inhibitors for SIRT1/2 and SIRT2/3. External validation parameters of the selectivity models confirmed their predictive power. Ultimately, the integration of 3D-QSAR, selectivity models and prediction of ADMET properties facilitated the identification of the most promising selective SIRT2 inhibitors for further development.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108242"},"PeriodicalIF":2.6,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142433267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1016/j.compbiolchem.2024.108239
Abdullahi Ibrahim Uba , Nicholas Joseph Paradis , Chun Wu , Gokhan Zengin
Phosphodiesterase type 5 (PDE5) is a cyclic nucleotide-hydrolyzing enzyme that plays essential roles in the regulation of second messenger cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP) produced in response to various stimuli. Pharmacological inhibition of PDE5 has been shown to have several therapeutic uses, including treating cardiovascular diseases and erectile dysfunction. In search of PDE5A inhibitors with safer pharmacokinetic properties, computational analyses of the binding propensity of fifty natural compounds comprising flavonoids, polyphenols, and glycosides were conducted. Molecular dynamics simulation coupled with Molecular mechanics with generalized Born and surface area solvation (MM/GBSA) showed that verbascoside may inhibit the activity of PDE5 with a comparative binding energy (ΔG) of -87.8 ± 9.2 kcal/mol to that of the cocrystal ligand (PDB ID: 3BJC), having ΔG = -77.7±4.5 kcal/mol. However, the other top compounds studied were found to have lower binding propensities than the cocrystal ligand WAN: hesperidin (ΔG = -33.8 ± 3.4 kcal/mol), rutin (ΔG = -23.6 ± 26.3 kcal/mol), caftaric acid (ΔG = -21.2 ±3.6 kcal/mol), and chlorogenic acid (ΔG = 6.0 ± 16.5 kcal/mol). Therefore, verbascoside may serve as a potential PDE5A inhibitor while hesperidin, rutin, and caftaric acid may provide templates for further structural optimization for the designs of safer PDE5 inhibitors.
{"title":"Computational analysis of natural compounds as potential phosphodiesterase type 5A inhibitors","authors":"Abdullahi Ibrahim Uba , Nicholas Joseph Paradis , Chun Wu , Gokhan Zengin","doi":"10.1016/j.compbiolchem.2024.108239","DOIUrl":"10.1016/j.compbiolchem.2024.108239","url":null,"abstract":"<div><div>Phosphodiesterase type 5 (PDE5) is a cyclic nucleotide-hydrolyzing enzyme that plays essential roles in the regulation of second messenger cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP) produced in response to various stimuli. Pharmacological inhibition of PDE5 has been shown to have several therapeutic uses, including treating cardiovascular diseases and erectile dysfunction. In search of PDE5A inhibitors with safer pharmacokinetic properties, computational analyses of the binding propensity of fifty natural compounds comprising flavonoids, polyphenols, and glycosides were conducted. Molecular dynamics simulation coupled with Molecular mechanics with generalized Born and surface area solvation (MM/GBSA) showed that verbascoside may inhibit the activity of PDE5 with a comparative binding energy (ΔG) of -87.8 ± 9.2<!--> <!-->kcal/mol to that of the cocrystal ligand (PDB ID: 3BJC), having ΔG = -77.7±4.5<!--> <!-->kcal/mol. However, the other top compounds studied were found to have lower binding propensities than the cocrystal ligand WAN: hesperidin (ΔG = -33.8 ± 3.4<!--> <!-->kcal/mol), rutin (ΔG = -23.6 ± 26.3<!--> <!-->kcal/mol), caftaric acid (ΔG = -21.2 ±3.6<!--> <!-->kcal/mol), and chlorogenic acid (ΔG = 6.0 ± 16.5<!--> <!-->kcal/mol). Therefore, verbascoside may serve as a potential PDE5A inhibitor while hesperidin, rutin, and caftaric acid may provide templates for further structural optimization for the designs of safer PDE5 inhibitors.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108239"},"PeriodicalIF":2.6,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142428645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}