Pub Date : 2024-10-23DOI: 10.1016/j.compbiolchem.2024.108257
Negar Safinianaini , Camila P.E. De Souza , Andrew Roth , Hazal Koptagel , Hosein Toosi , Jens Lagergren
Investigating tumor heterogeneity using single-cell sequencing technologies is imperative to understand how tumors evolve since each cell subpopulation harbors a unique set of genomic features that yields a unique phenotype, which is bound to have clinical relevance. Clustering of cells based on copy number data obtained from single-cell DNA sequencing provides an opportunity to identify different tumor cell subpopulations. Accordingly, computational methods have emerged for single-cell copy number profiling and clustering; however, these two tasks have been handled sequentially by applying various ad-hoc pre- and post-processing steps; hence, a procedure vulnerable to introducing clustering artifacts. We avoid the clustering artifact issues in our method, CopyMix, a Variational Inference for a novel mixture model, by jointly inferring cell clusters and their underlying copy number profile. Our probabilistic graphical model is an improved version of the mixture of hidden Markov models, which is designed uniquely to infer single-cell copy number profiling and clustering. For the evaluation, we used likelihood-ratio test, CH index, Silhouette, V-measure, total variation scores. CopyMix performs well on both biological and simulated data. Our favorable results indicate a considerable potential to obtain clinical impact by using CopyMix in studies of cancer tumor heterogeneity.
利用单细胞测序技术研究肿瘤异质性是了解肿瘤如何演变的当务之急,因为每个细胞亚群都有一套独特的基因组特征,从而产生独特的表型,这必然与临床相关。根据单细胞 DNA 测序获得的拷贝数数据对细胞进行聚类,为识别不同的肿瘤细胞亚群提供了机会。因此,出现了用于单细胞拷贝数分析和聚类的计算方法;然而,这两项任务是通过应用各种临时的前处理和后处理步骤来顺序处理的;因此,这种程序很容易引入聚类伪影。在我们的方法 "CopyMix--新型混合模型的变量推理 "中,我们通过联合推断细胞簇及其基本拷贝数特征,避免了聚类伪影问题。我们的概率图形模型是隐马尔可夫模型混合物的改进版,其设计独特,可用于推断单细胞拷贝数剖析和聚类。在评估中,我们使用了似然比检验、CH 指数、Silhouette、V-measure 和总变异分数。CopyMix 在生物数据和模拟数据上都表现良好。我们的良好结果表明,在癌症肿瘤异质性研究中使用 CopyMix 有很大的潜力产生临床影响。
{"title":"CopyMix: Mixture model based single-cell clustering and copy number profiling using variational inference","authors":"Negar Safinianaini , Camila P.E. De Souza , Andrew Roth , Hazal Koptagel , Hosein Toosi , Jens Lagergren","doi":"10.1016/j.compbiolchem.2024.108257","DOIUrl":"10.1016/j.compbiolchem.2024.108257","url":null,"abstract":"<div><div>Investigating tumor heterogeneity using single-cell sequencing technologies is imperative to understand how tumors evolve since each cell subpopulation harbors a unique set of genomic features that yields a unique phenotype, which is bound to have clinical relevance. Clustering of cells based on copy number data obtained from single-cell DNA sequencing provides an opportunity to identify different tumor cell subpopulations. Accordingly, computational methods have emerged for single-cell copy number profiling and clustering; however, these two tasks have been handled sequentially by applying various ad-hoc pre- and post-processing steps; hence, a procedure vulnerable to introducing clustering artifacts. We avoid the clustering artifact issues in our method, CopyMix, a Variational Inference for a novel mixture model, by jointly inferring cell clusters and their underlying copy number profile. Our probabilistic graphical model is an improved version of the mixture of hidden Markov models, which is designed uniquely to infer single-cell copy number profiling and clustering. For the evaluation, we used likelihood-ratio test, CH index, Silhouette, V-measure, total variation scores. CopyMix performs well on both biological and simulated data. Our favorable results indicate a considerable potential to obtain clinical impact by using CopyMix in studies of cancer tumor heterogeneity.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108257"},"PeriodicalIF":2.6,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142578223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-19DOI: 10.1016/j.compbiolchem.2024.108243
Vinnakota Sai Durga Tejaswi, Venubabu Rachapudi
Liver cancer is a leading cause of cancer-related deaths, often diagnosed at advanced stages due to reliance on traditional imaging methods. Existing computer-aided diagnosis systems struggle with noise, anatomical complexity, and ineffective feature integration, leading to inaccuracies in lesion segmentation and classification. By effectively addressing these challenges, the model aims to enhance early detection and assist clinicians in making informed decisions. Ultimately, this research seeks to contribute to more efficient and accurate liver cancer diagnosis. This paper presents a novel model for liver cancer classification, called SegNet-based Liver Cancer Classification via SqueezeNet (SgN-LCC-SqN). The model effectively executes liver cancer segmentation and classification through four key steps: preprocessing, segmentation, feature extraction, and classification. During preprocessing, Quadratic Mean Estimated Wiener Filtering (QMEWF) is utilized to minimize image noise. Segmentation divides the image into segments using Enhanced Feature Pyramid SegNet (EFP-SgN), which is essential for precise diagnosis. Feature extraction encompasses color features, Local Directional Pattern Variance, and Correlation Filtering-Local Gradient Increasing Pattern (CF-LGIP) features. The extracted features are then processed through an ensemble model, Deep Convolutional, Recurrent, Long Short Term Memory with SqueezeNet (DCR-LSTM-SqN), which includes Deep Convolutional Neural Network (DCNN), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), and Modified Loss Function in SqueezeNet (MLF-SqN) classifiers, sequentially analyzing the feature sets through DCNN, RNN, and LSTM before classification by MLF-SqN. The performance of the suggested DCR-LSTM-SqN model is evaluated over conventional methods for positive, negative and other metrics. The DCR-LSTM-SqN model consistently demonstrates superior accuracy, ranging from 0.947 to 0.984, across all training data percentages. Thus, the proposed model effectively segments liver lesions and classifies cancerous areas, demonstrating its potential as a valuable resource for clinicians to enhance the efficiency and accuracy of liver cancer diagnosis.
{"title":"Computer-aided diagnosis of liver cancer with improved SegNet and deep stacking ensemble model","authors":"Vinnakota Sai Durga Tejaswi, Venubabu Rachapudi","doi":"10.1016/j.compbiolchem.2024.108243","DOIUrl":"10.1016/j.compbiolchem.2024.108243","url":null,"abstract":"<div><div>Liver cancer is a leading cause of cancer-related deaths, often diagnosed at advanced stages due to reliance on traditional imaging methods. Existing computer-aided diagnosis systems struggle with noise, anatomical complexity, and ineffective feature integration, leading to inaccuracies in lesion segmentation and classification. By effectively addressing these challenges, the model aims to enhance early detection and assist clinicians in making informed decisions. Ultimately, this research seeks to contribute to more efficient and accurate liver cancer diagnosis. This paper presents a novel model for liver cancer classification, called SegNet-based Liver Cancer Classification via SqueezeNet (SgN-LCC-SqN). The model effectively executes liver cancer segmentation and classification through four key steps: preprocessing, segmentation, feature extraction, and classification. During preprocessing, Quadratic Mean Estimated Wiener Filtering (QMEWF) is utilized to minimize image noise. Segmentation divides the image into segments using Enhanced Feature Pyramid SegNet (EFP-SgN), which is essential for precise diagnosis. Feature extraction encompasses color features, Local Directional Pattern Variance, and Correlation Filtering-Local Gradient Increasing Pattern (CF-LGIP) features. The extracted features are then processed through an ensemble model, Deep Convolutional, Recurrent, Long Short Term Memory with SqueezeNet (DCR-LSTM-SqN), which includes Deep Convolutional Neural Network (DCNN), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), and Modified Loss Function in SqueezeNet (MLF-SqN) classifiers, sequentially analyzing the feature sets through DCNN, RNN, and LSTM before classification by MLF-SqN. The performance of the suggested DCR-LSTM-SqN model is evaluated over conventional methods for positive, negative and other metrics. The DCR-LSTM-SqN model consistently demonstrates superior accuracy, ranging from 0.947 to 0.984, across all training data percentages. Thus, the proposed model effectively segments liver lesions and classifies cancerous areas, demonstrating its potential as a valuable resource for clinicians to enhance the efficiency and accuracy of liver cancer diagnosis.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108243"},"PeriodicalIF":2.6,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-19DOI: 10.1016/j.compbiolchem.2024.108261
Zhangjie Di , Bo Yang , Meng Li , Yue Wu , Hong Ji
Technical or biologically irrelevant differences caused by different experiments, times, or sequencing platforms can generate batch effects that mask the true biological information. Therefore, batch effects are typically removed when analyzing single-cell RNA sequencing (scRNA-seq) datasets for downstream tasks. Existing batch correction methods usually mitigate batch effects by reducing the data from different batches to a lower dimensional space before clustering, potentially leading to the loss of rare cell types. To address this problem, we introduce a novel single-cell data batch effect correction model using Biological-noise Decoupling Autoencoder (BDA) and Central-cross Loss termed BDACL. The model initially reconstructs raw data using an auto-encoder and conducts preliminary clustering. We then construct a similarity matrix and a hierarchical clustering tree to delineate relationships within and between different batches. Finally, we introduce a Central-cross Loss (CL). This loss leverages cross-entropy loss to prompt the model to better distinguish between different cluster labels. Additionally, it employs the Central Loss to encourage samples to form more compact clusters in the embedding space, thereby enhancing the consistency and interpretability of clustering results to mitigate differences between different batches. The primary innovation of this model lies in reconstructing data with an auto-encoder and gradually merging smaller clusters into larger ones using a hierarchical clustering tree. By using reallocated cluster labels as training labels and employing the Central-cross Loss, the model effectively eliminates batch effects in an unsupervised manner. Compared to current methods, BDACL can mitigate batch effects without losing rare cell types.
{"title":"Batch effects correction in scRNA-seq based on biological-noise decoupling autoencoder and central-cross loss","authors":"Zhangjie Di , Bo Yang , Meng Li , Yue Wu , Hong Ji","doi":"10.1016/j.compbiolchem.2024.108261","DOIUrl":"10.1016/j.compbiolchem.2024.108261","url":null,"abstract":"<div><div>Technical or biologically irrelevant differences caused by different experiments, times, or sequencing platforms can generate batch effects that mask the true biological information. Therefore, batch effects are typically removed when analyzing single-cell RNA sequencing (scRNA-seq) datasets for downstream tasks. Existing batch correction methods usually mitigate batch effects by reducing the data from different batches to a lower dimensional space before clustering, potentially leading to the loss of rare cell types. To address this problem, we introduce a novel single-cell data batch effect correction model using Biological-noise Decoupling Autoencoder (BDA) and Central-cross Loss termed BDACL. The model initially reconstructs raw data using an auto-encoder and conducts preliminary clustering. We then construct a similarity matrix and a hierarchical clustering tree to delineate relationships within and between different batches. Finally, we introduce a Central-cross Loss (CL). This loss leverages cross-entropy loss to prompt the model to better distinguish between different cluster labels. Additionally, it employs the Central Loss to encourage samples to form more compact clusters in the embedding space, thereby enhancing the consistency and interpretability of clustering results to mitigate differences between different batches. The primary innovation of this model lies in reconstructing data with an auto-encoder and gradually merging smaller clusters into larger ones using a hierarchical clustering tree. By using reallocated cluster labels as training labels and employing the Central-cross Loss, the model effectively eliminates batch effects in an unsupervised manner. Compared to current methods, BDACL can mitigate batch effects without losing rare cell types.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108261"},"PeriodicalIF":2.6,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-19DOI: 10.1016/j.compbiolchem.2024.108258
Bo Yang, Yu Wan, Jieqiong Wang, Yun Liu, Shaohua Wang
Oral squamous cell carcinoma (OSCC), a significant type of head and neck cancer, has witnessed increasing incidence and mortality rates. Immune-related genes (IRGs) and metabolic-related genes (MRGs) play essential roles in the pathogenesis, metastasis, and progression of OSCC. This study exploited data from The Cancer Genome Atlas (TCGA) to identify IRGs and MRGs related to OSCC through differential analysis. Univariate Cox analysis was utilized to determine immune-metabolic-related genes (IMRGs) associated with patient prognosis. A prognostic model for OSCC was constructed using Lasso-Cox regression and subsequently validated with datasets from the Gene Expression Omnibus (GEO). Non-Negative Matrix Factorization (NMF) clustering identified three molecular subtypes of OSCC, among which the C2 subtype showed better overall survival (OS) and progression-free survival (PFS). A prognostic model based on nine IMRGs was developed to categorize OSCC patients into high- and low-risk groups, with the low-risk group demonstrating significantly longer OS in both training and testing cohorts. The model showed strong predictive capabilities, and the risk score served as an independent prognostic factor. Additionally, expression levels of programmed death 1 (PD1) and cytotoxic T-lymphocyte-associated antigen 4 (CTLA4) differed between the risk groups. Gene Set Enrichment Analysis (GSEA) indicated distinct enriched pathways between high-risk and low-risk groups, highlighting the crucial roles of immune and metabolic processes in OSCC. The nine IMRGs prognostic model presented excellent predictive performance and has potential for clinical application.
口腔鳞状细胞癌(OSCC)是一种重要的头颈部癌症,其发病率和死亡率不断上升。免疫相关基因(IRGs)和代谢相关基因(MRGs)在口腔鳞状细胞癌的发病、转移和发展过程中起着至关重要的作用。本研究利用癌症基因组图谱(The Cancer Genome Atlas,TCGA)的数据,通过差异分析确定与OSCC相关的IRGs和MRGs。利用单变量考克斯分析确定与患者预后相关的免疫代谢相关基因(IMRGs)。利用Lasso-Cox回归法构建了OSCC的预后模型,随后利用基因表达总库(GEO)的数据集进行了验证。非负矩阵因子化(NMF)聚类确定了OSCC的三种分子亚型,其中C2亚型显示出较好的总生存期(OS)和无进展生存期(PFS)。基于九个IMRGs建立的预后模型将OSCC患者分为高风险组和低风险组,其中低风险组在训练组和测试组中的OS明显更长。该模型具有很强的预测能力,风险评分是一个独立的预后因素。此外,程序性死亡1(PD1)和细胞毒性T淋巴细胞相关抗原4(CTLA4)的表达水平在不同风险组之间存在差异。基因组富集分析(Gene Set Enrichment Analysis,GSEA)表明,高危组和低危组之间存在不同的富集通路,突出了免疫和代谢过程在 OSCC 中的关键作用。九个IMRGs预后模型具有出色的预测性能,有望应用于临床。
{"title":"Construction and validation of a prognostic model based on immune-metabolic-related genes in oral squamous cell carcinoma","authors":"Bo Yang, Yu Wan, Jieqiong Wang, Yun Liu, Shaohua Wang","doi":"10.1016/j.compbiolchem.2024.108258","DOIUrl":"10.1016/j.compbiolchem.2024.108258","url":null,"abstract":"<div><div>Oral squamous cell carcinoma (OSCC), a significant type of head and neck cancer, has witnessed increasing incidence and mortality rates. Immune-related genes (IRGs) and metabolic-related genes (MRGs) play essential roles in the pathogenesis, metastasis, and progression of OSCC. This study exploited data from The Cancer Genome Atlas (TCGA) to identify IRGs and MRGs related to OSCC through differential analysis. Univariate Cox analysis was utilized to determine immune-metabolic-related genes (IMRGs) associated with patient prognosis. A prognostic model for OSCC was constructed using Lasso-Cox regression and subsequently validated with datasets from the Gene Expression Omnibus (GEO). Non-Negative Matrix Factorization (NMF) clustering identified three molecular subtypes of OSCC, among which the C2 subtype showed better overall survival (OS) and progression-free survival (PFS). A prognostic model based on nine IMRGs was developed to categorize OSCC patients into high- and low-risk groups, with the low-risk group demonstrating significantly longer OS in both training and testing cohorts. The model showed strong predictive capabilities, and the risk score served as an independent prognostic factor. Additionally, expression levels of programmed death 1 (PD1) and cytotoxic T-lymphocyte-associated antigen 4 (CTLA4) differed between the risk groups. Gene Set Enrichment Analysis (GSEA) indicated distinct enriched pathways between high-risk and low-risk groups, highlighting the crucial roles of immune and metabolic processes in OSCC. The nine IMRGs prognostic model presented excellent predictive performance and has potential for clinical application.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108258"},"PeriodicalIF":2.6,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-18DOI: 10.1016/j.compbiolchem.2024.108253
Mohammad-Sadegh Lotfi , Majid Jafari-Sabet
This study aimed to investigate and compare the binding affinity of apigenin and its dimeric flavonoid forms to PIM1 kinase in glioblastoma multiforme (GBM), an aggressive and lethal brain cancer. Apigenin is a natural herbal product that has demonstrated anti-cancer effects in numerous studies, both in vitro and in vivo, on various cancers. Our in silico analysis showed that PIM1 expression was significantly higher in GBM tumor tissue compared to normal brain tissue, and high PIM1 expression correlated with worse survival rates in patients with GBM. Also, our molecular docking studies showed that apigenin and its dimeric flavonoids, such as amentoflavone and hinokiflavone, can bind to the ATP-binding site of PIM1 with significant binding affinity and form various intermolecular interactions with key amino acid residues. Notably, dimeric flavonoids have a stronger binding affinity than apigenin, indicating their potential as potent PIM1 inhibitors. Our findings demonstrated the therapeutic potential of apigenin and its dimeric flavonoid forms in treating GBM by targeting PIM1 kinase. The observed inhibitory effects of PIM1 can inhibit tumor growth, induce cell cycle arrest, and promote apoptosis. However, further in vitro and in vivo studies are needed to confirm their anticancer potentials and elucidate the underlying molecular mechanisms of these compounds in GBM treatment.
{"title":"Comparative in Silico study of apigenin and its dimeric forms on PIM1 kinase in glioblastoma multiform","authors":"Mohammad-Sadegh Lotfi , Majid Jafari-Sabet","doi":"10.1016/j.compbiolchem.2024.108253","DOIUrl":"10.1016/j.compbiolchem.2024.108253","url":null,"abstract":"<div><div>This study aimed to investigate and compare the binding affinity of apigenin and its dimeric flavonoid forms to PIM1 kinase in glioblastoma multiforme (GBM), an aggressive and lethal brain cancer. Apigenin is a natural herbal product that has demonstrated anti-cancer effects in numerous studies, both in vitro and in vivo, on various cancers. Our in silico analysis showed that PIM1 expression was significantly higher in GBM tumor tissue compared to normal brain tissue, and high PIM1 expression correlated with worse survival rates in patients with GBM. Also, our molecular docking studies showed that apigenin and its dimeric flavonoids, such as amentoflavone and hinokiflavone, can bind to the ATP-binding site of PIM1 with significant binding affinity and form various intermolecular interactions with key amino acid residues. Notably, dimeric flavonoids have a stronger binding affinity than apigenin, indicating their potential as potent PIM1 inhibitors. Our findings demonstrated the therapeutic potential of apigenin and its dimeric flavonoid forms in treating GBM by targeting PIM1 kinase. The observed inhibitory effects of PIM1 can inhibit tumor growth, induce cell cycle arrest, and promote apoptosis. However, further in vitro and in vivo studies are needed to confirm their anticancer potentials and elucidate the underlying molecular mechanisms of these compounds in GBM treatment.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108253"},"PeriodicalIF":2.6,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-18DOI: 10.1016/j.compbiolchem.2024.108252
Utkarsh A. Jagtap , Sanket Rathod , Ravi Shukla , Atish T. Paul
The prevalence of obesity is rapidly increasing worldwide. Brown adipose tissue activates uncoupling protein 1 (UCP1) to generate heat through bypassing ATP synthesis, offering a potential target for obesity treatment. Targeting UCP1 activation to induce thermogenesis through small molecules presents a promising approach for obesity management. In this study, molecular docking of UCP1 activators, using 2,4-dinitrophenol (DNP) as a reference ligand (PDB ID: 8J1N, docking score: −5.343 kcal/mol), identified seven top-scoring compounds: naringin (-7.284 kcal/mol), quercetin (-6.661 kcal/mol), salsalate (-6.017 kcal/mol), rhein (-5.798 kcal/mol), mirabegron (-5.535 kcal/mol), curcumin (-5.479 kcal/mol), and formoterol (-5.451 kcal/mol). Prime MM-GBSA calculation of the top-scored molecule (i.e., naringin) in the docking study showed ΔGBind of −70.48 kcal/mol. Key interactions of these top 7 activators with UCP1 binding pocket residues Trp280, Arg276, Glu190, Arg83, and Arg91 were observed. Molecular dynamics simulations performed for 100 ns confirmed complex stability, with RMSD values below 6 Å. Additionally, most activators showed favorable intestinal absorption (>90 %) and lipophilicity (LogP 2–4), with pKa values supporting their pharmacological potential as UCP1-targeting therapeutics for obesity. These findings provide a foundation for designing potent UCP1 activators by integrating docking scores, interaction profiles, statistical profiles from MD simulations, and physicochemical assessments to develop effective anti-obesity therapies.
{"title":"Computational insights into human UCP1 activators through molecular docking, MM-GBSA, and molecular dynamics simulation studies","authors":"Utkarsh A. Jagtap , Sanket Rathod , Ravi Shukla , Atish T. Paul","doi":"10.1016/j.compbiolchem.2024.108252","DOIUrl":"10.1016/j.compbiolchem.2024.108252","url":null,"abstract":"<div><div>The prevalence of obesity is rapidly increasing worldwide. Brown adipose tissue activates uncoupling protein 1 (UCP1) to generate heat through bypassing ATP synthesis, offering a potential target for obesity treatment. Targeting UCP1 activation to induce thermogenesis through small molecules presents a promising approach for obesity management. In this study, molecular docking of UCP1 activators, using 2,4-dinitrophenol (DNP) as a reference ligand (PDB ID: 8J1N, docking score: −5.343 kcal/mol), identified seven top-scoring compounds: naringin (-7.284 kcal/mol), quercetin (-6.661 kcal/mol), salsalate (-6.017 kcal/mol), rhein (-5.798 kcal/mol), mirabegron (-5.535 kcal/mol), curcumin (-5.479 kcal/mol), and formoterol (-5.451 kcal/mol). Prime MM-GBSA calculation of the top-scored molecule (i.e., naringin) in the docking study showed ΔGBind of −70.48 kcal/mol. Key interactions of these top 7 activators with UCP1 binding pocket residues Trp280, Arg276, Glu190, Arg83, and Arg91 were observed. Molecular dynamics simulations performed for 100 ns confirmed complex stability, with RMSD values below 6 Å. Additionally, most activators showed favorable intestinal absorption (>90 %) and lipophilicity (LogP 2–4), with pKa values supporting their pharmacological potential as UCP1-targeting therapeutics for obesity. These findings provide a foundation for designing potent UCP1 activators by integrating docking scores, interaction profiles, statistical profiles from MD simulations, and physicochemical assessments to develop effective anti-obesity therapies.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108252"},"PeriodicalIF":2.6,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-18DOI: 10.1016/j.compbiolchem.2024.108245
Sabkat Mahmud , Alvira Ajadee , Md. Bayazid Hossen , Md. Saiful Islam , Reaz Ahmmed , Md. Ahad Ali , Md. Manir Hossain Mollah , Md. Selim Reza , Md. Nurul Haque Mollah
The most frequent endocrine cancer of the head and neck is thyroid carcinoma (THCA). Although there is increasing evidence linking THCA to genetic alterations, the exact molecular mechanism behind this relationship is not yet completely known to the researchers. There is still much to learn about THCA's molecular roots and genetic biomarkers. Though drug therapies are the best choice after metastasis, unfortunately, the majority of the patients progressively develop resistance against the therapeutic drugs after receiving them for a few years. Therefore, multi-targeted different variants of therapeutic drugs may be essential for effective treatment against THCA. To understand molecular mechanisms of THCA development and progression and explore multi-targeted different variants of therapeutic drugs, we detected 80 common differentially expressed genes (cDEGs) between THCA and non-THCA samples from six microarray gene expression datasets using the statistical LIMMA approach. Through protein-protein interaction (PPI) network analysis, we identified the top-ranked eight differentially expressed genes (TIMP1, FN1, THBS1, RUNX2, SHANK2, TOP2A, LRP2, and ACTN1) as the THCA-causing key genes (KGs), where 6 KGs (TIMP1, TOP2A, FN1, ACTN1, RUNX2, THBS1) are upregulated and 2 KGs (LRP2, SHANK2) are downregulated. The expression pattern analysis of KGs with the independent TCGA database by Box plots also confirmed their upregulated and downregulated patterns. The expression analysis of KGs in different stages of THCA development indicated that these KGs might be utilized as early diagnostic and prognostic biomarkers. The pan-cancer analysis of KGs indicated a substantial correlation of KGs with multiple cancers, including THCA. Some transcription factors (TFs) and microRNAs were detected as the key transcriptional and post-transcriptional regulators of KGs using gene regulatory network (GRN) analysis. The enrichment analysis of the cDEGs revealed several key molecular functions, biological processes, cellular components, and pathways significantly associated with THCA. These findings highlight critical mechanisms influenced by the identified key genes (KGs), providing deeper insight into their roles in THCA development. Then we detected 6 repurposable drug molecules (Entrectinib, Imatinib, Ponatinib, Sorafenib, Retevmo, and Pazopanib) by molecular docking with KGs-mediated receptor proteins, ADME/T analysis, and cross-validation with the independent receptors. Therefore, these findings might be useful resources for wet lab researchers and clinicians to consider an effective treatment strategy against THCA.
{"title":"Gene-expression profile analysis to disclose diagnostics and therapeutics biomarkers for thyroid carcinoma","authors":"Sabkat Mahmud , Alvira Ajadee , Md. Bayazid Hossen , Md. Saiful Islam , Reaz Ahmmed , Md. Ahad Ali , Md. Manir Hossain Mollah , Md. Selim Reza , Md. Nurul Haque Mollah","doi":"10.1016/j.compbiolchem.2024.108245","DOIUrl":"10.1016/j.compbiolchem.2024.108245","url":null,"abstract":"<div><div>The most frequent endocrine cancer of the head and neck is thyroid carcinoma (THCA). Although there is increasing evidence linking THCA to genetic alterations, the exact molecular mechanism behind this relationship is not yet completely known to the researchers. There is still much to learn about THCA's molecular roots and genetic biomarkers. Though drug therapies are the best choice after metastasis, unfortunately, the majority of the patients progressively develop resistance against the therapeutic drugs after receiving them for a few years. Therefore, multi-targeted different variants of therapeutic drugs may be essential for effective treatment against THCA. To understand molecular mechanisms of THCA development and progression and explore multi-targeted different variants of therapeutic drugs, we detected 80 common differentially expressed genes (cDEGs) between THCA and non-THCA samples from six microarray gene expression datasets using the statistical LIMMA approach. Through protein-protein interaction (PPI) network analysis, we identified the top-ranked eight differentially expressed genes (TIMP1, FN1, THBS1, RUNX2, SHANK2, TOP2A, LRP2, and ACTN1) as the THCA-causing key genes (KGs), where 6 KGs (TIMP1, TOP2A, FN1, ACTN1, RUNX2, THBS1) are upregulated and 2 KGs (LRP2, SHANK2) are downregulated. The expression pattern analysis of KGs with the independent TCGA database by Box plots also confirmed their upregulated and downregulated patterns. The expression analysis of KGs in different stages of THCA development indicated that these KGs might be utilized as early diagnostic and prognostic biomarkers. The pan-cancer analysis of KGs indicated a substantial correlation of KGs with multiple cancers, including THCA. Some transcription factors (TFs) and microRNAs were detected as the key transcriptional and post-transcriptional regulators of KGs using gene regulatory network (GRN) analysis. The enrichment analysis of the cDEGs revealed several key molecular functions, biological processes, cellular components, and pathways significantly associated with THCA. These findings highlight critical mechanisms influenced by the identified key genes (KGs), providing deeper insight into their roles in THCA development. Then we detected 6 repurposable drug molecules (Entrectinib, Imatinib, Ponatinib, Sorafenib, Retevmo, and Pazopanib) by molecular docking with KGs-mediated receptor proteins, ADME/T analysis, and cross-validation with the independent receptors. Therefore, these findings might be useful resources for wet lab researchers and clinicians to consider an effective treatment strategy against THCA.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108245"},"PeriodicalIF":2.6,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-17DOI: 10.1016/j.compbiolchem.2024.108259
Tong Hu , Jianguo Chen , Lili Qiao
Breast cancer poses a significant health threat to women, necessitating advancements in diagnostic technologies. Breast dynamic optical imaging (DOI) technology, recognized for its non-invasive and radiation-free properties, is extensively utilized for the early screening and quantitative analysis of breast tumors. The integration of deep learning, a robust technology for automatic image feature extraction, with breast DOI has the potential to enhance tumor detection and diagnosis significantly. This paper introduces a deep learning-enhanced image optimization approach to overcome challenges such as poor image quality and distorted projection data commonly encountered in existing DOI methods. The approach utilizes convolutional neural networks (CNNs) to extract features from raw images and employs generative adversarial networks (GANs) to enhance these images, thereby improving their quality and contrast. Additionally, a novel correction algorithm is developed to address projection data distortion, enabling the reconstruction and correction of this data for more accurate and reliable imaging results. Experimental findings confirm that the proposed method markedly enhances both image quality and projection data accuracy in breast DOI, offering a reliable foundation for clinical diagnosis. This study not only provides a new perspective and methodology for the early screening and diagnosis of breast cancer but also holds substantial clinical importance and prospective applications.
乳腺癌对妇女的健康构成严重威胁,因此诊断技术必须不断进步。乳腺动态光学成像(DOI)技术因其无创伤、无辐射的特性,被广泛用于乳腺肿瘤的早期筛查和定量分析。深度学习是一种强大的自动图像特征提取技术,将其与乳腺动态光学成像技术相结合,有望显著提高肿瘤的检测和诊断水平。本文介绍了一种深度学习增强型图像优化方法,以克服现有 DOI 方法中常见的图像质量差和投影数据失真等难题。该方法利用卷积神经网络(CNN)从原始图像中提取特征,并利用生成对抗网络(GAN)增强这些图像,从而提高图像质量和对比度。此外,还开发了一种新颖的校正算法来解决投影数据失真问题,从而能够重建和校正这些数据,获得更准确、更可靠的成像结果。实验结果证实,所提出的方法显著提高了乳腺 DOI 的图像质量和投影数据的准确性,为临床诊断提供了可靠的依据。这项研究不仅为乳腺癌的早期筛查和诊断提供了新的视角和方法,而且具有重要的临床意义和应用前景。
{"title":"Optimization and correction of breast dynamic optical imaging projection data based on deep learning","authors":"Tong Hu , Jianguo Chen , Lili Qiao","doi":"10.1016/j.compbiolchem.2024.108259","DOIUrl":"10.1016/j.compbiolchem.2024.108259","url":null,"abstract":"<div><div>Breast cancer poses a significant health threat to women, necessitating advancements in diagnostic technologies. Breast dynamic optical imaging (DOI) technology, recognized for its non-invasive and radiation-free properties, is extensively utilized for the early screening and quantitative analysis of breast tumors. The integration of deep learning, a robust technology for automatic image feature extraction, with breast DOI has the potential to enhance tumor detection and diagnosis significantly. This paper introduces a deep learning-enhanced image optimization approach to overcome challenges such as poor image quality and distorted projection data commonly encountered in existing DOI methods. The approach utilizes convolutional neural networks (CNNs) to extract features from raw images and employs generative adversarial networks (GANs) to enhance these images, thereby improving their quality and contrast. Additionally, a novel correction algorithm is developed to address projection data distortion, enabling the reconstruction and correction of this data for more accurate and reliable imaging results. Experimental findings confirm that the proposed method markedly enhances both image quality and projection data accuracy in breast DOI, offering a reliable foundation for clinical diagnosis. This study not only provides a new perspective and methodology for the early screening and diagnosis of breast cancer but also holds substantial clinical importance and prospective applications.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108259"},"PeriodicalIF":2.6,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142560564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-16DOI: 10.1016/j.compbiolchem.2024.108250
Tamizhini Loganathan , Madhulekha S. , Hatem Zayed , George Priya Doss C
Endometrial and Ovarian cancers are two highly prevalent and fatal reproductive diseases with poor prognoses among women. Elevated estrogen levels in Ovarian Cancer (OC) stimulate the endometrium, causing Endometrial Cancer (EC). Although numerous studies have reported the crucial genes and pathways in this cancer, the pathogenesis of this disease remains unclear. In this study, used bioinformatics tools to analyse GSE63678, GSE115810, GSE36389, GSE26712, GSE36668, GSE27651, GSE6008, GSE69429, GSE69428, GSE18521, GSE185209, GSE54388 gene expression microarray datasets for both the cancers. We analyzed the differential gene expression, functional association, and structural studies. The analysis identified crucial differentially expressed genes (DEGs) in both cancers associated with DNA damage, DNA integrity, and cell-cycle checkpoint signaling pathways. CLDN7, UBE2I, WT1, JAM2, FOXL2, F11R, JAM3, ZFPM2, MEF2C, and PIAS1 are the top 10 hub genes commonly identified in both cancer types. Only CLDN7 and F11R are upregulated, whereas the remaining hub genes are downregulated in both cancers, suggesting a common framework for contributing to tumorigenesis. Molecular docking and dynamics were performed on the UBE2I protein with Irinotecan Hydrochloride, which could serve as the new approach for treating and managing both cancers. The study reveals the common molecular pathways, pointing out the role of cell cycle and DNA damage and integrity checkpoint signaling in the pathogenesis of both cancer types. This study explored the UBE2I gene as a potential biomarker in OC and EC. Further, this study concludes that the irinotecan hydrochloride drug has higher therapeutic effects on UBE2I protein through docking and dynamics studies.
{"title":"Computational insights into irinotecan's interaction with UBE2I in ovarian and endometrial cancers","authors":"Tamizhini Loganathan , Madhulekha S. , Hatem Zayed , George Priya Doss C","doi":"10.1016/j.compbiolchem.2024.108250","DOIUrl":"10.1016/j.compbiolchem.2024.108250","url":null,"abstract":"<div><div>Endometrial and Ovarian cancers are two highly prevalent and fatal reproductive diseases with poor prognoses among women. Elevated estrogen levels in Ovarian Cancer (OC) stimulate the endometrium, causing Endometrial Cancer (EC). Although numerous studies have reported the crucial genes and pathways in this cancer, the pathogenesis of this disease remains unclear. In this study, used bioinformatics tools to analyse GSE63678, GSE115810, GSE36389, GSE26712, GSE36668, GSE27651, GSE6008, GSE69429, GSE69428, GSE18521, GSE185209, GSE54388 gene expression microarray datasets for both the cancers. We analyzed the differential gene expression, functional association, and structural studies. The analysis identified crucial differentially expressed genes (DEGs) in both cancers associated with DNA damage, DNA integrity, and cell-cycle checkpoint signaling pathways. <em>CLDN7, UBE2I, WT1, JAM2, FOXL2, F11R, JAM3, ZFPM2, MEF2C</em>, and <em>PIAS1</em> are the top 10 hub genes commonly identified in both cancer types. Only <em>CLDN7</em> and <em>F11R</em> are upregulated, whereas the remaining hub genes are downregulated in both cancers, suggesting a common framework for contributing to tumorigenesis. Molecular docking and dynamics were performed on the UBE2I protein with Irinotecan Hydrochloride, which could serve as the new approach for treating and managing both cancers. The study reveals the common molecular pathways, pointing out the role of cell cycle and DNA damage and integrity checkpoint signaling in the pathogenesis of both cancer types. This study explored the <em>UBE2I</em> gene as a potential biomarker in OC and EC. Further, this study concludes that the irinotecan hydrochloride drug has higher therapeutic effects on UBE2I protein through docking and dynamics studies.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108250"},"PeriodicalIF":2.6,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-16DOI: 10.1016/j.compbiolchem.2024.108240
YiMing Wang, Chun Fang
The rising prevalence of invasive fungal infections and the emergence of antifungal resistance highlight the urgent need for new antifungal medications. Antifungal peptides have emerged as promising alternatives to traditional antimicrobial agents. The identification of natural or synthetic antifungal peptides is crucial for advancing antifungal drug development. Typically, the availability of antifungal samples is limited, and significant sequence diversity exists among antifungal peptides, posing challenges for high-throughput screening. To address the identification challenge of antifungal peptides with limited sample availability, this study introduces the Cycle ESM method. Initially, the method utilises the ESM protein language model to generate additional data on antifungal peptides, serving as a data augmentation technique to enhance model training effectiveness. Subsequently, the ESM is employed in conjunction with a textCNN model to construct a classifier for peptide prediction, with a comprehensive exploration of peptide characteristics to improve prediction accuracy. Experimental results demonstrate that the performance of the Cycle ESM method surpasses that of existing methods across three distinct antifungal peptide datasets. This study presents a novel approach to antifungal peptide prediction and offers innovative insights for addressing classification problems with limited sample availability.
{"title":"Cycle-ESM: Generation-assisted classification of antifungal peptides using ESM protein language model","authors":"YiMing Wang, Chun Fang","doi":"10.1016/j.compbiolchem.2024.108240","DOIUrl":"10.1016/j.compbiolchem.2024.108240","url":null,"abstract":"<div><div>The rising prevalence of invasive fungal infections and the emergence of antifungal resistance highlight the urgent need for new antifungal medications. Antifungal peptides have emerged as promising alternatives to traditional antimicrobial agents. The identification of natural or synthetic antifungal peptides is crucial for advancing antifungal drug development. Typically, the availability of antifungal samples is limited, and significant sequence diversity exists among antifungal peptides, posing challenges for high-throughput screening. To address the identification challenge of antifungal peptides with limited sample availability, this study introduces the Cycle ESM method. Initially, the method utilises the ESM protein language model to generate additional data on antifungal peptides, serving as a data augmentation technique to enhance model training effectiveness. Subsequently, the ESM is employed in conjunction with a textCNN model to construct a classifier for peptide prediction, with a comprehensive exploration of peptide characteristics to improve prediction accuracy. Experimental results demonstrate that the performance of the Cycle ESM method surpasses that of existing methods across three distinct antifungal peptide datasets. This study presents a novel approach to antifungal peptide prediction and offers innovative insights for addressing classification problems with limited sample availability.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108240"},"PeriodicalIF":2.6,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}