Computational Biology and Chemistry最新文献_第9页

In-silico exploration of Attukal Kizhangu L. compounds: Promising candidates for periodontitis treatment 对Attukal Kizhangu L.复合物的分子内探索：有望治疗牙周炎的候选化合物

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-07 DOI: 10.1016/j.compbiolchem.2024.108186

Pragati Dubey , Manjit , Asha Rani , Neelam Mittal , Brahmeshwar Mishra

A medicinal pteridophyte known as Attukal Kizhangu L. has been used to cure patients for centuries by administering plant parts based on conventional and common practices. Regarding its biological functions, significant use and advancement have been made. Extract of Attukal Kizhangu L. is the subject of the current study, which uses network pharmacology as its foundation. Three targeted compounds such as α-Lapachone, Dihydrochalcone, and Piperine were chosen for additional research from the 17 Phytoconstituents that were filtered out by the Coupled UPLC-HRMS study since they followed to Lipinski rule and showed no toxicity. The pharmacokinetics and physicochemical properties of these targeted compounds were analyzed by using three online web servers pkCSM, Swiss ADME, and Protox-II. This is the first in silico study to document these compound's effectiveness against the standard drug DOX in treating Periodontitis. The Swiss target prediction database was used to retrieve the targets of these compounds. DisGeNET and GeneCards were used to extract the targets of periodontitis. The top five hub genes were identified by Cytoscape utilizing the protein-protein interaction of common genes, from which two hub genes and three binding proteins of collagenase enzymes were used for further studies AA2, PGE2, PI2, TNFA, and PGP. The minimal binding energy observed in molecular docking, indicative of the optimal docking score, corresponds to the highest affinity between the protein and ligand. To corroborate the findings of the docking study, molecular dynamics (MD) simulations, and MMPBSA calculations were conducted for the complexes involving AA2-α-LPHE, AA2-DHC, and AA2-PPR. This research concluded that AA2-DHC was the most stable complex among the investigated interactions, surpassing the stability of the other complexes examined in comparison with the standard drug DOX. Overall, the findings supported the promotion of widespread use of Attukal Kizhangu L. in clinics as a potential therapeutic agent or may be employed for the treatment of acute and chronic Periodontitis.

几个世纪以来，人们一直在使用一种名为阿图卡尔-奇占古（Attukal Kizhangu L.）的药用翼手目植物，根据传统和常见的做法，通过施用植物部分来治疗病人。关于它的生物功能，已经有了重要的应用和进展。本研究以阿图卡尔-基赞古提取物为主题，以网络药理学为基础。由于α-拉帕醌、二氢查尔酮和胡椒碱符合利宾斯基规则且无毒性，因此从耦合 UPLC-HRMS 研究筛选出的 17 种植物成分中选择了三种目标化合物进行进一步研究。我们使用 pkCSM、Swiss ADME 和 Protox-II 这三个在线网络服务器分析了这些目标化合物的药代动力学和理化性质。这是首次在硅学研究中证明这些化合物在治疗牙周炎方面对标准药物 DOX 的有效性。瑞士靶点预测数据库用于检索这些化合物的靶点。DisGeNET 和 GeneCards 被用来提取牙周炎的靶点。Cytoscape利用常见基因的蛋白质-蛋白质相互作用确定了前五个中心基因，并从中选出两个中心基因和三个胶原酶结合蛋白用于进一步研究：AA2、PGE2、PI2、TNFA和PGP。分子对接中观察到的最小结合能（表明最佳对接得分）与蛋白质和配体之间的最高亲和力相对应。为了证实对接研究的结果，对涉及 AA2-α-LPHE、AA2-DHC 和 AA2-PPR 的复合物进行了分子动力学（MD）模拟和 MMPBSA 计算。研究结果表明，在所研究的相互作用中，AA2-DHC 是最稳定的复合物，与标准药物 DOX 相比，其稳定性超过了所研究的其他复合物。总之，研究结果支持在临床上广泛使用阿图卡尔-奇占古作为一种潜在的治疗剂，或可用于治疗急性和慢性牙周炎。

{"title":"In-silico exploration of Attukal Kizhangu L. compounds: Promising candidates for periodontitis treatment","authors":"Pragati Dubey , Manjit , Asha Rani , Neelam Mittal , Brahmeshwar Mishra","doi":"10.1016/j.compbiolchem.2024.108186","DOIUrl":"10.1016/j.compbiolchem.2024.108186","url":null,"abstract":"<div><p>A medicinal pteridophyte known as <em>Attukal Kizhangu L.</em> has been used to cure patients for centuries by administering plant parts based on conventional and common practices. Regarding its biological functions, significant use and advancement have been made. Extract of <em>Attukal Kizhangu L</em>. is the subject of the current study, which uses network pharmacology as its foundation. Three targeted compounds such as α-Lapachone, Dihydrochalcone, and Piperine were chosen for additional research from the 17 Phytoconstituents that were filtered out by the Coupled UPLC-HRMS study since they followed to Lipinski rule and showed no toxicity. The pharmacokinetics and physicochemical properties of these targeted compounds were analyzed by using three online web servers pkCSM, Swiss ADME, and Protox-II. This is the first in silico study to document these compound's effectiveness against the standard drug DOX in treating Periodontitis. The Swiss target prediction database was used to retrieve the targets of these compounds. DisGeNET and GeneCards were used to extract the targets of periodontitis. The top five hub genes were identified by Cytoscape utilizing the protein-protein interaction of common genes, from which two hub genes and three binding proteins of collagenase enzymes were used for further studies AA2, PGE2, PI2, TNFA, and PGP. The minimal binding energy observed in molecular docking, indicative of the optimal docking score, corresponds to the highest affinity between the protein and ligand. To corroborate the findings of the docking study, molecular dynamics (MD) simulations, and MMPBSA calculations were conducted for the complexes involving AA2-α-LPHE, AA2-DHC, and AA2-PPR. This research concluded that AA2-DHC was the most stable complex among the investigated interactions, surpassing the stability of the other complexes examined in comparison with the standard drug DOX. Overall, the findings supported the promotion of widespread use of <em>Attukal Kizhangu L</em>. in clinics as a potential therapeutic agent or may be employed for the treatment of acute and chronic Periodontitis.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108186"},"PeriodicalIF":2.6,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1476927124001749/pdfft?md5=cc078a48df0d99b41835b4a5653a05a8&pid=1-s2.0-S1476927124001749-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Co-expression network and survival analysis of breast cancer inflammation and immune system hallmark genes 乳腺癌炎症和免疫系统标志基因的共表达网络和生存分析

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-06 DOI: 10.1016/j.compbiolchem.2024.108204

Ayaka Yakushi , Masahiro Sugimoto , Takanori Sasaki

The tertiary lymphoid structure (TLS) plays a central role in cancer immune response, and its gene expression pattern, called the TLS signature, has shown prognostic value in breast cancer. The formation of TLS and tumor-associated high endothelial venules (TA-HEVs), responsible for lymphocytic infiltration within the TLS, is associated with the expression of cancer hallmark genes (CHGs) related to immunity and inflammation. In this study, we performed co-expression network analysis of immune- and inflammation-related CHGs to identify predictive genes for breast cancer. In total, 382 immune- and inflammation-related CHGs with high expression variance were extracted from the GSE86166 microarray dataset of patients with breast cancer. CHGs were classified into five modules by applying weighted gene co-expression network analysis. The survival analysis results for each module showed that one module comprising 45 genes was statistically significant for relapse-free and overall survival. Four network properties identified key genes in this module with high prognostic prediction abilities: CD34, CXCL12, F2RL2, JAM2, PROS1, RAPGEF3, and SELP. The prognostic accuracy of the seven genes in breast cancer was synergistic and exceeded that of other predictors in both small and large public datasets. Enrichment analysis predicted that these genes had functions related to leukocyte infiltration of TA-HEVs. There was a positive correlation between key gene expression and the TLS signature, suggesting that gene expression levels are associated with TLS density. Co-expression network analysis of inflammation- and immune-related CHGs allowed us to identify genes that share a standard function in cancer immunity and have a high prognostic predictive value. This analytical approach may contribute to the identification of prognostic genes in TLS.

三级淋巴结构（TLS）在癌症免疫反应中起着核心作用，其基因表达模式被称为TLS特征，在乳腺癌中显示出预后价值。TLS和肿瘤相关高内皮静脉（TA-HEVs）的形成负责TLS内的淋巴细胞浸润，与免疫和炎症相关的癌症标志基因（CHGs）的表达有关。在这项研究中，我们对免疫和炎症相关的 CHGs 进行了共表达网络分析，以确定乳腺癌的预测基因。我们从乳腺癌患者的 GSE86166 微阵列数据集中共提取了 382 个具有高表达差异的免疫和炎症相关 CHGs。通过加权基因共表达网络分析，将CHGs分为五个模块。每个模块的生存分析结果显示，由 45 个基因组成的一个模块对无复发生存率和总生存率具有统计学意义。四个网络属性确定了该模块中具有较高预后预测能力的关键基因：CD34、CXCL12、F2RL2、JAM2、PROS1、RAPGEF3 和 SELP。这七个基因在乳腺癌中的预后准确性具有协同作用，在小型和大型公共数据集中都超过了其他预测因子。富集分析预测，这些基因的功能与TA-HEV的白细胞浸润有关。关键基因表达与TLS特征之间存在正相关，表明基因表达水平与TLS密度相关。通过对炎症和免疫相关CHG的共表达网络分析，我们确定了在癌症免疫中具有相同标准功能并具有较高预后预测价值的基因。这种分析方法可能有助于确定TLS的预后基因。

{"title":"Co-expression network and survival analysis of breast cancer inflammation and immune system hallmark genes","authors":"Ayaka Yakushi , Masahiro Sugimoto , Takanori Sasaki","doi":"10.1016/j.compbiolchem.2024.108204","DOIUrl":"10.1016/j.compbiolchem.2024.108204","url":null,"abstract":"<div><p>The tertiary lymphoid structure (TLS) plays a central role in cancer immune response, and its gene expression pattern, called the TLS signature, has shown prognostic value in breast cancer. The formation of TLS and tumor-associated high endothelial venules (TA-HEVs), responsible for lymphocytic infiltration within the TLS, is associated with the expression of cancer hallmark genes (CHGs) related to immunity and inflammation. In this study, we performed co-expression network analysis of immune- and inflammation-related CHGs to identify predictive genes for breast cancer. In total, 382 immune- and inflammation-related CHGs with high expression variance were extracted from the GSE86166 microarray dataset of patients with breast cancer. CHGs were classified into five modules by applying weighted gene co-expression network analysis. The survival analysis results for each module showed that one module comprising 45 genes was statistically significant for relapse-free and overall survival. Four network properties identified key genes in this module with high prognostic prediction abilities: <em>CD34</em>, <em>CXCL12</em>, <em>F2RL2</em>, <em>JAM2</em>, <em>PROS1</em>, <em>RAPGEF3</em>, and <em>SELP</em>. The prognostic accuracy of the seven genes in breast cancer was synergistic and exceeded that of other predictors in both small and large public datasets. Enrichment analysis predicted that these genes had functions related to leukocyte infiltration of TA-HEVs. There was a positive correlation between key gene expression and the TLS signature, suggesting that gene expression levels are associated with TLS density. Co-expression network analysis of inflammation- and immune-related CHGs allowed us to identify genes that share a standard function in cancer immunity and have a high prognostic predictive value. This analytical approach may contribute to the identification of prognostic genes in TLS.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108204"},"PeriodicalIF":2.6,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1476927124001920/pdfft?md5=1c407a366ff621563e7bacfdd48a6bb7&pid=1-s2.0-S1476927124001920-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142172702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AScirRNA: A novel computational approach to discover abiotic stress-responsive circular RNAs in plant genome AScirRNA：发现植物基因组中对非生物胁迫有反应的环状 RNA 的新型计算方法

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-06 DOI: 10.1016/j.compbiolchem.2024.108205

Upendra Kumar Pradhan , Prasanjit Behera , Ritwika Das , Sanchita Naha , Ajit Gupta , Rajender Parsad , Sukanta Kumar Pradhan , Prabina Kumar Meher

In the realm of plant biology, understanding the intricate regulatory mechanisms governing stress responses stands as a pivotal pursuit. Circular RNAs (circRNAs), emerging as critical players in gene regulation, have garnered attention in recent days for their potential roles in abiotic stress adaptation. A comprehensive grasp of circRNAs' functions in stress response offers avenues for breeders to manipulating plants to develop abiotic stress resistant crop cultivars to thrive in challenging climates. This study pioneers a machine learning-based model for predicting abiotic stress-responsive circRNAs. The K-tuple nucleotide composition (KNC) and Pseudo KNC (PKNC) features were utilized to numerically represent circRNAs. Three different feature selection strategies were employed to select relevant and non-redundant features. Eight shallow and four deep learning algorithms were evaluated to build the final predictive model. Following five-fold cross-validation process, XGBoost learning algorithm demonstrated superior performance with LightGBM-chosen 260 KNC features (Accuracy: 74.55 %, auROC: 81.23 %, auPRC: 76.52 %) and 160 PKNC features (Accuracy: 74.32 %, auROC: 81.04 %, auPRC: 76.43 %), over other combinations of learning algorithms and feature selection techniques. Further, the robustness of the developed models were evaluated using an independent test dataset, where the overall accuracy, auROC and auPRC were found to be 73.13 %, 72.34 % and 72.68 % for KNC feature set and 73.52 %, 79.53 % and 73.09 % for PKNC feature set, respectively. This computational approach was also integrated into an online prediction tool, AScirRNA (https://iasri-sg.icar.gov.in/ascirna/) for easy prediction by the users. Both the proposed model and the developed tool are poised to augment ongoing efforts in identifying stress-responsive circRNAs in plants.

在植物生物学领域，了解支配胁迫反应的复杂调控机制是一项关键的追求。环状 RNA（circRNA）作为基因调控的关键角色，因其在非生物胁迫适应中的潜在作用而在近期备受关注。全面掌握 circRNAs 在应激反应中的功能为育种者提供了一条途径，他们可以通过操纵植物来培育抗非生物应激的作物栽培品种，从而在充满挑战的气候条件下茁壮成长。本研究开创了一种基于机器学习的模型，用于预测非生物胁迫响应性 circRNA。该模型利用 K 元组核苷酸组成（KNC）和伪 KNC（PKNC）特征对 circRNA 进行数字表示。研究人员采用了三种不同的特征选择策略来选择相关的非冗余特征。对八种浅层学习算法和四种深度学习算法进行了评估，以建立最终的预测模型。经过五倍交叉验证过程，XGBoost 学习算法在使用 LightGBM 选择的 260 个 KNC 特征（准确率：74.55 %，auROC：81.23 %，auPRC：76.52 %）和 160 个 PKNC 特征（准确率：74.32 %，auROC：81.04 %，auPRC：76.43 %）时表现出优于其他学习算法和特征选择技术组合的性能。此外，还使用独立测试数据集对所开发模型的鲁棒性进行了评估，发现 KNC 特征集的总体准确率、auROC 和 auPRC 分别为 73.13 %、72.34 % 和 72.68 %，PKNC 特征集的总体准确率、auROC 和 auPRC 分别为 73.52 %、79.53 % 和 73.09 %。这种计算方法还被集成到在线预测工具 AScirRNA (https://iasri-sg.icar.gov.in/ascirna/) 中，方便用户进行预测。所提出的模型和所开发的工具都将为目前鉴定植物胁迫响应性 circRNA 的工作提供帮助。

{"title":"AScirRNA: A novel computational approach to discover abiotic stress-responsive circular RNAs in plant genome","authors":"Upendra Kumar Pradhan , Prasanjit Behera , Ritwika Das , Sanchita Naha , Ajit Gupta , Rajender Parsad , Sukanta Kumar Pradhan , Prabina Kumar Meher","doi":"10.1016/j.compbiolchem.2024.108205","DOIUrl":"10.1016/j.compbiolchem.2024.108205","url":null,"abstract":"<div><p>In the realm of plant biology, understanding the intricate regulatory mechanisms governing stress responses stands as a pivotal pursuit. Circular RNAs (circRNAs), emerging as critical players in gene regulation, have garnered attention in recent days for their potential roles in abiotic stress adaptation. A comprehensive grasp of circRNAs' functions in stress response offers avenues for breeders to manipulating plants to develop abiotic stress resistant crop cultivars to thrive in challenging climates. This study pioneers a machine learning-based model for predicting abiotic stress-responsive circRNAs. The K-tuple nucleotide composition (KNC) and Pseudo KNC (PKNC) features were utilized to numerically represent circRNAs. Three different feature selection strategies were employed to select relevant and non-redundant features. Eight shallow and four deep learning algorithms were evaluated to build the final predictive model. Following five-fold cross-validation process, XGBoost learning algorithm demonstrated superior performance with LightGBM-chosen 260 KNC features (Accuracy: 74.55 %, auROC: 81.23 %, auPRC: 76.52 %) and 160 PKNC features (Accuracy: 74.32 %, auROC: 81.04 %, auPRC: 76.43 %), over other combinations of learning algorithms and feature selection techniques. Further, the robustness of the developed models were evaluated using an independent test dataset, where the overall accuracy, auROC and auPRC were found to be 73.13 %, 72.34 % and 72.68 % for KNC feature set and 73.52 %, 79.53 % and 73.09 % for PKNC feature set, respectively. This computational approach was also integrated into an online prediction tool, AScirRNA (<span><span>https://iasri-sg.icar.gov.in/ascirna/</span><svg><path></path></svg></span>) for easy prediction by the users. Both the proposed model and the developed tool are poised to augment ongoing efforts in identifying stress-responsive circRNAs in plants.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108205"},"PeriodicalIF":2.6,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142168671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DCSGMDA: A dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations DCSGMDA：基于堆叠式深度学习协作梯度分解的双通道卷积模型，用于预测 miRNA 与疾病的关联性

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-04 DOI: 10.1016/j.compbiolchem.2024.108201

Xu Cao, Pengli Lu

Numerous studies have shown that microRNAs (miRNAs) play a key role in human diseases as critical biomarkers. Its abnormal expression is often accompanied by the emergence of specific diseases. Therefore, studying the relationship between miRNAs and diseases can deepen the insights of their pathogenesis, grasp the process of disease onset and development, and promote drug research of specific diseases. However, many undiscovered relationships between miRNAs and diseases remain, significantly limiting research on miRNA-disease correlations. To explore more potential correlations, we propose a dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations (DCSGMDA). Firstly, we constructed similarity networks for miRNAs and diseases, as well as an association relationship network. Secondly, potential features were fully mined using stacked deep learning and gradient decomposition networks, along with dual-channel convolutional neural networks. Finally, correlations were scored by a multilayer perceptron. We performed 5-fold and 10-fold cross-validation experiments on DCSGMDA using two datasets based on the Human MicroRNA Disease Database (HMDD). Additionally, parametric, ablation, and comparative experiments, along with case studies, were conducted. The experimental results demonstrate that DCSGMDA performs well in predicting miRNA-disease associations.

大量研究表明，微小核糖核酸（miRNA）作为重要的生物标志物，在人类疾病中发挥着关键作用。其异常表达往往伴随着特定疾病的出现。因此，研究 miRNA 与疾病的关系可以加深对疾病发病机制的认识，把握疾病的发生和发展过程，促进特定疾病的药物研究。然而，miRNA 与疾病之间仍存在许多未被发现的关系，极大地限制了 miRNA 与疾病相关性的研究。为了探索更多潜在的相关性，我们提出了一种基于堆叠深度学习协作梯度分解的双通道卷积模型来预测miRNA与疾病的关联（DCSGMDA）。首先，我们构建了 miRNA 与疾病的相似性网络以及关联关系网络。其次，利用堆叠深度学习和梯度分解网络以及双通道卷积神经网络充分挖掘潜在特征。最后，通过多层感知器对相关性进行评分。我们使用基于人类微RNA疾病数据库（HMDD）的两个数据集对DCSGMDA进行了5倍和10倍交叉验证实验。此外，还进行了参数实验、消融实验、比较实验以及案例研究。实验结果表明，DCSGMDA 在预测 miRNA 与疾病的关联方面表现良好。

{"title":"DCSGMDA: A dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations","authors":"Xu Cao, Pengli Lu","doi":"10.1016/j.compbiolchem.2024.108201","DOIUrl":"10.1016/j.compbiolchem.2024.108201","url":null,"abstract":"<div><p>Numerous studies have shown that microRNAs (miRNAs) play a key role in human diseases as critical biomarkers. Its abnormal expression is often accompanied by the emergence of specific diseases. Therefore, studying the relationship between miRNAs and diseases can deepen the insights of their pathogenesis, grasp the process of disease onset and development, and promote drug research of specific diseases. However, many undiscovered relationships between miRNAs and diseases remain, significantly limiting research on miRNA-disease correlations. To explore more potential correlations, we propose a dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations (DCSGMDA). Firstly, we constructed similarity networks for miRNAs and diseases, as well as an association relationship network. Secondly, potential features were fully mined using stacked deep learning and gradient decomposition networks, along with dual-channel convolutional neural networks. Finally, correlations were scored by a multilayer perceptron. We performed 5-fold and 10-fold cross-validation experiments on DCSGMDA using two datasets based on the Human MicroRNA Disease Database (HMDD). Additionally, parametric, ablation, and comparative experiments, along with case studies, were conducted. The experimental results demonstrate that DCSGMDA performs well in predicting miRNA-disease associations.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108201"},"PeriodicalIF":2.6,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TMODINET: A trustworthy multi-omics dynamic learning integration network for cancer diagnostic TMODINET：用于癌症诊断的值得信赖的多组学动态学习集成网络。

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-03 DOI: 10.1016/j.compbiolchem.2024.108202

Ling Du , Peipei Gao , Zhuang Liu , Nan Yin , Xiaochao Wang

Multiple types of omics data contain a wealth of biomedical information which reflect different aspects of clinical samples. Multi-omics integrated analysis is more likely to lead to more accurate clinical decisions. Existing cancer diagnostic methods based on multi-omics data integration mainly focus on the classification accuracy of the model, while neglecting the interpretability of the internal mechanism and the reliability of the results, which are crucial in specific domains such as precision medicine and the life sciences. To overcome this limitation, we propose a trustworthy multi-omics dynamic learning framework (TMODINET) for cancer diagnostic. The framework employs multi-omics adaptive dynamic learning to process each sample to provide patient-centered personality diagnosis by using self-attentional learning of features and modalities. To characterize the correlation between samples well, we introduce a graph dynamic learning method which can adaptively adjust the graph structure according to the specific classification results for specific graph convolutional networks (GCN) learning. Moreover, we utilize an uncertainty mechanism by employing Dirichlet distribution and Dempster–Shafer theory to obtain uncertainty and integrate multi-omics data at the decision level, ensuring trustworthy for cancer diagnosis. Extensive experiments on four real-world multimodal medical datasets are conducted. Compared to state-of-the-art methods, the superior performance and trustworthiness of our proposed algorithm are clearly validated. Our model has great potential for clinical diagnosis.

多种类型的 omics 数据包含丰富的生物医学信息，反映了临床样本的不同方面。多组学集成分析更有可能带来更准确的临床决策。现有的基于多组学数据整合的癌症诊断方法主要关注模型的分类准确性，而忽视了内部机制的可解释性和结果的可靠性，而这两点在精准医疗和生命科学等特定领域至关重要。为了克服这一局限，我们提出了一种用于癌症诊断的可信多组学动态学习框架（TMODINET）。该框架采用多组学自适应动态学习来处理每个样本，通过对特征和模式的自我注意学习，提供以患者为中心的个性诊断。为了很好地表征样本之间的相关性，我们引入了一种图动态学习方法，该方法可以根据特定卷积网络（GCN）学习的具体分类结果自适应地调整图结构。此外，我们还利用不确定性机制，采用 Dirichlet 分布和 Dempster-Shafer 理论来获取不确定性，并在决策层整合多组学数据，确保癌症诊断的可信度。我们在四个真实世界的多模态医疗数据集上进行了广泛的实验。与最先进的方法相比，我们提出的算法的优越性能和可信度得到了明确验证。我们的模型在临床诊断中大有可为。

{"title":"TMODINET: A trustworthy multi-omics dynamic learning integration network for cancer diagnostic","authors":"Ling Du , Peipei Gao , Zhuang Liu , Nan Yin , Xiaochao Wang","doi":"10.1016/j.compbiolchem.2024.108202","DOIUrl":"10.1016/j.compbiolchem.2024.108202","url":null,"abstract":"<div><p>Multiple types of omics data contain a wealth of biomedical information which reflect different aspects of clinical samples. Multi-omics integrated analysis is more likely to lead to more accurate clinical decisions. Existing cancer diagnostic methods based on multi-omics data integration mainly focus on the classification accuracy of the model, while neglecting the interpretability of the internal mechanism and the reliability of the results, which are crucial in specific domains such as precision medicine and the life sciences. To overcome this limitation, we propose a trustworthy multi-omics dynamic learning framework (TMODINET) for cancer diagnostic. The framework employs multi-omics adaptive dynamic learning to process each sample to provide patient-centered personality diagnosis by using self-attentional learning of features and modalities. To characterize the correlation between samples well, we introduce a graph dynamic learning method which can adaptively adjust the graph structure according to the specific classification results for specific graph convolutional networks (GCN) learning. Moreover, we utilize an uncertainty mechanism by employing Dirichlet distribution and Dempster–Shafer theory to obtain uncertainty and integrate multi-omics data at the decision level, ensuring trustworthy for cancer diagnosis. Extensive experiments on four real-world multimodal medical datasets are conducted. Compared to state-of-the-art methods, the superior performance and trustworthiness of our proposed algorithm are clearly validated. Our model has great potential for clinical diagnosis.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108202"},"PeriodicalIF":2.6,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A knowledge-transfer-based approach for combining ordinal regression and medical scoring system in the early prediction of sepsis with electronic health records 基于知识转移的方法，结合序数回归和医疗评分系统，利用电子健康记录对败血症进行早期预测

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-09-02 DOI: 10.1016/j.compbiolchem.2024.108203

Yu Ji , Kaipeng Wang , Yuan Yuan , Yueguo Wang , Qingyuan Liu , Yulan Wang , Jian Sun , Wenwen Wang , Huanli Wang , Shusheng Zhou , Kui Jin , Mengping Zhang , Yinglei Lai

Objective:

The prediction of sepsis, especially early diagnosis, has received a significant attention in biomedical research. In order to improve current medical scoring system and overcome the limitations of class imbalance and sample size of local EHR (electronic health records), we propose a novel knowledge-transfer-based approach, which combines a medical scoring system and an ordinal logistic regression model.

Materials and Methods:

Medical scoring systems (i.e. NEWS, SIRS and QSOFA) are generally robust and useful for sepsis diagnosis. With local EHR, machine-learning-based methods have been widely used for building prediction models/methods, but they are often impacted by class imbalance and sample size. Knowledge distillation and knowledge transfer have recently been proposed as a combination approach for improving the prediction performance and model generalization. In this study, we developed a novel knowledge-transfer-based method for combining a medical scoring system (after a proposed score transformation) and an ordinal logistic regression model. We mathematically confirmed that it was equivalent to a specific form of the weighted regression. Furthermore, we theoretically explored its effectiveness in the scenario of class imbalance.

Results:

For the local dataset and the MIMIC-IV dataset, the VUS (the volume under the multi-dimensional ROC surface, a generalization measure of AUC-ROC for ordinal categories) of the knowledge-transfer-based model (ORNEWS) based on the NEWS scoring system were 0.384 and 0.339, respectively, while the VUS of the traditional ordinal regression model (OR) were 0.352 and 0.322, respectively. Consistent analysis results were also observed for the knowledge-transfer-based models based on the SIRS/QSOFA scoring systems in the ordinal scenarios. Additionally, the predicted probabilities and the binary classification ROC curves of the knowledge-transfer-based models indicated that this approach enhanced the predicted probabilities for the minority classes while reducing the predicted probabilities for the majority classes, which improved AUCs/VUSs on imbalanced data.

Discussion:

Knowledge transfer, which combines a medical scoring system and a machine-learning-based model, improves the prediction performance for early diagnosis of sepsis, especially in the scenarios of class imbalance and limited sample size.

目的：败血症的预测，尤其是早期诊断，在生物医学研究中受到了极大的关注。材料与方法：医学评分系统（即 NEWS、SIRS 和 QSOFA）一般都很稳健，对败血症诊断很有用。在本地电子病历中，基于机器学习的方法已被广泛用于建立预测模型/方法，但这些方法往往受到类别不平衡和样本大小的影响。最近有人提出了知识提炼和知识转移相结合的方法，以提高预测性能和模型泛化。在本研究中，我们开发了一种基于知识转移的新方法，用于将医学评分系统（经过提议的评分转换后）与序数逻辑回归模型相结合。我们从数学上证实，该方法等同于加权回归的一种特定形式。结果：对于本地数据集和 MIMIC-IV 数据集，基于 NEWS 评分系统的知识转移模型（ORNEWS）的 VUS（多维 ROC 面下的体积，是 AUC-ROC 对序数类别的概括度量）分别为 0.384 和 0.339，而传统序数回归模型（OR）的 VUS 分别为 0.352 和 0.322。基于 SIRS/QSOFA 评分系统的知识转移模型在序数情景中也观察到了一致的分析结果。此外，基于知识转移的模型的预测概率和二元分类 ROC 曲线表明，这种方法提高了少数类别的预测概率，同时降低了多数类别的预测概率，从而提高了不平衡数据的 AUCs/VUS。

{"title":"A knowledge-transfer-based approach for combining ordinal regression and medical scoring system in the early prediction of sepsis with electronic health records","authors":"Yu Ji , Kaipeng Wang , Yuan Yuan , Yueguo Wang , Qingyuan Liu , Yulan Wang , Jian Sun , Wenwen Wang , Huanli Wang , Shusheng Zhou , Kui Jin , Mengping Zhang , Yinglei Lai","doi":"10.1016/j.compbiolchem.2024.108203","DOIUrl":"10.1016/j.compbiolchem.2024.108203","url":null,"abstract":"<div><h3>Objective:</h3><p>The prediction of sepsis, especially early diagnosis, has received a significant attention in biomedical research. In order to improve current medical scoring system and overcome the limitations of class imbalance and sample size of local EHR (electronic health records), we propose a novel knowledge-transfer-based approach, which combines a medical scoring system and an ordinal logistic regression model.</p></div><div><h3>Materials and Methods:</h3><p>Medical scoring systems (i.e. NEWS, SIRS and QSOFA) are generally robust and useful for sepsis diagnosis. With local EHR, machine-learning-based methods have been widely used for building prediction models/methods, but they are often impacted by class imbalance and sample size. Knowledge distillation and knowledge transfer have recently been proposed as a combination approach for improving the prediction performance and model generalization. In this study, we developed a novel knowledge-transfer-based method for combining a medical scoring system (after a proposed score transformation) and an ordinal logistic regression model. We mathematically confirmed that it was equivalent to a specific form of the weighted regression. Furthermore, we theoretically explored its effectiveness in the scenario of class imbalance.</p></div><div><h3>Results:</h3><p>For the local dataset and the MIMIC-IV dataset, the VUS (the volume under the multi-dimensional ROC surface, a generalization measure of AUC-ROC for ordinal categories) of the knowledge-transfer-based model (ORNEWS) based on the NEWS scoring system were 0.384 and 0.339, respectively, while the VUS of the traditional ordinal regression model (OR) were 0.352 and 0.322, respectively. Consistent analysis results were also observed for the knowledge-transfer-based models based on the SIRS/QSOFA scoring systems in the ordinal scenarios. Additionally, the predicted probabilities and the binary classification ROC curves of the knowledge-transfer-based models indicated that this approach enhanced the predicted probabilities for the minority classes while reducing the predicted probabilities for the majority classes, which improved AUCs/VUSs on imbalanced data.</p></div><div><h3>Discussion:</h3><p>Knowledge transfer, which combines a medical scoring system and a machine-learning-based model, improves the prediction performance for early diagnosis of sepsis, especially in the scenarios of class imbalance and limited sample size.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108203"},"PeriodicalIF":2.6,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142149775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Substituent effect on the chemical and biological properties of diisatin dihydrazone Schiff bases: DFT and docking studies 取代基对二靛红二腙席夫碱化学和生物特性的影响：DFT 和对接研究

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-08-31 DOI: 10.1016/j.compbiolchem.2024.108190

Mohamed Shaker S. Adam , Zahraa H.A. Al-Ateya , Mohamed M. Makhlouf , Obadah S. Abdel-Rahman , Amneh Shtaiwi , Ahmed Khalil

According to the considered role of lipophilicity-hydrophobicity on organic Schiff base hydrazones, different substituents of phenyl, ethyl, and methyl groups were inserted in the synthetic strategy of diisatin dihydrazones (L1–4). The biochemical enhancement was evaluated depending on their inhibitive potential of the growth power of three human tumor cells, fungi, and bacteria. The biochemical assays assigned the effected role of different substituents of phenyl, ethyl, and methyl groups on the effectiveness of their diisatin dihydrazone reagents. The interacting modes with calf thymus DNA (i.e. Ct-DNA) were studied via viscometric and spectrophotometric titration.

The organo-reagent L1 with the oxalic derivative assigned a performed inhibitive action for the examined microbes and the human tumor cell lines growing up over the terephthalic (L4) > malonic (L2) > succinic (L3) ones. From K_b = binding constant, and $∆ G_{b}^{\neq}$ = Gibb’s free energy values, the binding of interaction within Ct-DNA was evaluated for all compounds (L1–4), in which L1, L3, and L4 assigned the highest reactivity referring to the covalent/non-covalent modes of interaction, as given for (L1–4), 14.32, 13.28, 10.87, and 12.41 × 10⁷ mol⁻¹ dm³, and −45^.17, −43.24, −43.75, and −44.05 kJ mol⁻¹, respectively. DFT and docking studies were achieved to support the current work.

根据亲油疏水性对有机席夫碱肼的作用，在二靛基二肼（L1-4）的合成策略中加入了不同的苯基、乙基和甲基取代基。根据它们对三种人类肿瘤细胞、真菌和细菌生长的抑制潜力，对其生化增强效果进行了评估。生化检测结果表明，苯基、乙基和甲基的不同取代基对二靛红二氢腙试剂的有效性有影响。草酸衍生物有机试剂 L1 对所研究的微生物和人类肿瘤细胞株的抑制作用超过了对苯二甲酸（L4）>；丙二酸（L2）>；琥珀酸（L3）衍生物。根据 Kb = 结合常数和 ∆Gb≠ = 吉布斯自由能值，评估了所有化合物（L1-4）在 Ct-DNA 内的相互作用结合情况，其中 L1、L3 和 L4 具有最高的反应活性，这指的是共价/非共价相互作用模式，如 (L1-4) 所示，14.32、13.28、10.28、14.32、14.32、13.28、10.28。32、13.28、10.87 和 12.41 × 107 mol-1 dm3，以及-45.17、-43.24、-43.75 和 -44.05 kJ mol-1。DFT 和对接研究为当前工作提供了支持。

{"title":"Substituent effect on the chemical and biological properties of diisatin dihydrazone Schiff bases: DFT and docking studies","authors":"Mohamed Shaker S. Adam , Zahraa H.A. Al-Ateya , Mohamed M. Makhlouf , Obadah S. Abdel-Rahman , Amneh Shtaiwi , Ahmed Khalil","doi":"10.1016/j.compbiolchem.2024.108190","DOIUrl":"10.1016/j.compbiolchem.2024.108190","url":null,"abstract":"<div><p>According to the considered role of lipophilicity-hydrophobicity on organic Schiff base hydrazones, different substituents of phenyl, ethyl, and methyl groups were inserted in the synthetic strategy of diisatin dihydrazones (L1–4). The biochemical enhancement was evaluated depending on their inhibitive potential of the growth power of three human tumor cells, fungi, and bacteria. The biochemical assays assigned the effected role of different substituents of phenyl, ethyl, and methyl groups on the effectiveness of their diisatin dihydrazone reagents. The interacting modes with calf thymus DNA (<em>i.e.</em> Ct-DNA) were studied <em>via</em> viscometric and spectrophotometric titration.</p><p>The organo-reagent L1 with the oxalic derivative assigned a performed inhibitive action for the examined microbes and the human tumor cell lines growing up over the terephthalic (L4) > malonic (L2) > succinic (L3) ones. From <em>K</em><sub>b</sub> = binding constant, and <span><math><mrow><mo>∆</mo><msubsup><mrow><mi>G</mi></mrow><mrow><mi>b</mi></mrow><mrow><mo>≠</mo></mrow></msubsup></mrow></math></span> = Gibb’s free energy values, the binding of interaction within Ct-DNA was evaluated for all compounds (L1–4), in which L1, L3, and L4 assigned the highest reactivity referring to the covalent/non-covalent modes of interaction, as given for (L1–4), 14.32, 13.28, 10.87, and 12.41 × 10<sup>7</sup> mol<sup>−1</sup> dm<sup>3</sup>, and −45<sup>.1</sup>7, −43.24, −43.75, and −44.05 kJ mol<sup>−1</sup>, respectively. DFT and docking studies were achieved to support the current work.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108190"},"PeriodicalIF":2.6,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prognostic significance of a 3-gene ferroptosis-related signature in lung cancer via LASSO analysis and cellular functions of UBE2Z 通过 LASSO 分析和 UBE2Z 的细胞功能发现肺癌中 3 个基因铁突变相关特征的预后意义

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-08-30 DOI: 10.1016/j.compbiolchem.2024.108192

Bin Xie , Qiong Chen , Ziyu Dai , Chen Jiang , Jingyi Sun , Anqi Guan , Xi Chen

Ferroptosis is a newly identified form of non-apoptotic programmed cell death resulting from iron-dependent lipid peroxidation. It is controlled by integrated oxidation and antioxidant systems. Ferroptosis exerts a crucial effect on the carcinogenesis of several cancers, including pulmonary cancer. Herein, a ferroptosis-associated gene signature for lung cancer prognosis and diagnosis was identified using integrative bioinformatics analyses. From the FerrDB database, 256 ferroptotic regulators and markers were identified. Of these, 25 exhibited differential expression between lung cancer and non-cancerous samples, as evidenced by the GSE19804 and GSE7670 datasets from the GEO database. Utilizing LASSO Cox regression analysis on TCGA-LUAD data, a potent 3-gene risk signature comprising CAV1, RRM2, and EGFR was established. This signature adeptly differentiates various survival outcomes in lung cancer patients, including overall survival and disease-specific intervals. Based on the 3-gene risk signature, lung cancer patients were categorized into high-risk and low-risk groups. Comparative analysis revealed 69 differentially expressed genes between these groups, with UBE2Z significantly associated with overall survival in TCGA-LUAD. UBE2Z was found to be upregulated in LUAD tissues and cells compared to normal controls. Functionally, the knockdown of UBE2Z curtailed aggressive behaviors in LUAD cells, including viability, migration, and invasion. Moreover, this knockdown led to a decrease in the mesenchymal marker vimentin while elevating the epithelial marker E-cadherin within LUAD cell lines. In conclusion, the ferroptosis-associated 3-gene risk signature effectively differentiates prognosis and clinical features in patients with lung cancer. UBE2Z was identified through this model, and it is upregulated in LUAD samples. Its knockdown inhibits aggressive cellular behaviors, suggesting UBE2Z's potential as a therapeutic target for lung cancer treatment.

铁凋亡是一种新发现的非凋亡性程序性细胞死亡形式，由铁依赖性脂质过氧化引起。它受综合氧化和抗氧化系统的控制。铁凋亡对包括肺癌在内的多种癌症的发生具有重要影响。本文通过综合生物信息学分析，确定了用于肺癌预后和诊断的铁氧化相关基因特征。从 FerrDB 数据库中确定了 256 个铁变态调节因子和标记物。其中，25个基因在肺癌和非癌症样本之间表现出差异表达，GEO数据库中的GSE19804和GSE7670数据集证明了这一点。通过对TCGA-LUAD数据进行LASSO Cox回归分析，建立了由CAV1、RRM2和表皮生长因子受体（EGFR）组成的强效3基因风险特征。该特征能有效区分肺癌患者的各种生存结果，包括总生存期和疾病特异性间隔期。根据 3 个基因的风险特征，肺癌患者被分为高风险组和低风险组。对比分析发现，这些组别之间有69个差异表达基因，其中UBE2Z与TCGA-LUAD中的总生存期显著相关。与正常对照组相比，UBE2Z在LUAD组织和细胞中上调。从功能上讲，敲除 UBE2Z 可抑制 LUAD 细胞的侵袭行为，包括活力、迁移和侵袭。此外，这种敲除导致间质标志物波形蛋白（vimentin）下降，而上皮标志物E-cadherin在LUAD细胞系中升高。总之，铁蛋白沉积相关 3 基因风险特征能有效区分肺癌患者的预后和临床特征。通过该模型确定了 UBE2Z，它在 LUAD 样本中上调。敲除UBE2Z可抑制细胞的侵袭行为，这表明UBE2Z有望成为肺癌治疗的靶点。

{"title":"Prognostic significance of a 3-gene ferroptosis-related signature in lung cancer via LASSO analysis and cellular functions of UBE2Z","authors":"Bin Xie , Qiong Chen , Ziyu Dai , Chen Jiang , Jingyi Sun , Anqi Guan , Xi Chen","doi":"10.1016/j.compbiolchem.2024.108192","DOIUrl":"10.1016/j.compbiolchem.2024.108192","url":null,"abstract":"<div><p>Ferroptosis is a newly identified form of non-apoptotic programmed cell death resulting from iron-dependent lipid peroxidation. It is controlled by integrated oxidation and antioxidant systems. Ferroptosis exerts a crucial effect on the carcinogenesis of several cancers, including pulmonary cancer. Herein, a ferroptosis-associated gene signature for lung cancer prognosis and diagnosis was identified using integrative bioinformatics analyses. From the FerrDB database, 256 ferroptotic regulators and markers were identified. Of these, 25 exhibited differential expression between lung cancer and non-cancerous samples, as evidenced by the GSE19804 and GSE7670 datasets from the GEO database. Utilizing LASSO Cox regression analysis on TCGA-LUAD data, a potent 3-gene risk signature comprising CAV1, RRM2, and EGFR was established. This signature adeptly differentiates various survival outcomes in lung cancer patients, including overall survival and disease-specific intervals. Based on the 3-gene risk signature, lung cancer patients were categorized into high-risk and low-risk groups. Comparative analysis revealed 69 differentially expressed genes between these groups, with UBE2Z significantly associated with overall survival in TCGA-LUAD. UBE2Z was found to be upregulated in LUAD tissues and cells compared to normal controls. Functionally, the knockdown of UBE2Z curtailed aggressive behaviors in LUAD cells, including viability, migration, and invasion. Moreover, this knockdown led to a decrease in the mesenchymal marker vimentin while elevating the epithelial marker E-cadherin within LUAD cell lines. In conclusion, the ferroptosis-associated 3-gene risk signature effectively differentiates prognosis and clinical features in patients with lung cancer. UBE2Z was identified through this model, and it is upregulated in LUAD samples. Its knockdown inhibits aggressive cellular behaviors, suggesting UBE2Z's potential as a therapeutic target for lung cancer treatment.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108192"},"PeriodicalIF":2.6,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integrated data driven analysis identifies potential candidate genes associated with PCOS 综合数据驱动分析确定了与多囊卵巢综合症相关的潜在候选基因。

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-08-30 DOI: 10.1016/j.compbiolchem.2024.108191

Shaini Joseph , Krutika Patil , Niharika Rahate , Jatin Shah , Srabani Mukherjee , Smita D. Mahale

Polycystic ovary syndrome (PCOS) is one of the most common anovulatory disorder observed in women presenting with infertility. Several high and low throughput studies on PCOS have led to accumulation of vast amount of information on PCOS. Despite the availability of several resources which index the advances in PCOS, information on its etiology still remains inadequate. Analysis of the existing information using an integrated evidence based approach may aid identification of novel potential candidate genes with a role in PCOS pathophysiology. This work focuses on integrating existing information on PCOS from literature and gene expression studies and evaluating the application of gene prioritization and network analysis to predict missing novel candidates. Further, it assesses the utility of evidence-based scoring to rank genes for their association with PCOS. The results of this study led to identification of ∼2000 plausible candidate genes associated with PCOS. Insilico validation of these identified candidates confirmed the role of 938 genes in PCOS. Further, experimental validation was carried out for four of the potential candidate genes, a high-scoring (PROS1), two mid-scoring (C1QA and KNG1), and a low-scoring gene (VTN) involved in the complement and coagulation pathway by comparing protein levels in follicular fluid in women with PCOS and healthy controls. While the expression of PROS1, C1QA, and KNG1 was found to be significantly downregulated in women with PCOS, the expression of VTN was found to be unchanged in PCOS. The findings of this study reiterate the utility of employing insilico approaches to identify and prioritize the most promising candidate genes in diseases with a complex pathophysiology like PCOS. Further, the study also helps in gaining clearer insights into the molecular mechanisms associated with the manifestation of the PCOS phenotype by contributing to the existing repertoire of genes associated with PCOS.

多囊卵巢综合征（PCOS）是女性不孕症患者中最常见的无排卵性疾病之一。关于多囊卵巢综合症的多项高通量和低通量研究积累了大量有关多囊卵巢综合症的信息。尽管有一些资料显示了多囊卵巢综合症的进展，但有关其病因的信息仍然不足。采用基于证据的综合方法分析现有信息，有助于发现在多囊卵巢综合症病理生理学中发挥作用的潜在候选基因。这项工作的重点是从文献和基因表达研究中整合有关多囊卵巢综合症的现有信息，并评估基因优先排序和网络分析的应用，以预测缺失的新候选基因。此外，该研究还评估了基于证据的评分法对与多囊卵巢综合症相关的基因进行排序的实用性。研究结果发现了 2000 个与多囊卵巢综合症相关的可信候选基因。对这些已确定的候选基因进行的内部验证确认了 938 个基因在多囊卵巢综合症中的作用。此外，通过比较多囊卵巢综合症妇女和健康对照组卵泡液中的蛋白水平，对四个潜在候选基因进行了实验验证，其中包括一个高分基因（PROS1）、两个中分基因（C1QA 和 KNG1）和一个低分基因（VTN），这些基因涉及补体和凝血途径。结果发现，PROS1、C1QA 和 KNG1 的表达在多囊卵巢综合症女性患者中明显下调，而 VTN 的表达在多囊卵巢综合症患者中却没有变化。这项研究的结果再次证明，在像多囊卵巢综合症这样病理生理学复杂的疾病中，采用非分子方法来鉴定和优先选择最有希望的候选基因是非常有用的。此外，这项研究还有助于更清楚地了解与多囊卵巢综合症表型表现相关的分子机制，为现有的多囊卵巢综合症相关基因库做出贡献。

{"title":"Integrated data driven analysis identifies potential candidate genes associated with PCOS","authors":"Shaini Joseph , Krutika Patil , Niharika Rahate , Jatin Shah , Srabani Mukherjee , Smita D. Mahale","doi":"10.1016/j.compbiolchem.2024.108191","DOIUrl":"10.1016/j.compbiolchem.2024.108191","url":null,"abstract":"<div><p>Polycystic ovary syndrome (PCOS) is one of the most common anovulatory disorder observed in women presenting with infertility. Several high and low throughput studies on PCOS have led to accumulation of vast amount of information on PCOS. Despite the availability of several resources which index the advances in PCOS, information on its etiology still remains inadequate. Analysis of the existing information using an integrated evidence based approach may aid identification of novel potential candidate genes with a role in PCOS pathophysiology. This work focuses on integrating existing information on PCOS from literature and gene expression studies and evaluating the application of gene prioritization and network analysis to predict missing novel candidates. Further, it assesses the utility of evidence-based scoring to rank genes for their association with PCOS. The results of this study led to identification of ∼2000 plausible candidate genes associated with PCOS. Insilico validation of these identified candidates confirmed the role of 938 genes in PCOS. Further, experimental validation was carried out for four of the potential candidate genes, a high-scoring (PROS1), two mid-scoring (C1QA and KNG1), and a low-scoring gene (VTN) involved in the complement and coagulation pathway by comparing protein levels in follicular fluid in women with PCOS and healthy controls. While the expression of PROS1, C1QA, and KNG1 was found to be significantly downregulated in women with PCOS, the expression of VTN was found to be unchanged in PCOS. The findings of this study reiterate the utility of employing insilico approaches to identify and prioritize the most promising candidate genes in diseases with a complex pathophysiology like PCOS. Further, the study also helps in gaining clearer insights into the molecular mechanisms associated with the manifestation of the PCOS phenotype by contributing to the existing repertoire of genes associated with PCOS.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108191"},"PeriodicalIF":2.6,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142147174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fractional whale driving training-based optimization enabled transfer learning for detecting autism spectrum disorder 基于分鲸驱动训练的优化迁移学习用于检测自闭症谱系障碍

IF 2.6 4区生物学 Q2 BIOLOGY

Computational Biology and Chemistry

Pub Date : 2024-08-30 DOI: 10.1016/j.compbiolchem.2024.108200

Sriramakrishnan GV , P. Mano Paul , Hemachandra Gudimindla , Venubabu Rachapudi

Autism Spectrum Disorder (ASD) is a neurological illness that degrades communication and interaction among others. Autism can be detected at any stage. Early detection of ASD is important in preventing the communication, interaction and behavioral outcomes of individuals. Hence, this research introduced the Fractional Whale-driving Driving Training-based Based Optimization with Convolutional Neural Network-based Transfer learning (FWDTBO-CNN_TL) for identifying ASD. Here, the FWDTBO is modelled by the incorporation of Fractional calculus (FC), Whale optimization algorithm (WOA) and Driving Training-based Optimization (DTBO) that trains the hyperparameters of CNN-TL. Moreover, the Convolutional Neural Networks (CNN) utilize the hyperparameters from trained models, like Alex Net and Shuffle Net in such a way that the CNN-TL is designed. To improve the detection efficiency, the nub region was extracted and carried out with the functional connectivity-based Whale Driving Training Optimization (WDTBO) algorithm. Moreover, the TL is tuned by the FWDTBO algorithm. The result reveals that the ASD detection technique, FWDTBO-CNN-TL acquired 90.7 % accuracy, 95.4 % sensitivity, 93.7 % specificity and 93 % f-measure with the ABIDE-II dataset.

自闭症谱系障碍（ASD）是一种神经系统疾病，会降低交流和互动能力。自闭症可在任何阶段被发现。早期发现自闭症对预防个体的交流、互动和行为后果非常重要。因此，本研究引入了基于卷积神经网络迁移学习的分鲸驾驶训练优化（FWDTBO-CN_TL）来识别自闭症。在这里，FWDTBO 是通过结合分数微积分（FC）、鲸鱼优化算法（WOA）和基于驾驶训练的优化（DTBO）来训练 CNN-TL 的超参数。此外，卷积神经网络（CNN）利用经过训练的模型（如 Alex Net 和 Shuffle Net）的超参数设计了 CNN-TL。为了提高检测效率，使用基于功能连接的鲸鱼驱动训练优化（WDTBO）算法提取并执行了 nub 区域。此外，还利用 FWDTBO 算法对 TL 进行了调整。结果表明，FWDTBO-CNN-TL ASD 检测技术在 ABIDE-II 数据集上获得了 90.7 % 的准确率、95.4 % 的灵敏度、93.7 % 的特异性和 93 % 的 f-measure。

{"title":"Fractional whale driving training-based optimization enabled transfer learning for detecting autism spectrum disorder","authors":"Sriramakrishnan GV , P. Mano Paul , Hemachandra Gudimindla , Venubabu Rachapudi","doi":"10.1016/j.compbiolchem.2024.108200","DOIUrl":"10.1016/j.compbiolchem.2024.108200","url":null,"abstract":"<div><p>Autism Spectrum Disorder (ASD) is a neurological illness that degrades communication and interaction among others. Autism can be detected at any stage. Early detection of ASD is important in preventing the communication, interaction and behavioral outcomes of individuals. Hence, this research introduced the Fractional Whale-driving Driving Training-based Based Optimization with Convolutional Neural Network-based Transfer learning (FWDTBO-CNN_TL) for identifying ASD. Here, the FWDTBO is modelled by the incorporation of Fractional calculus (FC), Whale optimization algorithm (WOA) and Driving Training-based Optimization (DTBO) that trains the hyperparameters of CNN-TL. Moreover, the Convolutional Neural Networks (CNN) utilize the hyperparameters from trained models, like Alex Net and Shuffle Net in such a way that the CNN-TL is designed. To improve the detection efficiency, the nub region was extracted and carried out with the functional connectivity-based Whale Driving Training Optimization (WDTBO) algorithm. Moreover, the TL is tuned by the FWDTBO algorithm. The result reveals that the ASD detection technique, FWDTBO-CNN-TL acquired 90.7 % accuracy, 95.4 % sensitivity, 93.7 % specificity and 93 % f-measure with the ABIDE-II dataset.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108200"},"PeriodicalIF":2.6,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0