Frontiers in bioinformatics最新文献_第4页

Network-based insights into miR-30a-5p-mediated regulation and EGCG targeting in triple-negative breast cancer. 三阴性乳腺癌中mir -30a-5p介导的调控和EGCG靶向的基于网络的见解

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-19 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1735106

Loganathan Chandramani Priya Dharshini, Abul Kalam Azad Mandal

Background: Triple-negative breast cancer (TNBC) is defined by the absence of ER, PR, and HER2 expression. This limits the targeted therapies, resulting in poor clinical outcomes. Identifying the molecular targets that can be regulated through miRNAs and natural compounds offers a potential therapeutic platform.

Methods: We combined transcriptomic profiling with miRNA target prediction to identify genes regulated by miR-30a-5p and assess their interaction with the green tea polyphenol, epigallocatechin gallate (EGCG). Differentially expressed genes (DEGs) from TCGA-TNBC datasets and miRNA targets from miRDB, TargetScan, and miRTarBase were screened for common genes. Then, the protein-protein interaction and network topology analyses were performed to identify key hub genes. Molecular docking and simulation were carried out with the four key genes against EGCG.

Results: Data integration yielded 393 overlapping genes and identified ten hub genes- RRM2, KIF11, ANLN, CDC20, CCNA1, AGO2, YWHAZ, DTL, SKP2, and PCNA. Pathway enrichment showed that all these hubs are involved in cell cycle and mitotic regulation, which was associated with poor TNBC prognosis. Mutation profiling revealed high alteration rates in KIF11, ANLN, CDC20, and YWHAZ, with increased missense mutations and C>T transitions. Molecular docking and simulations identified YWHAZ as the most favorable and structurally stable EGCG-binding target, compared to the other three key genes.

Conclusion: The results emphasizes that EGCG has strong binding affinity towards YWHAZ, revealing that miR-30a-EGCG targets TNBC synergistically through cell-cycle-mediated pathways. The findings give rational support for miRNA-guided phytochemical-based TNBC therapeutic development.

背景：三阴性乳腺癌（TNBC）的定义是缺乏ER、PR和HER2的表达。这限制了靶向治疗，导致临床效果不佳。确定可以通过mirna和天然化合物调节的分子靶点提供了一个潜在的治疗平台。方法：我们将转录组学分析与miRNA靶标预测相结合，鉴定miR-30a-5p调控的基因，并评估它们与绿茶多酚表没食子儿茶素没食子酸酯（EGCG）的相互作用。对来自TCGA-TNBC数据集的差异表达基因（DEGs）和来自miRDB、TargetScan和miRTarBase的miRNA靶标进行共同基因筛选。然后，进行蛋白-蛋白相互作用和网络拓扑分析，以确定关键枢纽基因。对EGCG的4个关键基因进行了分子对接和模拟。结果：数据整合得到393个重叠基因，并鉴定出10个枢纽基因——RRM2、KIF11、ANLN、CDC20、CCNA1、AGO2、YWHAZ、DTL、SKP2和PCNA。途径富集表明，所有这些枢纽都参与细胞周期和有丝分裂调节，这与TNBC预后不良有关。突变谱显示，KIF11、ANLN、CDC20和YWHAZ的变化率很高，错义突变和C>T转换增加。分子对接和模拟表明，与其他三个关键基因相比，YWHAZ是最有利且结构稳定的egcg结合靶点。结论：结果强调EGCG对YWHAZ具有较强的结合亲和力，揭示miR-30a-EGCG通过细胞周期介导的途径协同作用于TNBC。这些发现为mirna引导的基于植物化学的TNBC治疗开发提供了合理的支持。

{"title":"Network-based insights into miR-30a-5p-mediated regulation and EGCG targeting in triple-negative breast cancer.","authors":"Loganathan Chandramani Priya Dharshini, Abul Kalam Azad Mandal","doi":"10.3389/fbinf.2025.1735106","DOIUrl":"10.3389/fbinf.2025.1735106","url":null,"abstract":"Background: Triple-negative breast cancer (TNBC) is defined by the absence of ER, PR, and HER2 expression. This limits the targeted therapies, resulting in poor clinical outcomes. Identifying the molecular targets that can be regulated through miRNAs and natural compounds offers a potential therapeutic platform.Methods: We combined transcriptomic profiling with miRNA target prediction to identify genes regulated by miR-30a-5p and assess their interaction with the green tea polyphenol, epigallocatechin gallate (EGCG). Differentially expressed genes (DEGs) from TCGA-TNBC datasets and miRNA targets from miRDB, TargetScan, and miRTarBase were screened for common genes. Then, the protein-protein interaction and network topology analyses were performed to identify key hub genes. Molecular docking and simulation were carried out with the four key genes against EGCG.Results: Data integration yielded 393 overlapping genes and identified ten hub genes- RRM2, KIF11, ANLN, CDC20, CCNA1, AGO2, YWHAZ, DTL, SKP2, and PCNA. Pathway enrichment showed that all these hubs are involved in cell cycle and mitotic regulation, which was associated with poor TNBC prognosis. Mutation profiling revealed high alteration rates in KIF11, ANLN, CDC20, and YWHAZ, with increased missense mutations and C>T transitions. Molecular docking and simulations identified YWHAZ as the most favorable and structurally stable EGCG-binding target, compared to the other three key genes.Conclusion: The results emphasizes that EGCG has strong binding affinity towards YWHAZ, revealing that miR-30a-EGCG targets TNBC synergistically through cell-cycle-mediated pathways. The findings give rational support for miRNA-guided phytochemical-based TNBC therapeutic development.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1735106"},"PeriodicalIF":3.9,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12757378/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145901661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Pan-cancer analyses identify oncogenic drivers, expression signatures, and therapeutic vulnerabilities in RHO GTPase pathway genes. 泛癌分析确定了RHO GTPase途径基因的致癌驱动因素、表达特征和治疗脆弱性。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-17 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1708800

Rubén Fernández, L Francisco Lorenzo-Martín, Víctor Quesada, Xosé R Bustelo

RHO family GTPases are key regulators of cancer-related processes such as cytoskeletal dynamics and cell migration, proliferation, and survival. Despite this, a comprehensive understanding of RHO signaling alterations across tumors is still lacking. In this study, we present a pan-cancer analysis of 484 genes encoding RHO GTPases, regulators, proximal effectors, distal downstream signaling elements, and components of their proximal interactomes using data from over 10,000 tumor samples and 33 tumor types present in The Cancer Genome Atlas (TCGA). In addition, we have utilized available data from genome-wide functional dependency screens performed in more than 1,000 gene-edited cancer cell lines. This study has uncovered positively selected mutations in both well-known and previously uncharacterized RHO pathway genes. Transcriptomic profiling reveals widespread and tumor-specific differential expression patterns, with some of them correlating with copy number changes. Interestingly, certain regulators exhibit consistent expression profiles across tumors opposite to those predicted from their canonical roles. Co-expression and gene set enrichment analyses highlight coordinated transcriptional programs involving some RHO GTPase pathway genes and their linkage to key cancer hallmarks, including extracellular matrix reorganization, cell motility, cell cycle progression, cell survival, and immune modulation. Functional screens further identify context-specific dependencies on several deregulated RHO GTPase pathway genes. Altogether, this study provides a comprehensive map of RHO GTPase pathway alterations in cancer and identifies new oncogenic drivers, expression-based signatures, and therapeutic vulnerabilities that could guide future mechanistic and translational research in this area.

RHO家族gtpase是癌症相关过程的关键调控因子，如细胞骨架动力学和细胞迁移、增殖和存活。尽管如此，对肿瘤中RHO信号的改变仍缺乏全面的了解。在这项研究中，我们利用来自癌症基因组图谱（TCGA）中超过10,000个肿瘤样本和33种肿瘤类型的数据，对484个编码RHO gtpase、调节因子、近端效应因子、远端下游信号元件及其近端相互作用组成分的基因进行了泛癌症分析。此外，我们还利用了在1000多个基因编辑的癌细胞系中进行的全基因组功能依赖筛选的可用数据。这项研究发现了众所周知的和以前未表征的RHO通路基因的正选择突变。转录组学分析揭示了广泛和肿瘤特异性的差异表达模式，其中一些与拷贝数变化相关。有趣的是，某些调节因子在肿瘤中表现出一致的表达谱，与它们的规范作用预测相反。共表达和基因集富集分析强调了涉及一些RHO GTPase途径基因的协调转录程序及其与关键癌症标志的联系，包括细胞外基质重组、细胞运动、细胞周期进展、细胞存活和免疫调节。功能筛选进一步确定了对几种不受调控的RHO GTPase通路基因的上下文特异性依赖性。总之，本研究提供了癌症中RHO GTPase通路改变的全面图谱，并确定了新的致癌驱动因素、基于表达的特征和治疗脆弱性，可以指导该领域未来的机制和转化研究。

{"title":"Pan-cancer analyses identify oncogenic drivers, expression signatures, and therapeutic vulnerabilities in RHO GTPase pathway genes.","authors":"Rubén Fernández, L Francisco Lorenzo-Martín, Víctor Quesada, Xosé R Bustelo","doi":"10.3389/fbinf.2025.1708800","DOIUrl":"10.3389/fbinf.2025.1708800","url":null,"abstract":"RHO family GTPases are key regulators of cancer-related processes such as cytoskeletal dynamics and cell migration, proliferation, and survival. Despite this, a comprehensive understanding of RHO signaling alterations across tumors is still lacking. In this study, we present a pan-cancer analysis of 484 genes encoding RHO GTPases, regulators, proximal effectors, distal downstream signaling elements, and components of their proximal interactomes using data from over 10,000 tumor samples and 33 tumor types present in The Cancer Genome Atlas (TCGA). In addition, we have utilized available data from genome-wide functional dependency screens performed in more than 1,000 gene-edited cancer cell lines. This study has uncovered positively selected mutations in both well-known and previously uncharacterized RHO pathway genes. Transcriptomic profiling reveals widespread and tumor-specific differential expression patterns, with some of them correlating with copy number changes. Interestingly, certain regulators exhibit consistent expression profiles across tumors opposite to those predicted from their canonical roles. Co-expression and gene set enrichment analyses highlight coordinated transcriptional programs involving some RHO GTPase pathway genes and their linkage to key cancer hallmarks, including extracellular matrix reorganization, cell motility, cell cycle progression, cell survival, and immune modulation. Functional screens further identify context-specific dependencies on several deregulated RHO GTPase pathway genes. Altogether, this study provides a comprehensive map of RHO GTPase pathway alterations in cancer and identifies new oncogenic drivers, expression-based signatures, and therapeutic vulnerabilities that could guide future mechanistic and translational research in this area.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1708800"},"PeriodicalIF":3.9,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12753894/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification and functional analysis of hub genes in knee osteoarthritis via bioinformatics and experimental validation. 基于生物信息学和实验验证的膝关节骨关节炎中枢基因鉴定和功能分析。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-17 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1671693

Shanyong Jiang, Jingjing Cao, Jianshu Lu, Jianxiao Liang, Lianxin Li, Yanqiang Song, Jincheng Gao, Baoen Jiang

Objective: Knee osteoarthritis (KOA) is a prevalent chronic degenerative joint disease that causes chronic pain and mobility restrictions in the elderly, significantly impacting quality of life. Current treatments focus on symptom relief, lacking effective interventions targeting the underlying mechanisms. Understanding KOA's molecular mechanisms and identifying key pathogenic genes are essential for developing targeted therapies.

Methods: Gene expression data from KOA patients and healthy controls were obtained from the Gene Expression Omnibus (GEO) database, and differentially expressed genes (DEGs) were identified. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed to reveal the associated biological processes and signaling pathways. Protein-protein interaction (PPI) network analysis and Gene Ontology-based semantic similarity calculations were used to identify hub genes. Gene Set Variation Analysis (GSVA) assessed enrichment in KOA-related pathways. Immune infiltration analysis (CIBERSORT) assessed the immune cell distribution in KOA tissues. Finally, hub gene expression changes were validated using the IL-1β-treated CHON-001 cell model and real-time quantitative PCR (RT-qPCR).

Results: A total of 3,290 upregulated and 2,536 downregulated DEGs were identified. GO and KEGG enrichment analyses revealed these genes were primarily involved in extracellular matrix remodeling, transmembrane transport, and inflammation-related pathways. Key hub genes, including HSPA5, FOXO1, and YWHAE, were identified. GSVA showed that these genes were significantly enriched in multiple KOA-associated signaling pathways. Immune infiltration analysis revealed significant differences in the levels of six immune cell types in KOA tissues, which were associated with the hub genes expression. In CHON-001 cell, the expression levels of GRB2, IKBKG, and HSPA12A were upregulated, whereas YWHAE, HSPB1, and DCAF8 were downregulated, consistent with the tissue samples.

Conclusion: This study identified key pathogenic genes and their regulatory pathways in KOA, highlighting their potential role in disease progression via inflammation and immune modulation. These findings provide insights for developing targeted therapeutic strategies for KOA.

目的：膝关节骨性关节炎（KOA）是一种常见的慢性退行性关节疾病，导致老年人慢性疼痛和活动受限，严重影响生活质量。目前的治疗侧重于症状缓解，缺乏针对潜在机制的有效干预措施。了解KOA的分子机制和确定关键致病基因对开发靶向治疗至关重要。方法：从Gene expression Omnibus （GEO）数据库中获取KOA患者和健康对照者的基因表达数据，鉴定差异表达基因（differential expression genes, DEGs）。基因本体（GO）和京都基因与基因组百科全书（KEGG）富集分析揭示了相关的生物学过程和信号通路。利用蛋白质-蛋白质相互作用（PPI）网络分析和基于基因本体的语义相似度计算来识别中心基因。基因集变异分析（GSVA）评估了koa相关通路的富集程度。免疫浸润分析（CIBERSORT）评估KOA组织中免疫细胞的分布。最后，利用il -1β处理的CHON-001细胞模型和实时定量PCR （RT-qPCR）验证hub基因表达变化。结果：共鉴定出3290个上调的deg和2536个下调的deg。GO和KEGG富集分析显示，这些基因主要参与细胞外基质重塑、跨膜运输和炎症相关途径。鉴定出关键枢纽基因，包括HSPA5、fox01和YWHAE。GSVA显示这些基因在多个koa相关信号通路中显著富集。免疫浸润分析显示，KOA组织中6种免疫细胞类型的水平存在显著差异，这些免疫细胞类型与枢纽基因的表达有关。在CHON-001细胞中，GRB2、IKBKG和HSPA12A的表达水平上调，而YWHAE、HSPB1和DCAF8的表达水平下调，与组织样本一致。结论：本研究确定了KOA的关键致病基因及其调控途径，强调了它们通过炎症和免疫调节在疾病进展中的潜在作用。这些发现为开发针对KOA的靶向治疗策略提供了见解。

{"title":"Identification and functional analysis of hub genes in knee osteoarthritis via bioinformatics and experimental validation.","authors":"Shanyong Jiang, Jingjing Cao, Jianshu Lu, Jianxiao Liang, Lianxin Li, Yanqiang Song, Jincheng Gao, Baoen Jiang","doi":"10.3389/fbinf.2025.1671693","DOIUrl":"10.3389/fbinf.2025.1671693","url":null,"abstract":"Objective: Knee osteoarthritis (KOA) is a prevalent chronic degenerative joint disease that causes chronic pain and mobility restrictions in the elderly, significantly impacting quality of life. Current treatments focus on symptom relief, lacking effective interventions targeting the underlying mechanisms. Understanding KOA's molecular mechanisms and identifying key pathogenic genes are essential for developing targeted therapies.Methods: Gene expression data from KOA patients and healthy controls were obtained from the Gene Expression Omnibus (GEO) database, and differentially expressed genes (DEGs) were identified. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed to reveal the associated biological processes and signaling pathways. Protein-protein interaction (PPI) network analysis and Gene Ontology-based semantic similarity calculations were used to identify hub genes. Gene Set Variation Analysis (GSVA) assessed enrichment in KOA-related pathways. Immune infiltration analysis (CIBERSORT) assessed the immune cell distribution in KOA tissues. Finally, hub gene expression changes were validated using the IL-1β-treated CHON-001 cell model and real-time quantitative PCR (RT-qPCR).Results: A total of 3,290 upregulated and 2,536 downregulated DEGs were identified. GO and KEGG enrichment analyses revealed these genes were primarily involved in extracellular matrix remodeling, transmembrane transport, and inflammation-related pathways. Key hub genes, including HSPA5, FOXO1, and YWHAE, were identified. GSVA showed that these genes were significantly enriched in multiple KOA-associated signaling pathways. Immune infiltration analysis revealed significant differences in the levels of six immune cell types in KOA tissues, which were associated with the hub genes expression. In CHON-001 cell, the expression levels of GRB2, IKBKG, and HSPA12A were upregulated, whereas YWHAE, HSPB1, and DCAF8 were downregulated, consistent with the tissue samples.Conclusion: This study identified key pathogenic genes and their regulatory pathways in KOA, highlighting their potential role in disease progression via inflammation and immune modulation. These findings provide insights for developing targeted therapeutic strategies for KOA.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1671693"},"PeriodicalIF":3.9,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12753952/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bioengineering hybrid artificial life. 生物工程混合人工生命。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-16 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1676359

Innocent Sibanda, Geoff Nitschke

The goal of bioengineering in synthetic biology is to redesign, reprogram, and rewire biological systems for specific applications using standardized parts such as promoters and ribosomes. For example, bioengineered micro-organisms capable of cleaning up environmental pollution or producing antibodies de novo to defend against viral pandemics have been predicted. Artificial Life (ALife) facilitates the design and understanding of living systems, not just those found in nature, but life as it could be, while synthetic biology provides the means to realize life as it can be engineered. Despite significant advances, the synthesis of evolving, adaptable, and bioengineered problem-solving ALife has yet to achieve practical feasibility. This is primarily due to limitations in directed evolution, fitness landscape mapping, and fitness approximation. Thus, currently synthetic (biological) ALife does not continue to evolve and adapt to changing tasks and environments. This is in stark contrast to current digital based ALife that continues to adapt and evolve in simulated environments demonstrating the dictum of life as it could be. We posit that if the bioengineering (on-demand design) of problem solving ALife is to ever become a reality then open issues pervading the directed evolution of synthetic ALife must first be addressed. This review examines open challenges in directed evolution, genetic diversity generation, fitness mapping, and fitness estimation, and outlines future directions toward a hybrid synthetic ALife design methodology. This review provides a novel perspective for a singular (hybridized) evolutionary design methodology, combining digital (in silico) and synthetic (in vitro) evolutionary design methods drawn from various bioengineering, digital and robotic ALife applications, while addressing highlighted directed evolution deficiencies.

合成生物学中生物工程的目标是利用启动子和核糖体等标准化部件为特定应用重新设计、重新编程和重新连接生物系统。例如，有人预测生物工程微生物能够清除环境污染或产生新的抗体来抵御病毒大流行。人工生命（ALife）促进了对生命系统的设计和理解，不仅仅是那些在自然界中发现的生命，还有生命的可能，而合成生物学提供了实现生命的手段，因为它可以被设计。尽管取得了重大进展，但进化、适应性和生物工程解决问题的生命的合成尚未实现实际可行性。这主要是由于定向进化、适应度景观映射和适应度近似的局限性。因此，目前合成（生物）生命不能继续进化和适应不断变化的任务和环境。这与当前基于数字的生命形成鲜明对比，后者在模拟环境中不断适应和进化，证明了生命的格言。我们认为，如果解决人工生命问题的生物工程（按需设计）要成为现实，那么必须首先解决人工生命定向进化中普遍存在的开放性问题。本文综述了定向进化、遗传多样性产生、适应度映射和适应度估计方面的开放性挑战，并概述了混合合成生命设计方法的未来方向。这篇综述为单一（杂交）进化设计方法提供了一个新的视角，结合了从各种生物工程、数字和机器人生命应用中提取的数字（计算机）和合成（体外）进化设计方法，同时解决了突出的定向进化缺陷。

{"title":"Bioengineering hybrid artificial life.","authors":"Innocent Sibanda, Geoff Nitschke","doi":"10.3389/fbinf.2025.1676359","DOIUrl":"10.3389/fbinf.2025.1676359","url":null,"abstract":"The goal of bioengineering in synthetic biology is to redesign, reprogram, and rewire biological systems for specific applications using standardized parts such as promoters and ribosomes. For example, bioengineered micro-organisms capable of cleaning up environmental pollution or producing antibodies de novo to defend against viral pandemics have been predicted. Artificial Life (ALife) facilitates the design and understanding of living systems, not just those found in nature, but life as it could be, while synthetic biology provides the means to realize life as it can be engineered. Despite significant advances, the synthesis of evolving, adaptable, and bioengineered problem-solving ALife has yet to achieve practical feasibility. This is primarily due to limitations in directed evolution, fitness landscape mapping, and fitness approximation. Thus, currently synthetic (biological) ALife does not continue to evolve and adapt to changing tasks and environments. This is in stark contrast to current digital based ALife that continues to adapt and evolve in simulated environments demonstrating the dictum of life as it could be. We posit that if the bioengineering (on-demand design) of problem solving ALife is to ever become a reality then open issues pervading the directed evolution of synthetic ALife must first be addressed. This review examines open challenges in directed evolution, genetic diversity generation, fitness mapping, and fitness estimation, and outlines future directions toward a hybrid synthetic ALife design methodology. This review provides a novel perspective for a singular (hybridized) evolutionary design methodology, combining digital (in silico) and synthetic (in vitro) evolutionary design methods drawn from various bioengineering, digital and robotic ALife applications, while addressing highlighted directed evolution deficiencies.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1676359"},"PeriodicalIF":3.9,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12748196/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

In silico identification of novel natural compounds as potential KIFC1 inhibitors for the therapeutic intervention of triple-negative breast cancer. 新型天然化合物作为三阴性乳腺癌治疗干预的潜在KIFC1抑制剂的计算机鉴定。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-16 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1689172

Prashant Kumar Tiwari, Mukesh Kumar, Richa Mishra, Xiaomeng Zhang, Sanjay Kumar

TNBC is an aggressive and various subtype of breast cancer, notable by the lack of specific oestrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2), consequential in limited treatment options and poor prognosis. Kinesin Family Member C1 (KIFC1), a mitotic motor protein critical for centrosome clustering and spindle formation, has critical role in TNBC progress. In this situation, natural compounds were explored as probable inhibitors of this protein. we utilized molecular docking, ADMET profiling, density functional theory calculations, molecular dynamics simulations, MM/GBSA binding free energy analysis, and principal component analysis to thoroughly evaluate binding affinity, stability, and drug-likeness property of natural compounds against KIFC1. Of the 36,900 compounds utilized, five natural compounds were carefully chosen for further assessment. All five compounds Fosfocytocin, Molybdopterin Compound Z, 5-amino-2-(3-hydroxy-13-methyltetradecanamido) pentanoic acid, TMC-52A, and Muscimol exhibited significant inhibitory efficacy against KIFC1. These compounds demonstrated persistent interactions with critical residues and had advantageous binding properties in computational evaluations. The results collectively indicate their potential as effective inhibitors for targeting KIFC1 in forthcoming studies. These data collectively identify all five natural compounds as possible inhibitors of KIFC1. Nonetheless, their effectiveness and safety must be confirmed through in vivo and in vitro study prior to consideration for clinical application.

TNBC是一种侵袭性的多种亚型乳腺癌，其特点是缺乏特异性雌激素受体（ER）、孕激素受体（PR）和人表皮生长因子受体2 (HER2)，因此治疗选择有限，预后差。Kinesin家族成员C1 （KIFC1）是一种对中心体聚集和纺锤体形成至关重要的有丝分裂运动蛋白，在TNBC的进展中起着关键作用。在这种情况下，天然化合物被探索作为这种蛋白质的可能抑制剂。我们利用分子对接、ADMET分析、密度泛函理论计算、分子动力学模拟、MM/GBSA结合自由能分析和主成分分析来全面评估天然化合物对KIFC1的结合亲和力、稳定性和药物相似性。在使用的36900种化合物中，精心选择了5种天然化合物进行进一步评估。磷霉素、钼钼素化合物Z、5-氨基-2-（3-羟基-13-甲基十四烷酰胺）戊酸、TMC-52A和Muscimol对KIFC1均有显著的抑制作用。这些化合物表现出与关键残基的持续相互作用，并在计算评估中具有有利的结合特性。这些结果共同表明，在未来的研究中，它们有可能成为靶向KIFC1的有效抑制剂。这些数据共同确定了所有五种天然化合物可能是KIFC1的抑制剂。然而，在考虑临床应用之前，它们的有效性和安全性必须通过体内和体外研究来证实。

{"title":"In silico identification of novel natural compounds as potential KIFC1 inhibitors for the therapeutic intervention of triple-negative breast cancer.","authors":"Prashant Kumar Tiwari, Mukesh Kumar, Richa Mishra, Xiaomeng Zhang, Sanjay Kumar","doi":"10.3389/fbinf.2025.1689172","DOIUrl":"10.3389/fbinf.2025.1689172","url":null,"abstract":"TNBC is an aggressive and various subtype of breast cancer, notable by the lack of specific oestrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2), consequential in limited treatment options and poor prognosis. Kinesin Family Member C1 (KIFC1), a mitotic motor protein critical for centrosome clustering and spindle formation, has critical role in TNBC progress. In this situation, natural compounds were explored as probable inhibitors of this protein. we utilized molecular docking, ADMET profiling, density functional theory calculations, molecular dynamics simulations, MM/GBSA binding free energy analysis, and principal component analysis to thoroughly evaluate binding affinity, stability, and drug-likeness property of natural compounds against KIFC1. Of the 36,900 compounds utilized, five natural compounds were carefully chosen for further assessment. All five compounds Fosfocytocin, Molybdopterin Compound Z, 5-amino-2-(3-hydroxy-13-methyltetradecanamido) pentanoic acid, TMC-52A, and Muscimol exhibited significant inhibitory efficacy against KIFC1. These compounds demonstrated persistent interactions with critical residues and had advantageous binding properties in computational evaluations. The results collectively indicate their potential as effective inhibitors for targeting KIFC1 in forthcoming studies. These data collectively identify all five natural compounds as possible inhibitors of KIFC1. Nonetheless, their effectiveness and safety must be confirmed through in vivo and in vitro study prior to consideration for clinical application.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1689172"},"PeriodicalIF":3.9,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12748000/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Celline: a flexible tool for one-step retrieval and integrative analysis of public single-cell RNA sequencing data. Celline：一个灵活的工具，用于一步检索和综合分析公共单细胞RNA测序数据。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-11 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1684227

Yuya Sato, Toru Asahi, Kosuke Kataoka

Single-cell RNA sequencing (scRNA-seq) has generated a rapidly expanding collection of public datasets that provide insight into development, disease, and therapy. However, researchers lack an end-to-end solution for seamlessly retrieving, preprocessing, integrating, and analyzing these data because existing tools address only isolated steps and require manual curation of accessions, metadata, and technical variability, known as batch effects. In this study, we developed Celline, a Python package that executes an entire workflow using a single-line commands per step. Celline automatically gathers raw single-cell RNA-seq data from multiple public repositories and extracts metadata using large language models. It then wraps established tools, including Scrublet for doublet removal, Seurat and Scanpy for quality control and cell-type annotation, Harmony and scVI for batch correction, and Slingshot for trajectory inference, into one-line commands, enabling seamless integrative analyses. To validate Celline-acquired data quality and the integrated framework's practical utility, we applied it to 2 mouse brain cortex datasets from embryonic days 14.5 and 18. Technical validation demonstrated that Celline successfully retrieved data, standardized metadata, and enabled standard analyses that removed low-quality cells, annotated 11 major cell types, improved integration quality (scIB score +0.22), and completed trajectory analysis. Thus, Celline transforms scattered public scRNA-seq resources into unified, analysis-ready datasets with minimal effort. Its modular design allows pipeline extension, encourages community-driven advances, and accelerates the discovery of single-cell data.

单细胞RNA测序（scRNA-seq）产生了一个快速扩展的公共数据集，提供了对发展、疾病和治疗的深入了解。然而，研究人员缺乏无缝检索、预处理、集成和分析这些数据的端到端解决方案，因为现有的工具只能处理孤立的步骤，并且需要手动管理接入、元数据和技术可变性，即批处理效应。在本研究中，我们开发了Celline，这是一个Python包，每一步使用单行命令执行整个工作流。Celline自动收集来自多个公共存储库的原始单细胞RNA-seq数据，并使用大型语言模型提取元数据。然后，它将已建立的工具，包括用于双线去除的scrulet，用于质量控制和细胞类型注释的Seurat和Scanpy，用于批量校正的Harmony和scVI，以及用于轨迹推断的Slingshot，打包成一行命令，从而实现无缝集成分析。为了验证celline获取的数据质量和集成框架的实用性，我们将其应用于2只小鼠胚胎14.5天和18天的大脑皮层数据集。技术验证表明，Celline成功地检索了数据，标准化了元数据，并启用了标准分析，删除了低质量细胞，注释了11种主要细胞类型，提高了集成质量（scIB评分+0.22），并完成了轨迹分析。因此，Celline将分散的公共scRNA-seq资源转化为统一的、可用于分析的数据集。其模块化设计允许管道扩展，鼓励社区驱动的进步，并加速单细胞数据的发现。

{"title":"Celline: a flexible tool for one-step retrieval and integrative analysis of public single-cell RNA sequencing data.","authors":"Yuya Sato, Toru Asahi, Kosuke Kataoka","doi":"10.3389/fbinf.2025.1684227","DOIUrl":"10.3389/fbinf.2025.1684227","url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) has generated a rapidly expanding collection of public datasets that provide insight into development, disease, and therapy. However, researchers lack an end-to-end solution for seamlessly retrieving, preprocessing, integrating, and analyzing these data because existing tools address only isolated steps and require manual curation of accessions, metadata, and technical variability, known as batch effects. In this study, we developed Celline, a Python package that executes an entire workflow using a single-line commands per step. Celline automatically gathers raw single-cell RNA-seq data from multiple public repositories and extracts metadata using large language models. It then wraps established tools, including Scrublet for doublet removal, Seurat and Scanpy for quality control and cell-type annotation, Harmony and scVI for batch correction, and Slingshot for trajectory inference, into one-line commands, enabling seamless integrative analyses. To validate Celline-acquired data quality and the integrated framework's practical utility, we applied it to 2 mouse brain cortex datasets from embryonic days 14.5 and 18. Technical validation demonstrated that Celline successfully retrieved data, standardized metadata, and enabled standard analyses that removed low-quality cells, annotated 11 major cell types, improved integration quality (scIB score +0.22), and completed trajectory analysis. Thus, Celline transforms scattered public scRNA-seq resources into unified, analysis-ready datasets with minimal effort. Its modular design allows pipeline extension, encourages community-driven advances, and accelerates the discovery of single-cell data.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1684227"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12738925/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Neurogenic locus notch homolog protein 1 (NOTCH 1) SNP informatics coupled with intrinsically disordered regions and post-translational modifications reveals the complex structural crosstalk of Lung Adenocarcinoma (LUAD). 神经源性基因座缺口同源蛋白1 (notch 1) SNP信息学结合内在无序区和翻译后修饰揭示了肺腺癌（LUAD）复杂的结构串扰。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-10 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1641521

Pearl John, C Sudandiradoss

Background: Lung adenocarcinoma (LUAD) is the predominant histological subtype of lung cancer, representing a major contributor to cancer mortality rate marked by a high frequency of mutations and intricate interactions between multiple signalling pathways.

Objective: Here we explore the role of NOTCH1 associated Single nucleotide polymorphisms (SNPs) IDR and PTM in LUAD progression. Although the NOTCH1 expression is downregulated, it has been validated as an important prognostic marker because of its complex biological roles under specific conditions.

Methods: With the aid of In silico tools we predicted and identified the deleterious SNPs. The Molecular Docking and dynamics simulations (MDS) were conducted to characterize these mutations.

Results: A total of 43 deleterious SNPs were found in the sequential SNP analysis with 13 SNPs resulted deleterious and damaging effects. The stabilizing SNPs such as S1464I, A1705V and T1602I are found within the conserved and functional domains of NOTCH1. In addition, 1660-2555 sequence region of the PEST domain was recognized as an Intrinsically Disordered Region (IDR) with a score of above 0.5. Moreover, the presence of the two phosphodegrons (SCF_FBW7_1 at 2129-2136 and SCF_FBW7_2 at 2508-2515) along with the Post Translational Modification (PTM) such as o-linked glycosylation and Phosphothreonine within the IDR region, PEST and conserved domains suggest functional significance in LUAD progression.

Conclusion: In conclusion our research highlights the potential regulatory role of identified SNPs, PTMs, and the functional domains of Notch1, particularly the PEST domain and IDR, in pathophysiology of LUAD particularly through the crosstalk of the EMT signalling.

背景：肺腺癌（LUAD）是肺癌的主要组织学亚型，是癌症死亡率的主要原因，其特征是高频率的突变和多种信号通路之间复杂的相互作用。目的：探讨NOTCH1相关的单核苷酸多态性（snp） IDR和PTM在LUAD进展中的作用。尽管NOTCH1表达下调，但由于其在特定条件下具有复杂的生物学作用，已被证实为重要的预后标志物。方法：利用计算机辅助工具对有害snp进行预测和鉴定。分子对接和动力学模拟（MDS）对这些突变进行了表征。结果：在序列SNP分析中共发现43个有害SNP，其中13个SNP产生有害和破坏性作用。稳定snp如S1464I、A1705V和T1602I位于NOTCH1的保守和功能结构域内。此外，PEST结构域1660 ~ 2555序列区被识别为内在无序区（IDR），评分在0.5以上。此外，两个磷酸化子（位于2129-2136的SCF_FBW7_1和位于2508-2515的SCF_FBW7_2）以及IDR区、PEST和保守结构域的翻译后修饰（PTM）如o-链糖基化和磷苏氨酸的存在表明在LUAD进展中的功能意义。结论：总之，我们的研究强调了已鉴定的snp、ptm和Notch1的功能域，特别是PEST结构域和IDR，在LUAD的病理生理中，特别是通过EMT信号的串扰，具有潜在的调节作用。

{"title":"Neurogenic locus notch homolog protein 1 (NOTCH 1) SNP informatics coupled with intrinsically disordered regions and post-translational modifications reveals the complex structural crosstalk of Lung Adenocarcinoma (LUAD).","authors":"Pearl John, C Sudandiradoss","doi":"10.3389/fbinf.2025.1641521","DOIUrl":"10.3389/fbinf.2025.1641521","url":null,"abstract":"Background: Lung adenocarcinoma (LUAD) is the predominant histological subtype of lung cancer, representing a major contributor to cancer mortality rate marked by a high frequency of mutations and intricate interactions between multiple signalling pathways.Objective: Here we explore the role of NOTCH1 associated Single nucleotide polymorphisms (SNPs) IDR and PTM in LUAD progression. Although the NOTCH1 expression is downregulated, it has been validated as an important prognostic marker because of its complex biological roles under specific conditions.Methods: With the aid of In silico tools we predicted and identified the deleterious SNPs. The Molecular Docking and dynamics simulations (MDS) were conducted to characterize these mutations.Results: A total of 43 deleterious SNPs were found in the sequential SNP analysis with 13 SNPs resulted deleterious and damaging effects. The stabilizing SNPs such as S1464I, A1705V and T1602I are found within the conserved and functional domains of NOTCH1. In addition, 1660-2555 sequence region of the PEST domain was recognized as an Intrinsically Disordered Region (IDR) with a score of above 0.5. Moreover, the presence of the two phosphodegrons (SCF_FBW7_1 at 2129-2136 and SCF_FBW7_2 at 2508-2515) along with the Post Translational Modification (PTM) such as o-linked glycosylation and Phosphothreonine within the IDR region, PEST and conserved domains suggest functional significance in LUAD progression.Conclusion: In conclusion our research highlights the potential regulatory role of identified SNPs, PTMs, and the functional domains of Notch1, particularly the PEST domain and IDR, in pathophysiology of LUAD particularly through the crosstalk of the EMT signalling.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1641521"},"PeriodicalIF":3.9,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12727990/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification and validation of tumor microenvironment-related therapeutic targets in gastric cancer using integrated multi-omics and molecular docking approaches. 基于多组学和分子对接方法的胃癌肿瘤微环境相关治疗靶点鉴定与验证

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-10 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1654326

Mohamed Kalith Oli M, Jafar Ali Ibrahim Syed Masood

Introduction: With increased drug resistance and tumor heterogeneity accounting for limited therapeutic strategies, gastric cancer remains one of the major causes of cancer-related mortality around the globe. Targeting the components of the tumor microenvironment (TME) has become a promising therapeutic strategy due to their crucial roles in cancer cell proliferation, progression, and metastasis. One of the limitations of the previously identified therapeutic targets is their limited applicability to a broader patient population.

Methods: This study aims to identify (TME)-related therapeutic targets using an integrated bioinformatics and molecular docking approach that involves a larger number of datasets to cover a broader cohort of gastric cancer patients. It analyzed multiple publicly available transcriptomic datasets using Robust Rank Aggregation (RRA) meta-analysis and Weighted Gene Co-expression Network Analysis (WGCNA) to identify significant hub genes. Furthermore, protein-protein interaction (PPI) network analyses, conducted using multiple methods such as Cytohubba topology analysis and ClusterONE module analysis, refined the potential therapeutic candidates. Functional enrichment analyses were performed to identify vital genes involved in TME interactions and ECM remodeling.

Results: The enriched genes were validated for their significant dysregulation in the Cancer Genome Atlas gastric adenocarcinoma dataset (TCGA-STAD) and three independent GEO datasets to ensure differential expression across distinct cohorts. Genes with consistent dysregulation were used in survival analyses across TCGA and two GEO datasets to prioritize hub genes with prognostic significance. Finally, a targeted literature survey ensured the exclusion of previously targeted genes, and molecular docking analyses conducted using phytocompounds identified potential therapeutic leads with strong affinities for the identified targets.

Discussion: This integrated approach revealed notable, promising targets in the TME and natural compounds for developing potential personalized therapeutic strategies in gastric cancer.

导言：由于耐药增加和肿瘤异质性导致治疗策略有限，胃癌仍然是全球癌症相关死亡的主要原因之一。肿瘤微环境（tumor microenvironment， TME）的靶向治疗已成为一种很有前景的治疗策略，因为它们在癌细胞增殖、进展和转移中起着至关重要的作用。先前确定的治疗靶点的局限性之一是它们对更广泛的患者群体的有限适用性。方法：本研究旨在利用集成的生物信息学和分子对接方法识别（TME）相关的治疗靶点，该方法涉及更多的数据集，以覆盖更广泛的胃癌患者队列。该研究使用稳健秩聚集（RRA）荟萃分析和加权基因共表达网络分析（WGCNA）分析了多个公开可用的转录组数据集，以确定重要的枢纽基因。此外，使用多种方法（如Cytohubba拓扑分析和ClusterONE模块分析）进行的蛋白质-蛋白质相互作用（PPI）网络分析，改进了潜在的治疗候选药物。进行功能富集分析以确定参与TME相互作用和ECM重塑的重要基因。结果：富集的基因在癌症基因组图谱胃腺癌数据集（TCGA-STAD）和三个独立的GEO数据集中被证实存在显著的失调，以确保在不同队列中的差异表达。在TCGA和两个GEO数据集的生存分析中，使用一致失调的基因来优先考虑具有预后意义的中心基因。最后，一项有针对性的文献调查确保了先前靶向基因的排除，并利用植物化合物进行了分子对接分析，确定了与所鉴定靶点具有强亲和力的潜在治疗线索。讨论：这种综合方法揭示了TME和天然化合物中值得注意的、有希望的靶点，可用于开发潜在的胃癌个性化治疗策略。

{"title":"Identification and validation of tumor microenvironment-related therapeutic targets in gastric cancer using integrated multi-omics and molecular docking approaches.","authors":"Mohamed Kalith Oli M, Jafar Ali Ibrahim Syed Masood","doi":"10.3389/fbinf.2025.1654326","DOIUrl":"10.3389/fbinf.2025.1654326","url":null,"abstract":"Introduction: With increased drug resistance and tumor heterogeneity accounting for limited therapeutic strategies, gastric cancer remains one of the major causes of cancer-related mortality around the globe. Targeting the components of the tumor microenvironment (TME) has become a promising therapeutic strategy due to their crucial roles in cancer cell proliferation, progression, and metastasis. One of the limitations of the previously identified therapeutic targets is their limited applicability to a broader patient population.Methods: This study aims to identify (TME)-related therapeutic targets using an integrated bioinformatics and molecular docking approach that involves a larger number of datasets to cover a broader cohort of gastric cancer patients. It analyzed multiple publicly available transcriptomic datasets using Robust Rank Aggregation (RRA) meta-analysis and Weighted Gene Co-expression Network Analysis (WGCNA) to identify significant hub genes. Furthermore, protein-protein interaction (PPI) network analyses, conducted using multiple methods such as Cytohubba topology analysis and ClusterONE module analysis, refined the potential therapeutic candidates. Functional enrichment analyses were performed to identify vital genes involved in TME interactions and ECM remodeling.Results: The enriched genes were validated for their significant dysregulation in the Cancer Genome Atlas gastric adenocarcinoma dataset (TCGA-STAD) and three independent GEO datasets to ensure differential expression across distinct cohorts. Genes with consistent dysregulation were used in survival analyses across TCGA and two GEO datasets to prioritize hub genes with prognostic significance. Finally, a targeted literature survey ensured the exclusion of previously targeted genes, and molecular docking analyses conducted using phytocompounds identified potential therapeutic leads with strong affinities for the identified targets.Discussion: This integrated approach revealed notable, promising targets in the TME and natural compounds for developing potential personalized therapeutic strategies in gastric cancer.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1654326"},"PeriodicalIF":3.9,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12727970/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Uncovering human kinase substrates in nipah proteome. 揭示尼帕病毒蛋白质组中的人激酶底物。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-05 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1678189

Vineetha Shaji, Akash Anil, Ayisha A Jabbar, Althaf Mahin, Ahmad Rafi, Amjesh Revikumar, Sowmya Soman, Ganesh Prasad, Sneha M Pinto, Yashwanth Subbannayya, Abhithaj Jayanandan, Rajesh Raju

Nipah virus (NiV) is a zoonotic pathogen that causes recurrent outbreaks with considerable implications for public health. Viruses engage host kinases to phosphorylate viral proteins, aiding replication and host disruption. Identifying NiV phosphoproteins and their host kinases is therefore critical for understanding the mechanism of infection and developing therapeutics. We performed kinase-substrate phosphomotif analysis based on prior studies and employed computational tools to identify putative phosphosites in NiV proteins and corresponding host kinases. Redundancy analysis highlighted key kinases capable of phosphorylating multiple NiV proteins and high-potential viral substrates. Integration with human-viral protein-protein interaction data revealed human kinase substrate proteins in human that interact with NiV proteins, while conservation analysis assessed phosphosites across nine NiV proteins in various strains. The functional significance of the identified and predicted viral substrates and their corresponding host kinases was further validated through in silico docking and molecular dynamics simulation (MD). Motif-based kinase-substrate analysis identified 51 human kinases predicted to target 1180 phosphorylation sites across nine NiV proteins, including key human kinases such as Eukaryotic elongation factor 2 kinase [EEF2K], Haploid germ cell-specific nuclear protein kinase [HASPIN], Mitogen-activated protein kinase 9 [MAPK9], Microtubule-associated serine/threonine-protein kinase 2 [MAST2], and Spleen tyrosine kinase [SYK], with the potential to phosphorylate multiple sites across NiV proteins. Using computational prediction tools, we identified several potential phosphorylation sites on NiV proteins, along with their corresponding candidate human kinases. In silico docking revealed interactions between EEF2K and both the NiV Fusion Glycoprotein and NiV Phosphoprotein (P), MAPK9 with the NiV Matrix Protein, and HASPIN with NiV RNA-dependent RNA polymerase. MD simulations of the EEF2K-NiV Fusion Glycoprotein complex confirmed the stability of this interaction. Leucine-rich repeat serine/threonine-protein kinase 2 [LRRK2], HASPIN, MAST2, and EEF2K were the human kinases predicted to phosphorylate experimentally validated sites on NiV nucleocapsid (N), P, and W proteins. Furthermore, through an extensive literature review, we investigated the therapeutic potential of targeting these kinases using known inhibitors and identified compounds that could potentially be repurposed as antiviral agents against NiV infection. Our findings indicate that EEF2K phosphorylates key NiV proteins at conserved phosphosites across variants, underscoring the pathogenic significance of kinases in NiV infection and their potential as therapeutic targets.

尼帕病毒是一种人畜共患病原体，可引起反复暴发，对公共卫生造成重大影响。病毒利用宿主激酶磷酸化病毒蛋白，帮助复制和破坏宿主。因此，鉴定NiV磷酸化蛋白及其宿主激酶对于了解感染机制和开发治疗方法至关重要。我们在先前研究的基础上进行了激酶-底物磷酸化分析，并使用计算工具鉴定了NiV蛋白和相应宿主激酶中的推定磷酸化位点。冗余分析突出了能够磷酸化多个NiV蛋白和高潜力病毒底物的关键激酶。结合人-病毒蛋白-蛋白相互作用数据显示，人类激酶底物蛋白与NiV蛋白相互作用，而保守分析评估了不同菌株中9种NiV蛋白的磷酸化位点。通过计算机对接和分子动力学模拟（MD）进一步验证了鉴定和预测的病毒底物及其相应宿主激酶的功能意义。基于基元的激酶-底物分析确定了51种人类激酶，预计可靶向9种NiV蛋白的1180个磷酸化位点，包括真核延伸因子2激酶（EEF2K）、单倍体生殖细胞特异性核蛋白激酶（HASPIN）、丝裂原活化蛋白激酶9 （MAPK9）、微管相关丝氨酸/苏氨酸蛋白激酶2 （MAST2）和脾酪氨酸激酶（SYK）等关键人类激酶，它们具有磷酸化NiV蛋白多个位点的潜力。利用计算预测工具，我们确定了NiV蛋白上的几个潜在磷酸化位点，以及相应的候选人类激酶。通过硅对接，发现EEF2K与NiV融合糖蛋白和NiV磷酸化蛋白(P)、MAPK9与NiV基质蛋白、HASPIN与NiV RNA依赖的RNA聚合酶之间存在相互作用。EEF2K-NiV融合糖蛋白复合物的MD模拟证实了这种相互作用的稳定性。富含亮氨酸的重复丝氨酸/苏氨酸蛋白激酶2 [LRRK2]、HASPIN、MAST2和EEF2K是预测磷酸化NiV核衣壳(N)、P和W蛋白上实验验证的位点的人激酶。此外，通过广泛的文献回顾，我们研究了使用已知抑制剂靶向这些激酶的治疗潜力，并确定了可能被重新用作抗NiV感染的抗病毒药物的化合物。我们的研究结果表明，EEF2K磷酸化了不同变体中保守磷酸化位点上的关键NiV蛋白，强调了激酶在NiV感染中的致病意义及其作为治疗靶点的潜力。

{"title":"Uncovering human kinase substrates in nipah proteome.","authors":"Vineetha Shaji, Akash Anil, Ayisha A Jabbar, Althaf Mahin, Ahmad Rafi, Amjesh Revikumar, Sowmya Soman, Ganesh Prasad, Sneha M Pinto, Yashwanth Subbannayya, Abhithaj Jayanandan, Rajesh Raju","doi":"10.3389/fbinf.2025.1678189","DOIUrl":"10.3389/fbinf.2025.1678189","url":null,"abstract":"Nipah virus (NiV) is a zoonotic pathogen that causes recurrent outbreaks with considerable implications for public health. Viruses engage host kinases to phosphorylate viral proteins, aiding replication and host disruption. Identifying NiV phosphoproteins and their host kinases is therefore critical for understanding the mechanism of infection and developing therapeutics. We performed kinase-substrate phosphomotif analysis based on prior studies and employed computational tools to identify putative phosphosites in NiV proteins and corresponding host kinases. Redundancy analysis highlighted key kinases capable of phosphorylating multiple NiV proteins and high-potential viral substrates. Integration with human-viral protein-protein interaction data revealed human kinase substrate proteins in human that interact with NiV proteins, while conservation analysis assessed phosphosites across nine NiV proteins in various strains. The functional significance of the identified and predicted viral substrates and their corresponding host kinases was further validated through in silico docking and molecular dynamics simulation (MD). Motif-based kinase-substrate analysis identified 51 human kinases predicted to target 1180 phosphorylation sites across nine NiV proteins, including key human kinases such as Eukaryotic elongation factor 2 kinase [EEF2K], Haploid germ cell-specific nuclear protein kinase [HASPIN], Mitogen-activated protein kinase 9 [MAPK9], Microtubule-associated serine/threonine-protein kinase 2 [MAST2], and Spleen tyrosine kinase [SYK], with the potential to phosphorylate multiple sites across NiV proteins. Using computational prediction tools, we identified several potential phosphorylation sites on NiV proteins, along with their corresponding candidate human kinases. In silico docking revealed interactions between EEF2K and both the NiV Fusion Glycoprotein and NiV Phosphoprotein (P), MAPK9 with the NiV Matrix Protein, and HASPIN with NiV RNA-dependent RNA polymerase. MD simulations of the EEF2K-NiV Fusion Glycoprotein complex confirmed the stability of this interaction. Leucine-rich repeat serine/threonine-protein kinase 2 [LRRK2], HASPIN, MAST2, and EEF2K were the human kinases predicted to phosphorylate experimentally validated sites on NiV nucleocapsid (N), P, and W proteins. Furthermore, through an extensive literature review, we investigated the therapeutic potential of targeting these kinases using known inhibitors and identified compounds that could potentially be repurposed as antiviral agents against NiV infection. Our findings indicate that EEF2K phosphorylates key NiV proteins at conserved phosphosites across variants, underscoring the pathogenic significance of kinases in NiV infection and their potential as therapeutic targets.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1678189"},"PeriodicalIF":3.9,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12715814/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting GD2 expression across cancer types by the integration of pathway topology and transcriptome data. 通过整合途径拓扑和转录组数据预测GD2在不同癌症类型中的表达。

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Frontiers in bioinformatics

Pub Date : 2025-12-04 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1705930

Arsenij Ustjanzew, Federico Marini, Saskia Wagner, Arthur Wingerter, Roger Sandhoff, Jörg Faber, Claudia Paret

Background: The disialoganglioside GD2 is a key cancer therapy target due to its overexpression in several cancers and limited presence in normal tissues. However, experimental assessment is technically challenging and not routinely available. We developed a computational framework that integrates reaction activity derived from transcriptomic data with the glycosphingolipid biosynthesis pathway to predict GD2 expression.

Methods: We computed Reaction Activity Scores from transcriptomic data and weighted the reactions of a glycosphingolipid metabolic network, refining edge weights with topology-based transition probabilities to account for enzyme promiscuity. Cumulative activities of GD2-promoting and -mitigating reactions served as features in a Support Vector Machine (SVM) to model GD2-associated differences between neuroblastoma and normal tissue. SVM decision values were used as a continuous proxy for GD2 expression. We validated the predicted GD2 scores across independent datasets by comparing them with literature-reported values and flow-cytometric confirmation of a model-predicted high-GD2 tumor. Copy-number alteration (CNA) data were integrated to identify candidate genomic biomarkers of GD2-positive samples.

Results: Our SVM-based GD2 score achieved balanced accuracy of 0.80 with a linear kernel, selected due to reduced overfitting risk and interpretability, while matching the accuracy of more complex kernels. The model transferred reliably across six independent RNA-seq datasets and reproduced known GD2 expression patterns, outperforming a two-gene signature in capturing subtype-specific heterogeneity and avoiding overestimation in normal brain tissue. Pan-cancer analyses revealed heterogeneous GD2 expression in several cancer subtypes. Notably, we experimentally confirmed high GD2 expression in clear cell sarcoma of the kidney, consistent with model predictions. CNA analysis implicated B4GALNT1 amplification as a GD2-promoting factor in dedifferentiated liposarcoma. To facilitate adoption of our approach, we developed GD2Viz, an R package with an interactive Shiny application for score computation, visualization, and analysis of user data.

Conclusion: Our computational framework provides a robust, interpretable, biologically grounded predictor of GD2 expression, offering greater consistency and clinical interpretability over existing gene-based signatures. Importantly, with over 20 GD2-directed trials ongoing, our approach may help prioritize tumor entities with high GD2 levels, delineate candidate patient subgroups, and generate testable hypotheses in underexplored cancers, thereby supporting patient stratification and eligibility screening for clinical trials.

背景：二对话神经节苷脂GD2在几种癌症中过表达，在正常组织中存在有限，是一个关键的癌症治疗靶点。然而，实验评估在技术上具有挑战性，并且不是常规可用的。我们开发了一个计算框架，将来自转录组学数据的反应活性与鞘糖脂生物合成途径相结合，以预测GD2的表达。方法：我们从转录组学数据中计算反应活性分数，并对糖鞘脂代谢网络的反应进行加权，用基于拓扑的转移概率来细化边缘权重，以解释酶的混杂性。gd2促进和缓解反应的累积活性作为支持向量机（SVM）的特征来模拟神经母细胞瘤和正常组织之间gd2相关的差异。使用SVM决策值作为GD2表达的连续代理。我们通过将预测的GD2评分与文献报道的值和流式细胞术对模型预测的高GD2肿瘤的确认进行比较，验证了独立数据集的预测GD2评分。整合拷贝数改变（CNA）数据来鉴定gd2阳性样本的候选基因组生物标志物。结果：我们基于svm的GD2评分与线性核的平衡精度达到0.80，选择线性核是由于降低了过拟合风险和可解释性，同时匹配更复杂核的精度。该模型在六个独立的RNA-seq数据集之间可靠地转移，并再现了已知的GD2表达模式，在捕获亚型特异性异质性和避免在正常脑组织中高估方面优于双基因标记。泛癌分析揭示了GD2在几种癌症亚型中的异质表达。值得注意的是，我们通过实验证实了GD2在肾透明细胞肉瘤中的高表达，与模型预测一致。CNA分析提示B4GALNT1扩增在去分化脂肪肉瘤中是gd2促进因子。为了便于采用我们的方法，我们开发了GD2Viz，这是一个带有交互式Shiny应用程序的R包，用于分数计算、可视化和用户数据分析。结论：我们的计算框架提供了一个强大的、可解释的、基于生物学的GD2表达预测器，与现有的基于基因的特征相比，提供了更大的一致性和临床可解释性。重要的是，有超过20个GD2导向的试验正在进行中，我们的方法可能有助于优先考虑具有高GD2水平的肿瘤实体，描绘候选患者亚组，并在未被探索的癌症中产生可测试的假设，从而支持患者分层和临床试验的资格筛选。

{"title":"Predicting GD2 expression across cancer types by the integration of pathway topology and transcriptome data.","authors":"Arsenij Ustjanzew, Federico Marini, Saskia Wagner, Arthur Wingerter, Roger Sandhoff, Jörg Faber, Claudia Paret","doi":"10.3389/fbinf.2025.1705930","DOIUrl":"10.3389/fbinf.2025.1705930","url":null,"abstract":"Background: The disialoganglioside GD2 is a key cancer therapy target due to its overexpression in several cancers and limited presence in normal tissues. However, experimental assessment is technically challenging and not routinely available. We developed a computational framework that integrates reaction activity derived from transcriptomic data with the glycosphingolipid biosynthesis pathway to predict GD2 expression.Methods: We computed Reaction Activity Scores from transcriptomic data and weighted the reactions of a glycosphingolipid metabolic network, refining edge weights with topology-based transition probabilities to account for enzyme promiscuity. Cumulative activities of GD2-promoting and -mitigating reactions served as features in a Support Vector Machine (SVM) to model GD2-associated differences between neuroblastoma and normal tissue. SVM decision values were used as a continuous proxy for GD2 expression. We validated the predicted GD2 scores across independent datasets by comparing them with literature-reported values and flow-cytometric confirmation of a model-predicted high-GD2 tumor. Copy-number alteration (CNA) data were integrated to identify candidate genomic biomarkers of GD2-positive samples.Results: Our SVM-based GD2 score achieved balanced accuracy of 0.80 with a linear kernel, selected due to reduced overfitting risk and interpretability, while matching the accuracy of more complex kernels. The model transferred reliably across six independent RNA-seq datasets and reproduced known GD2 expression patterns, outperforming a two-gene signature in capturing subtype-specific heterogeneity and avoiding overestimation in normal brain tissue. Pan-cancer analyses revealed heterogeneous GD2 expression in several cancer subtypes. Notably, we experimentally confirmed high GD2 expression in clear cell sarcoma of the kidney, consistent with model predictions. CNA analysis implicated B4GALNT1 amplification as a GD2-promoting factor in dedifferentiated liposarcoma. To facilitate adoption of our approach, we developed GD2Viz, an R package with an interactive Shiny application for score computation, visualization, and analysis of user data.Conclusion: Our computational framework provides a robust, interpretable, biologically grounded predictor of GD2 expression, offering greater consistency and clinical interpretability over existing gene-based signatures. Importantly, with over 20 GD2-directed trials ongoing, our approach may help prioritize tumor entities with high GD2 levels, delineate candidate patient subgroups, and generate testable hypotheses in underexplored cancers, thereby supporting patient stratification and eligibility screening for clinical trials.","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1705930"},"PeriodicalIF":3.9,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12711791/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0