首页 > 最新文献

Computational Biology and Chemistry最新文献

英文 中文
GCLSC: Single-cell clustering model based on graph contrastive learning GCLSC:基于图对比学习的单细胞聚类模型
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-11 DOI: 10.1016/j.compbiolchem.2026.108896
Hui An , Teng Zhang , Jianjun Tan
The advent of single-cell RNA sequencing (scRNA-seq) technology has enabled the analysis of cellular heterogeneity at the single-cell level. In scRNA-seq data analysis, cell clustering is a crucial downstream task, as it facilitates the discovery of novel cell subtypes and the identification of known cell types, laying the groundwork for subsequent analyses. However, scRNA-seq data pose significant challenges for cell clustering due to their high dimensionality, sparsity, and technical artifacts (including batch effects and dropout events). To address these challenges, we propose GCLSC (Graph Contrastive Learning for Single-Cell Clustering), a novel graph contrastive learning model. GCLSC integrates Graph Transformer and Graph Attention Network (GAT) to co-model local cellular interactions and global topological dependencies. Four data augmentation strategies enhance data diversity and mitigate overfitting. Experiments on nine real-world scRNA-seq datasets demonstrate that the model achieves superior clustering accuracy. Furthermore, leveraging its excellent clustering performance, this architecture provides a reliable computational tool for cell population profiling of scRNA-seq data — its accurate clustering results can effectively support core tasks such as novel cell subtype identification and known cell type annotation. GCLSC exemplifies the potential of combining GAT, Transformer, and contrastive learning for robust single-cell analysis. The source code is available at https://github.com/JianjunTan-Beijing/GCLSC.
单细胞RNA测序(scRNA-seq)技术的出现使得在单细胞水平上分析细胞异质性成为可能。在scRNA-seq数据分析中,细胞聚类是一项至关重要的下游任务,因为它有助于发现新的细胞亚型和鉴定已知的细胞类型,为后续分析奠定基础。然而,由于scRNA-seq数据的高维性、稀疏性和技术伪影(包括批处理效应和退出事件),scRNA-seq数据对细胞聚类构成了重大挑战。为了解决这些挑战,我们提出了一种新的图对比学习模型GCLSC (Graph contrast Learning for Single-Cell Clustering)。GCLSC集成了图转换器和图注意网络(GAT)来共同建模局部细胞相互作用和全局拓扑依赖性。四种数据增强策略增强了数据多样性并减轻了过拟合。在9个真实scRNA-seq数据集上的实验表明,该模型具有较好的聚类精度。此外,利用其优异的聚类性能,该架构为scRNA-seq数据的细胞群体分析提供了可靠的计算工具,其准确的聚类结果可以有效地支持诸如新细胞亚型鉴定和已知细胞类型注释等核心任务。GCLSC举例说明了将GAT、Transformer和对比学习结合起来进行稳健的单细胞分析的潜力。源代码可从https://github.com/JianjunTan-Beijing/GCLSC获得。
{"title":"GCLSC: Single-cell clustering model based on graph contrastive learning","authors":"Hui An ,&nbsp;Teng Zhang ,&nbsp;Jianjun Tan","doi":"10.1016/j.compbiolchem.2026.108896","DOIUrl":"10.1016/j.compbiolchem.2026.108896","url":null,"abstract":"<div><div>The advent of single-cell RNA sequencing (scRNA-seq) technology has enabled the analysis of cellular heterogeneity at the single-cell level. In scRNA-seq data analysis, cell clustering is a crucial downstream task, as it facilitates the discovery of novel cell subtypes and the identification of known cell types, laying the groundwork for subsequent analyses. However, scRNA-seq data pose significant challenges for cell clustering due to their high dimensionality, sparsity, and technical artifacts (including batch effects and dropout events). To address these challenges, we propose GCLSC (Graph Contrastive Learning for Single-Cell Clustering), a novel graph contrastive learning model. GCLSC integrates Graph Transformer and Graph Attention Network (GAT) to co-model local cellular interactions and global topological dependencies. Four data augmentation strategies enhance data diversity and mitigate overfitting. Experiments on nine real-world scRNA-seq datasets demonstrate that the model achieves superior clustering accuracy. Furthermore, leveraging its excellent clustering performance, this architecture provides a reliable computational tool for cell population profiling of scRNA-seq data — its accurate clustering results can effectively support core tasks such as novel cell subtype identification and known cell type annotation. GCLSC exemplifies the potential of combining GAT, Transformer, and contrastive learning for robust single-cell analysis. The source code is available at <span><span>https://github.com/JianjunTan-Beijing/GCLSC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108896"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145974166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling the genetic blueprint of geosmin synthesis, secondary metabolite pathways, and functional genome analysis of Streptomyces rubrogriseus RKDTS3 from tilapia fish pond sediment 揭示罗非鱼鱼塘沉积物中红褐链霉菌RKDTS3土臭素合成、次生代谢途径的遗传蓝图及功能基因组分析
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-15 DOI: 10.1016/j.compbiolchem.2026.108900
Ramakrishnan Govindharaj Kumar, Dhanasekaran Dharumadurai
Off-flavours such as geosmin and 2-methylisoborneol (MIB) are economically and sensorially problematic compounds in freshwater aquaculture. Although "geosmin" is produced by certain Streptomyces species living in lake sediments, we know very little about the genetic basis of this production or the biosynthetic precursors. Hence, we sequenced the draft genome of a Streptomyces rubrogriseus (RKDTS3), originally isolated from sediments in a tilapia pond near Tamil Nadu, India, to identify genes involved in producing geosmin and other secondary metabolites. The Illumina MiSeq-generated draft genome for RKDTS3 contains 5.32 Mb of sequence information, has a GC content of 71 %, and contains 6129 protein-coding genes, 61 tRNA genes, and one rRNA operon. The annotation of this genome indicated a significant number of metabolic genes required for productive biosynthetic pathways (as well as stress environment adaptation). There are 21 BGCs for producing various terpenoids, polyketides, nonribosomal peptide ligands (NRPBs); ribosomal peptide ligands (RiPPs); and siderophore compounds. The search for the biosynthetic cluster that produces geosmin and encodes the gene geoA identified a BGC that contained the KO K10187, determined using KofamKOALA, and provided strong evidence that the geosmin biosynthetic pathway is conserved and functional. A comparison of Streptomyces strains reveals 1994 core BGCs, along with a highly variable accessory genome that has adapted to various ecological environments. This strain has also acquired multiple copies of the CRISPR genome, three plasmids, and an incomplete prophage, indicating that it has undergone horizontal gene transfer, developed defence mechanisms to protect against phage, and has a dynamic genome. Overall, genome analysis revealed a GC-rich draft genome encoding 21 biosynthetic gene clusters, including a conserved geoA-containing terpene cluster responsible for geosmin biosynthesis, conserved core genome alongside a highly variable accessory genome, reflecting ecological adaptation in comparative genomics. Thus, the findings state the genomic origin of geosmin and secondary metabolite biosynthesis in S. rubrogriseus RKDTS3.
土臭素和2-甲基异龙脑(MIB)等异味在淡水水产养殖中是经济和感官上有问题的化合物。虽然“土臭素”是由生活在湖泊沉积物中的某些链霉菌产生的,但我们对这种生产或生物合成前体的遗传基础知之甚少。因此,我们对最初从印度泰米尔纳德邦附近的罗非鱼池塘沉积物中分离出来的一株rubbrogriseus链霉菌(RKDTS3)的基因组草图进行了测序,以确定与产生土臭素和其他次生代谢物有关的基因。Illumina misiq生成的RKDTS3基因组草图包含5.32 Mb的序列信息,GC含量为71 %,包含6129个蛋白质编码基因,61个tRNA基因和1个rRNA操纵子。该基因组的注释表明,生产生物合成途径(以及应激环境适应)所需的大量代谢基因。有21种bgc用于生产各种萜类、多酮类、非核糖体肽配体(NRPBs);核糖体肽配体;还有铁载体化合物。对产生土臭素和编码geoA基因的生物合成簇的研究发现了一个含有KO K10187的BGC,使用KofamKOALA测定,并提供了强有力的证据,证明土臭素生物合成途径是保守的和功能性的。链霉菌菌株的比较揭示了1994核心BGCs,以及适应各种生态环境的高度可变的辅助基因组。该菌株还获得了CRISPR基因组的多个拷贝、三个质粒和一个不完整的噬菌体,这表明它经历了水平基因转移,发展了防御噬菌体的防御机制,并且具有动态基因组。总体而言,基因组分析揭示了一个富含gc的草图基因组,编码21个生物合成基因簇,包括一个保守的含geoa萜烯簇,负责土精素的生物合成,保守的核心基因组和一个高度可变的辅助基因组,反映了比较基因组学的生态适应性。因此,这些发现阐明了S. rubrogriseus RKDTS3中土臭素和次生代谢物生物合成的基因组起源。
{"title":"Unveiling the genetic blueprint of geosmin synthesis, secondary metabolite pathways, and functional genome analysis of Streptomyces rubrogriseus RKDTS3 from tilapia fish pond sediment","authors":"Ramakrishnan Govindharaj Kumar,&nbsp;Dhanasekaran Dharumadurai","doi":"10.1016/j.compbiolchem.2026.108900","DOIUrl":"10.1016/j.compbiolchem.2026.108900","url":null,"abstract":"<div><div>Off-flavours such as geosmin and 2-methylisoborneol (MIB) are economically and sensorially problematic compounds in freshwater aquaculture. Although \"geosmin\" is produced by certain <em>Streptomyces</em> species living in lake sediments, we know very little about the genetic basis of this production or the biosynthetic precursors. Hence, we sequenced the draft genome of a <em>Streptomyces rubrogriseus</em> (RKDTS3), originally isolated from sediments in a tilapia pond near Tamil Nadu, India, to identify genes involved in producing geosmin and other secondary metabolites. The Illumina MiSeq-generated draft genome for RKDTS3 contains 5.32 Mb of sequence information, has a GC content of 71 %, and contains 6129 protein-coding genes, 61 tRNA genes, and one rRNA operon. The annotation of this genome indicated a significant number of metabolic genes required for productive biosynthetic pathways (as well as stress environment adaptation). There are 21 BGCs for producing various terpenoids, polyketides, nonribosomal peptide ligands (NRPBs); ribosomal peptide ligands (RiPPs); and siderophore compounds. The search for the biosynthetic cluster that produces geosmin and encodes the gene <em>geoA</em> identified a BGC that contained the KO K10187, determined using KofamKOALA, and provided strong evidence that the geosmin biosynthetic pathway is conserved and functional. A comparison of <em>Streptomyces</em> strains reveals 1994 core BGCs, along with a highly variable accessory genome that has adapted to various ecological environments. This strain has also acquired multiple copies of the CRISPR genome, three plasmids, and an incomplete prophage, indicating that it has undergone horizontal gene transfer, developed defence mechanisms to protect against phage, and has a dynamic genome. Overall, genome analysis revealed a GC-rich draft genome encoding 21 biosynthetic gene clusters, including a conserved <em>geoA</em>-containing terpene cluster responsible for geosmin biosynthesis, conserved core genome alongside a highly variable accessory genome, reflecting ecological adaptation in comparative genomics. Thus, the findings state the genomic origin of geosmin and secondary metabolite biosynthesis in <em>S. rubrogriseus</em> RKDTS3.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108900"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145974171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-fragmented derivatives of methotrexate to design effective and safer DHFR inhibitor: A computational breakthrough for ectopic pregnancy therapy 人工智能碎片化甲氨蝶呤衍生物设计有效和安全的DHFR抑制剂:异位妊娠治疗的计算突破
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-14 DOI: 10.1016/j.compbiolchem.2026.108906
Deniz Inan , Sinan Karageçili , Nouman Ali , Adeeba Ali
Ectopic pregnancy (EP) remains a major cause of maternal morbidity and mortality during the first trimester, affecting approximately 1–3 % of all pregnancies and often necessitating invasive surgery when medical therapy fails. Methotrexate (MTX), a dihydrofolate reductase (DHFR) inhibitor, is the current gold standard for medical management, yet its limitations include treatment failure rates of 10–20 % and significant systemic toxicity. This study aimed to design and evaluate novel AI-fragmented MTX derivatives as improved DHFR inhibitors for EP. The three-dimensional structure of DHFR was retrieved from the AlphaFold Protein Structure Database and validated with a pLDDT score of 96.15, a Ramachandran plot showing 93.1 % residues in most favored regions, and an ERRAT quality factor of 98.85, confirming its suitability for computational studies. Protein–protein interaction mapping confirmed DHFR’s central role in folate metabolism. Using an AI-driven fragmentation approach, 20 MTX analogues were generated, of which Derivative 8 (Replace) emerged as the most promising. Docking simulations demonstrated a binding affinity of –10.6 kcal/mol compared to MTX (–8.6 kcal/mol), with detailed interaction analysis revealing extensive hydrogen bonding, electrostatic stabilization, and hydrophobic packing within the DHFR active site. Molecular dynamics (500 ns) confirmed the stability of the DHFR–Derivative 8 complex, with backbone RMSD stabilizing at 0.15–0.20 nm, radius of gyration averaging 1.61 nm, consistent SASA (∼970–980 Ų), and stable hydrogen bonding throughout. Free energy calculations further supported binding, with MM-GBSA yielding a favorable ΔG_total of –45.54 kcal/mol. Pharmacophore characterization showed Derivative 8 possessed enhanced features (3 aromatic rings, 7 acceptors, 5 hydrophobic centers), while DFT analysis revealed a narrower HOMO–LUMO gap, indicating higher electronic reactivity. ADMET analysis predicted reduced toxicity for Derivative 8 (LD50 ≈ 135 mg/kg, class III) compared to MTX (LD50 ≈ 3 mg/kg, class I). Collectively, these results suggest that Derivative 8 is a more potent and safer DHFR inhibitor than MTX. Limitations include reliance on in-silico methods without experimental validation; thus, future directions involve synthesis, in vitro assays, and reproductive toxicity evaluation.
宫外孕(EP)仍然是妊娠早期产妇发病和死亡的主要原因,约占所有妊娠的1-3 - %,当药物治疗失败时,通常需要进行侵入性手术。甲氨蝶呤(MTX)是一种二氢叶酸还原酶(DHFR)抑制剂,是目前医疗管理的金标准,但其局限性包括治疗失败率为10 - 20% %和显著的全身毒性。本研究旨在设计和评估新型ai碎片化MTX衍生物作为EP的改进DHFR抑制剂。DHFR的三维结构从AlphaFold蛋白结构数据库中检索,pLDDT评分为96.15,Ramachandran图显示最有利区域的残留物为93.1 %,ERRAT质量因子为98.85,证实其适合计算研究。蛋白-蛋白相互作用图谱证实了DHFR在叶酸代谢中的核心作用。使用人工智能驱动的碎片化方法,生成了20种MTX类似物,其中衍生品8 (Replace)是最有希望的。对接模拟显示,与MTX(-8.6 kcal/mol)相比,其结合亲和力为-10.6 kcal/mol,详细的相互作用分析显示,DHFR活性位点内存在广泛的氢键、静电稳定和疏水堆积。分子动力学(500 ns)证实了DHFR-Derivative 8配合物的稳定性,主链RMSD稳定在0.15-0.20 nm,旋转半径平均为1.61 nm, SASA一致(~ 970-980 Ų),整个氢键稳定。自由能计算进一步支持结合,MM-GBSA的有利值ΔG_total为-45.54 kcal/mol。药效团表征显示衍生物8具有增强的特征(3个芳香环,7个受体,5个疏水中心),而DFT分析显示HOMO-LUMO间隙更窄,表明更高的电子反应性。ADMET分析预测,与MTX (LD50 ≈3 mg/kg, I类)相比,衍生物8的毒性降低(LD50 ≈135 mg/kg, III类)。综上所述,这些结果表明衍生物8是一种比MTX更有效和更安全的DHFR抑制剂。局限性包括依赖于没有实验验证的计算机方法;因此,未来的发展方向包括合成、体外试验和生殖毒性评价。
{"title":"AI-fragmented derivatives of methotrexate to design effective and safer DHFR inhibitor: A computational breakthrough for ectopic pregnancy therapy","authors":"Deniz Inan ,&nbsp;Sinan Karageçili ,&nbsp;Nouman Ali ,&nbsp;Adeeba Ali","doi":"10.1016/j.compbiolchem.2026.108906","DOIUrl":"10.1016/j.compbiolchem.2026.108906","url":null,"abstract":"<div><div>Ectopic pregnancy (EP) remains a major cause of maternal morbidity and mortality during the first trimester, affecting approximately 1–3 % of all pregnancies and often necessitating invasive surgery when medical therapy fails. Methotrexate (MTX), a dihydrofolate reductase (DHFR) inhibitor, is the current gold standard for medical management, yet its limitations include treatment failure rates of 10–20 % and significant systemic toxicity. This study aimed to design and evaluate novel AI-fragmented MTX derivatives as improved DHFR inhibitors for EP. The three-dimensional structure of DHFR was retrieved from the AlphaFold Protein Structure Database and validated with a pLDDT score of 96.15, a Ramachandran plot showing 93.1 % residues in most favored regions, and an ERRAT quality factor of 98.85, confirming its suitability for computational studies. Protein–protein interaction mapping confirmed DHFR’s central role in folate metabolism. Using an AI-driven fragmentation approach, 20 MTX analogues were generated, of which Derivative 8 (Replace) emerged as the most promising. Docking simulations demonstrated a binding affinity of –10.6 kcal/mol compared to MTX (–8.6 kcal/mol), with detailed interaction analysis revealing extensive hydrogen bonding, electrostatic stabilization, and hydrophobic packing within the DHFR active site. Molecular dynamics (500 ns) confirmed the stability of the DHFR–Derivative 8 complex, with backbone RMSD stabilizing at 0.15–0.20 nm, radius of gyration averaging 1.61 nm, consistent SASA (∼970–980 Ų), and stable hydrogen bonding throughout. Free energy calculations further supported binding, with MM-GBSA yielding a favorable ΔG_total of –45.54 kcal/mol. Pharmacophore characterization showed Derivative 8 possessed enhanced features (3 aromatic rings, 7 acceptors, 5 hydrophobic centers), while DFT analysis revealed a narrower HOMO–LUMO gap, indicating higher electronic reactivity. ADMET analysis predicted reduced toxicity for Derivative 8 (LD50 ≈ 135 mg/kg, class III) compared to MTX (LD50 ≈ 3 mg/kg, class I). Collectively, these results suggest that Derivative 8 is a more potent and safer DHFR inhibitor than MTX. Limitations include reliance on <em>in-silico</em> methods without experimental validation; thus, future directions involve synthesis, <em>in vitro</em> assays, and reproductive toxicity evaluation.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108906"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145974169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unraveling novel transposable elements (TEs)-driven gene dysregulation in non-small cell lung cancer (NSCLC) by integrated transcriptomic and TEs analysis 通过整合转录组学和TEs分析揭示非小细胞肺癌(NSCLC)中新型转座因子(TEs)驱动的基因失调
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-02-09 DOI: 10.1016/j.compbiolchem.2026.108942
Sahadevan Shrinidhi , R. Sagaya Jansi , Ameer Khusro
Transposable Elements (TEs) represent a class of mobile genomic sequences, which may seriously disrupt gene regulation and can contribute to tumorigenesis. Yet, their role in NSCLC has remained unexplored to a great degree. Therefore, an integrated transcriptomic and Transposable Element (TE) analysis was performed to investigate TE-driven gene dysregulation in NSCLC. Hierarchical clustering of differentially expressed TE revealed significant over-representation of LTR1A1 and HERVL18-int in the cancer samples, with notably high expression of LINE and ERV members, especially HERVL-int, L1MC5, and L1M5. The intersection of TE expression with differentially expressed genes revealed several TE-associated genes involved in cell cycle regulation, genomic stability, and tumor progression. Fusion transcript analysis highlighted unique cancer-specific events, offering insights into TE-mediated transcriptomic alterations. Molecular docking of TE-associated proteins, HMMR, and PBK suggested potential interactions that may influence oncogenic pathways. Collectively, our findings uncover novel TE-driven mechanisms of gene dysregulation in NSCLC and highlight specific TEs and associated genes as potential diagnostic markers and therapeutic targets, offering a framework for future experimental studies to explore their mechanistic and clinical significance.
转座因子(te)是一类可移动的基因组序列,它可能严重破坏基因调控,并可能促进肿瘤的发生。然而,它们在NSCLC中的作用在很大程度上仍未被探索。因此,我们进行了转录组学和转座因子(TE)综合分析,以研究TE驱动的非小细胞肺癌基因失调。差异表达TE的分层聚类结果显示,LTR1A1和HERVL18-int在癌症样本中的过度表达,LINE和ERV成员的高表达,尤其是herv1 -int、L1MC5和L1M5。TE表达与差异表达基因的交集揭示了一些TE相关基因参与细胞周期调节、基因组稳定性和肿瘤进展。融合转录分析突出了独特的癌症特异性事件,为te介导的转录组改变提供了见解。te相关蛋白、HMMR和PBK的分子对接提示可能影响致癌途径的潜在相互作用。总的来说,我们的研究结果揭示了te驱动的NSCLC基因失调的新机制,并强调了特异性te和相关基因作为潜在的诊断标记和治疗靶点,为未来的实验研究提供了框架,以探索其机制和临床意义。
{"title":"Unraveling novel transposable elements (TEs)-driven gene dysregulation in non-small cell lung cancer (NSCLC) by integrated transcriptomic and TEs analysis","authors":"Sahadevan Shrinidhi ,&nbsp;R. Sagaya Jansi ,&nbsp;Ameer Khusro","doi":"10.1016/j.compbiolchem.2026.108942","DOIUrl":"10.1016/j.compbiolchem.2026.108942","url":null,"abstract":"<div><div>Transposable Elements (TEs) represent a class of mobile genomic sequences, which may seriously disrupt gene regulation and can contribute to tumorigenesis. Yet, their role in NSCLC has remained unexplored to a great degree. Therefore, an integrated transcriptomic and Transposable Element (TE) analysis was performed to investigate TE-driven gene dysregulation in NSCLC. Hierarchical clustering of differentially expressed TE revealed significant over-representation of LTR1A1 and HERVL18-int in the cancer samples, with notably high expression of LINE and ERV members, especially HERVL-int, L1MC5, and L1M5. The intersection of TE expression with differentially expressed genes revealed several TE-associated genes involved in cell cycle regulation, genomic stability, and tumor progression. Fusion transcript analysis highlighted unique cancer-specific events, offering insights into TE-mediated transcriptomic alterations. Molecular docking of TE-associated proteins, HMMR, and PBK suggested potential interactions that may influence oncogenic pathways. Collectively, our findings uncover novel TE-driven mechanisms of gene dysregulation in NSCLC and highlight specific TEs and associated genes as potential diagnostic markers and therapeutic targets, offering a framework for future experimental studies to explore their mechanistic and clinical significance.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108942"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ningxue shengban decoction regulates T-cell immune balance in immune thrombocytopenia via the bone marrow hematopoietic microenvironment 宁血生板汤通过骨髓造血微环境调节免疫性血小板减少患者t细胞免疫平衡。
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-29 DOI: 10.1016/j.compbiolchem.2026.108925
Wuxia Yang , Huiying Kang , Yang Liu , Zhen Wang , Yanqi Song , Baoshan Liu , Aidi Wang

Background

Immune thrombocytopenia (ITP) is characterized by increased platelet clearance and decreased platelet production. Hematopoietic stem and progenitor cell (HSPCs) abnormalities are a key mechanism of ITP, contributing to megakaryocyte defects and aberrant lymphocyte differentiation. Ningxue Shengban Decoction (NXSBD) has proven to be an effective therapeutic option for ITP.

Aim

This study aimed to explore the therapeutic mechanism of NXSBD in ITP.

Method

First, we evaluated the therapeutic effect of NXSBD on thrombocytopenia. T-lymphocyte subsets were analyzed by flow cytometry, megakaryocyte features were examined by HE staining, serum cytokines were quantified by ELISA, and hepatic/renal safety indices were measured by MS. Following the identification of core targets and pathways via network pharmacology, their expression in bone marrow cells was confirmed by single-cell RNA sequencing analysis. Pseudo-temporal analysis then tracked the dynamics of these targets during HSC lineage commitment, and molecular docking finally confirmed strong binding affinities with NXSBD's constituents.

Results

NXSBD significantly ameliorated thrombocytopenia in ITP mice by rebalancing T-cell subsets and modulating key inflammatory cytokines, while also promoting the generation of thrombocytogenic megakaryocytes in the bone marrow. The core targets of NXSBD (RELA, IKBKB, AKT1, TP53, MAPK1, JUN, and FOS) were found to be present in hematopoietic stem cells (HSCs) and were involved in HSC differentiation into megakaryocytes and T lymphocytes. Molecular docking confirmed strong binding affinities between NXSBD constituents and these core targets. In conclusion, our findings demonstrate that NXSBD alleviates ITP through a multi-target mechanism that may participate in megakaryocyte production from HSCs and contribute to the restoration of peripheral T-cell homeostasis.
背景:免疫性血小板减少症(ITP)的特征是血小板清除率增加和血小板生成减少。造血干细胞和祖细胞(HSPCs)异常是ITP的关键机制,导致巨核细胞缺陷和淋巴细胞分化异常。宁血生板汤(NXSBD)是治疗ITP的有效选择。目的:探讨NXSBD治疗ITP的作用机制。方法:首先,评价NXSBD治疗血小板减少症的疗效。流式细胞术检测t淋巴细胞亚群,HE染色检测巨核细胞特征,ELISA检测血清细胞因子,ms检测肝/肾安全指标。通过网络药理学鉴定核心靶点和通路,通过单细胞RNA测序分析确认其在骨髓细胞中的表达。伪时间分析随后跟踪了这些靶点在HSC谱系承诺过程中的动态,分子对接最终证实了与NXSBD成分的强结合亲和力。结果:NXSBD通过平衡t细胞亚群和调节关键炎症细胞因子,显著改善ITP小鼠的血小板减少症,同时促进骨髓中血小板生成巨核细胞的产生。NXSBD的核心靶点(RELA、IKBKB、AKT1、TP53、MAPK1、JUN和FOS)存在于造血干细胞(HSC)中,并参与HSC向巨核细胞和T淋巴细胞的分化。分子对接证实了NXSBD成分与这些核心靶点之间的强结合亲和力。总之,我们的研究结果表明,NXSBD通过多靶点机制缓解ITP,该机制可能参与hsc巨核细胞的产生,并有助于恢复外周t细胞的稳态。
{"title":"Ningxue shengban decoction regulates T-cell immune balance in immune thrombocytopenia via the bone marrow hematopoietic microenvironment","authors":"Wuxia Yang ,&nbsp;Huiying Kang ,&nbsp;Yang Liu ,&nbsp;Zhen Wang ,&nbsp;Yanqi Song ,&nbsp;Baoshan Liu ,&nbsp;Aidi Wang","doi":"10.1016/j.compbiolchem.2026.108925","DOIUrl":"10.1016/j.compbiolchem.2026.108925","url":null,"abstract":"<div><h3>Background</h3><div>Immune thrombocytopenia (ITP) is characterized by increased platelet clearance and decreased platelet production. Hematopoietic stem and progenitor cell (HSPCs) abnormalities are a key mechanism of ITP, contributing to megakaryocyte defects and aberrant lymphocyte differentiation. Ningxue Shengban Decoction (NXSBD) has proven to be an effective therapeutic option for ITP.</div></div><div><h3>Aim</h3><div>This study aimed to explore the therapeutic mechanism of NXSBD in ITP.</div></div><div><h3>Method</h3><div>First, we evaluated the therapeutic effect of NXSBD on thrombocytopenia. T-lymphocyte subsets were analyzed by flow cytometry, megakaryocyte features were examined by HE staining, serum cytokines were quantified by ELISA, and hepatic/renal safety indices were measured by MS. Following the identification of core targets and pathways via network pharmacology, their expression in bone marrow cells was confirmed by single-cell RNA sequencing analysis. Pseudo-temporal analysis then tracked the dynamics of these targets during HSC lineage commitment, and molecular docking finally confirmed strong binding affinities with NXSBD's constituents.</div></div><div><h3>Results</h3><div>NXSBD significantly ameliorated thrombocytopenia in ITP mice by rebalancing T-cell subsets and modulating key inflammatory cytokines, while also promoting the generation of thrombocytogenic megakaryocytes in the bone marrow. The core targets of NXSBD (RELA, IKBKB, AKT1, TP53, MAPK1, JUN, and FOS) were found to be present in hematopoietic stem cells (HSCs) and were involved in HSC differentiation into megakaryocytes and T lymphocytes. Molecular docking confirmed strong binding affinities between NXSBD constituents and these core targets. In conclusion, our findings demonstrate that NXSBD alleviates ITP through a multi-target mechanism that may participate in megakaryocyte production from HSCs and contribute to the restoration of peripheral T-cell homeostasis.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108925"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic cell type identification methods for single-cell RNA sequencing based on coordinate convolutional neural network 基于坐标卷积神经网络的单细胞RNA测序细胞类型自动鉴定方法
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-22 DOI: 10.1016/j.compbiolchem.2026.108923
Shuang Xu , Wen Yan , Renchu Guan , Yu Jiang
Cell type identification is a fundamental step in the analysis of single-cell RNA sequencing (scRNA-seq) data. Among the various classification tools, support vector machine (SVM) classifiers have traditionally demonstrated strong overall performance. However, the rapid accumulation of scRNA-seq data has led to a significant increase in SVM training time. Convolutional neural networks (CNNs), known for their success in image recognition and their ability to handle high-dimensional data, offer a promising alternative. Nevertheless, the inherent translation invariance of CNNs proves counterproductive in the context of scRNA-seq data, often resulting in misclassification of cell types. To address this limitation, we propose a novel cell type identification method, termed BP-Coord, which incorporates coordinate information as additional channels to enhance the spatial awareness of the model. Furthermore, a bicubic interpolation upsampling layer is introduced prior to the CoordConv layers, enabling the CNN to capture more precise positional information and better adapt to translation variations in the data. Experimental results on five public scRNA-seq benchmark datasets demonstrate that the proposed BP-Coord model consistently outperforms state-of-the-art methods, including SVM-based classifiers and recent deep learning approaches such as SuperCT and scGAC. In particular, BP-Coord achieves accuracy improvements of up to 3.5 % over the best competing methods on large-scale PBMC datasets and shows superior robustness on imbalanced and small-sample datasets. These results highlight the effectiveness of incorporating explicit positional encoding into convolutional architectures for automatic cell type identification.
细胞类型鉴定是单细胞RNA测序(scRNA-seq)数据分析的基本步骤。在各种分类工具中,支持向量机(SVM)分类器传统上表现出较强的综合性能。然而,scRNA-seq数据的快速积累导致SVM的训练时间显著增加。卷积神经网络(cnn)以其在图像识别方面的成功和处理高维数据的能力而闻名,提供了一个有希望的替代方案。然而,cnn固有的翻译不变性在scRNA-seq数据的背景下被证明是适得其反的,经常导致细胞类型的错误分类。为了解决这一限制,我们提出了一种新的细胞类型识别方法,称为BP-Coord,它将坐标信息作为额外的通道来增强模型的空间意识。此外,在CoordConv层之前引入了双三次插值上采样层,使CNN能够捕获更精确的位置信息,更好地适应数据的平移变化。在5个公开的scRNA-seq基准数据集上的实验结果表明,BP-Coord模型始终优于最先进的方法,包括基于svm的分类器和最近的深度学习方法,如SuperCT和scGAC。特别是,BP-Coord在大规模PBMC数据集上的准确率比最佳竞争方法提高了3.5 %,并且在不平衡和小样本数据集上表现出优越的鲁棒性。这些结果强调了将显式位置编码纳入自动细胞类型识别的卷积架构的有效性。
{"title":"Automatic cell type identification methods for single-cell RNA sequencing based on coordinate convolutional neural network","authors":"Shuang Xu ,&nbsp;Wen Yan ,&nbsp;Renchu Guan ,&nbsp;Yu Jiang","doi":"10.1016/j.compbiolchem.2026.108923","DOIUrl":"10.1016/j.compbiolchem.2026.108923","url":null,"abstract":"<div><div>Cell type identification is a fundamental step in the analysis of single-cell RNA sequencing (scRNA-seq) data. Among the various classification tools, support vector machine (SVM) classifiers have traditionally demonstrated strong overall performance. However, the rapid accumulation of scRNA-seq data has led to a significant increase in SVM training time. Convolutional neural networks (CNNs), known for their success in image recognition and their ability to handle high-dimensional data, offer a promising alternative. Nevertheless, the inherent translation invariance of CNNs proves counterproductive in the context of scRNA-seq data, often resulting in misclassification of cell types. To address this limitation, we propose a novel cell type identification method, termed BP-Coord, which incorporates coordinate information as additional channels to enhance the spatial awareness of the model. Furthermore, a bicubic interpolation upsampling layer is introduced prior to the CoordConv layers, enabling the CNN to capture more precise positional information and better adapt to translation variations in the data. Experimental results on five public scRNA-seq benchmark datasets demonstrate that the proposed BP-Coord model consistently outperforms state-of-the-art methods, including SVM-based classifiers and recent deep learning approaches such as SuperCT and scGAC. In particular, BP-Coord achieves accuracy improvements of up to 3.5 % over the best competing methods on large-scale PBMC datasets and shows superior robustness on imbalanced and small-sample datasets. These results highlight the effectiveness of incorporating explicit positional encoding into convolutional architectures for automatic cell type identification.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108923"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146035279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel multi-objective optimization framework using NSGA-II for gene co-expression network inference 基于NSGA-II基因共表达网络推理的多目标优化框架
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-02-10 DOI: 10.1016/j.compbiolchem.2026.108944
Behnam Aghajan , Mohammad Reza Ghaemi , Ali M. Mosammam , Emran Heshmati , Khosrow Khalifeh
Gene co-expression networks (GCNs) provide a powerful framework for uncovering functional gene modules and biological pathways from complex transcriptomic data. However, constructing reliable GCNs from noisy datasets often yields spurious edges and biologically implausible topologies. To address this challenge, we propose a novel multi-objective optimization approach based on the Non-dominated Sorting Genetic Algorithm II (NSGA-II) to refine edge selection in GCNs. Our pipeline integrates Variance Stabilizing Transformation (VST) for RNA-seq normalization, Spearman rank correlation for robust co-expression estimation, permutation testing to establish an initial significance threshold, and bootstrap resampling to assess edge stability. We applied this framework to two heterogeneous datasets including GSE10245 (microarray, n = 58) and GSE102349 (RNA-seq, n = 113), to optimize multiple network properties simultaneously; including sparsity, modularity, scale-free topology, and edge reproducibility. Comparative analyses against conventional widely used methods; Weighted Gene Co-expression Network Analysis (WGCNA) and the Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNE), demonstrate that our approach consistently yields sparser, more modular networks that better conform to biologically expected scale-free architectures across both data types. This adaptive, optimization-driven strategy offers a robust foundation for integrative genomic studies and holds significant potential for advancing biomarker discovery and disease mechanism modeling.
基因共表达网络(GCNs)为从复杂的转录组学数据中揭示功能基因模块和生物学途径提供了一个强大的框架。然而,从有噪声的数据集构建可靠的GCNs通常会产生虚假的边缘和生物学上不可信的拓扑。为了解决这一挑战,我们提出了一种新的基于非支配排序遗传算法II (NSGA-II)的多目标优化方法来优化GCNs中的边缘选择。我们的流水线集成了方差稳定变换(VST)用于RNA-seq归一化,Spearman秩相关用于鲁棒共表达估计,排列检验用于建立初始显著性阈值,以及自举重采样用于评估边缘稳定性。我们将该框架应用于两个异构数据集GSE10245(微阵列,n = 58)和GSE102349 (RNA-seq, n = 113),以同时优化多个网络特性;包括稀疏性、模块化、无标度拓扑和边缘再现性。与常用常规方法的比较分析;加权基因共表达网络分析(WGCNA)和精确细胞网络重建算法(ARACNE)表明,我们的方法始终产生更稀疏、更模块化的网络,更好地符合两种数据类型的生物学预期的无标度架构。这种自适应、优化驱动的策略为整合基因组研究提供了坚实的基础,并具有推进生物标志物发现和疾病机制建模的巨大潜力。
{"title":"A novel multi-objective optimization framework using NSGA-II for gene co-expression network inference","authors":"Behnam Aghajan ,&nbsp;Mohammad Reza Ghaemi ,&nbsp;Ali M. Mosammam ,&nbsp;Emran Heshmati ,&nbsp;Khosrow Khalifeh","doi":"10.1016/j.compbiolchem.2026.108944","DOIUrl":"10.1016/j.compbiolchem.2026.108944","url":null,"abstract":"<div><div>Gene co-expression networks (GCNs) provide a powerful framework for uncovering functional gene modules and biological pathways from complex transcriptomic data. However, constructing reliable GCNs from noisy datasets often yields spurious edges and biologically implausible topologies. To address this challenge, we propose a novel multi-objective optimization approach based on the Non-dominated Sorting Genetic Algorithm II (NSGA-II) to refine edge selection in GCNs. Our pipeline integrates Variance Stabilizing Transformation (VST) for RNA-seq normalization, Spearman rank correlation for robust co-expression estimation, permutation testing to establish an initial significance threshold, and bootstrap resampling to assess edge stability. We applied this framework to two heterogeneous datasets including GSE10245 (microarray, n = 58) and GSE102349 (RNA-seq, n = 113), to optimize multiple network properties simultaneously; including sparsity, modularity, scale-free topology, and edge reproducibility. Comparative analyses against conventional widely used methods; Weighted Gene Co-expression Network Analysis (WGCNA) and the Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNE), demonstrate that our approach consistently yields sparser, more modular networks that better conform to biologically expected scale-free architectures across both data types. This adaptive, optimization-driven strategy offers a robust foundation for integrative genomic studies and holds significant potential for advancing biomarker discovery and disease mechanism modeling.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108944"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tailored pyrrole-based imidazothiazole scaffolds: Synthetic elaboration, enzyme kinetic profiling and DFT-guided molecular docking toward Antidiabetic therapeutics 定制吡咯基咪唑噻唑支架:合成细化,酶动力学分析和dft引导的抗糖尿病治疗分子对接
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-21 DOI: 10.1016/j.compbiolchem.2026.108920
Zanib Fiaz , Shoaib Khan , Tayyiaba Iqbal , Wajid Rehman , Anila Fareed , Rafaqat Hussain , Kayumov Khasan Yusuf Ogli , Merajuddin Khan , Mohammed Rafi Shaik
The current research study highlights the successful biological evaluation of novel imidazo-thiadiazole based pyrrole derivatives, with the aim of targeting diabetes mellitus through alpha-amylase and alpha-glucosidase inhibition. These compounds exhibited promising anti-diabetic activity, notably compound 8 emerged as a leading candidate (3.50 ± 0.20, and 4.10 ± 0.10 µM) which outperformed the potential of acarbose (6.20 ± 0.10 and 6.70 ± 0.20 µM), a reference drug. The enhanced biological potential of compound 8 is likely due to incorporation of hydroxyl substituents, which may strengthen its binding affinity and selectivity towards the targeted enzymes. Molecular docking revealed stable interactions with key amino acids residues of targeted enzymes, providing mechanistic basis for its potent inhibitory activity. To further established their therapeutic relevance, enzyme kinetic study was conducted which confirmed their mode of inhibition while ADMET analysis indicated favorable pharmacokinetics and safety profiles. Moreover, pharmacophore modeling and molecular dynamics simulations reinforced the stability and binding efficiency of lead compounds under dynamic biological conditions. All the experimental results and in silico validations demonstrate that potent compounds possess significant anti-diabetic activity profile. Their ability to outperform an existing diabetes mellitus inhibitor and maintaining a favorable safety profile suggest that these compounds have potential to be further used in drug development and optimization against Diabetes Mellitus.
目前的研究重点是新型咪唑-噻二唑吡咯衍生物的成功生物学评价,目的是通过抑制α -淀粉酶和α -葡萄糖苷酶靶向糖尿病。这些化合物显示出良好的抗糖尿病活性,特别是化合物8成为主要候选药物(3.50±0.20和4.10±0.10 µM),优于阿卡波糖(6.20 ± 0.10和6.70 ± 0.20 µM)作为参比药物。化合物8的生物潜力增强可能是由于羟基取代基的掺入,这可能增强了它对目标酶的结合亲和力和选择性。分子对接揭示了其与目标酶关键氨基酸残基的稳定相互作用,为其有效的抑制活性提供了机制基础。为了进一步确定它们的治疗相关性,进行了酶动力学研究,证实了它们的抑制模式,而ADMET分析显示了良好的药代动力学和安全性。此外,药效团建模和分子动力学模拟增强了先导化合物在动态生物条件下的稳定性和结合效率。所有的实验结果和硅验证表明,有效的化合物具有显著的抗糖尿病活性谱。它们的性能优于现有的糖尿病抑制剂,并保持良好的安全性,这表明这些化合物具有进一步用于糖尿病药物开发和优化的潜力。
{"title":"Tailored pyrrole-based imidazothiazole scaffolds: Synthetic elaboration, enzyme kinetic profiling and DFT-guided molecular docking toward Antidiabetic therapeutics","authors":"Zanib Fiaz ,&nbsp;Shoaib Khan ,&nbsp;Tayyiaba Iqbal ,&nbsp;Wajid Rehman ,&nbsp;Anila Fareed ,&nbsp;Rafaqat Hussain ,&nbsp;Kayumov Khasan Yusuf Ogli ,&nbsp;Merajuddin Khan ,&nbsp;Mohammed Rafi Shaik","doi":"10.1016/j.compbiolchem.2026.108920","DOIUrl":"10.1016/j.compbiolchem.2026.108920","url":null,"abstract":"<div><div>The current research study highlights the successful biological evaluation of novel imidazo-thiadiazole based pyrrole derivatives, with the aim of targeting diabetes mellitus through alpha-amylase and alpha-glucosidase inhibition. These compounds exhibited promising anti-diabetic activity, notably compound 8 emerged as a leading candidate (3.50 <strong>±</strong> 0.20, and 4.10 <strong>±</strong> 0.10 µM) which outperformed the potential of acarbose (6.20 ± 0.10 and 6.70 ± 0.20 µM), a reference drug. The enhanced biological potential of compound 8 is likely due to incorporation of hydroxyl substituents, which may strengthen its binding affinity and selectivity towards the targeted enzymes. Molecular docking revealed stable interactions with key amino acids residues of targeted enzymes, providing mechanistic basis for its potent inhibitory activity. To further established their therapeutic relevance, enzyme kinetic study was conducted which confirmed their mode of inhibition while ADMET analysis indicated favorable pharmacokinetics and safety profiles. Moreover, pharmacophore modeling and molecular dynamics simulations reinforced the stability and binding efficiency of lead compounds under dynamic biological conditions. All the experimental results and in silico validations demonstrate that potent compounds possess significant anti-diabetic activity profile. Their ability to outperform an existing diabetes mellitus inhibitor and maintaining a favorable safety profile suggest that these compounds have potential to be further used in drug development and optimization against Diabetes Mellitus.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108920"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146034881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of artificial intelligence in sarcopenia: Advances, applications, and future directions 人工智能在肌肉减少症中的作用:进展、应用和未来方向
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-27 DOI: 10.1016/j.compbiolchem.2026.108930
Muhammad Waleed Yousaf , Ahmad Hassan Nadeem , M. Faisal Nadeem , Rizwan Qaisar
Sarcopenia, the gradual loss of skeletal muscle mass, strength, and function, is a growing concern in aging populations. Early detection is vital to reduce the risk of frailty, disability, and mortality, yet traditional diagnostic methods such as imaging and physical performance tests are often costly, inconsistent, or difficult to implement in routine care. Artificial intelligence (AI), including machine learning (ML) and deep learning (DL), is emerging as a powerful tool in sarcopenia research and clinical practice. This review explores how AI is being applied to early detection, imaging-based diagnosis, prediction of functional outcomes, and personalized monitoring. Models trained on large datasets, such as NHANES, have demonstrated strong predictive performance using standard clinical variables. DL has enabled automated analysis of CT scans for muscle segmentation, reducing the need for manual interpretation. At the same time, ML systems integrated with wearable devices allow real-time tracking of physical function. Emerging approaches such as explainable AI, federated learning, and the integration of diverse data sources, including omics and microbiome profiles, are expanding opportunities for individualized care. Despite these advances, significant challenges remain, including variability in data quality, limited model transparency, algorithmic bias, and ethical concerns. Regulatory oversight and clinician engagement will be key to successful implementation. AI offers a promising path toward proactive, scalable, and personalized management of sarcopenia.
骨骼肌减少症,骨骼肌质量、力量和功能的逐渐丧失,是老年人日益关注的问题。早期发现对于降低虚弱、残疾和死亡的风险至关重要,但传统的诊断方法,如成像和身体表现测试,往往成本高昂,不一致,或难以在常规护理中实施。人工智能(AI),包括机器学习(ML)和深度学习(DL),正在成为肌肉减少症研究和临床实践的有力工具。这篇综述探讨了人工智能如何应用于早期检测、基于成像的诊断、功能结果预测和个性化监测。在NHANES等大型数据集上训练的模型,在使用标准临床变量时表现出了很强的预测性能。DL实现了对肌肉分割的CT扫描的自动分析,减少了人工解释的需要。与此同时,与可穿戴设备集成的机器学习系统可以实时跟踪身体功能。可解释的人工智能、联合学习以及包括组学和微生物组资料在内的多种数据源的整合等新兴方法正在扩大个性化护理的机会。尽管取得了这些进步,但仍然存在重大挑战,包括数据质量的可变性、有限的模型透明度、算法偏见和伦理问题。监管监督和临床医生参与将是成功实施的关键。人工智能为肌肉减少症的主动、可扩展和个性化管理提供了一条有前途的道路。
{"title":"The role of artificial intelligence in sarcopenia: Advances, applications, and future directions","authors":"Muhammad Waleed Yousaf ,&nbsp;Ahmad Hassan Nadeem ,&nbsp;M. Faisal Nadeem ,&nbsp;Rizwan Qaisar","doi":"10.1016/j.compbiolchem.2026.108930","DOIUrl":"10.1016/j.compbiolchem.2026.108930","url":null,"abstract":"<div><div>Sarcopenia, the gradual loss of skeletal muscle mass, strength, and function, is a growing concern in aging populations. Early detection is vital to reduce the risk of frailty, disability, and mortality, yet traditional diagnostic methods such as imaging and physical performance tests are often costly, inconsistent, or difficult to implement in routine care. Artificial intelligence (AI), including machine learning (ML) and deep learning (DL), is emerging as a powerful tool in sarcopenia research and clinical practice. This review explores how AI is being applied to early detection, imaging-based diagnosis, prediction of functional outcomes, and personalized monitoring. Models trained on large datasets, such as NHANES, have demonstrated strong predictive performance using standard clinical variables. DL has enabled automated analysis of CT scans for muscle segmentation, reducing the need for manual interpretation. At the same time, ML systems integrated with wearable devices allow real-time tracking of physical function. Emerging approaches such as explainable AI, federated learning, and the integration of diverse data sources, including omics and microbiome profiles, are expanding opportunities for individualized care. Despite these advances, significant challenges remain, including variability in data quality, limited model transparency, algorithmic bias, and ethical concerns. Regulatory oversight and clinician engagement will be key to successful implementation. AI offers a promising path toward proactive, scalable, and personalized management of sarcopenia.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108930"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146074078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multilayered integrated analysis of insomnia-related genes ATG7 and JAK2 in the autophagy-inflammation mechanism and clinical implications in major depressive disorder 失眠相关基因ATG7和JAK2在重度抑郁症自噬-炎症机制中的多层整合分析及其临床意义
IF 3.1 4区 生物学 Q2 BIOLOGY Pub Date : 2026-06-01 Epub Date: 2026-01-29 DOI: 10.1016/j.compbiolchem.2026.108931
Jinxiang Yu , Xiaochuan Feng , Haikun Zhang , Le Cao , Lifeng Jia , Pengcheng Ma , Nianliang Zhang , Tao Zhao

Objective

Insomnia is widely recognized as a key risk factor for major depressive disorder (MDD). However, the potential molecular mechanisms and the underlying interactions among them remain to be elucidate.

Methods

In NHANES, an insomnia-like high-risk sleep phenotype was linked to an increased risk of MDD, and Mendelian randomization (MR) provided evidence for a potential causal effect of the genetic liability to insomnia. Core genes were identified via PPI network construction and machine learning analyses, integrated with the GEO database and insomnia-related genes. Subsequently, the clinical relevance of these genes was validated, and the performance of the model was evaluated using external datasets. A functional enrichment analysis of the core genes was conducted to explore the related biological pathways. The impacts of core genes on the immune microenvironment were explored via immune infiltration and cell-cell interaction analyses. Moreover, promising candidate therapeutic compounds were identified through drug enrichment and molecular interaction analyses.

Results

An insomnia-like high-risk sleep phenotype showed a consistent association with an increased MDD risk, and MR suggested a potential causal connection. The diagnostic model, constructed using the core genes ATG7 and JAK2, demonstrated good predictive abilities and showed potential for clinical application. Enrichment analyses highlighted autophagy- and inflammation-related pathways, accompanied by altered immune-cell signatures, supporting the involvement of an autophagy–inflammation axis in MDD. Molecular docking and dynamics indicated that emodin could interact stably with JAK2, which warrants experimental validation.

Conclusion

ATG7 and JAK2 are insomnia-associated genes linked to autophagy- and inflammation-related pathways in MDD. The ATG7/JAK2-based diagnostic model and the in silico–prioritized compound emodin provide testable hypotheses for future mechanistic investigations and translational exploration, pending experimental validation.
目的:失眠被广泛认为是重度抑郁障碍(MDD)的关键危险因素。然而,潜在的分子机制和它们之间潜在的相互作用仍有待阐明。方法:在NHANES中,失眠样高风险睡眠表型与MDD风险增加有关,孟德尔随机化(MR)为失眠遗传倾向的潜在因果效应提供了证据。通过PPI网络构建和机器学习分析,结合GEO数据库和失眠相关基因,鉴定出核心基因。随后,验证这些基因的临床相关性,并使用外部数据集评估模型的性能。对核心基因进行功能富集分析,探索相关生物学途径。通过免疫浸润和细胞-细胞相互作用分析,探讨核心基因对免疫微环境的影响。此外,通过药物富集和分子相互作用分析,发现了有希望的候选治疗化合物。结果:失眠样高危睡眠表型与重度抑郁症风险增加一致,MR提示潜在的因果关系。利用核心基因ATG7和JAK2构建的诊断模型具有良好的预测能力,具有临床应用潜力。富集分析强调了自噬和炎症相关的途径,伴随着免疫细胞特征的改变,支持自噬-炎症轴参与MDD。分子对接和动力学分析表明,大黄素与JAK2具有稳定的相互作用,值得实验验证。结论:ATG7和JAK2是失眠相关基因,与MDD的自噬和炎症相关途径相关。基于ATG7/ jak2的诊断模型和硅优先化合物大黄素为未来的机制研究和转化探索提供了可测试的假设,有待实验验证。
{"title":"A multilayered integrated analysis of insomnia-related genes ATG7 and JAK2 in the autophagy-inflammation mechanism and clinical implications in major depressive disorder","authors":"Jinxiang Yu ,&nbsp;Xiaochuan Feng ,&nbsp;Haikun Zhang ,&nbsp;Le Cao ,&nbsp;Lifeng Jia ,&nbsp;Pengcheng Ma ,&nbsp;Nianliang Zhang ,&nbsp;Tao Zhao","doi":"10.1016/j.compbiolchem.2026.108931","DOIUrl":"10.1016/j.compbiolchem.2026.108931","url":null,"abstract":"<div><h3>Objective</h3><div>Insomnia is widely recognized as a key risk factor for major depressive disorder (MDD). However, the potential molecular mechanisms and the underlying interactions among them remain to be elucidate.</div></div><div><h3>Methods</h3><div>In NHANES, an insomnia-like high-risk sleep phenotype was linked to an increased risk of MDD, and Mendelian randomization (MR) provided evidence for a potential causal effect of the genetic liability to insomnia. Core genes were identified via PPI network construction and machine learning analyses, integrated with the GEO database and insomnia-related genes. Subsequently, the clinical relevance of these genes was validated, and the performance of the model was evaluated using external datasets. A functional enrichment analysis of the core genes was conducted to explore the related biological pathways. The impacts of core genes on the immune microenvironment were explored via immune infiltration and cell-cell interaction analyses. Moreover, promising candidate therapeutic compounds were identified through drug enrichment and molecular interaction analyses.</div></div><div><h3>Results</h3><div>An insomnia-like high-risk sleep phenotype showed a consistent association with an increased MDD risk, and MR suggested a potential causal connection. The diagnostic model, constructed using the core genes ATG7 and JAK2, demonstrated good predictive abilities and showed potential for clinical application. Enrichment analyses highlighted autophagy- and inflammation-related pathways, accompanied by altered immune-cell signatures, supporting the involvement of an autophagy–inflammation axis in MDD. Molecular docking and dynamics indicated that emodin could interact stably with JAK2, which warrants experimental validation.</div></div><div><h3>Conclusion</h3><div>ATG7 and JAK2 are insomnia-associated genes linked to autophagy- and inflammation-related pathways in MDD. The ATG7/JAK2-based diagnostic model and the in silico–prioritized compound emodin provide testable hypotheses for future mechanistic investigations and translational exploration, pending experimental validation.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"122 ","pages":"Article 108931"},"PeriodicalIF":3.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146115053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Biology and Chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1