Pub Date : 2026-01-23eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1707053
Wei Zheng, Xinli Xing, Xuejing Sun, Na Wei
Polycystin-1 (PC1), encoded by the PKD1 gene, forms a complex with polycystin-2 (PKD2; 173910) that regulates multiple signaling pathways to maintain normal renal tubular structure and function. Mutations in the PKD1 gene are the primary cause of type 1 PKD (polycystic kidney disease), accounting for 78%-85% of all PKD cases. In this study, we report a case of a boy presenting with microscopic hematuria with multiple renal cysts and carrying an unreported intronic variant, c.12445-34_12445-10del, in the PKD1 gene inherited from his father who also presented PKD. Sanger sequencing and reverse transcription polymerase chain reaction (RT-PCR) for minigene splicing assays showed two abnormal splicing alterations with the c.12445-34_12445-10del variant at the mRNA level: one causes a 16-bp deletion in exon 46, resulting in premature protein termination (p.Phe4149GlyfsTer45), and the other results in a 205-bp deletion, leading to delayed termination (p.Phe4149ProfsTer139). Based on the clinical characteristics and gene mutations with functional verification, the patient was finally diagnosed with PKD caused by PKD1 function defection, as confirmed by the combined clinical features and genetic analysis. Management strategies include dietary management, blood pressure monitoring, and regular follow-up of kidney function. This is the first study to report an intronic deletion in the PKD1 gene that influences alternative splicing. Our findings expand the mutation spectrum leading to PKD1-related diseases and highlight the importance of genetic counseling for the family.
{"title":"An intronic micro-deletion impacts the transcription and translation of <i>PKD1</i> gene.","authors":"Wei Zheng, Xinli Xing, Xuejing Sun, Na Wei","doi":"10.3389/fgene.2025.1707053","DOIUrl":"10.3389/fgene.2025.1707053","url":null,"abstract":"<p><p>Polycystin-1 (PC1), encoded by the <i>PKD1</i> gene, forms a complex with polycystin-2 (<i>PKD2</i>; 173910) that regulates multiple signaling pathways to maintain normal renal tubular structure and function. Mutations in the <i>PKD1</i> gene are the primary cause of type 1 PKD (polycystic kidney disease), accounting for 78%-85% of all PKD cases. In this study, we report a case of a boy presenting with microscopic hematuria with multiple renal cysts and carrying an unreported intronic variant, c.12445-34_12445-10del, in the <i>PKD1</i> gene inherited from his father who also presented PKD. Sanger sequencing and reverse transcription polymerase chain reaction (RT-PCR) for minigene splicing assays showed two abnormal splicing alterations with the c.12445-34_12445-10del variant at the mRNA level: one causes a 16-bp deletion in exon 46, resulting in premature protein termination (p.Phe4149GlyfsTer45), and the other results in a 205-bp deletion, leading to delayed termination (p.Phe4149ProfsTer139). Based on the clinical characteristics and gene mutations with functional verification, the patient was finally diagnosed with PKD caused by PKD1 function defection, as confirmed by the combined clinical features and genetic analysis. Management strategies include dietary management, blood pressure monitoring, and regular follow-up of kidney function. This is the first study to report an intronic deletion in the <i>PKD1</i> gene that influences alternative splicing. Our findings expand the mutation spectrum leading to PKD1-related diseases and highlight the importance of genetic counseling for the family.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1707053"},"PeriodicalIF":2.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12875593/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146142077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Editorial: Genetics and epigenetics of melanoma and non-melanoma skin cancer.","authors":"Chiara Moltrasio, Paola Maura Tricarico, Muhammad Suleman, Sergio Crovella, Maurizio Romagnuolo","doi":"10.3389/fgene.2026.1770737","DOIUrl":"https://doi.org/10.3389/fgene.2026.1770737","url":null,"abstract":"","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"17 ","pages":"1770737"},"PeriodicalIF":2.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12875716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146141888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-22eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1699333
Asif Bashir Shikari, Raheel Shafeeq Khan, Noor-Ul Ain, F A Mohiddin, Gazala Hassan Khan, Najeeb-Ul-Rehman Sofi, Zahoor A Dar, M Ashraf Ahangar, Gowhar Ali, Aflaq Hamid Wani, Bilal A Padder
Introduction: A novel set of pyramided lines for durable blast resistance was developed using marker-assisted backcross breeding (MABB) strategy in the genetic background of the aromatic landrace Mushk Budji (MB).
Methods: Simultaneous-but-stepwise transfer of the blast resistance genes Pi54 and Pi9 was achieved through the integration of foreground selection (FS) and background genome recovery processes, aided by genome-wide SSR and >1500 KASP markers. MABB, whole genome re-sequencing, coupled with stringent phenotypic selection for aroma, amylose content, kernel dimensions, and cooking quality, helped minimize the linkage drag and achieve early recurrent parent genome (RPG) recovery in the inter-cross BC2F2:3 generation.
Results: The two-gene lines carrying Pi9 + Pi54 were developed through inter-crossing corresponding near-isogenic lines (NILs) with an RPG of approximately 90%. With the help of sequencing of the derived NILs, we were able, for the first time, to establish the role of major alleles underlying rice quality and stress resilience in MB. In the process, we confirmed the retention of favorable alleles at key genetic loci, such as BADH2 (aroma), Wx (amylose content), Rc (white pericarp), Hd1/Hd4/Hd5 (heading date), and COLD1/COLD6 (cold tolerance) in the derived NILs. GGE biplot analysis revealed stable performance of five advanced lines across target ecologies.
Discussion: The set of NILs was useful in elucidating the phenotypic effects of 11 genes related to grain type, quality, and adaptability traits in the landrace MB. Multi-environment screening for blast resistance, at hot spot locations, in addition to artificial inoculation, validated the resistance response of the lines to both leaf and neck blast. This study demonstrates the successful integration of genomics-assisted breeding and phenotypic selection to improve a heritage rice variety for enhanced disease resistance and ecological adaptation. The development of improved MB lines represents a rare endeavor towards the area expansion and conservation of the heirloom rice.
{"title":"Genomics-informed elucidation of trait-phenotype relationships and MABB approaches deliver major gene blast resistance in the aromatic rice landrace <i>Mushk Budji</i>.","authors":"Asif Bashir Shikari, Raheel Shafeeq Khan, Noor-Ul Ain, F A Mohiddin, Gazala Hassan Khan, Najeeb-Ul-Rehman Sofi, Zahoor A Dar, M Ashraf Ahangar, Gowhar Ali, Aflaq Hamid Wani, Bilal A Padder","doi":"10.3389/fgene.2025.1699333","DOIUrl":"10.3389/fgene.2025.1699333","url":null,"abstract":"<p><strong>Introduction: </strong>A novel set of pyramided lines for durable blast resistance was developed using marker-assisted backcross breeding (MABB) strategy in the genetic background of the aromatic landrace <i>Mushk Budji</i> (MB).</p><p><strong>Methods: </strong>Simultaneous-but-stepwise transfer of the blast resistance genes <i>Pi54</i> and <i>Pi9</i> was achieved through the integration of foreground selection (FS) and background genome recovery processes, aided by genome-wide SSR and >1500 KASP markers. MABB, whole genome re-sequencing, coupled with stringent phenotypic selection for aroma, amylose content, kernel dimensions, and cooking quality, helped minimize the linkage drag and achieve early recurrent parent genome (RPG) recovery in the inter-cross BC<sub>2</sub>F<sub>2:3</sub> generation.</p><p><strong>Results: </strong>The two-gene lines carrying <i>Pi9</i> + <i>Pi54</i> were developed through inter-crossing corresponding near-isogenic lines (NILs) with an RPG of approximately 90%. With the help of sequencing of the derived NILs, we were able, for the first time, to establish the role of major alleles underlying rice quality and stress resilience in MB. In the process, we confirmed the retention of favorable alleles at key genetic loci, such as <i>BADH2</i> (aroma), <i>Wx</i> (amylose content), <i>Rc</i> (white pericarp), <i>Hd1/Hd4/Hd5</i> (heading date), and <i>COLD1/COLD6</i> (cold tolerance) in the derived NILs. GGE biplot analysis revealed stable performance of five advanced lines across target ecologies.</p><p><strong>Discussion: </strong>The set of NILs was useful in elucidating the phenotypic effects of 11 genes related to grain type, quality, and adaptability traits in the landrace MB. Multi-environment screening for blast resistance, at hot spot locations, in addition to artificial inoculation, validated the resistance response of the lines to both leaf and neck blast. This study demonstrates the successful integration of genomics-assisted breeding and phenotypic selection to improve a heritage rice variety for enhanced disease resistance and ecological adaptation. The development of improved MB lines represents a rare endeavor towards the area expansion and conservation of the heirloom rice.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1699333"},"PeriodicalIF":2.8,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12872327/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1752811
Shulin Zhao, Baoyun Nan, Jun Guo, Wenkai Xu, Zhen Li
Introduction: Coronary atherosclerotic heart disease (CHD) is a leading cause of morbidity and mortality worldwide, making timely identification critical for improving patient prognosis. However, traditional imaging examinations are limited by high costs and patient selection bias, while existing prediction models often lack interpretability and generalization ability. This study aimed to develop a robust, interpretable machine learning approach to address these challenges.
Methods: This retrospective study analyzed hospitalized patients at Quzhou People's Hospital from July 2021 to March 2025. Patients diagnosed with CHD were categorized as positive samples, while those without cardiovascular disease served as negative controls. The dataset integrated demographic data, blood biomarkers, and vital signs. A Generative Adversarial Imputation Network (GAIN) was utilized to handle missing values, and multiple machine learning models were constructed and compared for prediction performance.
Results: Among the evaluated algorithms, the XGBoost model achieved superior performance on the test set with an Area Under the Curve (AUC) of 0.9053. To enhance clinical utility, the integration of SHAP (SHapley Additive exPlanations) values enabled both global and local interpretation of model decisions. Key predictive factors identified included mean respiratory rate during hospitalization, age, high-sensitivity troponin I (hs-cTnI), and hypertension.
Discussion: The developed model demonstrates robust prediction performance combined with high clinical interpretability. Unlike traditional "black box" models, this approach clarifies the contribution of specific risk factors. Crucially, the tool is well-suited for dual deployment: serving as an automated screening tool integrated into hospital electronic health records (EHRs) and as a self-monitoring aid for individuals with underlying health conditions via mobile health applications.
简介:冠状动脉粥样硬化性心脏病(CHD)是世界范围内发病率和死亡率的主要原因,及时识别对改善患者预后至关重要。然而,传统的影像学检查受到成本高和患者选择偏差的限制,而现有的预测模型往往缺乏可解释性和泛化能力。本研究旨在开发一种强大的、可解释的机器学习方法来应对这些挑战。方法:对衢州市人民医院2021年7月至2025年3月住院患者进行回顾性研究。诊断为冠心病的患者被归类为阳性样本,而没有心血管疾病的患者被归类为阴性对照。该数据集整合了人口统计数据、血液生物标志物和生命体征。利用生成式对抗Imputation网络(GAIN)处理缺失值,构建多个机器学习模型并比较其预测性能。结果:在评估的算法中,XGBoost模型在测试集上取得了较好的性能,曲线下面积(Area Under the Curve, AUC)为0.9053。为了提高临床效用,SHAP (SHapley加性解释)值的整合使模型决策的全局和局部解释成为可能。确定的关键预测因素包括住院期间的平均呼吸频率、年龄、高敏感性肌钙蛋白I (hs-cTnI)和高血压。讨论:开发的模型具有强大的预测性能,并具有较高的临床可解释性。与传统的“黑箱”模型不同,这种方法澄清了特定风险因素的贡献。至关重要的是,该工具非常适合双重部署:作为集成到医院电子健康记录(EHRs)中的自动筛查工具,并通过移动健康应用程序作为具有潜在健康状况的个人的自我监测辅助工具。
{"title":"Coronary heart disease risk prediction based on GAIN imputation and interpretable machine learning.","authors":"Shulin Zhao, Baoyun Nan, Jun Guo, Wenkai Xu, Zhen Li","doi":"10.3389/fgene.2025.1752811","DOIUrl":"10.3389/fgene.2025.1752811","url":null,"abstract":"<p><strong>Introduction: </strong>Coronary atherosclerotic heart disease (CHD) is a leading cause of morbidity and mortality worldwide, making timely identification critical for improving patient prognosis. However, traditional imaging examinations are limited by high costs and patient selection bias, while existing prediction models often lack interpretability and generalization ability. This study aimed to develop a robust, interpretable machine learning approach to address these challenges.</p><p><strong>Methods: </strong>This retrospective study analyzed hospitalized patients at Quzhou People's Hospital from July 2021 to March 2025. Patients diagnosed with CHD were categorized as positive samples, while those without cardiovascular disease served as negative controls. The dataset integrated demographic data, blood biomarkers, and vital signs. A Generative Adversarial Imputation Network (GAIN) was utilized to handle missing values, and multiple machine learning models were constructed and compared for prediction performance.</p><p><strong>Results: </strong>Among the evaluated algorithms, the XGBoost model achieved superior performance on the test set with an Area Under the Curve (AUC) of 0.9053. To enhance clinical utility, the integration of SHAP (SHapley Additive exPlanations) values enabled both global and local interpretation of model decisions. Key predictive factors identified included mean respiratory rate during hospitalization, age, high-sensitivity troponin I (hs-cTnI), and hypertension.</p><p><strong>Discussion: </strong>The developed model demonstrates robust prediction performance combined with high clinical interpretability. Unlike traditional \"black box\" models, this approach clarifies the contribution of specific risk factors. Crucially, the tool is well-suited for dual deployment: serving as an automated screening tool integrated into hospital electronic health records (EHRs) and as a self-monitoring aid for individuals with underlying health conditions via mobile health applications.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1752811"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867338/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146118768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Whole-genome sequence and assembly of the sporogenic <i>Bacillus paralicheniformis</i> T7 strain with high proteolytic and amylolytic activities.","authors":"Arman Mussakhmetov, Saniya Aktayeva, Arailym Sarsen, Asset Daniyarov, Bekbolat Khassenov, Ulykbek Kairov","doi":"10.3389/fgene.2026.1720096","DOIUrl":"10.3389/fgene.2026.1720096","url":null,"abstract":"","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"17 ","pages":"1720096"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867574/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146118843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1588292
Yulin Yuan, Sheng-Xiao Ma, Heshi Liu, Yang Gong, Xuan Sun, Quan Wang, Weifu Zhang
Cancer progression involves dynamic crosstalk between tumor-intrinsic pathways and microenvironmental remodeling, and identifying pan-cancer biomarkers is critical for precision oncology. CDKN2AIPNL exhibits a paradoxical role in cancer, acting as a tumor suppressor in myeloid malignancies but promoting solid tumor progression, yet its systematic pan-cancer characteristics remain unelucidated. This study aimed to comprehensively analyze CDKN2AIPNL's expression patterns, prognostic value, genetic alterations, and molecular mechanisms across multiple tumor types using public datasets including TCGA, GTEx, HPA, and tools such as GEPIA2, cBioPortal, TIMER2, STRING, and BioGRID. We performed expression difference analysis, survival analysis (overall survival, disease-free survival, progression-free survival), genetic alteration analysis, cancer-associated fibroblast (CAF) infiltration analysis, and gene/protein interaction enrichment analysis. Results showed that CDKN2AIPNL was significantly upregulated in multiple tumors (e.g., LIHC, UVM, BRCA, LUAD) and downregulated in others (e.g., KICH, KIRP, THCA), with high tumor specificity. Elevated CDKN2AIPNL expression correlated with poor overall survival in LIHC (HR = 1.7, p = 0.0026), UVM (HR = 26, p = 2.2e-6), BRCA (HR = 26, p = 2.2e-6), LUAD (HR = 1.36, p = 0.049), PCPG (HR = 1.71, p = 0.0012), and TGCT (HR = 0.37, p=0.023), and was associated with advanced tumor stages in metabolically active cancers. Genetic alterations (amplifications and mutations) were frequent in KIRC (>5%) and ACC (>4%), with all mutations localized to the XTBD region, and amplification predicted poor prognosis in PRAD (p = 0.008) while mutations conferred favorable outcomes in BLCA. CDKN2AIPNL expression positively correlated with CAF infiltration in ESCA, KICH, UVM, and other tumors, and interacted with MYC, XRN2, and CHAMP1 to regulate metabolic reprogramming, cell cycle, and immune suppression. Our findings systematically reveal CDKN2AIPNL's dual role in tumorigenesis and validate it as a potential pan-cancer prognostic biomarker, providing novel insights for cancer diagnosis and targeted therapy.
{"title":"<i>CDKN2AIPNL</i>: a potential pan-cancer biomarker.","authors":"Yulin Yuan, Sheng-Xiao Ma, Heshi Liu, Yang Gong, Xuan Sun, Quan Wang, Weifu Zhang","doi":"10.3389/fgene.2025.1588292","DOIUrl":"10.3389/fgene.2025.1588292","url":null,"abstract":"<p><p>Cancer progression involves dynamic crosstalk between tumor-intrinsic pathways and microenvironmental remodeling, and identifying pan-cancer biomarkers is critical for precision oncology. <i>CDKN2AIPNL</i> exhibits a paradoxical role in cancer, acting as a tumor suppressor in myeloid malignancies but promoting solid tumor progression, yet its systematic pan-cancer characteristics remain unelucidated. This study aimed to comprehensively analyze <i>CDKN2AIPNL's</i> expression patterns, prognostic value, genetic alterations, and molecular mechanisms across multiple tumor types using public datasets including TCGA, GTEx, HPA, and tools such as GEPIA2, cBioPortal, TIMER2, STRING, and BioGRID. We performed expression difference analysis, survival analysis (overall survival, disease-free survival, progression-free survival), genetic alteration analysis, cancer-associated fibroblast (CAF) infiltration analysis, and gene/protein interaction enrichment analysis. Results showed that CDKN2AIPNL was significantly upregulated in multiple tumors (e.g., LIHC, UVM, BRCA, LUAD) and downregulated in others (e.g., KICH, KIRP, THCA), with high tumor specificity. Elevated CDKN2AIPNL expression correlated with poor overall survival in LIHC (HR = 1.7, p = 0.0026), UVM (HR = 26, p = 2.2e-6), BRCA (HR = 26, p = 2.2e-6), LUAD (HR = 1.36, p = 0.049), PCPG (HR = 1.71, p = 0.0012), and TGCT (HR = 0.37, p=0.023), and was associated with advanced tumor stages in metabolically active cancers. Genetic alterations (amplifications and mutations) were frequent in KIRC (>5%) and ACC (>4%), with all mutations localized to the XTBD region, and amplification predicted poor prognosis in PRAD (p = 0.008) while mutations conferred favorable outcomes in BLCA. CDKN2AIPNL expression positively correlated with CAF infiltration in ESCA, KICH, UVM, and other tumors, and interacted with MYC, XRN2, and CHAMP1 to regulate metabolic reprogramming, cell cycle, and immune suppression. Our findings systematically reveal CDKN2AIPNL's dual role in tumorigenesis and validate it as a potential pan-cancer prognostic biomarker, providing novel insights for cancer diagnosis and targeted therapy.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1588292"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867337/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146118795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1737945
Xiang Ding, Xujiang Yu, Jing Chen, Runlang Su, Jinyou Li
The mitochondrial genome provides crucial information for understanding the evolution and phylogeny of various Apis cerana populations. Apis cerana-Southern China is a unique ecological type of Asian bee, A. cerana, primarily found in the coastal mountains of South China. Here, we used PacBio-HiFi sequencing to obtain the complete mitochondrial genome of A. cerana-Southern China and infer the phylogenetic relationships between A. cerana-Southern China and other A. cerana ecotypes. The mitochondrial genome of A. cerana-Southern China contains 16,137 bp and includes 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes, and an AT-rich region. The overall base composition is as follows: A (42.41%), C (9.59%), G (6.18%), and T (41.82%). The combined percentage of A and T (84.23%) is significantly higher than that of G and C. Among the 37 genes, 23 were located on the majority strand, while the remaining 14 were located on the minority strand. The phylogenetic tree based on the 13 PCGs showed that the genetic distance between A. cerana-Southern China, A. cerana-Central China, and A. cerana-Aba was closer. The complete mitochondrial genome sequence reported here would be useful for further phylogenetic analysis and conservation genetic studies in A. cerana-Southern China.
{"title":"The complete mitochondrial genome of <i>Apis cerana</i>-southern China (Hymenoptera: Apidae) and insights into the phylogenetics.","authors":"Xiang Ding, Xujiang Yu, Jing Chen, Runlang Su, Jinyou Li","doi":"10.3389/fgene.2025.1737945","DOIUrl":"10.3389/fgene.2025.1737945","url":null,"abstract":"<p><p>The mitochondrial genome provides crucial information for understanding the evolution and phylogeny of various <i>Apis cerana</i> populations. <i>Apis cerana</i>-Southern China is a unique ecological type of Asian bee, <i>A. cerana</i>, primarily found in the coastal mountains of South China. Here, we used PacBio-HiFi sequencing to obtain the complete mitochondrial genome of <i>A. cerana</i>-Southern China and infer the phylogenetic relationships between <i>A. cerana</i>-Southern China and other <i>A. cerana</i> ecotypes. The mitochondrial genome of <i>A. cerana</i>-Southern China contains 16,137 bp and includes 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes, and an AT-rich region. The overall base composition is as follows: A (42.41%), C (9.59%), G (6.18%), and T (41.82%). The combined percentage of A and T (84.23%) is significantly higher than that of G and C. Among the 37 genes, 23 were located on the majority strand, while the remaining 14 were located on the minority strand. The phylogenetic tree based on the 13 PCGs showed that the genetic distance between <i>A. cerana</i>-Southern China, <i>A. cerana</i>-Central China, and <i>A. cerana</i>-Aba was closer. The complete mitochondrial genome sequence reported here would be useful for further phylogenetic analysis and conservation genetic studies in <i>A. cerana</i>-Southern China.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1737945"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867336/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146118798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2026-01-01DOI: 10.3389/fgene.2026.1774569
Yanjuan Li, Yiben Lin, Dong Chen
The major histocompatibility complex (MHC) is the central genetic basis of adaptive immune responses, it plays a crucial role in antigen presentation, immune surveillance, and susceptibility to various diseases. Therefore, accurate MHC identification is essential for both immunological research and clinical applications. Most existing methods still depend on manually engineered features or a single protein language model (PLM for short), these methods cannot perfectly capture complementary information across sequence lengths or across different PLMs. Furthermore, most existing methods often adopt conventional machine learning algorithms or simple multilayer perceptron (MLP) classifiers to construct identification model, they have no ability to model deep semantic dependencies within sequences. To overcome these limitations, we introduce an MHC identification model based on dual-stage training and multi-view feature fusion, termed DFL-MHC, a novel framework that unifies multi-sequence and multi-model views within a dual-stage training strategy. In the feature extraction stage, we design a cross-sequence and cross-model multi-view scheme. In this scheme, a protein sequence is truncated into two different residue sequences with a length of 1,022, two PLMs are respectively employed to extract features from the two different residue sequences, these extracted features are fused to represent the protein sequence. The dimensionality reduction algorithm is applied to the fused features and obtain the optimal feature subset. The optimal feature subset can fully capture complementary information across sequence lengths and across different PLMs. In the feature modeling stage, we construct a bi-directional long short-term memory (BiLSTM) network incorporating an attention mechanism to capture long-range dependencies and deep semantic dependencies within sequences. On the MHC identification task, DFL-MHC achieves better performance than the existing methods. It is demonstrated that the effectiveness of leveraging both multi-view feature fusion and dual-stage training to achieve accurate and reliable MHC identification.
{"title":"DFL-MHC: MHC identification model based on dual-stage training and multi-view feature fusion.","authors":"Yanjuan Li, Yiben Lin, Dong Chen","doi":"10.3389/fgene.2026.1774569","DOIUrl":"10.3389/fgene.2026.1774569","url":null,"abstract":"<p><p>The major histocompatibility complex (MHC) is the central genetic basis of adaptive immune responses, it plays a crucial role in antigen presentation, immune surveillance, and susceptibility to various diseases. Therefore, accurate MHC identification is essential for both immunological research and clinical applications. Most existing methods still depend on manually engineered features or a single protein language model (PLM for short), these methods cannot perfectly capture complementary information across sequence lengths or across different PLMs. Furthermore, most existing methods often adopt conventional machine learning algorithms or simple multilayer perceptron (MLP) classifiers to construct identification model, they have no ability to model deep semantic dependencies within sequences. To overcome these limitations, we introduce an MHC identification model based on dual-stage training and multi-view feature fusion, termed DFL-MHC, a novel framework that unifies multi-sequence and multi-model views within a dual-stage training strategy. In the feature extraction stage, we design a cross-sequence and cross-model multi-view scheme. In this scheme, a protein sequence is truncated into two different residue sequences with a length of 1,022, two PLMs are respectively employed to extract features from the two different residue sequences, these extracted features are fused to represent the protein sequence. The dimensionality reduction algorithm is applied to the fused features and obtain the optimal feature subset. The optimal feature subset can fully capture complementary information across sequence lengths and across different PLMs. In the feature modeling stage, we construct a bi-directional long short-term memory (BiLSTM) network incorporating an attention mechanism to capture long-range dependencies and deep semantic dependencies within sequences. On the MHC identification task, DFL-MHC achieves better performance than the existing methods. It is demonstrated that the effectiveness of leveraging both multi-view feature fusion and dual-stage training to achieve accurate and reliable MHC identification.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"17 ","pages":"1774569"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12868131/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2025-01-01DOI: 10.3389/fgene.2025.1713817
Xin Li, Zhongli Zhao, Yang Cao, Yumin Zhao, Lihong Qin
In this study, we successfully isolated and cultured bovine skeletal muscle satellite cells (bSMSCs) and induced muscle cell formation in vitro. Skeletal muscle satellite cells (SMSCs) were isolated from the deep tissues of foetal bovine hind limbs and differentiated with 2% horse serum in vitro. The transcriptome sequencing results revealed a total of 1030 differentially expressed genes (DEGs) in the middle stage of differentiation (day 3) compared with the predifferentiation stage (day 0). A total of 374 DEGs were identified in the postdifferentiation stage (day 7) compared with the middle differentiation stage (day 3). We further investigated the regulatory effects of the DEG nerve growth factor (NGF) on the proliferation and myogenic differentiation of bSMSCs. The overexpression of NGF increased the mRNA and protein expression levels of myosin heavy chain (MyHC) and myogenin (MyoG), which are myoblast development markers, whereas NGF knockdown had the opposite effect; however, NGF did not affect the expression of the proliferation marker paired box gene 7 (Pax7) in bSMSCs. In addition, functional enrichment analysis of the DEGs revealed that the PI3K/Akt signalling pathway was significantly enriched in the DEGs and that NGF regulates myogenesis through the activation of the PI3K/Akt signalling pathway. Our results revealed that NGF was shown to be a putative regulator that controls myogenesis by activating the PI3K/Akt signalling pathway. The study provided a reference for further studies on the molecular mechanism of myogenic differentiation, regulatory network establishment, and beef quality improvement.
{"title":"RNA-seq analysis reveals a positive role for NGF in the myogenic differentiation of bovine skeletal muscle satellite cells.","authors":"Xin Li, Zhongli Zhao, Yang Cao, Yumin Zhao, Lihong Qin","doi":"10.3389/fgene.2025.1713817","DOIUrl":"10.3389/fgene.2025.1713817","url":null,"abstract":"<p><p>In this study, we successfully isolated and cultured bovine skeletal muscle satellite cells (bSMSCs) and induced muscle cell formation <i>in vitro</i>. Skeletal muscle satellite cells (SMSCs) were isolated from the deep tissues of foetal bovine hind limbs and differentiated with 2% horse serum <i>in vitro</i>. The transcriptome sequencing results revealed a total of 1030 differentially expressed genes (DEGs) in the middle stage of differentiation (day 3) compared with the predifferentiation stage (day 0). A total of 374 DEGs were identified in the postdifferentiation stage (day 7) compared with the middle differentiation stage (day 3). We further investigated the regulatory effects of the DEG nerve growth factor (NGF) on the proliferation and myogenic differentiation of bSMSCs. The overexpression of NGF increased the mRNA and protein expression levels of myosin heavy chain (MyHC) and myogenin (MyoG), which are myoblast development markers, whereas NGF knockdown had the opposite effect; however, NGF did not affect the expression of the proliferation marker paired box gene 7 (Pax7) in bSMSCs. In addition, functional enrichment analysis of the DEGs revealed that the PI3K/Akt signalling pathway was significantly enriched in the DEGs and that NGF regulates myogenesis through the activation of the PI3K/Akt signalling pathway. Our results revealed that NGF was shown to be a putative regulator that controls myogenesis by activating the PI3K/Akt signalling pathway. The study provided a reference for further studies on the molecular mechanism of myogenic differentiation, regulatory network establishment, and beef quality improvement.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1713817"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867674/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-21eCollection Date: 2026-01-01DOI: 10.3389/fgene.2026.1721789
Si-Ming Zhang
This review highlights recent advances and ongoing challenges in the genomics, genetics, and gene-editing (3G) of the freshwater snail Biomphalaria glabrata, based on insights gained from a novel model system we initiated two decades ago. B. glabrata is an intermediate host of the human blood fluke Schistosoma mansoni and serves as the principal model organism for schistosomiasis research. We developed two homozygous lines of B. glabrata, the iM line and iBS90, through 81 and 41 generations of selfing the commonly used M line and BS90, respectively. These lines display contrasting infection phenotypes: susceptibility or resistance to S. mansoni. High-quality scaffold-based genome assemblies were generated for both lines, followed by a chromosome-level assembly of the iM line genome. An F2 segregating population derived from these lines enabled the identification of three loci, two linked to resistance or susceptibility and one associated with pigmentation, using a double digest restriction-site associated DNA sequencing (ddRADseq) approach. Recombinant inbred lines (RILs) were developed through two crosses and ten generations of selfing. Genetic mapping with RILs refined the resistance locus on chromosome 5 from 8 to 3 Mb through individual-based whole-genome sequencing. Ongoing work includes comparative transcriptome analyses of the two homozygous lines and RILs in response to schistosome infection, along with fine-scale mapping of advanced intercross lines to elucidate the molecular basis of the snail's anti-schistosome defenses. Over the past 10 years, we have made extensive efforts to achieve germline delivery and generate genetically modified snails. Although pantropic lentiviral and yolk protein-mediated germline delivery methods were unsuccessful, these pioneering experiments provide valuable insights for future research. Finally, we successfully generated germline-edited B. glabrata, the first genetically modified schistosomiasis vector snail, by microinjecting CRISPR/Cas9 and guide RNA (gRNA) targeting the fibrinogen-related protein 3.1 (FREP3.1) gene into decapsulated embryos, followed by ex ovo culture. This breakthrough establishes a foundation for innovative genetic strategies to control this neglected tropical disease.
{"title":"Snail 3G: genomics, genetics, and gene-editing of <i>Biomphalaria glabrata</i>.","authors":"Si-Ming Zhang","doi":"10.3389/fgene.2026.1721789","DOIUrl":"10.3389/fgene.2026.1721789","url":null,"abstract":"<p><p>This review highlights recent advances and ongoing challenges in the genomics, genetics, and gene-editing (3G) of the freshwater snail <i>Biomphalaria glabrata</i>, based on insights gained from a novel model system we initiated two decades ago. <i>B. glabrata</i> is an intermediate host of the human blood fluke <i>Schistosoma mansoni</i> and serves as the principal model organism for schistosomiasis research. We developed two homozygous lines of <i>B. glabrata</i>, the iM line and iBS90, through 81 and 41 generations of selfing the commonly used M line and BS90, respectively. These lines display contrasting infection phenotypes: susceptibility or resistance to <i>S. mansoni</i>. High-quality scaffold-based genome assemblies were generated for both lines, followed by a chromosome-level assembly of the iM line genome. An F<sub>2</sub> segregating population derived from these lines enabled the identification of three loci, two linked to resistance or susceptibility and one associated with pigmentation, using a double digest restriction-site associated DNA sequencing (ddRADseq) approach. Recombinant inbred lines (RILs) were developed through two crosses and ten generations of selfing. Genetic mapping with RILs refined the resistance locus on chromosome 5 from 8 to 3 Mb through individual-based whole-genome sequencing. Ongoing work includes comparative transcriptome analyses of the two homozygous lines and RILs in response to schistosome infection, along with fine-scale mapping of advanced intercross lines to elucidate the molecular basis of the snail's anti-schistosome defenses. Over the past 10 years, we have made extensive efforts to achieve germline delivery and generate genetically modified snails. Although pantropic lentiviral and yolk protein-mediated germline delivery methods were unsuccessful, these pioneering experiments provide valuable insights for future research. Finally, we successfully generated germline-edited <i>B. glabrata</i>, the first genetically modified schistosomiasis vector snail, by microinjecting CRISPR/Cas9 and guide RNA (gRNA) targeting the <i>fibrinogen-related protein 3.1</i> (<i>FREP3.1</i>) gene into decapsulated embryos, followed by ex ovo culture. This breakthrough establishes a foundation for innovative genetic strategies to control this neglected tropical disease.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"17 ","pages":"1721789"},"PeriodicalIF":2.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867339/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146118814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}