Genomics & informatics最新文献_第3页

Genome-wide association study-driven identification of thrombomodulin and factor V as the best biomarker combination for deep vein thrombosis. 全基因组关联研究驱动的血栓调节蛋白和V因子作为深静脉血栓形成的最佳生物标志物组合的鉴定。

Genomics & informatics

Pub Date : 2025-05-15 DOI: 10.1186/s44342-025-00047-2

Usi Sukorini, Gisca Ajeng Widya Ninggar, Mohammad Hendra Setia Lesmana, Lalu Irham, Wirawan Adikusuma, Hegaria Rahmawati, Nur Imma Fatimah Harahap, Chiou-Feng Lin, Rahmat Dani Satria

Deep vein thrombosis (DVT) is a clinically significant condition characterized by the formation of thrombi in deep venous structures, leading to high morbidity and potential mortality. Identifying reliable biomarkers for DVT risk prediction remains challenging due to the intricate genetic and molecular mechanisms underlying the disease. This study aims to investigate the best biomarker for DVT. Our study utilized genome-wide association studies (GWAS) findings coupled with a functional annotation scoring system to identify and prioritize genetic markers with strong associations to DVT. Furthermore, gene expression levels were analyzed to determine the most promising genetic markers. Several databases were utilized, including the GWAS Catalog, HaploReg 4.2, WebGestalt, Enrichr, and the GTEx Portal. Through the comprehensive analysis, we found 5 potential biomarkers and highlighted thrombomodulin (THBD) and Factor V (F5) as the best blood-based biomarkers. THBD and F5 genes were selected based on their elevated expression levels in blood and the presence of eQTLs. Functionally, THBD modulates coagulation via protein C activation, while F5 is pivotal in thrombin formation and clot stabilization, underscoring their mechanistic relevance to DVT pathogenesis, and rendering them suitable for non-invasive clinical assessment. Our findings emphasize the potential of genetic biomarkers to transform DVT risk assessment and support advancements in precision medicine for thrombotic disorders.

深静脉血栓形成（DVT）是一种临床意义重大的疾病，其特征是在深静脉结构中形成血栓，具有很高的发病率和潜在的死亡率。由于该疾病复杂的遗传和分子机制，确定可靠的DVT风险预测生物标志物仍然具有挑战性。本研究旨在探讨深静脉血栓形成的最佳生物标志物。我们的研究利用全基因组关联研究（GWAS）的结果，结合功能注释评分系统来识别和优先考虑与DVT强关联的遗传标记。此外，还分析了基因表达水平，以确定最有希望的遗传标记。使用了几个数据库，包括GWAS Catalog、HaploReg 4.2、WebGestalt、plentir和GTEx Portal。通过综合分析，我们发现了5个潜在的生物标志物，并强调血栓调节素（THBD）和因子V （F5）是最好的血基生物标志物。根据THBD和F5基因在血液中的高表达水平和eqtl的存在来选择它们。在功能上，THBD通过蛋白C激活调节凝血，而F5在凝血酶形成和凝块稳定中起关键作用，强调了它们与DVT发病机制的机制相关性，并使它们适合用于无创临床评估。我们的研究结果强调了遗传生物标志物在改变深静脉血栓形成风险评估方面的潜力，并支持血栓性疾病精准医学的进步。

{"title":"Genome-wide association study-driven identification of thrombomodulin and factor V as the best biomarker combination for deep vein thrombosis.","authors":"Usi Sukorini, Gisca Ajeng Widya Ninggar, Mohammad Hendra Setia Lesmana, Lalu Irham, Wirawan Adikusuma, Hegaria Rahmawati, Nur Imma Fatimah Harahap, Chiou-Feng Lin, Rahmat Dani Satria","doi":"10.1186/s44342-025-00047-2","DOIUrl":"10.1186/s44342-025-00047-2","url":null,"abstract":"Deep vein thrombosis (DVT) is a clinically significant condition characterized by the formation of thrombi in deep venous structures, leading to high morbidity and potential mortality. Identifying reliable biomarkers for DVT risk prediction remains challenging due to the intricate genetic and molecular mechanisms underlying the disease. This study aims to investigate the best biomarker for DVT. Our study utilized genome-wide association studies (GWAS) findings coupled with a functional annotation scoring system to identify and prioritize genetic markers with strong associations to DVT. Furthermore, gene expression levels were analyzed to determine the most promising genetic markers. Several databases were utilized, including the GWAS Catalog, HaploReg 4.2, WebGestalt, Enrichr, and the GTEx Portal. Through the comprehensive analysis, we found 5 potential biomarkers and highlighted thrombomodulin (THBD) and Factor V (F5) as the best blood-based biomarkers. THBD and F5 genes were selected based on their elevated expression levels in blood and the presence of eQTLs. Functionally, THBD modulates coagulation via protein C activation, while F5 is pivotal in thrombin formation and clot stabilization, underscoring their mechanistic relevance to DVT pathogenesis, and rendering them suitable for non-invasive clinical assessment. Our findings emphasize the potential of genetic biomarkers to transform DVT risk assessment and support advancements in precision medicine for thrombotic disorders.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083130/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144083126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bioinformatics analysis reveals shared molecular pathways for relationship between ulcerative colitis and primary sclerosing cholangitis. 生物信息学分析揭示了溃疡性结肠炎和原发性硬化性胆管炎之间的共同分子通路。

Genomics & informatics

Pub Date : 2025-05-15 DOI: 10.1186/s44342-025-00045-4

Pooya Jalali, Malihe Rezaee, Alireza Yaghoobi, Moein Piroozkhah, Mohammad Reza Zabihi, Shahram Aliyari, Zahra Salehi

Background: Inflammatory bowel disease (IBD) is a group of chronic inflammatory disorders, including ulcerative colitis (UC) and Crohn's disease, affecting the gastrointestinal tract and is associated with high morbidity and mortality. Accumulating evidence indicates that IBD not only impacts the gastrointestinal tract but also affects multiple extraintestinal organs, which may manifest prior to the diagnosis of IBD. Among these extraintestinal manifestations associated with IBD, primary sclerosing cholangitis (PSC) stands out as a prominent example. PSC is recognized as a progressive cholestatic disorder, characterized by the narrowing of bile ducts, eventual development of liver cirrhosis, end-stage liver disease, and the potential emergence of cholangiocarcinoma. This study aimed to identify the molecular contributors in UC-induced PSC by detecting the essential regulatory genes that are differentially expressed in both diseases.

Materials and methods: The common single-nucleotide polymorphisms (SNPs) and differentially expressed genes (DEGs) were detected using DisGeNET and GEO databases, respectively. Then, the top module and hub genes within the protein-protein interaction network were identified. Furthermore, the co-expression network of the top module was constructed using the HIPPIE database. Additionally, the gene regulatory network was constructed based on miRNAs and circRNAs. Finally, we searched the DGIdb database for possible interacting drugs with UC-PSC top module genes.

Results: A total of 132 SNPs and their associated genes were found to be shared between UC and PSC. Gene expression analysis identified 56 common DEGs between the two diseases. Following functional enrichment analysis, 207 significant biological processes (BP), 48 molecular functions (MF), and 8 KEGG pathways, with notable enrichment in mRNA-related processes such as mRNA splicing and RNA binding, were defined. Particularly, the PTPN2 gene was the only gene common between UC and PSC at both the SNP level and the expression level. Additionally, the top cluster of PPI network analysis was consisted of PABPC1, SNRPA1, NOP56, NHP2L1, and HNRNPA2B1 genes. Finally, ceRNA network involving 4 mRNAs, 94 miRNAs, and 200 selected circRNAs was constructed.

Conclusion: The present study provides novel potential candidate genes that may be involved in the molecular association between ulcerative colitis and primary sclerosing cholangitis, resulting in the development of diagnostic tools and therapeutic targets to prevent the progression of PSC from UC.

背景：炎症性肠病（IBD）是一组慢性炎症性疾病，包括溃疡性结肠炎（UC）和克罗恩病，影响胃肠道，具有高发病率和死亡率。越来越多的证据表明，IBD不仅影响胃肠道，还影响多个肠外器官，这可能在IBD的诊断之前就表现出来。在这些与IBD相关的肠外表现中，原发性硬化性胆管炎（PSC）是一个突出的例子。PSC被认为是一种进行性胆汁淤积性疾病，其特征是胆管狭窄，最终发展为肝硬化、终末期肝病，并可能出现胆管癌。本研究旨在通过检测在两种疾病中差异表达的必要调控基因，确定uc诱导PSC的分子贡献者。材料和方法：分别使用DisGeNET和GEO数据库检测常见单核苷酸多态性（snp）和差异表达基因（DEGs）。然后，对蛋白-蛋白相互作用网络中的顶层模块和枢纽基因进行了鉴定。利用HIPPIE数据库构建了顶层模块的共表达网络。此外，构建了基于mirna和circrna的基因调控网络。最后，我们在DGIdb数据库中搜索可能与UC-PSC顶部模块基因相互作用的药物。结果：发现UC和PSC共有132个snp及其相关基因。基因表达分析确定了两种疾病之间56个共同的基因变异位点。通过功能富集分析，确定了207个重要生物过程（BP）， 48个分子功能（MF）和8个KEGG通路，其中mRNA剪接和RNA结合等mRNA相关过程显著富集。特别是，PTPN2基因是UC和PSC在SNP水平和表达水平上唯一共同的基因。此外，PPI网络分析的顶层聚类由PABPC1、SNRPA1、NOP56、NHP2L1和HNRNPA2B1基因组成。最后，构建了包含4个mrna、94个mirna和200个选定circrna的ceRNA网络。结论：本研究提供了新的潜在候选基因，可能参与溃疡性结肠炎和原发性硬化性胆管炎之间的分子关联，从而开发了诊断工具和治疗靶点，以防止UC的PSC进展。

{"title":"Bioinformatics analysis reveals shared molecular pathways for relationship between ulcerative colitis and primary sclerosing cholangitis.","authors":"Pooya Jalali, Malihe Rezaee, Alireza Yaghoobi, Moein Piroozkhah, Mohammad Reza Zabihi, Shahram Aliyari, Zahra Salehi","doi":"10.1186/s44342-025-00045-4","DOIUrl":"10.1186/s44342-025-00045-4","url":null,"abstract":"Background: Inflammatory bowel disease (IBD) is a group of chronic inflammatory disorders, including ulcerative colitis (UC) and Crohn's disease, affecting the gastrointestinal tract and is associated with high morbidity and mortality. Accumulating evidence indicates that IBD not only impacts the gastrointestinal tract but also affects multiple extraintestinal organs, which may manifest prior to the diagnosis of IBD. Among these extraintestinal manifestations associated with IBD, primary sclerosing cholangitis (PSC) stands out as a prominent example. PSC is recognized as a progressive cholestatic disorder, characterized by the narrowing of bile ducts, eventual development of liver cirrhosis, end-stage liver disease, and the potential emergence of cholangiocarcinoma. This study aimed to identify the molecular contributors in UC-induced PSC by detecting the essential regulatory genes that are differentially expressed in both diseases.Materials and methods: The common single-nucleotide polymorphisms (SNPs) and differentially expressed genes (DEGs) were detected using DisGeNET and GEO databases, respectively. Then, the top module and hub genes within the protein-protein interaction network were identified. Furthermore, the co-expression network of the top module was constructed using the HIPPIE database. Additionally, the gene regulatory network was constructed based on miRNAs and circRNAs. Finally, we searched the DGIdb database for possible interacting drugs with UC-PSC top module genes.Results: A total of 132 SNPs and their associated genes were found to be shared between UC and PSC. Gene expression analysis identified 56 common DEGs between the two diseases. Following functional enrichment analysis, 207 significant biological processes (BP), 48 molecular functions (MF), and 8 KEGG pathways, with notable enrichment in mRNA-related processes such as mRNA splicing and RNA binding, were defined. Particularly, the PTPN2 gene was the only gene common between UC and PSC at both the SNP level and the expression level. Additionally, the top cluster of PPI network analysis was consisted of PABPC1, SNRPA1, NOP56, NHP2L1, and HNRNPA2B1 genes. Finally, ceRNA network involving 4 mRNAs, 94 miRNAs, and 200 selected circRNAs was constructed.Conclusion: The present study provides novel potential candidate genes that may be involved in the molecular association between ulcerative colitis and primary sclerosing cholangitis, resulting in the development of diagnostic tools and therapeutic targets to prevent the progression of PSC from UC.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"12"},"PeriodicalIF":0.0,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12082998/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144083125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Single-cell network biology enabling cell-type-resolved disease genetics. 单细胞网络生物学使细胞类型解决疾病遗传学。

Genomics & informatics

Pub Date : 2025-03-27 DOI: 10.1186/s44342-025-00042-7

Junha Cha, Insuk Lee

Gene network models provide a foundation for graph theory approaches, aiding in the novel discovery of drug targets, disease genes, and genetic mechanisms for various biological functions. Disease genetics must be interpreted within the cellular context of disease-associated cell types, which cannot be achieved with datasets consisting solely of organism-level samples. Single-cell RNA sequencing (scRNA-seq) technology allows computational distinction of cell states which provides a unique opportunity to understand cellular biology that drives disease processes. Importantly, the abundance of cell samples with their transcriptome-wide profile allows the modeling of systemic cell-type-specific gene networks (CGNs), offering insights into gene-cell-disease relationships. In this review, we present reference-based and de novo inference of gene functional interaction networks that we have recently developed using scRNA-seq datasets. We also introduce a compendium of CGNs as a useful resource for cell-type-resolved disease genetics. By leveraging these advances, we envision single-cell network biology as the key approach for mapping the gene-cell-disease axis.

基因网络模型为图论方法提供了基础，有助于发现新的药物靶点、疾病基因和各种生物功能的遗传机制。疾病遗传学必须在与疾病相关的细胞类型的细胞背景下进行解释，而这是仅由生物体级样本组成的数据集无法实现的。单细胞 RNA 测序（scRNA-seq）技术可通过计算区分细胞状态，为了解驱动疾病进程的细胞生物学提供了独特的机会。重要的是，丰富的细胞样本及其转录组全貌可以建立系统细胞类型特异性基因网络（CGN）模型，从而深入了解基因-细胞-疾病之间的关系。在这篇综述中，我们介绍了我们最近利用 scRNA-seq 数据集开发的基于参考和全新推断的基因功能相互作用网络。我们还介绍了 CGN 简编，作为细胞类型解析疾病遗传学的有用资源。通过利用这些进展，我们设想单细胞网络生物学将成为绘制基因-细胞-疾病轴的关键方法。

{"title":"Single-cell network biology enabling cell-type-resolved disease genetics.","authors":"Junha Cha, Insuk Lee","doi":"10.1186/s44342-025-00042-7","DOIUrl":"10.1186/s44342-025-00042-7","url":null,"abstract":"Gene network models provide a foundation for graph theory approaches, aiding in the novel discovery of drug targets, disease genes, and genetic mechanisms for various biological functions. Disease genetics must be interpreted within the cellular context of disease-associated cell types, which cannot be achieved with datasets consisting solely of organism-level samples. Single-cell RNA sequencing (scRNA-seq) technology allows computational distinction of cell states which provides a unique opportunity to understand cellular biology that drives disease processes. Importantly, the abundance of cell samples with their transcriptome-wide profile allows the modeling of systemic cell-type-specific gene networks (CGNs), offering insights into gene-cell-disease relationships. In this review, we present reference-based and de novo inference of gene functional interaction networks that we have recently developed using scRNA-seq datasets. We also introduce a compendium of CGNs as a useful resource for cell-type-resolved disease genetics. By leveraging these advances, we envision single-cell network biology as the key approach for mapping the gene-cell-disease axis.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951680/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143733850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial for the special collection: frontiers in rare disease genetics. 专集编辑：罕见病遗传学前沿。

Genomics & informatics

Pub Date : 2025-03-07 DOI: 10.1186/s44342-025-00039-2

Murim Choi

引用次数: 0

Impact of cysteine mutations on the structural dynamics and functional impairment of SOD1: insights into the pathogenicity of amyotrophic lateral sclerosis. 半胱氨酸突变对SOD1结构动力学和功能损害的影响：对肌萎缩性侧索硬化症致病性的见解。

Genomics & informatics

Pub Date : 2025-03-06 DOI: 10.1186/s44342-025-00041-8

Jessica Jeejan, Lawanya Rao, Shivank Sadasivan, Richa Lopes, Norine Dsouza

Amyotrophic lateral sclerosis (ALS) is a rare neurodegenerative disease prevalent in American and European populations, with its onset and progression significantly influenced by mutations in the superoxide dismutase 1 (SOD1) protein. While previous studies have highlighted the effects of mutations in the metal-binding region and catalytic region and dimerisation of SOD1, the impact of mutations involving the Cysteine residue at the N-terminal end remains unexplored. This study investigates the effects of Cysteine-to-Trp, Phe, Ser, and Gly mutations at the 6th position of SOD1's N-terminal end on its structural dynamics and functional impairment. Our computational analysis using PolyPhen-2, PROVEAN, Meta-SNP, and PhD-SNP predicted mutations to be deleterious, with their negative impacts likely contributing to disease development. Furthermore, stability studies and bonding pattern changes due to the mutations, analysed by mCSM, SDM, DUET, Dynamut2, and PremPS revealed changes in free energy and disruption in intramolecular interactions. The molecular dynamics studies revealed distinct changes in stability patterns among the mutations, particularly in Cys6Trp and Cys6Phe. All the mutations primarily altered the catalytic region of the protein; additionally, Cys6Phe and Cys6Gly caused disruption in the metal-binding region. The impact of mutations on the dimerisation of SOD1, analysed using MM/PBSA showed destabilisation due to Cys6Phe mutation. These findings provide molecular insights into the clinical symptoms observed in patients, highlighting the critical impact of the Cys6Phe mutation on the metal-binding and catalytic loops of SOD1 along with destabilisation of dimer formation. Overall, our analysis offers valuable insights into the molecular mechanisms driving structural changes in SOD1 due to mutations, contributing to a deeper understanding of their role in ALS pathogenicity.

肌萎缩性侧索硬化症（ALS）是一种罕见的神经退行性疾病，常见于美国和欧洲人群，其发病和进展受超氧化物歧化酶1 （SOD1）蛋白突变的显著影响。虽然先前的研究已经强调了SOD1金属结合区和催化区突变和二聚化的影响，但涉及n端半胱氨酸残基的突变的影响仍未被探索。本研究探讨了SOD1 n端第6位半胱氨酸转色氨酸、Phe、Ser和Gly突变对其结构动力学和功能损伤的影响。我们使用polyphen2、PROVEAN、Meta-SNP和PhD-SNP进行计算分析，预测突变是有害的，它们的负面影响可能有助于疾病的发展。此外，通过mCSM、SDM、DUET、Dynamut2和PremPS分析的稳定性研究和键模式变化揭示了自由能的变化和分子内相互作用的破坏。分子动力学研究揭示了突变之间稳定性模式的明显变化，特别是在Cys6Trp和Cys6Phe中。所有的突变都主要改变了蛋白质的催化区；此外，Cys6Phe和Cys6Gly在金属结合区引起破坏。使用MM/PBSA分析突变对SOD1二聚化的影响显示，由于Cys6Phe突变，SOD1不稳定。这些发现为患者观察到的临床症状提供了分子见解，突出了Cys6Phe突变对SOD1金属结合和催化环以及二聚体形成不稳定的关键影响。总的来说，我们的分析为SOD1突变引起的结构变化的分子机制提供了有价值的见解，有助于更深入地了解它们在ALS致病性中的作用。

{"title":"Impact of cysteine mutations on the structural dynamics and functional impairment of SOD1: insights into the pathogenicity of amyotrophic lateral sclerosis.","authors":"Jessica Jeejan, Lawanya Rao, Shivank Sadasivan, Richa Lopes, Norine Dsouza","doi":"10.1186/s44342-025-00041-8","DOIUrl":"10.1186/s44342-025-00041-8","url":null,"abstract":"Amyotrophic lateral sclerosis (ALS) is a rare neurodegenerative disease prevalent in American and European populations, with its onset and progression significantly influenced by mutations in the superoxide dismutase 1 (SOD1) protein. While previous studies have highlighted the effects of mutations in the metal-binding region and catalytic region and dimerisation of SOD1, the impact of mutations involving the Cysteine residue at the N-terminal end remains unexplored. This study investigates the effects of Cysteine-to-Trp, Phe, Ser, and Gly mutations at the 6th position of SOD1's N-terminal end on its structural dynamics and functional impairment. Our computational analysis using PolyPhen-2, PROVEAN, Meta-SNP, and PhD-SNP predicted mutations to be deleterious, with their negative impacts likely contributing to disease development. Furthermore, stability studies and bonding pattern changes due to the mutations, analysed by mCSM, SDM, DUET, Dynamut2, and PremPS revealed changes in free energy and disruption in intramolecular interactions. The molecular dynamics studies revealed distinct changes in stability patterns among the mutations, particularly in Cys6Trp and Cys6Phe. All the mutations primarily altered the catalytic region of the protein; additionally, Cys6Phe and Cys6Gly caused disruption in the metal-binding region. The impact of mutations on the dimerisation of SOD1, analysed using MM/PBSA showed destabilisation due to Cys6Phe mutation. These findings provide molecular insights into the clinical symptoms observed in patients, highlighting the critical impact of the Cys6Phe mutation on the metal-binding and catalytic loops of SOD1 along with destabilisation of dimer formation. Overall, our analysis offers valuable insights into the molecular mechanisms driving structural changes in SOD1 due to mutations, contributing to a deeper understanding of their role in ALS pathogenicity.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11884185/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143574758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Lessons from national biobank projects utilizing whole-genome sequencing for population-scale genomics. 利用全基因组测序进行人口规模基因组学的国家生物银行项目的经验教训。

Genomics & informatics

Pub Date : 2025-03-06 DOI: 10.1186/s44342-025-00040-9

Hyeji Lee, Wooheon Kim, Nahyeon Kwon, Chanhee Kim, Sungmin Kim, Joon-Yong An

Large-scale national biobank projects utilizing whole-genome sequencing have emerged as transformative resources for understanding human genetic variation and its relationship to health and disease. These initiatives, which include the UK Biobank, All of Us Research Program, Singapore's PRECISE, Biobank Japan, and the National Project of Bio-Big Data of Korea, are generating unprecedented volumes of high-resolution genomic data integrated with comprehensive phenotypic, environmental, and clinical information. This review examines the methodologies, contributions, and challenges of major WGS-based national genome projects worldwide. We first discuss the landscape of national biobank initiatives, highlighting their distinct approaches to data collection, participant recruitment, and phenotype characterization. We then introduce recent technological advances that enable efficient processing and analysis of large-scale WGS data, including improvements in variant calling algorithms, innovative methods for creating multi-sample VCFs, optimized data storage formats, and cloud-based computing solutions. The review synthesizes key discoveries from these projects, particularly in identifying expression quantitative trait loci and rare variants associated with complex diseases. Our review introduces the latest findings from the National Project of Bio-Big Data of Korea, which has advanced our understanding of population-specific genetic variation and rare diseases in Korean and East Asian populations. Finally, we discuss future directions and challenges in maximizing the impact of these resources on precision medicine and global health equity. This comprehensive examination demonstrates how large-scale national genome projects are revolutionizing genetic research and healthcare delivery while highlighting the importance of continued investment in diverse, population-specific genomic resources.

利用全基因组测序的大规模国家生物银行项目已成为了解人类遗传变异及其与健康和疾病关系的变革性资源。包括英国生物银行、我们所有人研究计划、新加坡的PRECISE、日本生物银行和韩国国家生物大数据项目在内的这些项目正在产生前所未有的高分辨率基因组数据，这些数据集成了全面的表型、环境和临床信息。本文综述了全球主要基于wgs的国家基因组计划的方法、贡献和挑战。我们首先讨论了国家生物银行倡议的景观，强调了他们在数据收集、参与者招募和表型表征方面的独特方法。然后，我们介绍了能够有效处理和分析大规模WGS数据的最新技术进展，包括变体调用算法的改进、创建多样本vcf的创新方法、优化的数据存储格式和基于云的计算解决方案。这篇综述综合了这些项目的主要发现，特别是在鉴定与复杂疾病相关的表达数量性状位点和罕见变异方面。我们的综述介绍了韩国国家生物大数据项目的最新发现，这些发现提高了我们对韩国和东亚人群中人群特异性遗传变异和罕见疾病的理解。最后，我们讨论了未来的方向和挑战，以最大限度地发挥这些资源对精准医疗和全球卫生公平的影响。这一全面的研究表明，大规模的国家基因组项目正在彻底改变基因研究和医疗保健服务，同时强调继续投资于多样化、特定人群的基因组资源的重要性。

{"title":"Lessons from national biobank projects utilizing whole-genome sequencing for population-scale genomics.","authors":"Hyeji Lee, Wooheon Kim, Nahyeon Kwon, Chanhee Kim, Sungmin Kim, Joon-Yong An","doi":"10.1186/s44342-025-00040-9","DOIUrl":"10.1186/s44342-025-00040-9","url":null,"abstract":"Large-scale national biobank projects utilizing whole-genome sequencing have emerged as transformative resources for understanding human genetic variation and its relationship to health and disease. These initiatives, which include the UK Biobank, All of Us Research Program, Singapore's PRECISE, Biobank Japan, and the National Project of Bio-Big Data of Korea, are generating unprecedented volumes of high-resolution genomic data integrated with comprehensive phenotypic, environmental, and clinical information. This review examines the methodologies, contributions, and challenges of major WGS-based national genome projects worldwide. We first discuss the landscape of national biobank initiatives, highlighting their distinct approaches to data collection, participant recruitment, and phenotype characterization. We then introduce recent technological advances that enable efficient processing and analysis of large-scale WGS data, including improvements in variant calling algorithms, innovative methods for creating multi-sample VCFs, optimized data storage formats, and cloud-based computing solutions. The review synthesizes key discoveries from these projects, particularly in identifying expression quantitative trait loci and rare variants associated with complex diseases. Our review introduces the latest findings from the National Project of Bio-Big Data of Korea, which has advanced our understanding of population-specific genetic variation and rare diseases in Korean and East Asian populations. Finally, we discuss future directions and challenges in maximizing the impact of these resources on precision medicine and global health equity. This comprehensive examination demonstrates how large-scale national genome projects are revolutionizing genetic research and healthcare delivery while highlighting the importance of continued investment in diverse, population-specific genomic resources.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11887102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143574765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comprehensive Analysis of TCR and BCR Repertoires: Insights into Methodologies, Challenges, and Applications. TCR和BCR的综合分析：方法、挑战和应用。

Genomics & informatics

Pub Date : 2025-02-24 DOI: 10.1186/s44342-024-00034-z

Kayoung Seo, Jung Kyoon Choi

The diversity of T-cell receptors (TCRs) and B-cell receptors (BCRs) underpins the adaptive immune system's ability to recognize and respond to a wide array of antigens. Recent advancements in RNA sequencing have expanded its application beyond transcriptomics to include the analysis of immune repertoires, enabling the exploration of TCR and BCR sequences across various physiological and pathological contexts. This review highlights key methodologies and considerations for TCR and BCR repertoire analysis, focusing on the technical aspects of receptor sequence extraction, data processing, and clonotype identification. We compare the use of bulk and single-cell sequencing, discuss computational tools and pipelines, and evaluate the implications of examining specific receptor regions such as CDR3. By integrating immunology, bioinformatics, and clinical research, immune repertoire analysis provides valuable insights into immune function, therapeutic responses, and precision medicine approaches, advancing our understanding of health and disease.

t细胞受体（TCRs）和b细胞受体（BCRs）的多样性支撑了适应性免疫系统识别和应答多种抗原的能力。RNA测序的最新进展已将其应用范围从转录组学扩展到免疫谱分析，从而能够在各种生理和病理背景下探索TCR和BCR序列。这篇综述强调了TCR和BCR库分析的关键方法和注意事项，重点是受体序列提取、数据处理和克隆型鉴定的技术方面。我们比较了批量测序和单细胞测序的使用，讨论了计算工具和管道，并评估了检查特定受体区域（如CDR3）的意义。通过整合免疫学、生物信息学和临床研究，免疫库分析为免疫功能、治疗反应和精准医学方法提供了有价值的见解，促进了我们对健康和疾病的理解。

引用次数: 0

Deciphering single-cell genomic architecture: insights into cellular heterogeneity and regulatory dynamics. 破译单细胞基因组结构：洞察细胞异质性和调控动力学。

Genomics & informatics

Pub Date : 2025-02-11 DOI: 10.1186/s44342-025-00037-4

Byunghee Kang, Hyeonji Lee, Tae-Young Roh

Background: The genomic architecture of eukaryotes exhibits dynamic spatial and temporal changes, enabling cellular processes critical for maintaining viability and functional diversity. Recent advances in sequencing technologies have facilitated the dissection of genomic architecture and functional activity at single-cell resolution, moving beyond the averaged signals typically derived from bulk cell analyses.

Main body: The advent of single-cell genomics and epigenomics has yielded transformative insights into cellular heterogeneity, behavior, and biological complexity with unparalleled genomic resolution and reproducibility. This review summarizes recent progress in the characterization of genomic architecture at the single-cell level, emphasizing the impact of structural variation and chromatin organization on gene regulatory networks and cellular identity.

Conclusion: Future directions in single-cell genomics and high-resolution epigenomic methodologies are explored, focusing on emerging challenges and potential impacts on the understanding of cellular states, regulatory dynamics, and the intricate mechanisms driving cellular function and diversity. Future perspectives on the challenges and potential implications of single-cell genomics, along with high-resolution genomic and epigenomic technologies for understanding cellular states and regulatory dynamics, are also discussed.

背景：真核生物的基因组结构表现出动态的空间和时间变化，使细胞过程对维持活力和功能多样性至关重要。测序技术的最新进展促进了在单细胞分辨率上对基因组结构和功能活动的解剖，超越了通常来自大细胞分析的平均信号。正文：单细胞基因组学和表观基因组学的出现，对细胞异质性、行为和生物复杂性产生了革命性的见解，具有无与伦比的基因组分辨率和可重复性。本文综述了单细胞水平基因组结构表征的最新进展，强调了结构变异和染色质组织对基因调控网络和细胞身份的影响。结论：探讨了单细胞基因组学和高分辨率表观基因组学方法的未来发展方向，重点关注对细胞状态、调控动力学以及驱动细胞功能和多样性的复杂机制的理解的新挑战和潜在影响。还讨论了单细胞基因组学的挑战和潜在影响的未来前景，以及用于理解细胞状态和调节动力学的高分辨率基因组学和表观基因组学技术。

{"title":"Deciphering single-cell genomic architecture: insights into cellular heterogeneity and regulatory dynamics.","authors":"Byunghee Kang, Hyeonji Lee, Tae-Young Roh","doi":"10.1186/s44342-025-00037-4","DOIUrl":"10.1186/s44342-025-00037-4","url":null,"abstract":"Background: The genomic architecture of eukaryotes exhibits dynamic spatial and temporal changes, enabling cellular processes critical for maintaining viability and functional diversity. Recent advances in sequencing technologies have facilitated the dissection of genomic architecture and functional activity at single-cell resolution, moving beyond the averaged signals typically derived from bulk cell analyses.Main body: The advent of single-cell genomics and epigenomics has yielded transformative insights into cellular heterogeneity, behavior, and biological complexity with unparalleled genomic resolution and reproducibility. This review summarizes recent progress in the characterization of genomic architecture at the single-cell level, emphasizing the impact of structural variation and chromatin organization on gene regulatory networks and cellular identity.Conclusion: Future directions in single-cell genomics and high-resolution epigenomic methodologies are explored, focusing on emerging challenges and potential impacts on the understanding of cellular states, regulatory dynamics, and the intricate mechanisms driving cellular function and diversity. Future perspectives on the challenges and potential implications of single-cell genomics, along with high-resolution genomic and epigenomic technologies for understanding cellular states and regulatory dynamics, are also discussed.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11817879/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143401027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Epigenetic regulation of angiogenesis and its therapeutics. 血管生成的表观遗传调控及其治疗。

Genomics & informatics

Pub Date : 2025-02-11 DOI: 10.1186/s44342-025-00038-3

Dong Kyu Choi

Angiogenesis, the formation of new blood vessels from preexisting ones, is essential for normal development, wound healing, and tissue repair. However, dysregulated angiogenesis is implicated in various pathological conditions, including cancer, diabetic retinopathy, and atherosclerosis. Epigenetic modifications, including DNA methylation, histone modification, and noncoding RNAs (e.g., miRNAs), play a crucial role in regulating angiogenic gene expression without altering the underlying DNA sequence. These modifications tightly regulate the balance between pro-angiogenic and anti-angiogenic factors, thereby influencing endothelial cell proliferation, migration, and tube formation. In recent years, epigenetic drugs, such as DNA methyltransferase inhibitors (e.g., azacitidine, decitabine), histone deacetylase inhibitors (e.g., vorinostat, romidepsin), and BET inhibitors (e.g., JQ1), have emerged as promising therapeutic strategies for targeting abnormal angiogenesis. These agents modulate gene expression patterns, reactivating silenced tumor suppressor genes while downregulating pro-angiogenic signaling pathways. Additionally, miRNA modulators, such as MRG-110 and MRG-201, provide precise regulation of angiogenesis-related pathways, demonstrating significant therapeutic potential in preclinical models. This review underscores the intricate interplay between epigenetic regulation and angiogenesis, highlighting key mechanisms and therapeutic applications. Advancing our understanding of these processes will enable the development of more effective and targeted epigenetic therapies for angiogenesis-related diseases, paving the way for innovative clinical interventions.

血管生成，即从原有血管中生成新血管，对正常发育、伤口愈合和组织修复至关重要。然而，血管生成失调与多种病理状况有关，包括癌症、糖尿病视网膜病变和动脉粥样硬化。表观遗传修饰，包括DNA甲基化、组蛋白修饰和非编码rna（如miRNAs），在不改变潜在DNA序列的情况下，在调节血管生成基因表达中起着至关重要的作用。这些修饰严格调节促血管生成和抗血管生成因子之间的平衡，从而影响内皮细胞的增殖、迁移和管的形成。近年来，表观遗传药物，如DNA甲基转移酶抑制剂（如阿扎胞苷、地西他滨）、组蛋白去乙酰化酶抑制剂（如vorinostat、roidepsin）和BET抑制剂（如JQ1），已成为针对异常血管生成的有希望的治疗策略。这些药物调节基因表达模式，重新激活沉默的肿瘤抑制基因，同时下调促血管生成信号通路。此外，miRNA调节剂，如MRG-110和MRG-201，提供血管生成相关途径的精确调控，在临床前模型中显示出显著的治疗潜力。这篇综述强调了表观遗传调控和血管生成之间复杂的相互作用，强调了关键机制和治疗应用。推进我们对这些过程的理解将有助于开发更有效和更有针对性的表观遗传治疗血管生成相关疾病，为创新的临床干预铺平道路。

{"title":"Epigenetic regulation of angiogenesis and its therapeutics.","authors":"Dong Kyu Choi","doi":"10.1186/s44342-025-00038-3","DOIUrl":"10.1186/s44342-025-00038-3","url":null,"abstract":"Angiogenesis, the formation of new blood vessels from preexisting ones, is essential for normal development, wound healing, and tissue repair. However, dysregulated angiogenesis is implicated in various pathological conditions, including cancer, diabetic retinopathy, and atherosclerosis. Epigenetic modifications, including DNA methylation, histone modification, and noncoding RNAs (e.g., miRNAs), play a crucial role in regulating angiogenic gene expression without altering the underlying DNA sequence. These modifications tightly regulate the balance between pro-angiogenic and anti-angiogenic factors, thereby influencing endothelial cell proliferation, migration, and tube formation. In recent years, epigenetic drugs, such as DNA methyltransferase inhibitors (e.g., azacitidine, decitabine), histone deacetylase inhibitors (e.g., vorinostat, romidepsin), and BET inhibitors (e.g., JQ1), have emerged as promising therapeutic strategies for targeting abnormal angiogenesis. These agents modulate gene expression patterns, reactivating silenced tumor suppressor genes while downregulating pro-angiogenic signaling pathways. Additionally, miRNA modulators, such as MRG-110 and MRG-201, provide precise regulation of angiogenesis-related pathways, demonstrating significant therapeutic potential in preclinical models. This review underscores the intricate interplay between epigenetic regulation and angiogenesis, highlighting key mechanisms and therapeutic applications. Advancing our understanding of these processes will enable the development of more effective and targeted epigenetic therapies for angiogenesis-related diseases, paving the way for innovative clinical interventions.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"23 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11817428/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143401032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparative analysis of generative LLMs for labeling entities in clinical notes. 临床记录中标记实体的生成式llm的比较分析。

Genomics & informatics

Pub Date : 2025-02-06 DOI: 10.1186/s44342-024-00036-x

Rodrigo Del Moral-González, Helena Gómez-Adorno, Orlando Ramos-Flores

This paper evaluates and compares different fine-tuned variations of generative large language models (LLM) in the zero-shot named entity recognition (NER) task for the clinical domain. As part of the 8th Biomedical Linked Annotation Hackathon, we examined Llama 2 and Mistral models, including base versions and those that have been fine-tuned for code, chat, and instruction-following tasks. We assess both the number of correctly identified entities and the models' ability to retrieve entities in structured formats. We used a publicly available set of clinical cases labeled with mentions of diseases, symptoms, and medical procedures for the evaluation. Results show that instruction fine-tuned models perform better than chat fine-tuned and base models in recognizing entities. It is also shown that models perform better when simple output structures are requested.

本文评估和比较了临床领域零射击命名实体识别（NER）任务中生成大语言模型（LLM）的不同微调变化。作为第八届生物医学链接注释黑客马拉松的一部分，我们检查了Llama 2和Mistral模型，包括基本版本和那些为代码、聊天和指令遵循任务进行了微调的模型。我们评估了正确识别实体的数量和模型以结构化格式检索实体的能力。我们使用了一组公开的临床病例，标记了疾病、症状和医疗程序进行评估。结果表明，指令微调模型在实体识别方面优于聊天微调模型和基本模型。当要求简单的输出结构时，模型的性能更好。

引用次数: 0