首页 > 最新文献

HGG Advances最新文献

英文 中文
Large-scale blood pressure GWAS accounting for gene-depression interactions in 564,680 individuals from diverse populations. 来自不同人群的564,680个个体的血压与基因抑郁症状相互作用的大规模全基因组关联研究
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-10 DOI: 10.1016/j.xhgg.2026.100566
Songmi Lee, Clint L Miller, Amy R Bentley, Michael R Brown, Pavithra Nagarajan, Raymond Noordam, John L Morrison, Karen Schwander, Kenneth Westerman, Minjung Kho, Aldi T Kraja, Paul S de Vries, Farah Ammous, Hughes Aschard, Traci M Bartz, Anh Do, Charles T Dupont, Mary F Feitosa, Valborg Gudmundsdottir, Xiuqing Guo, Sarah E Harris, Keiko Hikino, Zhijie Huang, Christophe Lefevre, Leo-Pekka Lyytikäinen, Yuri Milaneschi, Giuseppe Giovanni Nardone, Aurora Santin, Helena Schmidt, Botong Shen, Tamar Sofer, Quan Sun, Ye An Tan, Jingxian Tang, Sébastien Thériault, Peter J van der Most, Erin B Ware, Stefan Weiss, Wang Ya Xing, Chenglong Yu, Wei Zhao, Md Abu Yusuf Ansari, Pramod Anugu, John R Attia, Lydia A Bazzano, Joshua C Bis, Max Breyer, Brian Cade, Guanjie Chen, Stacey Collins, Janie Corley, Gail Davies, Marcus Dörr, Jiawen Du, Todd L Edwards, Tariq Faquih, Jessica D Faul, Alison E Fohner, Amanda M Fretts, Srushti Gangireddy, Adam Gepner, MariaElisa Graff, Edith Hofer, Georg Homuth, Michelle M Hood, Xu Jie, Mika Kähönen, Sharon L R Kardia, Carrie A Karvonen-Gutierrez, Lenore J Launer, Daniel Levy, Maitreiyi Maheshwari, Lisa W Martin, Koichi Matsuda, John J McNeil, Ilja M Nolte, Tomo Okochi, Laura M Raffield, Olli T Raitakari, Lorenz Risch, Martin Risch, Ana Diez Roux, Edward A Ruiz-Narvaez, Tom C Russ, Takeo Saito, Pamela J Schreiner, Rodney J Scott, James Shikany, Jennifer A Smith, Harold Snieder, Beatrice Spedicati, E Shyong Tai, Adele M Taylor, Kent D Taylor, Paola Tesolin, Rob M van Dam, Rujia Wang, Wei Wenbin, Tian Xie, Jie Yao, Kristin L Young, Ruiyuan Zhang, Alan B Zonderman, Maria Pina Concas, David Conen, Simon R Cox, Michele K Evans, Ervin R Fox, Lisa de Las Fuentes, Ayush Giri, Giorgia Girotto, Hans J Grabe, Charles Gu, Vilmundur Gudnason, Sioban D Harlow, Elizabeth Holliday, Jonas B Jost, Paul Lacaze, Seunggeun Lee, Terho Lehtimäki, Changwei Li, Ching-Ti Liu, Alanna C Morrison, Kari E North, Brenda W J H Penninx, Patricia A Peyser, Michael M Province, Bruce M Psaty, Susan Redline, Frits R Rosendaal, Charles N Rotimi, Jerome I Rotter, Reinhold Schmidt, Xueling Sim, Chikashi Terao, David R Weir, Xiaofeng Zhu, Nora Franceschini, Jeffrey R O'Connell, Cashell E Jaquish, Heming Wang, Alisa Manning, Patricia B Munroe, Dabeeru C Rao, Han Chen, W James Gauderman, Laura J Bierut, Thomas W Winkler, Myriam Fornage

Gene-environment interactions may enhance our understanding of blood pressure (BP) biology. We conducted a meta-analysis of multi-population genome-wide association studies (GWASs) of BP traits accounting for gene-depressive symptomatology (DEPR) interactions. Our study included 564,680 adults from 67 cohorts and four population backgrounds: African (5%), Asian (7%), European (85%), and Hispanic (3%). We discovered seven previously unreported BP loci showing gene-DEPR interaction. These loci mapped to genes implicated in neurogenesis (TGFA and CASP3), lipid metabolism (ACSL1), neuronal apoptosis (CASP3), and synaptic activity (CNTN6 and DBI). We also showed evidence for gene-DEPR interaction at nine known BP loci, further suggesting links between mood disturbance and BP regulation. Of the 16 identified loci, 11 were derived from non-European populations. Post-GWAS analyses prioritized 36 genes, including genes involved in synaptic functions (DOCK4 and MAGI2) and neuronal signaling (CCK, UGDH, and SLC01A2). Integrative druggability analyses identified 11 druggable candidate gene targets linked to pathways involved in mood disorders as well as known anti-hypertensive drugs. Our findings emphasize the importance of considering gene-DEPR interactions on BP, particularly in non-European populations. Our prioritized genes and druggable targets highlight biological pathways connecting mood disorders and hypertension and suggest opportunities for BP drug repurposing and risk factor prevention, especially in individuals with DEPR.

基因与环境的相互作用可能增强我们对血压生物学的理解。我们对BP性状与基因抑郁症状(DEPR)相互作用的多种群全基因组关联研究进行了荟萃分析。我们的研究包括来自67个队列和4种人口背景(非洲人(5%)、亚洲人(7%)、欧洲人(85%)和西班牙人(3%)的564,680名成年人。我们发现了7个以前未报道的BP基因座,显示基因与depr相互作用。这些基因座与神经发生(TGFA, CASP3)、脂质代谢(ACSL1)、神经元凋亡(CASP3)和突触活性(CNTN6, DBI)有关。我们还发现了在9个已知的血压位点上基因- depr相互作用的证据,进一步表明情绪障碍和血压调节之间存在联系。在鉴定的16个基因座中,有11个基因座来自非欧洲人群。Post-GWAS分析对36个基因进行了优先排序,包括与突触功能相关的基因(DOCK4、MAGI2)和神经元信号传导相关的基因(CCK、UGDH、SLC01A2)。综合药物分析确定了11个与情绪障碍和已知抗高血压药物相关的可药物候选基因靶点。我们的研究结果强调了考虑基因- depr相互作用对BP的重要性,特别是在非欧洲人群中。我们的优先基因和药物靶点突出了连接情绪障碍和高血压的生物学途径,并为BP药物再利用和风险因素预防提供了机会,特别是在DEPR患者中。
{"title":"Large-scale blood pressure GWAS accounting for gene-depression interactions in 564,680 individuals from diverse populations.","authors":"Songmi Lee, Clint L Miller, Amy R Bentley, Michael R Brown, Pavithra Nagarajan, Raymond Noordam, John L Morrison, Karen Schwander, Kenneth Westerman, Minjung Kho, Aldi T Kraja, Paul S de Vries, Farah Ammous, Hughes Aschard, Traci M Bartz, Anh Do, Charles T Dupont, Mary F Feitosa, Valborg Gudmundsdottir, Xiuqing Guo, Sarah E Harris, Keiko Hikino, Zhijie Huang, Christophe Lefevre, Leo-Pekka Lyytikäinen, Yuri Milaneschi, Giuseppe Giovanni Nardone, Aurora Santin, Helena Schmidt, Botong Shen, Tamar Sofer, Quan Sun, Ye An Tan, Jingxian Tang, Sébastien Thériault, Peter J van der Most, Erin B Ware, Stefan Weiss, Wang Ya Xing, Chenglong Yu, Wei Zhao, Md Abu Yusuf Ansari, Pramod Anugu, John R Attia, Lydia A Bazzano, Joshua C Bis, Max Breyer, Brian Cade, Guanjie Chen, Stacey Collins, Janie Corley, Gail Davies, Marcus Dörr, Jiawen Du, Todd L Edwards, Tariq Faquih, Jessica D Faul, Alison E Fohner, Amanda M Fretts, Srushti Gangireddy, Adam Gepner, MariaElisa Graff, Edith Hofer, Georg Homuth, Michelle M Hood, Xu Jie, Mika Kähönen, Sharon L R Kardia, Carrie A Karvonen-Gutierrez, Lenore J Launer, Daniel Levy, Maitreiyi Maheshwari, Lisa W Martin, Koichi Matsuda, John J McNeil, Ilja M Nolte, Tomo Okochi, Laura M Raffield, Olli T Raitakari, Lorenz Risch, Martin Risch, Ana Diez Roux, Edward A Ruiz-Narvaez, Tom C Russ, Takeo Saito, Pamela J Schreiner, Rodney J Scott, James Shikany, Jennifer A Smith, Harold Snieder, Beatrice Spedicati, E Shyong Tai, Adele M Taylor, Kent D Taylor, Paola Tesolin, Rob M van Dam, Rujia Wang, Wei Wenbin, Tian Xie, Jie Yao, Kristin L Young, Ruiyuan Zhang, Alan B Zonderman, Maria Pina Concas, David Conen, Simon R Cox, Michele K Evans, Ervin R Fox, Lisa de Las Fuentes, Ayush Giri, Giorgia Girotto, Hans J Grabe, Charles Gu, Vilmundur Gudnason, Sioban D Harlow, Elizabeth Holliday, Jonas B Jost, Paul Lacaze, Seunggeun Lee, Terho Lehtimäki, Changwei Li, Ching-Ti Liu, Alanna C Morrison, Kari E North, Brenda W J H Penninx, Patricia A Peyser, Michael M Province, Bruce M Psaty, Susan Redline, Frits R Rosendaal, Charles N Rotimi, Jerome I Rotter, Reinhold Schmidt, Xueling Sim, Chikashi Terao, David R Weir, Xiaofeng Zhu, Nora Franceschini, Jeffrey R O'Connell, Cashell E Jaquish, Heming Wang, Alisa Manning, Patricia B Munroe, Dabeeru C Rao, Han Chen, W James Gauderman, Laura J Bierut, Thomas W Winkler, Myriam Fornage","doi":"10.1016/j.xhgg.2026.100566","DOIUrl":"10.1016/j.xhgg.2026.100566","url":null,"abstract":"<p><p>Gene-environment interactions may enhance our understanding of blood pressure (BP) biology. We conducted a meta-analysis of multi-population genome-wide association studies (GWASs) of BP traits accounting for gene-depressive symptomatology (DEPR) interactions. Our study included 564,680 adults from 67 cohorts and four population backgrounds: African (5%), Asian (7%), European (85%), and Hispanic (3%). We discovered seven previously unreported BP loci showing gene-DEPR interaction. These loci mapped to genes implicated in neurogenesis (TGFA and CASP3), lipid metabolism (ACSL1), neuronal apoptosis (CASP3), and synaptic activity (CNTN6 and DBI). We also showed evidence for gene-DEPR interaction at nine known BP loci, further suggesting links between mood disturbance and BP regulation. Of the 16 identified loci, 11 were derived from non-European populations. Post-GWAS analyses prioritized 36 genes, including genes involved in synaptic functions (DOCK4 and MAGI2) and neuronal signaling (CCK, UGDH, and SLC01A2). Integrative druggability analyses identified 11 druggable candidate gene targets linked to pathways involved in mood disorders as well as known anti-hypertensive drugs. Our findings emphasize the importance of considering gene-DEPR interactions on BP, particularly in non-European populations. Our prioritized genes and druggable targets highlight biological pathways connecting mood disorders and hypertension and suggest opportunities for BP drug repurposing and risk factor prevention, especially in individuals with DEPR.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100566"},"PeriodicalIF":3.6,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12874302/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145949296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EEFSEC deficiency underlies a human selenopathy with primary neurodevelopmental origins via midbrain-hindbrain hypoplasia. EEFSEC缺乏是由中脑-后脑发育不全引起的原发性神经发育起源的人硒病的基础。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-09 DOI: 10.1016/j.xhgg.2026.100563
Zhiyi Xia, Hui Liu, Pengbo Guo, Chongfen Chen, Lili Ge, Longfei Tang, Yaodong Zhang, Yanli Ma

Bi-allelic mutations in EEFSEC, a key factor in selenoprotein synthesis, cause a severe human selenopathy characterized by developmental delay, spasticity, and profound cerebellar atrophy. While previous studies in invertebrate models framed this condition as an early-onset neurodegenerative disorder, the contribution of primary developmental defects to the severe brain malformations in patients has remained a critical unanswered question. Here, we address this gap using a zebrafish model of EEFSEC deficiency. We discovered that loss of eefsec function does not impair global somatic growth but instead causes specific and significant hypoplasia of the midbrain and hindbrain-the embryonic precursors to the human cerebellum and brain stem. These structural defects directly correlate with robust behavioral impairments, including diminished locomotion and blunted escape responses, mirroring the severe motor dysfunction in patients. Critically, our findings provide the in vivo evidence from a vertebrate model that this disorder involves a primary neurodevelopmental defect, which underlies the severe brain malformations and creates a structurally vulnerable nervous system. This establishes a developmental basis for understanding this condition. We propose that this initial failure in brain construction, which we term a developmental selenopathy, creates a structurally vulnerable nervous system, providing a plausible mechanistic explanation for the human phenotype and proposing a framework for understanding this devastating condition.

EEFSEC是硒蛋白合成的关键因子,其双等位基因突变可导致严重的人硒病,其特征是发育迟缓、痉挛和严重的小脑萎缩。虽然以前在无脊椎动物模型中的研究将这种情况定义为早发性神经退行性疾病,但原发性发育缺陷对患者严重脑畸形的贡献仍然是一个关键的未解问题。在这里,我们使用eesec缺陷的斑马鱼模型来解决这一差距。我们发现,eefsec功能的丧失并不会损害整体的体细胞生长,而是导致中脑和后脑(人类小脑和脑干的胚胎前体)特异性和显著的发育不全。这些结构缺陷与严重的行为障碍直接相关,包括运动能力下降和逃避反应迟钝,反映了患者严重的运动功能障碍。重要的是,我们的研究结果提供了脊椎动物模型的体内证据,表明这种疾病涉及原发性神经发育缺陷,这是严重脑畸形的基础,并造成结构上脆弱的神经系统。这为理解这种情况建立了一个发展基础。我们提出,这种大脑构造的初始失败,我们称之为“发育性硒病”,造成了一个结构上脆弱的神经系统,为人类表型提供了一个合理的机制解释,并提出了一个理解这种毁灭性疾病的框架。
{"title":"EEFSEC deficiency underlies a human selenopathy with primary neurodevelopmental origins via midbrain-hindbrain hypoplasia.","authors":"Zhiyi Xia, Hui Liu, Pengbo Guo, Chongfen Chen, Lili Ge, Longfei Tang, Yaodong Zhang, Yanli Ma","doi":"10.1016/j.xhgg.2026.100563","DOIUrl":"10.1016/j.xhgg.2026.100563","url":null,"abstract":"<p><p>Bi-allelic mutations in EEFSEC, a key factor in selenoprotein synthesis, cause a severe human selenopathy characterized by developmental delay, spasticity, and profound cerebellar atrophy. While previous studies in invertebrate models framed this condition as an early-onset neurodegenerative disorder, the contribution of primary developmental defects to the severe brain malformations in patients has remained a critical unanswered question. Here, we address this gap using a zebrafish model of EEFSEC deficiency. We discovered that loss of eefsec function does not impair global somatic growth but instead causes specific and significant hypoplasia of the midbrain and hindbrain-the embryonic precursors to the human cerebellum and brain stem. These structural defects directly correlate with robust behavioral impairments, including diminished locomotion and blunted escape responses, mirroring the severe motor dysfunction in patients. Critically, our findings provide the in vivo evidence from a vertebrate model that this disorder involves a primary neurodevelopmental defect, which underlies the severe brain malformations and creates a structurally vulnerable nervous system. This establishes a developmental basis for understanding this condition. We propose that this initial failure in brain construction, which we term a developmental selenopathy, creates a structurally vulnerable nervous system, providing a plausible mechanistic explanation for the human phenotype and proposing a framework for understanding this devastating condition.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100563"},"PeriodicalIF":3.6,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12861234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating genetic ancestry inference from single-cell transcriptomic datasets. 评估单细胞转录组数据集的遗传祖先推断。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-08 DOI: 10.1016/j.xhgg.2026.100564
Jianing Yao, Steven Gazal

Characterizing the ancestry of donors in single-cell transcriptomic studies is crucial to ensure genetic homogeneity, reduce biases in analyses, identify ancestry-specific regulatory mechanisms and their downstream roles in disease, and ensure that existing datasets are representative of human genetic diversity. While these datasets are now widely available, information on the ancestry of donors is often missing, hindering further analysis. Here, we propose a framework to evaluate methods for inferring genetic ancestry from genetic polymorphisms detected in single-cell sequencing reads. We demonstrate that widely used tools (e.g., ADMIXTURE) provide accurate inference of genetic ancestry and admixture proportions, despite the limited number of genetic polymorphisms identified and imperfect variant calling from sequencing reads. We infer genetic ancestry for 401 donors from 10 Human Cell Atlas datasets and report a high proportion of donors of European ancestry in this resource. For researchers generating single-cell transcriptomic datasets, we recommend reporting genetic ancestry inference for all donors and generating datasets that represent diverse ancestries.

在单细胞转录组学研究中描述供体祖先的特征对于确保遗传同质性、减少分析中的偏差、确定特定祖先的调节机制及其在疾病中的下游作用、确保现有数据集代表人类遗传多样性至关重要。虽然这些数据集现已广泛可用,但关于献血者祖先的信息往往缺失,阻碍了进一步的分析。在这里,我们提出了一个框架来评估从单细胞测序读取中检测到的遗传多态性推断遗传祖先的方法。我们证明了广泛使用的工具(例如admix)提供了遗传祖先和混合比例的准确推断,尽管从测序读取中发现的遗传多态性数量有限,变体调用也不完美。我们从10个人类细胞图谱数据集中推断出401个供体的遗传祖先,并在该资源中报告了高比例的欧洲血统供体。对于生成单细胞转录组数据集的研究人员,我们建议报告所有供体的遗传祖先推断,并生成代表不同祖先的数据集。
{"title":"Evaluating genetic ancestry inference from single-cell transcriptomic datasets.","authors":"Jianing Yao, Steven Gazal","doi":"10.1016/j.xhgg.2026.100564","DOIUrl":"10.1016/j.xhgg.2026.100564","url":null,"abstract":"<p><p>Characterizing the ancestry of donors in single-cell transcriptomic studies is crucial to ensure genetic homogeneity, reduce biases in analyses, identify ancestry-specific regulatory mechanisms and their downstream roles in disease, and ensure that existing datasets are representative of human genetic diversity. While these datasets are now widely available, information on the ancestry of donors is often missing, hindering further analysis. Here, we propose a framework to evaluate methods for inferring genetic ancestry from genetic polymorphisms detected in single-cell sequencing reads. We demonstrate that widely used tools (e.g., ADMIXTURE) provide accurate inference of genetic ancestry and admixture proportions, despite the limited number of genetic polymorphisms identified and imperfect variant calling from sequencing reads. We infer genetic ancestry for 401 donors from 10 Human Cell Atlas datasets and report a high proportion of donors of European ancestry in this resource. For researchers generating single-cell transcriptomic datasets, we recommend reporting genetic ancestry inference for all donors and generating datasets that represent diverse ancestries.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100564"},"PeriodicalIF":3.6,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12860357/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145935601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recessive AARS1 variants perturb human and mouse development. 隐性AARS1变异干扰人类和小鼠的发育。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-08 DOI: 10.1016/j.xhgg.2026.100565
Jennifer L Watts, Nicole Costantino, Ammar Husami, Thamara Dayarathna, Logan Willeke, Loren D M Peña, Elizabeth Seiwert, Donald L Gilbert, Rolf W Stottmann

Humans with pathogenic loss of function alanyl-tRNA synthetase 1 (AARS1) variants have a range of congenital brain phenotypes, including a high prevalence of microcephaly. The molecular mechanisms for this are unclear but zebrafish mutants in aars1 have reduced neurogenesis and increased apoptosis. Here, we report two individuals with compound heterozygous AARS1 variants. We created two mouse models to study the role of Aars1 in embryonic brain development. We provide evidence from these mouse models and in vitro splicing assays that both human AARS1 alleles are pathogenic. Mice homozygous for either a missense allele or an indel allele are both lethal very early in embryonic development. Aars1G80S/WT heterozygous animals show reduced Purkinje cell immunoreactivity at 8 months of age but no gross morphological cerebellar phenotypes or impaired performance in a motor coordination assay. We conclude these are pathogenic alleles in AARS1 but lethality in mice preclude a detailed study of neural development.

患有致病性功能丧失alanyl-tRNA合成酶1 (AARS1)变异的人具有一系列先天性脑表型,包括小头畸形的高患病率。其分子机制尚不清楚,但斑马鱼aar1突变体减少了神经发生并增加了细胞凋亡。在这里,我们报告了两个具有复合杂合AARS1变异的个体。我们建立了两个小鼠模型来研究Aars1在胚胎大脑发育中的作用。我们从这些小鼠模型和体外剪接实验中提供证据,证明人类AARS1等位基因都是致病的。错义等位基因或非义等位基因纯合的小鼠在胚胎发育的早期都是致命的。Aars1G80S/wt杂合动物在8个月大时表现出浦肯野细胞免疫反应性降低,但在运动协调试验中没有明显的小脑形态表型或表现受损。我们得出结论,这些等位基因在AARS1中是致病性的,但在小鼠中的致病性妨碍了对神经发育的详细研究。
{"title":"Recessive AARS1 variants perturb human and mouse development.","authors":"Jennifer L Watts, Nicole Costantino, Ammar Husami, Thamara Dayarathna, Logan Willeke, Loren D M Peña, Elizabeth Seiwert, Donald L Gilbert, Rolf W Stottmann","doi":"10.1016/j.xhgg.2026.100565","DOIUrl":"10.1016/j.xhgg.2026.100565","url":null,"abstract":"<p><p>Humans with pathogenic loss of function alanyl-tRNA synthetase 1 (AARS1) variants have a range of congenital brain phenotypes, including a high prevalence of microcephaly. The molecular mechanisms for this are unclear but zebrafish mutants in aars1 have reduced neurogenesis and increased apoptosis. Here, we report two individuals with compound heterozygous AARS1 variants. We created two mouse models to study the role of Aars1 in embryonic brain development. We provide evidence from these mouse models and in vitro splicing assays that both human AARS1 alleles are pathogenic. Mice homozygous for either a missense allele or an indel allele are both lethal very early in embryonic development. Aars1<sup>G80S/WT</sup> heterozygous animals show reduced Purkinje cell immunoreactivity at 8 months of age but no gross morphological cerebellar phenotypes or impaired performance in a motor coordination assay. We conclude these are pathogenic alleles in AARS1 but lethality in mice preclude a detailed study of neural development.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100565"},"PeriodicalIF":3.6,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859473/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145935607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate and scalable genome-wide ancestry estimation using haplotype clusters. 准确和可扩展的全基因组祖先估计使用单倍型集群。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-08 DOI: 10.1016/j.xhgg.2026.100561
Jonas Meisner

Unsupervised genome-wide ancestry estimation in unrelated individuals has been a staple in population and medical genetics for decades, and its importance continues to grow with the increasing number of large genetic cohorts of mixed ancestries. We propose an extension to the hapla framework that scales model-based ancestry estimation to unprecedented sample sizes by leveraging inferred haplotype clusters from phased genotype data. Our haplotype cluster-based approach is approximately 5× and 20× faster than the fastest model-free and model-based SNP-based approaches, respectively, for unsupervised genome-wide ancestry estimation on the harmonized Human Genome Diversity Project and 1000 Genomes Project dataset. Furthermore, we demonstrate that it is the most accurate method to date in an extensive simulation study, across a range of sample sizes from thousands to hundreds of thousands of individuals. Our accurate ancestry estimates can help reduce health disparities and accelerate precision medicine efforts in the growing number of biobanks globally.

几十年来,非亲属个体的无监督全基因组祖先估计一直是人口和医学遗传学的主要内容,随着混合祖先的大型遗传队列数量的增加,其重要性继续增长。我们提出了对hapla框架的扩展,通过利用从分阶段基因型数据推断的单倍型集群,将基于模型的祖先估计扩展到前所未有的样本量。在协调人类基因组多样性计划和1000基因组计划数据集上,我们基于单倍型聚类的方法分别比最快的无模型和基于模型的snp方法快5倍和20倍。此外,我们证明,在广泛的模拟研究中,这是迄今为止最准确的方法,样本量从数千到数十万不等。我们准确的祖先估计可以帮助减少健康差距,并加速全球越来越多的生物银行的精准医疗工作。
{"title":"Accurate and scalable genome-wide ancestry estimation using haplotype clusters.","authors":"Jonas Meisner","doi":"10.1016/j.xhgg.2026.100561","DOIUrl":"10.1016/j.xhgg.2026.100561","url":null,"abstract":"<p><p>Unsupervised genome-wide ancestry estimation in unrelated individuals has been a staple in population and medical genetics for decades, and its importance continues to grow with the increasing number of large genetic cohorts of mixed ancestries. We propose an extension to the hapla framework that scales model-based ancestry estimation to unprecedented sample sizes by leveraging inferred haplotype clusters from phased genotype data. Our haplotype cluster-based approach is approximately 5× and 20× faster than the fastest model-free and model-based SNP-based approaches, respectively, for unsupervised genome-wide ancestry estimation on the harmonized Human Genome Diversity Project and 1000 Genomes Project dataset. Furthermore, we demonstrate that it is the most accurate method to date in an extensive simulation study, across a range of sample sizes from thousands to hundreds of thousands of individuals. Our accurate ancestry estimates can help reduce health disparities and accelerate precision medicine efforts in the growing number of biobanks globally.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100561"},"PeriodicalIF":3.6,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859494/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transcriptome-wide association study of cardiovascular outcomes in chronic kidney disease: The chronic renal insufficiency cohort. 慢性肾脏疾病心血管结局的转录组关联研究:慢性肾功能不全队列
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2026-01-08 DOI: 10.1016/j.xhgg.2026.100562
Bridget M Lin, Jia Wen, Andrea R V R Horimoto, Lingbo Zhou, Shuai Huang, Mirela Dobre, Alan S Go, Yun Li, Nora Franceschini

Transcriptome-wide association studies (TWASs) are powerful for identifying gene-trait associations by integrating gene expression and genome-wide association data, but findings may be impacted by the choice of gene expression reference. We performed TWAS of cardiovascular outcomes using multi-tissue and ancestry-matched gene expression references. We used data from the Chronic Renal Insufficiency Cohort Study for participants of African (AFR, n = 1,512) and European (EUR, n = 2,067) ancestry and three outcomes: all-cause stroke, coronary heart disease, and heart failure. We performed TWASs using EUR and AFR predicted gene expression reference panels and multi-tissue TWAS by integrating gene expression from 10 GTEx selected tissues. TWAS identified KDELR2 associated with heart failure in AFR participants using matched AFR reference panel (p = 4.7 × 10-6), although findings were near significant using the EUR mismatched reference panel (p = 5.6 × 10-6). PSMC1 was associated with coronary heart disease in TWAS of CRIC EUR using AFR reference panel, and this gene was not present in the EUR-trained gene expression model. Multi-tissue TWASs identified KHDRBS2 significantly associated with all-cause stroke in CRIC AFR participants (p = 4.0 × 10-6). Variants near KDELR2 have been associated with coronary artery disease, which is a main cause of heart failure, while KHDRBS2 has been associated with cardiovascular risk factors in genome-wide association studies. Our findings highlight differences in gene discovery for TWAS of cardiovascular disease applied to high-risk participants based on participant ancestry and gene expression reference panels, and gains to identify genes compared with traditional genome-wide association approaches.

转录组全关联研究(Transcriptome-wide association studies, TWAS)通过整合基因表达和全基因组关联数据来识别基因-性状关联,但结果可能受到基因表达参考选择的影响。我们使用多组织和祖先匹配的基因表达参考对心血管结果进行了TWAS。我们使用了来自慢性肾功能不全队列研究的数据,包括非洲(AFR, n = 1512)和欧洲(EUR, n = 2067)血统的参与者,以及三种结局:全因中风、冠心病和心力衰竭。我们使用EUR和AFR预测基因表达参考面板进行TWAS,并通过整合来自10个GTEx选择组织的基因表达进行多组织TWAS。TWAS使用匹配的AFR参考组确定了KDELR2与AFR参与者的心力衰竭相关(p = 4.7 x 10-6),尽管使用EUR不匹配参考组的发现接近显著(p = 5.6 x 10-6)。使用AFR参考面板,在CRIC - EUR的TWAS中,PSMC1与冠心病相关,而该基因在EUR训练的基因表达模型中不存在。多组织TWAS鉴定出KHDRBS2与CRIC AFR参与者的全因卒中显著相关(p=4.0 x 10-6)。KDELR2附近的变异与冠状动脉疾病相关,这是心力衰竭的主要原因,而KHDRBS2在全基因组关联研究中与心血管危险因素相关。我们的研究结果强调了基于参与者祖先和基因表达参考面板的高危参与者在心血管疾病TWAS基因发现方面的差异,以及与传统全基因组关联方法相比,在识别基因方面的收获。
{"title":"Transcriptome-wide association study of cardiovascular outcomes in chronic kidney disease: The chronic renal insufficiency cohort.","authors":"Bridget M Lin, Jia Wen, Andrea R V R Horimoto, Lingbo Zhou, Shuai Huang, Mirela Dobre, Alan S Go, Yun Li, Nora Franceschini","doi":"10.1016/j.xhgg.2026.100562","DOIUrl":"10.1016/j.xhgg.2026.100562","url":null,"abstract":"<p><p>Transcriptome-wide association studies (TWASs) are powerful for identifying gene-trait associations by integrating gene expression and genome-wide association data, but findings may be impacted by the choice of gene expression reference. We performed TWAS of cardiovascular outcomes using multi-tissue and ancestry-matched gene expression references. We used data from the Chronic Renal Insufficiency Cohort Study for participants of African (AFR, n = 1,512) and European (EUR, n = 2,067) ancestry and three outcomes: all-cause stroke, coronary heart disease, and heart failure. We performed TWASs using EUR and AFR predicted gene expression reference panels and multi-tissue TWAS by integrating gene expression from 10 GTEx selected tissues. TWAS identified KDELR2 associated with heart failure in AFR participants using matched AFR reference panel (p = 4.7 × 10<sup>-6</sup>), although findings were near significant using the EUR mismatched reference panel (p = 5.6 × 10<sup>-6</sup>). PSMC1 was associated with coronary heart disease in TWAS of CRIC EUR using AFR reference panel, and this gene was not present in the EUR-trained gene expression model. Multi-tissue TWASs identified KHDRBS2 significantly associated with all-cause stroke in CRIC AFR participants (p = 4.0 × 10<sup>-6</sup>). Variants near KDELR2 have been associated with coronary artery disease, which is a main cause of heart failure, while KHDRBS2 has been associated with cardiovascular risk factors in genome-wide association studies. Our findings highlight differences in gene discovery for TWAS of cardiovascular disease applied to high-risk participants based on participant ancestry and gene expression reference panels, and gains to identify genes compared with traditional genome-wide association approaches.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100562"},"PeriodicalIF":3.6,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12857338/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145935579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterozygous KRT32 variant is responsible for autosomal dominant loose anagen hair syndrome. 杂合子KRT32变异是常染色体显性脱发综合征的原因。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2025-10-09 Epub Date: 2025-08-14 DOI: 10.1016/j.xhgg.2025.100495
Marcelo Melo, Elizabeth Phillippi, Thomas Moninger, Lisa J Stille, Kya Foxx, Benjamin Darbro, Kelly N Messingham, Edward A Sander, Hatem El-Shanti

Loose anagen hair syndrome is a form of childhood-onset non-scarring alopecia marked by easily and painlessly plucking terminal hair during its active growth, or anagen, phase. It is believed to result from poor hair shaft anchoring within the follicle due to premature keratinization. Our study identified a plausibly pathogenic variant in KRT32 (c.296C>T; p.Thr99Ile) that co-segregates with the phenotype in a large family. This study aimed to explore the role of KRT32, previously unassociated with loose anagen hair, in hair anchorage and assess the functional impact of its p.Thr99Ile variant. We hypothesized that the p.Thr99Ile variant reduces the binding affinity of KRT32 to KRT82, disrupting the intermediate filament structure in the hair shaft cuticle and leading to weak anagen hair anchorage. To test this hypothesis, we conducted a protein-protein interaction assay using far-western blotting and performed in silico intermediate filament network segmentation analysis on high-resolution fluorescent microscopy images. Our results showed a decreased binding affinity of KRT32Thr99Ile to KRT82 when compared to KRT32WT. There were significant differences in segment count and filament thickness, as measured by brightness, between the KRT32Thr99Ile and the KRT32WT. We conclude that the c.296C>T variant of KRT32 is associated with loose anagen hair phenotype.

毛发疏松综合征是一种儿童期发病的非瘢痕性脱发,其特征是在毛发活跃生长-毛发生长阶段容易无痛地拔毛,据信是由于过早角质化导致毛干在毛囊内固定不良造成的。我们的研究确定了KRT32 (c.296C>T;p.Thr99Ile)在一个大家族中与表现型共分离。本研究旨在探索KRT32在头发锚定中的作用,并评估其p.s thr99ile变体的功能影响。我们假设p.s thr99ile变异降低了KRT32与KRT82的结合亲和力,破坏了毛干角质层的中间细丝结构,导致毛发的弱锚定。为了验证这一假设,我们使用远西印迹法进行了蛋白质相互作用分析,并对高分辨率荧光显微镜图像进行了硅中间丝网络分割分析。我们的研究结果表明,与KRT32WT相比,KRT32Thr99Ile与KRT82的结合亲和力降低。KRT32Thr99Ile和KRT32WT在灯节数和灯丝厚度(以亮度衡量)上存在显著差异。我们得出结论,KRT32的c.296C >t变异与疏松的毛发表型有关。
{"title":"Heterozygous KRT32 variant is responsible for autosomal dominant loose anagen hair syndrome.","authors":"Marcelo Melo, Elizabeth Phillippi, Thomas Moninger, Lisa J Stille, Kya Foxx, Benjamin Darbro, Kelly N Messingham, Edward A Sander, Hatem El-Shanti","doi":"10.1016/j.xhgg.2025.100495","DOIUrl":"10.1016/j.xhgg.2025.100495","url":null,"abstract":"<p><p>Loose anagen hair syndrome is a form of childhood-onset non-scarring alopecia marked by easily and painlessly plucking terminal hair during its active growth, or anagen, phase. It is believed to result from poor hair shaft anchoring within the follicle due to premature keratinization. Our study identified a plausibly pathogenic variant in KRT32 (c.296C>T; p.Thr99Ile) that co-segregates with the phenotype in a large family. This study aimed to explore the role of KRT32, previously unassociated with loose anagen hair, in hair anchorage and assess the functional impact of its p.Thr99Ile variant. We hypothesized that the p.Thr99Ile variant reduces the binding affinity of KRT32 to KRT82, disrupting the intermediate filament structure in the hair shaft cuticle and leading to weak anagen hair anchorage. To test this hypothesis, we conducted a protein-protein interaction assay using far-western blotting and performed in silico intermediate filament network segmentation analysis on high-resolution fluorescent microscopy images. Our results showed a decreased binding affinity of KRT32<sup>Thr99Ile</sup> to KRT82 when compared to KRT32<sup>WT</sup>. There were significant differences in segment count and filament thickness, as measured by brightness, between the KRT32<sup>Thr99Ile</sup> and the KRT32<sup>WT</sup>. We conclude that the c.296C>T variant of KRT32 is associated with loose anagen hair phenotype.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100495"},"PeriodicalIF":3.6,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12409965/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144856591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early-onset multivalvular disease caused by a missense variant in lamin A/C. 由层粘连蛋白a /C错义变异引起的早发性多瓣疾病
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2025-10-09 Epub Date: 2025-08-08 DOI: 10.1016/j.xhgg.2025.100491
Alexandre Janin, Nathalie Gaudreault, Victoria Saavedra Armero, Zhonglin Li, Ran Xu, Dominique K Boudreau, Lily Frenette, Julien Ternacle, Danielle Tardif, Sébastien Thériault, Philippe Pibarot, Patrick Mathieu, Christian Steinberg, Yohan Bossé

Lamins A/C, coded by LMNA gene, are crucial for nuclear architecture preservation. Pathogenic LMNA variants cause a wide range of inherited diseases called "laminopathies". A subgroup is referred to "progeroid syndromes" characterized by premature aging and other manifestations including cardiac valve abnormalities. Atypical phenotypes, generally less severe, have also been reported. We report the case of a 26-year-old male with calcific tricuspid aortic and mitral valve diseases. His father was diagnosed with severe aortic valve stenosis and mitral annulus calcification at the age of 38. The goal of this study was to identify the putative variant causing this non-syndromic multivalvular disease. Known disease-causing variants in NOTCH1, FLNA, and DCHS1 were first excluded by Sanger sequencing. Whole-exome sequencing was then performed in five family members. A LMNA variant (p.Glu262Val) was identified with in silico evidences of pathogenicity (CADD [combined annotation dependent depletion] = 33). Cells transfected with the cDNA construct harboring p.Glu262Val were characterized by abnormal nuclear morphology. Along with a literature review, the variant was classified as likely pathogenic. Elucidating the mechanism by which LMNA p.Glu262Val specifically affects cardiac heart valves is likely to provide insight about the pathogenesis of Mendelian forms of valvular heart diseases and may help guide the development of therapies.

由LMNA基因编码的层粘胶蛋白A/C对核结构保存起着至关重要的作用。致病性LMNA变异引起广泛的遗传性疾病,称为“层板病”。一个亚组被称为“类早衰综合征”,其特征是过早衰老和其他表现,包括心脏瓣膜异常。非典型表型,通常不太严重,也有报道。我们报告一例26岁男性钙化三尖瓣主动脉和二尖瓣疾病。他的父亲在38岁时被诊断出患有严重的主动脉瓣狭窄和二尖瓣环钙化。本研究的目的是确定引起这种非综合征性多瓣疾病的推定变异。已知的NOTCH1、FLNA和DCHS1致病变异首先通过Sanger测序排除。然后对五个家庭成员进行全外显子组测序。一个LMNA变异(p.g ul262val)被鉴定出具有致病性的计算机证据(CADD=33)。携带p.g ul262val的cDNA构建体转染细胞后,细胞核形态发生异常。与文献综述一起,该变异被归类为可能致病。阐明LMNA p.g ul262val特异性影响心脏瓣膜的机制,可能有助于了解孟德尔型瓣膜性心脏病的发病机制,并有助于指导治疗方法的发展。
{"title":"Early-onset multivalvular disease caused by a missense variant in lamin A/C.","authors":"Alexandre Janin, Nathalie Gaudreault, Victoria Saavedra Armero, Zhonglin Li, Ran Xu, Dominique K Boudreau, Lily Frenette, Julien Ternacle, Danielle Tardif, Sébastien Thériault, Philippe Pibarot, Patrick Mathieu, Christian Steinberg, Yohan Bossé","doi":"10.1016/j.xhgg.2025.100491","DOIUrl":"10.1016/j.xhgg.2025.100491","url":null,"abstract":"<p><p>Lamins A/C, coded by LMNA gene, are crucial for nuclear architecture preservation. Pathogenic LMNA variants cause a wide range of inherited diseases called \"laminopathies\". A subgroup is referred to \"progeroid syndromes\" characterized by premature aging and other manifestations including cardiac valve abnormalities. Atypical phenotypes, generally less severe, have also been reported. We report the case of a 26-year-old male with calcific tricuspid aortic and mitral valve diseases. His father was diagnosed with severe aortic valve stenosis and mitral annulus calcification at the age of 38. The goal of this study was to identify the putative variant causing this non-syndromic multivalvular disease. Known disease-causing variants in NOTCH1, FLNA, and DCHS1 were first excluded by Sanger sequencing. Whole-exome sequencing was then performed in five family members. A LMNA variant (p.Glu262Val) was identified with in silico evidences of pathogenicity (CADD [combined annotation dependent depletion] = 33). Cells transfected with the cDNA construct harboring p.Glu262Val were characterized by abnormal nuclear morphology. Along with a literature review, the variant was classified as likely pathogenic. Elucidating the mechanism by which LMNA p.Glu262Val specifically affects cardiac heart valves is likely to provide insight about the pathogenesis of Mendelian forms of valvular heart diseases and may help guide the development of therapies.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100491"},"PeriodicalIF":3.6,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12398237/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144812528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variational autoencoder-based model improves polygenic prediction in blood cell traits. 基于变分自编码器的模型改进血细胞性状的多基因预测。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2025-10-09 Epub Date: 2025-08-08 DOI: 10.1016/j.xhgg.2025.100490
Xiaoqi Li, Elena Kharitonova, Minxing Pang, Jia Wen, Laura Y Zhou, Laura Raffield, Haibo Zhou, Huaxiu Yao, Can Chen, Yun Li, Quan Sun

Genetic prediction of complex traits, enabled by large-scale genomic studies, has created new measures to understand individual genetic predisposition. Polygenic risk scores (PRSs) offer a way to aggregate information across the genome, enabling personalized risk prediction for complex traits and diseases. However, conventional PRS calculation methods that rely on linear models are limited in their ability to capture complex patterns and interaction effects in high-dimensional genomic data. In this study, we seek to improve the predictive power of PRS through applying advanced deep learning techniques. We show that the variational autoencoder-based model for PRS construction (VAE-PRS) outperforms currently state-of-the-art methods for biobank-level data in 14 out of 16 blood cell traits, while being computationally efficient. Through comprehensive experiments, we found that the VAE-PRS model offers the ability to capture interaction effects in high-dimensional data and shows robust performance across different pre-screened variant sets. Furthermore, VAE-PRS is easily interpretable via assessing the contribution of each individual marker to the final prediction score through the Shapley additive explanations method, providing potential new insights in identifying trait-associated genetic variants. In summary, VAE-PRS presents a measure to genetic risk prediction for blood cell traits by harnessing the power of deep learning methods given appropriate training sample size, which could further facilitate the development of personalized medicine and genetic research.

大规模基因组研究使复杂性状的遗传预测成为可能,为了解个体遗传倾向创造了新的方法。多基因风险评分(PRS)提供了一种收集基因组信息的方法,使复杂性状和疾病的个性化风险预测成为可能。然而,传统的依赖线性模型的PRS计算方法在捕获高维基因组数据中的复杂模式和相互作用效应的能力方面受到限制。在本研究中,我们试图通过应用先进的深度学习技术来提高PRS的预测能力。我们表明,基于变分自动编码器的PRS构建模型(VAE-PRS)在16种血细胞特征中的14种中优于目前最先进的生物库级数据方法,同时具有计算效率。通过综合实验,我们发现VAE-PRS模型能够捕获高维数据中的交互效应,并在不同的预筛选变体集上显示出稳健的性能。此外,通过SHapley加性解释(SHAP)方法评估每个个体标记对最终预测分数的贡献,可以很容易地解释VAE-PRS,为识别性状相关的遗传变异提供了潜在的新见解。综上所述,VAE-PRS通过适当的训练样本量,利用深度学习方法的力量,为血细胞特征的遗传风险预测提供了一种措施,可以进一步促进个性化医疗和基因研究的发展。
{"title":"Variational autoencoder-based model improves polygenic prediction in blood cell traits.","authors":"Xiaoqi Li, Elena Kharitonova, Minxing Pang, Jia Wen, Laura Y Zhou, Laura Raffield, Haibo Zhou, Huaxiu Yao, Can Chen, Yun Li, Quan Sun","doi":"10.1016/j.xhgg.2025.100490","DOIUrl":"10.1016/j.xhgg.2025.100490","url":null,"abstract":"<p><p>Genetic prediction of complex traits, enabled by large-scale genomic studies, has created new measures to understand individual genetic predisposition. Polygenic risk scores (PRSs) offer a way to aggregate information across the genome, enabling personalized risk prediction for complex traits and diseases. However, conventional PRS calculation methods that rely on linear models are limited in their ability to capture complex patterns and interaction effects in high-dimensional genomic data. In this study, we seek to improve the predictive power of PRS through applying advanced deep learning techniques. We show that the variational autoencoder-based model for PRS construction (VAE-PRS) outperforms currently state-of-the-art methods for biobank-level data in 14 out of 16 blood cell traits, while being computationally efficient. Through comprehensive experiments, we found that the VAE-PRS model offers the ability to capture interaction effects in high-dimensional data and shows robust performance across different pre-screened variant sets. Furthermore, VAE-PRS is easily interpretable via assessing the contribution of each individual marker to the final prediction score through the Shapley additive explanations method, providing potential new insights in identifying trait-associated genetic variants. In summary, VAE-PRS presents a measure to genetic risk prediction for blood cell traits by harnessing the power of deep learning methods given appropriate training sample size, which could further facilitate the development of personalized medicine and genetic research.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100490"},"PeriodicalIF":3.6,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12398231/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144812538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging global genetics resources to enhance polygenic prediction across ancestrally diverse populations. 利用全球遗传资源加强祖先多样性人群的多基因预测。
IF 3.6 Q2 GENETICS & HEREDITY Pub Date : 2025-10-09 Epub Date: 2025-07-18 DOI: 10.1016/j.xhgg.2025.100482
Oliver Pain

Genome-wide association studies (GWASs) from multiple ancestral populations are increasingly available, offering opportunities to improve the accuracy and equity of polygenic scores (PGSs). Several methods now aim to leverage multiple GWAS sources, but predictive performance and computational efficiency remain unclear, particularly when individual-level tuning data are unavailable. This study evaluates a comprehensive set of PGS methods across African (AFR), East Asian (EAS), and European (EUR) ancestries for 10 complex traits, using summary statistics from the Ugandan Genome Resource, Biobank Japan, UK Biobank, and the Million Veteran Program. Single-source PGSs were derived using methods including DBSLMM, lassosum, LDpred2, MegaPRS, pT + clump, PRS-CS, QuickPRS, and SBayesRC. Multi-source approaches included PRS-CSx, TL-PRS, X-Wing, and combinations of independently optimized single-source scores. All methods were restricted to HapMap3 variants and used linkage disequilibrium reference panels matching the GWAS super population. A key contribution is a novel application of the LEOPARD method to estimate optimal linear combinations of population-specific PGSs using only summary statistics. Analyses were implemented using the open-source GenoPred pipeline. In AFR and EAS populations, PGS combining ancestry-aligned and European GWASs outperformed single-source models. Linear combinations of independently optimized scores consistently outperformed current jointly optimized multi-source methods, while being substantially more computationally efficient. The LEOPARD extension offered a practical solution for tuning these combinations when only summary statistics were available, achieving performance comparable to tuning with individual-level data. These findings highlight a flexible and generalizable framework for multi-source PGS construction. The GenoPred pipeline supports more equitable, accurate, and accessible polygenic prediction.

来自多个祖先群体的全基因组关联研究(GWAS)越来越多,为提高多基因评分(PGS)的准确性和公平性提供了机会。现在有几种方法旨在利用多个GWAS源,但是预测性能和计算效率仍然不清楚,特别是在无法获得个人级别调优数据的情况下。本研究利用来自乌干达基因组资源、日本生物银行、英国生物银行和百万退伍军人计划的汇总统计数据,对非洲(AFR)、东亚(EAS)和欧洲(EUR)祖先的10个复杂性状的综合PGS方法进行了评估。采用DBSLMM、lassosum、LDpred2、MegaPRS、pT+ cluster、PRS-CS、QuickPRS、SBayesRC等方法推导单源PGS。多源方法包括PRS-CSx、TL-PRS、X-Wing以及独立优化的单源评分组合。所有方法均局限于HapMap3变异,并使用与GWAS超级群体匹配的连锁不平衡参考面板。一个关键贡献是LEOPARD方法的新应用,该方法仅使用汇总统计来估计种群特定PGS的最佳线性组合。分析是使用开源GenoPred管道实现的。在AFR和EAS人群中,结合祖先对齐和欧洲GWAS的PGS优于单一来源模型。独立优化分数的线性组合始终优于当前联合优化的多源方法,同时具有更高的计算效率。LEOPARD扩展提供了一个实用的解决方案,可以在只有汇总统计数据可用时调优这些组合,从而实现与使用个人级别数据调优相当的性能。这些发现强调了多源PGS构建的灵活和通用框架。GenoPred管道支持更公平、更准确和更容易获得的多基因预测。
{"title":"Leveraging global genetics resources to enhance polygenic prediction across ancestrally diverse populations.","authors":"Oliver Pain","doi":"10.1016/j.xhgg.2025.100482","DOIUrl":"10.1016/j.xhgg.2025.100482","url":null,"abstract":"<p><p>Genome-wide association studies (GWASs) from multiple ancestral populations are increasingly available, offering opportunities to improve the accuracy and equity of polygenic scores (PGSs). Several methods now aim to leverage multiple GWAS sources, but predictive performance and computational efficiency remain unclear, particularly when individual-level tuning data are unavailable. This study evaluates a comprehensive set of PGS methods across African (AFR), East Asian (EAS), and European (EUR) ancestries for 10 complex traits, using summary statistics from the Ugandan Genome Resource, Biobank Japan, UK Biobank, and the Million Veteran Program. Single-source PGSs were derived using methods including DBSLMM, lassosum, LDpred2, MegaPRS, pT + clump, PRS-CS, QuickPRS, and SBayesRC. Multi-source approaches included PRS-CSx, TL-PRS, X-Wing, and combinations of independently optimized single-source scores. All methods were restricted to HapMap3 variants and used linkage disequilibrium reference panels matching the GWAS super population. A key contribution is a novel application of the LEOPARD method to estimate optimal linear combinations of population-specific PGSs using only summary statistics. Analyses were implemented using the open-source GenoPred pipeline. In AFR and EAS populations, PGS combining ancestry-aligned and European GWASs outperformed single-source models. Linear combinations of independently optimized scores consistently outperformed current jointly optimized multi-source methods, while being substantially more computationally efficient. The LEOPARD extension offered a practical solution for tuning these combinations when only summary statistics were available, achieving performance comparable to tuning with individual-level data. These findings highlight a flexible and generalizable framework for multi-source PGS construction. The GenoPred pipeline supports more equitable, accurate, and accessible polygenic prediction.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100482"},"PeriodicalIF":3.6,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12536657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144668690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
HGG Advances
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1