PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY Genetic Epidemiology Pub Date : 2024-09-24 DOI:10.1002/gepi.22593
Marie-Sophie C. Ogloblinsky, Ozvan Bocher, Chaker Aloui, Anne-Louise Leutenegger, Ozan Ozisik, Anaïs Baudot, Elisabeth Tournier-Lasserve, Helen Castillo-Madeen, Daniel Lewinsohn, Donald F. Conrad, Emmanuelle Génin, Gaëlle Marenne
{"title":"PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis","authors":"Marie-Sophie C. Ogloblinsky,&nbsp;Ozvan Bocher,&nbsp;Chaker Aloui,&nbsp;Anne-Louise Leutenegger,&nbsp;Ozan Ozisik,&nbsp;Anaïs Baudot,&nbsp;Elisabeth Tournier-Lasserve,&nbsp;Helen Castillo-Madeen,&nbsp;Daniel Lewinsohn,&nbsp;Donald F. Conrad,&nbsp;Emmanuelle Génin,&nbsp;Gaëlle Marenne","doi":"10.1002/gepi.22593","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The introduction of Next-Generation Sequencing technologies in the clinics has improved rare disease diagnosis. Nonetheless, for very heterogeneous or very rare diseases, more than half of cases still lack molecular diagnosis. Novel strategies are needed to prioritize variants within a single individual. The Population Sampling Probability (PSAP) method was developed to meet this aim but only for coding variants in exome data. Here, we propose an extension of the PSAP method to the non-coding genome called PSAP-genomic-regions. In this extension, instead of considering genes as testing units (PSAP-genes strategy), we use genomic regions defined over the whole genome that pinpoint potential functional constraints. We conceived an evaluation protocol for our method using artificially generated disease exomes and genomes, by inserting coding and non-coding pathogenic ClinVar variants in large data sets of exomes and genomes from the general population. PSAP-genomic-regions significantly improves the ranking of these variants compared to using a pathogenicity score alone. Using PSAP-genomic-regions, more than 50% of non-coding ClinVar variants were among the top 10 variants of the genome. On real sequencing data from six patients with Cerebral Small Vessel Disease and nine patients with male infertility, all causal variants were ranked in the top 100 variants with PSAP-genomic-regions. By revisiting the testing units used in the PSAP method to include non-coding variants, we have developed PSAP-genomic-regions, an efficient whole-genome prioritization tool which offers promising results for the diagnosis of unresolved rare diseases.</p></div>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22593","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

The introduction of Next-Generation Sequencing technologies in the clinics has improved rare disease diagnosis. Nonetheless, for very heterogeneous or very rare diseases, more than half of cases still lack molecular diagnosis. Novel strategies are needed to prioritize variants within a single individual. The Population Sampling Probability (PSAP) method was developed to meet this aim but only for coding variants in exome data. Here, we propose an extension of the PSAP method to the non-coding genome called PSAP-genomic-regions. In this extension, instead of considering genes as testing units (PSAP-genes strategy), we use genomic regions defined over the whole genome that pinpoint potential functional constraints. We conceived an evaluation protocol for our method using artificially generated disease exomes and genomes, by inserting coding and non-coding pathogenic ClinVar variants in large data sets of exomes and genomes from the general population. PSAP-genomic-regions significantly improves the ranking of these variants compared to using a pathogenicity score alone. Using PSAP-genomic-regions, more than 50% of non-coding ClinVar variants were among the top 10 variants of the genome. On real sequencing data from six patients with Cerebral Small Vessel Disease and nine patients with male infertility, all causal variants were ranked in the top 100 variants with PSAP-genomic-regions. By revisiting the testing units used in the PSAP method to include non-coding variants, we have developed PSAP-genomic-regions, an efficient whole-genome prioritization tool which offers promising results for the diagnosis of unresolved rare diseases.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PSAP-Genomic-Regions:利用群体数据优先处理全基因组测序中编码和非编码变异以诊断罕见病的方法。
下一代测序技术在临床上的应用改善了罕见病的诊断。然而,对于非常异质性或非常罕见的疾病,仍有一半以上的病例缺乏分子诊断。我们需要新的策略来确定单个个体内变异的优先次序。为实现这一目标,我们开发了群体采样概率(PSAP)方法,但该方法仅适用于外显子组数据中的编码变异。在这里,我们提出将 PSAP 方法扩展到非编码基因组,称为 PSAP-基因组-区域。在这一扩展中,我们不再将基因作为测试单元(PSAP-基因策略),而是使用定义在整个基因组上的基因组区域,以精确定位潜在的功能限制。我们设想了一个评估方案,利用人工生成的疾病外显子组和基因组,将编码和非编码致病性 ClinVar 变异插入来自普通人群的外显子组和基因组的大型数据集中,对我们的方法进行评估。与单独使用致病性评分相比,PSAP-基因组-区域能显著提高这些变异体的排序。使用 PSAP 基因组区域,50% 以上的 ClinVar 非编码变异跻身基因组前 10 个变异之列。在六名脑小血管疾病患者和九名男性不育症患者的真实测序数据中,所有因果变异都进入了使用 PSAP 基因组区域的前 100 个变异之列。通过重新审视 PSAP 方法中使用的检测单元,将非编码变异纳入其中,我们开发出了 PSAP-基因组-区域这一高效的全基因组优先排序工具,为诊断未解决的罕见病提供了可喜的成果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
期刊最新文献
Genetic Associations of Persistent Opioid Use After Surgery Point to OPRM1 but Not Other Opioid-Related Loci as the Main Driver of Opioid Use Disorder. Bayesian Effect Size Ranking to Prioritise Genetic Risk Variants in Common Diseases for Follow-Up Studies. Using Family History Data to Improve the Power of Association Studies: Application to Cancer in UK Biobank. Issue Information Issue Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1