Open-Source Bioinformatic Pipeline to Improve PMS2 Genetic Testing Using Short-Read NGS Data

IF 3.4 3区 医学 Q1 PATHOLOGY Journal of Molecular Diagnostics Pub Date : 2024-08-01 DOI:10.1016/j.jmoldx.2024.05.005
{"title":"Open-Source Bioinformatic Pipeline to Improve PMS2 Genetic Testing Using Short-Read NGS Data","authors":"","doi":"10.1016/j.jmoldx.2024.05.005","DOIUrl":null,"url":null,"abstract":"<div><p>The molecular diagnosis of mismatch repair–deficient cancer syndromes is hampered by difficulties in sequencing the <em>PMS2</em> gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for <em>PMS2</em> mutational analysis and explore <em>PMS2</em> germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. <em>PMS2</em> mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with <em>PMS2</em> (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the <em>PMS2</em> reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956, respectively) in the validation cohort, identifying all previously <em>PMS2</em> pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic <em>PMS2</em> variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved <em>PMS2</em> mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining <em>PMS2</em> screening.</p></div>","PeriodicalId":50128,"journal":{"name":"Journal of Molecular Diagnostics","volume":"26 8","pages":"Pages 727-738"},"PeriodicalIF":3.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1525157824001181/pdfft?md5=91683b4368460b158919e0c187d84829&pid=1-s2.0-S1525157824001181-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1525157824001181","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The molecular diagnosis of mismatch repair–deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for PMS2 mutational analysis and explore PMS2 germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. PMS2 mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with PMS2 (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the PMS2 reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956, respectively) in the validation cohort, identifying all previously PMS2 pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic PMS2 variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved PMS2 mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining PMS2 screening.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用短读数 NGS 数据改进 PMS2 基因测试的开源生物信息学管道。
主要由于 PMS2CL 假基因导致的 PMS2 基因测序困难阻碍了 MMR 缺失癌症综合征的分子诊断。下一代测序(NGS)短读数无法通过标准管道进行明确映射,从而影响了变异调用的准确性。我们的目标是为 PMS2 基因突变分析提供一个完善的生物信息学管道,并在一个未选择的遗传性癌症(HC)队列中探索 PMS2 基因致病变体的流行率。利用两个队列对 PMS2 基因突变分析进行了优化:192例未经筛选的HC患者用于评估旁系序列变异的等位基因比例,13例样本富含PMS2(可能)致病变异,这些变异之前已通过长程gDNA PCR扩增(LR-PCR)进行了筛查。读数被强制与 PMS2 参考序列比对,但与第 11 号外显子对应的读数除外,只考虑与基因特异性不变位置相交的读数。之后,在 40 名患者的队列中验证了改进管道的准确性,并用于筛选 5619 名 HC 患者。与我们的常规诊断管道相比,PMS2_vaR 管道在验证队列中显示出更高的技术灵敏度(从 0.853 到 0.956),能识别出以前通过 LR-PCR 发现的所有 PMS2 致病变异。15个HC队列样本携带致病性PMS2变体(15/5619,0.285%),比一般人群的估计患病率高出一倍。改进后的开源方法提高了 PMS2 突变分析的准确性,使其能够纳入常规 NGS 管线,简化 PMS2 筛查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.10
自引率
2.40%
发文量
143
审稿时长
43 days
期刊介绍: The Journal of Molecular Diagnostics, the official publication of the Association for Molecular Pathology (AMP), co-owned by the American Society for Investigative Pathology (ASIP), seeks to publish high quality original papers on scientific advances in the translation and validation of molecular discoveries in medicine into the clinical diagnostic setting, and the description and application of technological advances in the field of molecular diagnostic medicine. The editors welcome for review articles that contain: novel discoveries or clinicopathologic correlations including studies in oncology, infectious diseases, inherited diseases, predisposition to disease, clinical informatics, or the description of polymorphisms linked to disease states or normal variations; the application of diagnostic methodologies in clinical trials; or the development of new or improved molecular methods which may be applied to diagnosis or monitoring of disease or disease predisposition.
期刊最新文献
Table of Contents Editorial Board Twenty-Five Years of Germline Genetic Testing and What May Lie Ahead Changes and Challenges in Molecular Diagnostics A New Serotyping Method of Streptococcus pneumoniae Based on CRISPR/Cas9–Targeted Sequencing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1