RNA-seq单核苷酸召唤基因型插入策略的比较分析。

IF 3.7 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY BMC Genomics Pub Date : 2025-03-13 DOI:10.1186/s12864-025-11411-5
Kaixuan Guo, Zhanming Zhong, Haonan Zeng, Changliang Zhang, Teddy Tinashe Chitotombe, Jinyan Teng, Yahui Gao, Zhe Zhang
{"title":"RNA-seq单核苷酸召唤基因型插入策略的比较分析。","authors":"Kaixuan Guo, Zhanming Zhong, Haonan Zeng, Changliang Zhang, Teddy Tinashe Chitotombe, Jinyan Teng, Yahui Gao, Zhe Zhang","doi":"10.1186/s12864-025-11411-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>RNA sequencing (RNA-seq) is a powerful tool for transcriptome profiling, enabling integrative studies of expression quantitative trait loci (eQTL). As it identifies fewer genetic variants than DNA sequencing (DNA-seq), reference panel-based genotype imputation is often required to enhance its utility.</p><p><strong>Results: </strong>This study evaluated the accuracy of genotype imputation using SNPs called from RNA-seq data (RNA-SNPs). SNP features from 6,567 RNA-seq samples across 28 pig tissues were used to mask whole genome sequencing (WGS) data, with the Pig Genomic Reference Panel (PGRP) serving as the reference panel. Three imputation software tools (i.e., Beagle, Minimac4, and Impute5) were employed to perform the imputation. The result showed that RNA-SNPs achieved higher imputation accuracy (CR: 0.895 ~ 0.933; r²: 0.745 ~ 0.817) than SNPs from GeneSeek Genomic Profiler Porcine SNP50 BeadChip (Chip-SNPs) (CR: 0.873 ~ 0.909; r²: 0.629 ~ 0.698), and lower accuracy in \"intergenic\" regions. After imputation, quality control (QC) by minor allele frequency (MAF) and imputation quality (DR²) could improve r² but reduce SNP retention. Among software, Minimac4 takes the least runtime in single-thread setting, while Beagle performed best in multi-thread setting and phasing. Impute5 takes up minimal memory usage but requires the maximum runtime. All tools demonstrated comparable global accuracy (CR: 0.906 ~ 0.917; r²: 0.780 ~ 0.787).</p><p><strong>Conclusions: </strong>This study offers practical guidance for conducting RNA-SNP imputation strategies in genome and transcriptome research.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"26 1","pages":"245"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11907794/pdf/","citationCount":"0","resultStr":"{\"title\":\"Comparative analysis of genotype imputation strategies for SNPs calling from RNA-seq.\",\"authors\":\"Kaixuan Guo, Zhanming Zhong, Haonan Zeng, Changliang Zhang, Teddy Tinashe Chitotombe, Jinyan Teng, Yahui Gao, Zhe Zhang\",\"doi\":\"10.1186/s12864-025-11411-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>RNA sequencing (RNA-seq) is a powerful tool for transcriptome profiling, enabling integrative studies of expression quantitative trait loci (eQTL). As it identifies fewer genetic variants than DNA sequencing (DNA-seq), reference panel-based genotype imputation is often required to enhance its utility.</p><p><strong>Results: </strong>This study evaluated the accuracy of genotype imputation using SNPs called from RNA-seq data (RNA-SNPs). SNP features from 6,567 RNA-seq samples across 28 pig tissues were used to mask whole genome sequencing (WGS) data, with the Pig Genomic Reference Panel (PGRP) serving as the reference panel. Three imputation software tools (i.e., Beagle, Minimac4, and Impute5) were employed to perform the imputation. The result showed that RNA-SNPs achieved higher imputation accuracy (CR: 0.895 ~ 0.933; r²: 0.745 ~ 0.817) than SNPs from GeneSeek Genomic Profiler Porcine SNP50 BeadChip (Chip-SNPs) (CR: 0.873 ~ 0.909; r²: 0.629 ~ 0.698), and lower accuracy in \\\"intergenic\\\" regions. After imputation, quality control (QC) by minor allele frequency (MAF) and imputation quality (DR²) could improve r² but reduce SNP retention. Among software, Minimac4 takes the least runtime in single-thread setting, while Beagle performed best in multi-thread setting and phasing. Impute5 takes up minimal memory usage but requires the maximum runtime. All tools demonstrated comparable global accuracy (CR: 0.906 ~ 0.917; r²: 0.780 ~ 0.787).</p><p><strong>Conclusions: </strong>This study offers practical guidance for conducting RNA-SNP imputation strategies in genome and transcriptome research.</p>\",\"PeriodicalId\":9030,\"journal\":{\"name\":\"BMC Genomics\",\"volume\":\"26 1\",\"pages\":\"245\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11907794/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12864-025-11411-5\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-025-11411-5","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:RNA测序(RNA-seq)是一种强大的转录组分析工具,可以实现表达数量性状位点(eQTL)的综合研究。由于它识别的遗传变异比DNA测序(DNA-seq)少,因此通常需要基于参考面板的基因型插入来提高其实用性。结果:本研究评估了使用来自RNA-seq数据的snp (RNA-SNPs)进行基因型插入的准确性。来自28个猪组织的6567个RNA-seq样本的SNP特征被用来掩盖全基因组测序(WGS)数据,猪基因组参考面板(PGRP)作为参考面板。采用Beagle、Minimac4、Impute5三种插补软件工具进行插补。结果表明,rna - snp具有较高的归算精度(CR: 0.895 ~ 0.933;r²:0.745 ~ 0.817)比GeneSeek Genomic Profiler猪SNP50 BeadChip (Chip-SNPs)的SNPs (CR: 0.873 ~ 0.909;R²:0.629 ~ 0.698),而“基因间”区域的准确率较低。代入后,通过小等位基因频率(MAF)和代入质量(DR²)进行质量控制(QC)可以提高r²,但降低SNP保留率。在软件中,Minimac4在单线程设置下运行时间最少,Beagle在多线程设置和分阶段中运行时间最好。Impute5占用最小的内存使用,但需要最大的运行时间。所有工具均具有相当的全局精度(CR: 0.906 ~ 0.917;R²:0.780 ~ 0.787)。结论:本研究为RNA-SNP在基因组和转录组研究中的植入策略提供了实践指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparative analysis of genotype imputation strategies for SNPs calling from RNA-seq.

Background: RNA sequencing (RNA-seq) is a powerful tool for transcriptome profiling, enabling integrative studies of expression quantitative trait loci (eQTL). As it identifies fewer genetic variants than DNA sequencing (DNA-seq), reference panel-based genotype imputation is often required to enhance its utility.

Results: This study evaluated the accuracy of genotype imputation using SNPs called from RNA-seq data (RNA-SNPs). SNP features from 6,567 RNA-seq samples across 28 pig tissues were used to mask whole genome sequencing (WGS) data, with the Pig Genomic Reference Panel (PGRP) serving as the reference panel. Three imputation software tools (i.e., Beagle, Minimac4, and Impute5) were employed to perform the imputation. The result showed that RNA-SNPs achieved higher imputation accuracy (CR: 0.895 ~ 0.933; r²: 0.745 ~ 0.817) than SNPs from GeneSeek Genomic Profiler Porcine SNP50 BeadChip (Chip-SNPs) (CR: 0.873 ~ 0.909; r²: 0.629 ~ 0.698), and lower accuracy in "intergenic" regions. After imputation, quality control (QC) by minor allele frequency (MAF) and imputation quality (DR²) could improve r² but reduce SNP retention. Among software, Minimac4 takes the least runtime in single-thread setting, while Beagle performed best in multi-thread setting and phasing. Impute5 takes up minimal memory usage but requires the maximum runtime. All tools demonstrated comparable global accuracy (CR: 0.906 ~ 0.917; r²: 0.780 ~ 0.787).

Conclusions: This study offers practical guidance for conducting RNA-SNP imputation strategies in genome and transcriptome research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Genomics
BMC Genomics 生物-生物工程与应用微生物
CiteScore
7.40
自引率
4.50%
发文量
769
审稿时长
6.4 months
期刊介绍: BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics. BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
期刊最新文献
Benchmarking the impact of reference genome selection on taxonomic profiling accuracy. The interplay of specialized metabolites, antibiotic resistance, and virulence in Enterococcus faecium: in silico analysis of bacterial genomes. Unraveling GhMYB102: dual roles of Gh_A01G069800 in inhibiting anthocyanin biosynthesis and inducing drought tolerance in cotton. Jejunal and pancreatic transcriptomic adaptations underpin enhanced performance in broilers fed sugarcane bagasse-supplemented diets. Nanopore full-length sequencing reveals Nudt21 knockdown drives genome-wide 3'UTR shortening and transcriptome reprogramming in mouse hepatocytes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1