基于AluScan序列的拷贝数变异分析。

Journal of clinical bioinformatics Pub Date : 2014-12-05 eCollection Date: 2014-01-01 DOI:10.1186/s13336-014-0015-z
Jian-Feng Yang, Xiao-Fan Ding, Lei Chen, Wai-Kin Mat, Michelle Zhi Xu, Jin-Fei Chen, Jian-Min Wang, Lin Xu, Wai-Sang Poon, Ava Kwong, Gilberto Ka-Kit Leung, Tze-Ching Tan, Chi-Hung Yu, Yue-Bin Ke, Xin-Yun Xu, Xiao-Yan Ke, Ronald Cw Ma, Juliana Cn Chan, Wei-Qing Wan, Li-Wei Zhang, Yogesh Kumar, Shui-Ying Tsang, Shao Li, Hong-Yang Wang, Hong Xue
{"title":"基于AluScan序列的拷贝数变异分析。","authors":"Jian-Feng Yang,&nbsp;Xiao-Fan Ding,&nbsp;Lei Chen,&nbsp;Wai-Kin Mat,&nbsp;Michelle Zhi Xu,&nbsp;Jin-Fei Chen,&nbsp;Jian-Min Wang,&nbsp;Lin Xu,&nbsp;Wai-Sang Poon,&nbsp;Ava Kwong,&nbsp;Gilberto Ka-Kit Leung,&nbsp;Tze-Ching Tan,&nbsp;Chi-Hung Yu,&nbsp;Yue-Bin Ke,&nbsp;Xin-Yun Xu,&nbsp;Xiao-Yan Ke,&nbsp;Ronald Cw Ma,&nbsp;Juliana Cn Chan,&nbsp;Wei-Qing Wan,&nbsp;Li-Wei Zhang,&nbsp;Yogesh Kumar,&nbsp;Shui-Ying Tsang,&nbsp;Shao Li,&nbsp;Hong-Yang Wang,&nbsp;Hong Xue","doi":"10.1186/s13336-014-0015-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing.</p><p><strong>Results: </strong>In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained.</p><p><strong>Conclusions: </strong>The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.</p>","PeriodicalId":73663,"journal":{"name":"Journal of clinical bioinformatics","volume":"4 1","pages":"15"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13336-014-0015-z","citationCount":"12","resultStr":"{\"title\":\"Copy number variation analysis based on AluScan sequences.\",\"authors\":\"Jian-Feng Yang,&nbsp;Xiao-Fan Ding,&nbsp;Lei Chen,&nbsp;Wai-Kin Mat,&nbsp;Michelle Zhi Xu,&nbsp;Jin-Fei Chen,&nbsp;Jian-Min Wang,&nbsp;Lin Xu,&nbsp;Wai-Sang Poon,&nbsp;Ava Kwong,&nbsp;Gilberto Ka-Kit Leung,&nbsp;Tze-Ching Tan,&nbsp;Chi-Hung Yu,&nbsp;Yue-Bin Ke,&nbsp;Xin-Yun Xu,&nbsp;Xiao-Yan Ke,&nbsp;Ronald Cw Ma,&nbsp;Juliana Cn Chan,&nbsp;Wei-Qing Wan,&nbsp;Li-Wei Zhang,&nbsp;Yogesh Kumar,&nbsp;Shui-Ying Tsang,&nbsp;Shao Li,&nbsp;Hong-Yang Wang,&nbsp;Hong Xue\",\"doi\":\"10.1186/s13336-014-0015-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing.</p><p><strong>Results: </strong>In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained.</p><p><strong>Conclusions: </strong>The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.</p>\",\"PeriodicalId\":73663,\"journal\":{\"name\":\"Journal of clinical bioinformatics\",\"volume\":\"4 1\",\"pages\":\"15\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s13336-014-0015-z\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of clinical bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s13336-014-0015-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2014/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of clinical bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13336-014-0015-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2014/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

背景:AluScan将基于多个方向相反的alu引物的inter-Alu PCR与下一代测序相结合,捕获大量的Alu-proximal基因组序列进行研究。它只需要亚微克数量的DNA,便于对大量样品进行检查。然而,由于AluScan数据的特殊性,直接使用全基因组测序(WGS)或外显子组测序设计的调用算法难以调用拷贝数变异(CNV)。结果:在本研究中,我们组装了一个AluScanCNV包,利用配对测试-对照样本之间或测试样本与参考样本构建的参考模板之间的读深比的ge加里-欣克利转换(GHT),从AluScan测序数据中高效调用CNV,调用本地化的CNV,然后使用类似gistics的算法识别循环CNV和循环二值分割(CBS),以揭示大的扩展CNV。为了评估从AluScan数据中调用的CNVs的效用,本研究分析了来自23个非癌症和38个癌症基因组的AluScan。分析的胶质瘤样本在染色体1p和9上产生了常见的延长拷贝数损失。此外,在1q和8q染色体拷贝数增益的显著富集方面,从肝癌样本中鉴定出的复发性体细胞CNVs与肝癌WGS中报道的相似。当通过基于相关性的机器学习选择能够区分肝癌和非肝癌样本的局部或复发性cnv特征时,实现了肝癌和非肝癌类别的高度精确分离。结论:从非癌组织和癌组织中获得的结果表明,AluScanCNV包可用于调用来自AluScan序列的定位、复发和扩展的cnv。此外,通过这种方法识别的局部和复发性cnv都可以进行机器学习选择,以产生能够区分肝癌和其他类型癌症的区分cnv特征。由于该方法适用于任何有或没有配对对照的人类DNA样本,因此它也可用于分析个体的体质CNVs。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Copy number variation analysis based on AluScan sequences.

Background: AluScan combines inter-Alu PCR using multiple Alu-based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu-proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing.

Results: In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained.

Conclusions: The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Clinical research informatics (CRI): overview over new tools and services First Clinical Research Informatics (CRI) Solutions Day: advanced IT support from EU projects for clinical trials Mobile eHealth solution (ePRO) EHR4CR local workbench TRANSFoRm Data quality tool
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1