Identity by descent genome segmentation based on single nucleotide polymorphism distributions.

T W Blackwell, E Rouchka, D J States
{"title":"Identity by descent genome segmentation based on single nucleotide polymorphism distributions.","authors":"T W Blackwell,&nbsp;E Rouchka,&nbsp;D J States","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>In the course of our efforts to build extended regions of human genomic sequence by assembling individual BAC sequences, we have encountered several instances where a region of the genome has been sequenced independently using reagents derived from two different individuals. Comparing these sequences allows us to analyze the frequency and distribution of single nucleotide polymorphisms (SNPs) in the human genome. The observed transition/transversion frequencies are consistent with a biological origin for the sequence discrepancies, and this suggests that the data produced by large sequencing centers are accurate enough to be used as the basis for SNP analysis. The observed distribution of single nucleotide polymorphisms in the human genome is not uniform. An apparent duplication in the human genome extending over more than 130 kb between chromosomes 1p34 and 16p13 is reported. Independently derived sequences covering these regions are more than 99.9% identical, indicating that this duplication event must have occurred quite recently. FISH mapping results reported by the relevant laboratories indicate that the human population may be polymorphic for this duplication. We present a population genetic theory for the expected distribution of SNPs and derive an algorithm for probabilistically segmenting genomic sequence into regions that are identical by descent (IBD) between two individuals based on this theory and the observed locations of polymorphisms. Based on these methods and a random mating model for the human population, estimates are made for the mutation rate in the human genome.</p>","PeriodicalId":79420,"journal":{"name":"Proceedings. International Conference on Intelligent Systems for Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Intelligent Systems for Molecular Biology","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the course of our efforts to build extended regions of human genomic sequence by assembling individual BAC sequences, we have encountered several instances where a region of the genome has been sequenced independently using reagents derived from two different individuals. Comparing these sequences allows us to analyze the frequency and distribution of single nucleotide polymorphisms (SNPs) in the human genome. The observed transition/transversion frequencies are consistent with a biological origin for the sequence discrepancies, and this suggests that the data produced by large sequencing centers are accurate enough to be used as the basis for SNP analysis. The observed distribution of single nucleotide polymorphisms in the human genome is not uniform. An apparent duplication in the human genome extending over more than 130 kb between chromosomes 1p34 and 16p13 is reported. Independently derived sequences covering these regions are more than 99.9% identical, indicating that this duplication event must have occurred quite recently. FISH mapping results reported by the relevant laboratories indicate that the human population may be polymorphic for this duplication. We present a population genetic theory for the expected distribution of SNPs and derive an algorithm for probabilistically segmenting genomic sequence into regions that are identical by descent (IBD) between two individuals based on this theory and the observed locations of polymorphisms. Based on these methods and a random mating model for the human population, estimates are made for the mutation rate in the human genome.

分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于单核苷酸多态性分布的血统基因组分割鉴定。
在我们努力通过组装个体BAC序列来构建人类基因组序列扩展区域的过程中,我们遇到了几个使用来自两个不同个体的试剂独立测序基因组区域的实例。比较这些序列使我们能够分析人类基因组中单核苷酸多态性(snp)的频率和分布。观察到的转换/翻转频率与序列差异的生物学起源一致,这表明大型测序中心产生的数据足够准确,可以用作SNP分析的基础。观察到的单核苷酸多态性在人类基因组中的分布并不均匀。据报道,人类基因组中1p34和16p13染色体之间存在明显的重复,长度超过130 kb。覆盖这些区域的独立衍生序列超过99.9%是相同的,这表明这种重复事件一定是最近发生的。相关实验室报告的FISH图谱结果表明,人类群体可能具有这种重复的多态性。我们提出了一种预期snp分布的群体遗传理论,并基于该理论和观察到的多态性位置,推导了一种将基因组序列概率分割为两个个体之间相同的区域(IBD)的算法。基于这些方法和人类群体的随机交配模型,对人类基因组的突变率进行了估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Reducing Mass Degeneracy in SAR by MS by Stable Isotopic Labeling Intelligent aids for parallel experiment planning and macromolecular crystallization. A practical algorithm for optimal inference of haplotypes from diploid populations. Analysis of yeast's ORF upstream regions by parallel processing, microarrays, and computational methods. Finding regulatory elements using joint likelihoods for sequence and expression profile data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1