Detection and Annotation of Unique Regions in Mammalian Genomes.

IF 2.1 3区 生物学 Q3 GENETICS & HEREDITY G3: Genes|Genomes|Genetics Pub Date : 2024-11-06 DOI:10.1093/g3journal/jkae257
Beatriz Vieira Mourato, Bernhard Haubold
{"title":"Detection and Annotation of Unique Regions in Mammalian Genomes.","authors":"Beatriz Vieira Mourato, Bernhard Haubold","doi":"10.1093/g3journal/jkae257","DOIUrl":null,"url":null,"abstract":"<p><p>Long unique genomic regions have been reported to be highly enriched for developmental genes in mice and humans. In this paper we identify unique genomic regions using an efficient method based on fast string matching. We quantify the resource consumption and accuracy of this method before applying it to the genomes of 18 mammals. We annotate their unique regions of at least 10 kb and find that they are strongly enriched for developmental genes across the board. We then investigated the subset of unique regions that lack annotations, which we call \"anonymous\". The longest anonymous unique region in the tasmanian devil spanned 83 kb and contained the gene encoding inositol polyphosphate-5-phosphatase A, which is an essential part of intracellular signaling. This discovery of an essential gene in a unique region implies that unique regions might be given priority when annotating mammalian genomes. Our documented pipeline for annotating unique regions in any mammalian genome is available from the repository github.com/evolbioinf/auger; additional data for this study is available from the dataverse at doi.org/10.17617/3.4IKQAG.</p>","PeriodicalId":12468,"journal":{"name":"G3: Genes|Genomes|Genetics","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"G3: Genes|Genomes|Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/g3journal/jkae257","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Long unique genomic regions have been reported to be highly enriched for developmental genes in mice and humans. In this paper we identify unique genomic regions using an efficient method based on fast string matching. We quantify the resource consumption and accuracy of this method before applying it to the genomes of 18 mammals. We annotate their unique regions of at least 10 kb and find that they are strongly enriched for developmental genes across the board. We then investigated the subset of unique regions that lack annotations, which we call "anonymous". The longest anonymous unique region in the tasmanian devil spanned 83 kb and contained the gene encoding inositol polyphosphate-5-phosphatase A, which is an essential part of intracellular signaling. This discovery of an essential gene in a unique region implies that unique regions might be given priority when annotating mammalian genomes. Our documented pipeline for annotating unique regions in any mammalian genome is available from the repository github.com/evolbioinf/auger; additional data for this study is available from the dataverse at doi.org/10.17617/3.4IKQAG.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
哺乳动物基因组中独特区域的检测与注释
据报道,在小鼠和人类中,长的独特基因组区域高度富集发育基因。在本文中,我们使用一种基于快速字符串匹配的高效方法来识别独特的基因组区域。在将该方法应用于 18 种哺乳动物的基因组之前,我们对其资源消耗和准确性进行了量化。我们对至少 10 kb 的独特区域进行了注释,发现这些区域强烈富集了所有发育基因。然后,我们研究了缺乏注释的独特区域子集,我们称其为 "匿名 "区域。塔斯马尼亚袋獾中最长的匿名独特区域横跨 83 kb,包含编码肌醇多磷酸-5-磷酸酶 A 的基因,该基因是细胞内信号转导的重要组成部分。在一个独特的区域发现了一个重要基因,这意味着在注释哺乳动物基因组时,可以优先考虑独特的区域。我们在任何哺乳动物基因组中注释独特区域的记录管道可从存储库 github.com/evolbioinf/auger 获取;本研究的其他数据可从 doi.org/10.17617/3.4IKQAG 的数据筐获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
G3: Genes|Genomes|Genetics
G3: Genes|Genomes|Genetics GENETICS & HEREDITY-
CiteScore
5.10
自引率
3.80%
发文量
305
审稿时长
3-8 weeks
期刊介绍: G3: Genes, Genomes, Genetics provides a forum for the publication of high‐quality foundational research, particularly research that generates useful genetic and genomic information such as genome maps, single gene studies, genome‐wide association and QTL studies, as well as genome reports, mutant screens, and advances in methods and technology. The Editorial Board of G3 believes that rapid dissemination of these data is the necessary foundation for analysis that leads to mechanistic insights. G3, published by the Genetics Society of America, meets the critical and growing need of the genetics community for rapid review and publication of important results in all areas of genetics. G3 offers the opportunity to publish the puzzling finding or to present unpublished results that may not have been submitted for review and publication due to a perceived lack of a potential high-impact finding. G3 has earned the DOAJ Seal, which is a mark of certification for open access journals, awarded by DOAJ to journals that achieve a high level of openness, adhere to Best Practice and high publishing standards.
期刊最新文献
A collection of split-Gal4 drivers targeting conserved signaling ligands in Drosophila. GenoTools: An Open-Source Python Package for Efficient Genotype Data Quality Control and Analysis. Testis- and ovary-expressed polo-like kinase transcripts and gene duplications affect male fertility when expressed in the Drosophila melanogaster germline. Transcriptome profiling of maize transcription factor mutants to probe gene regulatory network predictions. Comparative genomics reveals putative copper tolerance genes in a Fusarium oxysporum strain.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1