Emergence of novel domains in proteins.

IF 3.4 Q1 Agricultural and Biological Sciences BMC Evolutionary Biology Pub Date : 2013-02-20 DOI:10.1186/1471-2148-13-47
Macarena Toll-Riera, M Mar Albà
{"title":"Emergence of novel domains in proteins.","authors":"Macarena Toll-Riera,&nbsp;M Mar Albà","doi":"10.1186/1471-2148-13-47","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Proteins are composed of a combination of discrete, well-defined, sequence domains, associated with specific functions that have arisen at different times during evolutionary history. The emergence of novel domains is related to protein functional diversification and adaptation. But currently little is known about how novel domains arise and how they subsequently evolve.</p><p><strong>Results: </strong>To gain insights into the impact of recently emerged domains in protein evolution we have identified all human young protein domains that have emerged in approximately the past 550 million years. We have classified them into vertebrate-specific and mammalian-specific groups, and compared them to older domains. We have found 426 different annotated young domains, totalling 995 domain occurrences, which represent about 12.3% of all human domains. We have observed that 61.3% of them arose in newly formed genes, while the remaining 38.7% are found combined with older domains, and have very likely emerged in the context of a previously existing protein. Young domains are preferentially located at the N-terminus of the protein, indicating that, at least in vertebrates, novel functional sequences often emerge there. Furthermore, young domains show significantly higher non-synonymous to synonymous substitution rates than older domains using human and mouse orthologous sequence comparisons. This is also true when we compare young and old domains located in the same protein, suggesting that recently arisen domains tend to evolve in a less constrained manner than older domains.</p><p><strong>Conclusions: </strong>We conclude that proteins tend to gain domains over time, becoming progressively longer. We show that many proteins are made of domains of different age, and that the fastest evolving parts correspond to the domains that have been acquired more recently.</p>","PeriodicalId":9111,"journal":{"name":"BMC Evolutionary Biology","volume":"13 ","pages":"47"},"PeriodicalIF":3.4000,"publicationDate":"2013-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1471-2148-13-47","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Evolutionary Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1471-2148-13-47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Proteins are composed of a combination of discrete, well-defined, sequence domains, associated with specific functions that have arisen at different times during evolutionary history. The emergence of novel domains is related to protein functional diversification and adaptation. But currently little is known about how novel domains arise and how they subsequently evolve.

Results: To gain insights into the impact of recently emerged domains in protein evolution we have identified all human young protein domains that have emerged in approximately the past 550 million years. We have classified them into vertebrate-specific and mammalian-specific groups, and compared them to older domains. We have found 426 different annotated young domains, totalling 995 domain occurrences, which represent about 12.3% of all human domains. We have observed that 61.3% of them arose in newly formed genes, while the remaining 38.7% are found combined with older domains, and have very likely emerged in the context of a previously existing protein. Young domains are preferentially located at the N-terminus of the protein, indicating that, at least in vertebrates, novel functional sequences often emerge there. Furthermore, young domains show significantly higher non-synonymous to synonymous substitution rates than older domains using human and mouse orthologous sequence comparisons. This is also true when we compare young and old domains located in the same protein, suggesting that recently arisen domains tend to evolve in a less constrained manner than older domains.

Conclusions: We conclude that proteins tend to gain domains over time, becoming progressively longer. We show that many proteins are made of domains of different age, and that the fastest evolving parts correspond to the domains that have been acquired more recently.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
蛋白质新结构域的出现。
背景:蛋白质是由离散的、定义明确的序列结构域组合而成,这些结构域与进化历史中不同时期出现的特定功能相关。新结构域的出现与蛋白质功能的多样化和适应性有关。但是目前人们对新结构域是如何产生的以及它们随后是如何进化的知之甚少。结果:为了深入了解最近出现的结构域对蛋白质进化的影响,我们已经确定了大约在过去5.5亿年里出现的所有人类年轻蛋白质结构域。我们将它们分为脊椎动物和哺乳动物,并将它们与更早的领域进行了比较。我们发现了426个不同的注释年轻域,共有995个域出现,约占所有人类域的12.3%。我们观察到,其中61.3%出现在新形成的基因中,而剩下的38.7%是与较老的结构域结合在一起的,很可能是在先前存在的蛋白质中出现的。年轻结构域优先位于蛋白质的n端,这表明,至少在脊椎动物中,新的功能序列经常出现在那里。此外,使用人和小鼠的同源序列比较,年轻结构域显示出明显高于老结构域的非同义到同义替换率。当我们比较位于同一蛋白质中的年轻结构域和旧结构域时也是如此,这表明最近出现的结构域倾向于以比旧结构域更少的约束方式进化。结论:我们得出的结论是,随着时间的推移,蛋白质倾向于获得结构域,变得越来越长。我们发现许多蛋白质是由不同年龄的结构域组成的,而进化最快的部分与最近获得的结构域相对应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Evolutionary Biology
BMC Evolutionary Biology 生物-进化生物学
CiteScore
5.80
自引率
0.00%
发文量
0
审稿时长
6 months
期刊介绍: BMC Evolutionary Biology is an open access, peer-reviewed journal that considers articles on all aspects of molecular and non-molecular evolution of all organisms, as well as phylogenetics and palaeontology.
期刊最新文献
Correlation between bioluminescent blinks and swimming behavior in the splitfin flashlight fish Anomalops katoptron Home range of three turtle species in Central Yucatan. A comparative study Variation and plasticity in life-history traits and fitness of wild Arabidopsis thaliana populations are not related to their genotypic and ecological diversity Does local soil factor drive functional leaf trait variation? A test on Neilingding Island, South China Biodiversity and distribution patterns of blooming jellyfish in the Bohai Sea revealed by eDNA metabarcoding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1