{"title":"Protein domains of low sequence complexity—dark matter of the proteome","authors":"Steven L. McKnight","doi":"10.1101/gad.351465.123","DOIUrl":null,"url":null,"abstract":"This perspective begins with a speculative consideration of the properties of the earliest proteins to appear during evolution. What did these primitive proteins look like, and how were they of benefit to early forms of life? I proceed to hypothesize that primitive proteins have been preserved through evolution and now serve diverse functions important to the dynamics of cell morphology and biological regulation. The primitive nature of these modern proteins is easy to spot. They are composed of a limited subset of the 20 amino acids used by traditionally evolved proteins and thus are of low sequence complexity. This chemical simplicity limits protein domains of low sequence complexity to forming only a crude and labile type of protein structure currently hidden from the computational powers of machine learning. I conclude by hypothesizing that this structural weakness represents the underlying virtue of proteins that, at least for the moment, constitute the dark matter of the proteome.","PeriodicalId":12591,"journal":{"name":"Genes & development","volume":"34 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genes & development","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gad.351465.123","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This perspective begins with a speculative consideration of the properties of the earliest proteins to appear during evolution. What did these primitive proteins look like, and how were they of benefit to early forms of life? I proceed to hypothesize that primitive proteins have been preserved through evolution and now serve diverse functions important to the dynamics of cell morphology and biological regulation. The primitive nature of these modern proteins is easy to spot. They are composed of a limited subset of the 20 amino acids used by traditionally evolved proteins and thus are of low sequence complexity. This chemical simplicity limits protein domains of low sequence complexity to forming only a crude and labile type of protein structure currently hidden from the computational powers of machine learning. I conclude by hypothesizing that this structural weakness represents the underlying virtue of proteins that, at least for the moment, constitute the dark matter of the proteome.
期刊介绍:
Genes & Development is a research journal published in association with The Genetics Society. It publishes high-quality research papers in the areas of molecular biology, molecular genetics, and related fields. The journal features various research formats including Research papers, short Research Communications, and Resource/Methodology papers.
Genes & Development has gained recognition and is considered as one of the Top Five Research Journals in the field of Molecular Biology and Genetics. It has an impressive Impact Factor of 12.89. The journal is ranked #2 among Developmental Biology research journals, #5 in Genetics and Heredity, and is among the Top 20 in Cell Biology (according to ISI Journal Citation Reports®, 2021).