Igor Fesenko, Harutyun Sahakyan, Rajat Dhyani, Svetlana A. Shabalina, Gisela Storz, Eugene V. Koonin
{"title":"The hidden bacterial microproteome","authors":"Igor Fesenko, Harutyun Sahakyan, Rajat Dhyani, Svetlana A. Shabalina, Gisela Storz, Eugene V. Koonin","doi":"10.1016/j.molcel.2025.01.025","DOIUrl":null,"url":null,"abstract":"Microproteins encoded by small open reading frames comprise the “dark matter” of proteomes. Although microproteins have been detected in diverse organisms from all three domains of life, many more remain to be identified, and only a few have been functionally characterized. In this comprehensive study of intergenic small open reading frames (ismORFs, 15–70 codons) in 5,668 bacterial genomes of the family <em>Enterobacteriaceae</em>, we identify 67,297 clusters of ismORFs subject to purifying selection. Expression of tagged <em>Escherichia coli</em> microproteins is detected for 11 of the 16 tested, validating the predictions. Although the ismORFs mainly code for hydrophobic, potentially transmembrane, unstructured, or minimally structured microproteins, some globular folds, oligomeric structures, and possible interactions with proteins encoded by neighboring genes are predicted. Complete information on the predicted microprotein families, including evidence of transcription and translation, and structure predictions are available as an easily searchable resource for investigation of microprotein functions.","PeriodicalId":18950,"journal":{"name":"Molecular Cell","volume":"1 1","pages":""},"PeriodicalIF":14.5000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Cell","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.molcel.2025.01.025","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Microproteins encoded by small open reading frames comprise the “dark matter” of proteomes. Although microproteins have been detected in diverse organisms from all three domains of life, many more remain to be identified, and only a few have been functionally characterized. In this comprehensive study of intergenic small open reading frames (ismORFs, 15–70 codons) in 5,668 bacterial genomes of the family Enterobacteriaceae, we identify 67,297 clusters of ismORFs subject to purifying selection. Expression of tagged Escherichia coli microproteins is detected for 11 of the 16 tested, validating the predictions. Although the ismORFs mainly code for hydrophobic, potentially transmembrane, unstructured, or minimally structured microproteins, some globular folds, oligomeric structures, and possible interactions with proteins encoded by neighboring genes are predicted. Complete information on the predicted microprotein families, including evidence of transcription and translation, and structure predictions are available as an easily searchable resource for investigation of microprotein functions.
期刊介绍:
Molecular Cell is a companion to Cell, the leading journal of biology and the highest-impact journal in the world. Launched in December 1997 and published monthly. Molecular Cell is dedicated to publishing cutting-edge research in molecular biology, focusing on fundamental cellular processes. The journal encompasses a wide range of topics, including DNA replication, recombination, and repair; Chromatin biology and genome organization; Transcription; RNA processing and decay; Non-coding RNA function; Translation; Protein folding, modification, and quality control; Signal transduction pathways; Cell cycle and checkpoints; Cell death; Autophagy; Metabolism.