{"title":"Decoding biology with massively parallel reporter assays and machine learning.","authors":"Alyssa La Fleur, Yongsheng Shi, Georg Seelig","doi":"10.1101/gad.351800.124","DOIUrl":null,"url":null,"abstract":"<p><p>Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of <i>cis</i>-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on <i>cis</i>-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.</p>","PeriodicalId":12591,"journal":{"name":"Genes & development","volume":" ","pages":"843-865"},"PeriodicalIF":7.5000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11535156/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genes & development","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gad.351800.124","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of cis-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on cis-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.
期刊介绍:
Genes & Development is a research journal published in association with The Genetics Society. It publishes high-quality research papers in the areas of molecular biology, molecular genetics, and related fields. The journal features various research formats including Research papers, short Research Communications, and Resource/Methodology papers.
Genes & Development has gained recognition and is considered as one of the Top Five Research Journals in the field of Molecular Biology and Genetics. It has an impressive Impact Factor of 12.89. The journal is ranked #2 among Developmental Biology research journals, #5 in Genetics and Heredity, and is among the Top 20 in Cell Biology (according to ISI Journal Citation Reports®, 2021).