N6-methyladenosine (m6A) is the most prevalent chemical modification in eukaryotic mRNAs and plays key roles in diverse cellular processes. Precise localization of m6A sites is thus critical for characterizing the functional roles of m6A in various conditions and dissecting the mechanisms governing its deposition. Here, we design a combined framework of Transformer architecture and recurrent neural network, deepSRAMP, to identify m6A sites using sequence-based and genome-derived features. As a result, deepSRAMP achieves a notably enhanced performance compared to its predecessor, SRAMP, the most-used predictor in this field. Moreover, based on multiple benchmark datasets, deepSRAMP greatly outperforms other state-of-the-art m6A predictors, including WHISTLE and DeepPromise, with an average 16.1% and 18.3% increase in AUROC and a 43.9% and 46.4% increase in AUPRC. Finally, deepSRAMP can be successfully exploited on mammalian m6A epitranscriptome mapping under diverse cellular conditions and can potentially reveal differential m6A sites among transcript isoforms of individual genes.
{"title":"A combined deep learning framework for mammalian m6A site prediction.","authors":"Rui Fan, Chunmei Cui, Boming Kang, Zecheng Chang, Guoqing Wang, Qinghua Cui","doi":"10.1016/j.xgen.2024.100697","DOIUrl":"https://doi.org/10.1016/j.xgen.2024.100697","url":null,"abstract":"<p><p>N<sup>6</sup>-methyladenosine (m6A) is the most prevalent chemical modification in eukaryotic mRNAs and plays key roles in diverse cellular processes. Precise localization of m6A sites is thus critical for characterizing the functional roles of m6A in various conditions and dissecting the mechanisms governing its deposition. Here, we design a combined framework of Transformer architecture and recurrent neural network, deepSRAMP, to identify m6A sites using sequence-based and genome-derived features. As a result, deepSRAMP achieves a notably enhanced performance compared to its predecessor, SRAMP, the most-used predictor in this field. Moreover, based on multiple benchmark datasets, deepSRAMP greatly outperforms other state-of-the-art m6A predictors, including WHISTLE and DeepPromise, with an average 16.1% and 18.3% increase in AUROC and a 43.9% and 46.4% increase in AUPRC. Finally, deepSRAMP can be successfully exploited on mammalian m6A epitranscriptome mapping under diverse cellular conditions and can potentially reveal differential m6A sites among transcript isoforms of individual genes.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100697"},"PeriodicalIF":11.1,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142689870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13DOI: 10.1016/j.xgen.2024.100694
Jayne Hehir-Kwa
Jayne Hehir-Kwa is based at the Princess Máxima Center for Pediatric Oncology in the Netherlands and is an associate group leader within the Kemmeren group and the Big Data Core. Her work is focused on genomic and transcriptomic sequencing of pediatric cancer and the resulting analysis, storage, and management of this large volume of valuable patient data. In this issue of Cell Genomics, her team presents the research article "Complex structural variation is prevalent and highly pathogenic in pediatric solid tumors," which illustrates complex genomic rearrangements in five pediatric cancer types.
Jayne Hehir-Kwa 在荷兰的马克西马公主儿科肿瘤中心工作,是 Kemmeren 小组和大数据核心的副组长。她的工作重点是儿科癌症的基因组和转录组测序,以及由此产生的大量宝贵患者数据的分析、存储和管理。在本期《细胞基因组学》(Cell Genomics)杂志上,她的团队发表了研究文章《复杂结构变异在小儿实体瘤中普遍存在并具有高度致病性》(Complex structural variation is prevalent and highly pathogenic in pediatric solid tumors),文章展示了五种小儿癌症类型的复杂基因组重排。
{"title":"Meet the author: Jayne Hehir-Kwa.","authors":"Jayne Hehir-Kwa","doi":"10.1016/j.xgen.2024.100694","DOIUrl":"10.1016/j.xgen.2024.100694","url":null,"abstract":"<p><p>Jayne Hehir-Kwa is based at the Princess Máxima Center for Pediatric Oncology in the Netherlands and is an associate group leader within the Kemmeren group and the Big Data Core. Her work is focused on genomic and transcriptomic sequencing of pediatric cancer and the resulting analysis, storage, and management of this large volume of valuable patient data. In this issue of Cell Genomics, her team presents the research article \"Complex structural variation is prevalent and highly pathogenic in pediatric solid tumors,\" which illustrates complex genomic rearrangements in five pediatric cancer types.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"4 11","pages":"100694"},"PeriodicalIF":11.1,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-10-11DOI: 10.1016/j.xgen.2024.100671
Joshua S Weinstock, Maya M Arce, Jacob W Freimer, Mineto Ota, Alexander Marson, Alexis Battle, Jonathan K Pritchard
The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation has been challenging due to small effects, but experimental perturbations offer a complementary approach. Using CRISPR, we knocked out 84 genes in primary CD4+ T cells, targeting inborn error of immunity (IEI) disease transcription factors (TFs) and TFs without immune disease association. We developed a novel gene network inference method called linear latent causal Bayes (LLCB) to estimate the network from perturbation data and observed 211 regulatory connections between genes. We characterized programs affected by the TFs, which we associated with immune genome-wide association study (GWAS) genes, finding that JAK-STAT family members are regulated by KMT2A, an epigenetic regulator. These analyses reveal the trans-regulatory cascades linking GWAS genes to signaling pathways.
{"title":"Gene regulatory network inference from CRISPR perturbations in primary CD4<sup>+</sup> T cells elucidates the genomic basis of immune disease.","authors":"Joshua S Weinstock, Maya M Arce, Jacob W Freimer, Mineto Ota, Alexander Marson, Alexis Battle, Jonathan K Pritchard","doi":"10.1016/j.xgen.2024.100671","DOIUrl":"10.1016/j.xgen.2024.100671","url":null,"abstract":"<p><p>The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation has been challenging due to small effects, but experimental perturbations offer a complementary approach. Using CRISPR, we knocked out 84 genes in primary CD4<sup>+</sup> T cells, targeting inborn error of immunity (IEI) disease transcription factors (TFs) and TFs without immune disease association. We developed a novel gene network inference method called linear latent causal Bayes (LLCB) to estimate the network from perturbation data and observed 211 regulatory connections between genes. We characterized programs affected by the TFs, which we associated with immune genome-wide association study (GWAS) genes, finding that JAK-STAT family members are regulated by KMT2A, an epigenetic regulator. These analyses reveal the trans-regulatory cascades linking GWAS genes to signaling pathways.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100671"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-10-21DOI: 10.1016/j.xgen.2024.100680
Alyssa R Holman, Shaina Tran, Eugin Destici, Elie N Farah, Ting Li, Aileena C Nelson, Adam J Engler, Neil C Chi
Illuminating the precise stepwise genetic programs directing cardiac development provides insights into the mechanisms of congenital heart disease and strategies for cardiac regenerative therapies. Here, we integrate in vitro and in vivo human single-cell multi-omic studies with high-throughput functional genomic screening to reveal dynamic, cardiac-specific gene regulatory networks (GRNs) and transcriptional regulators during human cardiomyocyte development. Interrogating developmental trajectories reconstructed from single-cell data unexpectedly reveal divergent cardiomyocyte lineages with distinct gene programs based on developmental signaling pathways. High-throughput functional genomic screens identify key transcription factors from inferred GRNs that are functionally relevant for cardiomyocyte lineages derived from each pathway. Notably, we discover a critical heat shock transcription factor 1 (HSF1)-mediated cardiometabolic GRN controlling cardiac mitochondrial/metabolic function and cell survival, also observed in fetal human cardiomyocytes. Overall, these multi-modal genomic studies enable the systematic discovery and validation of coordinated GRNs and transcriptional regulators controlling the development of distinct human cardiomyocyte populations.
{"title":"Single-cell multi-modal integrative analyses highlight functional dynamic gene regulatory networks directing human cardiac development.","authors":"Alyssa R Holman, Shaina Tran, Eugin Destici, Elie N Farah, Ting Li, Aileena C Nelson, Adam J Engler, Neil C Chi","doi":"10.1016/j.xgen.2024.100680","DOIUrl":"10.1016/j.xgen.2024.100680","url":null,"abstract":"<p><p>Illuminating the precise stepwise genetic programs directing cardiac development provides insights into the mechanisms of congenital heart disease and strategies for cardiac regenerative therapies. Here, we integrate in vitro and in vivo human single-cell multi-omic studies with high-throughput functional genomic screening to reveal dynamic, cardiac-specific gene regulatory networks (GRNs) and transcriptional regulators during human cardiomyocyte development. Interrogating developmental trajectories reconstructed from single-cell data unexpectedly reveal divergent cardiomyocyte lineages with distinct gene programs based on developmental signaling pathways. High-throughput functional genomic screens identify key transcription factors from inferred GRNs that are functionally relevant for cardiomyocyte lineages derived from each pathway. Notably, we discover a critical heat shock transcription factor 1 (HSF1)-mediated cardiometabolic GRN controlling cardiac mitochondrial/metabolic function and cell survival, also observed in fetal human cardiomyocytes. Overall, these multi-modal genomic studies enable the systematic discovery and validation of coordinated GRNs and transcriptional regulators controlling the development of distinct human cardiomyocyte populations.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100680"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-10-31DOI: 10.1016/j.xgen.2024.100692
Victor Borda, Douglas P Loesch, Bing Guo, Roland Laboulaye, Diego Veliz-Otani, Jennifer N French, Thiago Peixoto Leal, Stephanie M Gogarten, Sunday Ikpe, Mateus H Gouveia, Marla Mendes, Gonçalo R Abecasis, Isabela Alvim, Carlos E Arboleda-Bustos, Gonzalo Arboleda, Humberto Arboleda, Mauricio L Barreto, Lucas Barwick, Marcos A Bezzera, John Blangero, Vanderci Borges, Omar Caceres, Jianwen Cai, Pedro Chana-Cuevas, Zhanghua Chen, Brian Custer, Michael Dean, Carla Dinardo, Igor Domingos, Ravindranath Duggirala, Elena Dieguez, Willian Fernandez, Henrique B Ferraz, Frank Gilliland, Heinner Guio, Bernardo Horta, Joanne E Curran, Jill M Johnsen, Robert C Kaplan, Shannon Kelly, Eimear E Kenny, Barbara A Konkle, Charles Kooperberg, Andres Lescano, M Fernanda Lima-Costa, Ruth J F Loos, Ani Manichaikul, Deborah A Meyers, Michel S Naslavsky, Deborah A Nickerson, Kari E North, Carlos Padilla, Michael Preuss, Victor Raggio, Alexander P Reiner, Stephen S Rich, Carlos R Rieder, Michiel Rienstra, Jerome I Rotter, Tatjana Rundek, Ralph L Sacco, Cesar Sanchez, Vijay G Sankaran, Bruno Lopes Santos-Lobato, Artur Francisco Schumacher-Schuh, Marilia O Scliar, Edwin K Silverman, Tamar Sofer, Jessica Lasky-Su, Vitor Tumas, Scott T Weiss, Ignacio F Mata, Ryan D Hernandez, Eduardo Tarazona-Santos, Timothy D O'Connor
Latin Americans are underrepresented in genetic studies, increasing disparities in personalized genomic medicine. Despite available genetic data from thousands of Latin Americans, accessing and navigating the bureaucratic hurdles for consent or access remains challenging. To address this, we introduce the Genetics of Latin American Diversity (GLAD) Project, compiling genome-wide information from 53,738 Latin Americans across 39 studies representing 46 geographical regions. Through GLAD, we identified heterogeneous ancestry composition and recent gene flow across the Americas. Additionally, we developed GLAD-match, a simulated annealing-based algorithm, to match the genetic background of external samples to our database, sharing summary statistics (i.e., allele and haplotype frequencies) without transferring individual-level genotypes. Finally, we demonstrate the potential of GLAD as a critical resource for evaluating statistical genetic software in the presence of admixture. By providing this resource, we promote genomic research in Latin Americans and contribute to the promises of personalized medicine to more people.
{"title":"Genetics of Latin American Diversity Project: Insights into population genetics and association studies in admixed groups in the Americas.","authors":"Victor Borda, Douglas P Loesch, Bing Guo, Roland Laboulaye, Diego Veliz-Otani, Jennifer N French, Thiago Peixoto Leal, Stephanie M Gogarten, Sunday Ikpe, Mateus H Gouveia, Marla Mendes, Gonçalo R Abecasis, Isabela Alvim, Carlos E Arboleda-Bustos, Gonzalo Arboleda, Humberto Arboleda, Mauricio L Barreto, Lucas Barwick, Marcos A Bezzera, John Blangero, Vanderci Borges, Omar Caceres, Jianwen Cai, Pedro Chana-Cuevas, Zhanghua Chen, Brian Custer, Michael Dean, Carla Dinardo, Igor Domingos, Ravindranath Duggirala, Elena Dieguez, Willian Fernandez, Henrique B Ferraz, Frank Gilliland, Heinner Guio, Bernardo Horta, Joanne E Curran, Jill M Johnsen, Robert C Kaplan, Shannon Kelly, Eimear E Kenny, Barbara A Konkle, Charles Kooperberg, Andres Lescano, M Fernanda Lima-Costa, Ruth J F Loos, Ani Manichaikul, Deborah A Meyers, Michel S Naslavsky, Deborah A Nickerson, Kari E North, Carlos Padilla, Michael Preuss, Victor Raggio, Alexander P Reiner, Stephen S Rich, Carlos R Rieder, Michiel Rienstra, Jerome I Rotter, Tatjana Rundek, Ralph L Sacco, Cesar Sanchez, Vijay G Sankaran, Bruno Lopes Santos-Lobato, Artur Francisco Schumacher-Schuh, Marilia O Scliar, Edwin K Silverman, Tamar Sofer, Jessica Lasky-Su, Vitor Tumas, Scott T Weiss, Ignacio F Mata, Ryan D Hernandez, Eduardo Tarazona-Santos, Timothy D O'Connor","doi":"10.1016/j.xgen.2024.100692","DOIUrl":"10.1016/j.xgen.2024.100692","url":null,"abstract":"<p><p>Latin Americans are underrepresented in genetic studies, increasing disparities in personalized genomic medicine. Despite available genetic data from thousands of Latin Americans, accessing and navigating the bureaucratic hurdles for consent or access remains challenging. To address this, we introduce the Genetics of Latin American Diversity (GLAD) Project, compiling genome-wide information from 53,738 Latin Americans across 39 studies representing 46 geographical regions. Through GLAD, we identified heterogeneous ancestry composition and recent gene flow across the Americas. Additionally, we developed GLAD-match, a simulated annealing-based algorithm, to match the genetic background of external samples to our database, sharing summary statistics (i.e., allele and haplotype frequencies) without transferring individual-level genotypes. Finally, we demonstrate the potential of GLAD as a critical resource for evaluating statistical genetic software in the presence of admixture. By providing this resource, we promote genomic research in Latin Americans and contribute to the promises of personalized medicine to more people.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100692"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-11-06DOI: 10.1016/j.xgen.2024.100693
Neha Rohatgi, Jean-Philippe Fortin, Ted Lau, Yi Ying, Yue Zhang, Bettina L Lee, Michael R Costa, Rohit Reja
The CRISPR interference (CRISPRi) system is a powerful tool for selectively and efficiently silencing genes in functional genomics research applications. However, its off-target activity has not been systematically investigated. Here, we utilized a genome-wide CRISPRi-Cas9 single-guide RNA (sgRNA) library to investigate the presence of off-target activity and its effects on gene expression. Our findings suggest that off-target effects in CRISPRi are quite pervasive and have direct and indirect impacts on gene expression. Most of the identified off-targets can be accounted for by complementarity of the protospacer adjacent motif (PAM)-proximal genomic sequence with the 3' half of the sgRNA spacer sequence, the seed sequence. We also report that while the stability of off-target binding is primarily driven by the PAM-proximal seed sequences, variations in the length of these seed sequences and the degree of mismatch tolerance at various positions can differ across different sgRNAs.
{"title":"Seed sequences mediate off-target activity in the CRISPR-interference system.","authors":"Neha Rohatgi, Jean-Philippe Fortin, Ted Lau, Yi Ying, Yue Zhang, Bettina L Lee, Michael R Costa, Rohit Reja","doi":"10.1016/j.xgen.2024.100693","DOIUrl":"10.1016/j.xgen.2024.100693","url":null,"abstract":"<p><p>The CRISPR interference (CRISPRi) system is a powerful tool for selectively and efficiently silencing genes in functional genomics research applications. However, its off-target activity has not been systematically investigated. Here, we utilized a genome-wide CRISPRi-Cas9 single-guide RNA (sgRNA) library to investigate the presence of off-target activity and its effects on gene expression. Our findings suggest that off-target effects in CRISPRi are quite pervasive and have direct and indirect impacts on gene expression. Most of the identified off-targets can be accounted for by complementarity of the protospacer adjacent motif (PAM)-proximal genomic sequence with the 3' half of the sgRNA spacer sequence, the seed sequence. We also report that while the stability of off-target binding is primarily driven by the PAM-proximal seed sequences, variations in the length of these seed sequences and the degree of mismatch tolerance at various positions can differ across different sgRNAs.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100693"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13DOI: 10.1016/j.xgen.2024.100695
Lucy G Dornan, Jeremy C Simpson
Establishing the subcellular distribution of all proteins encoded by the human genome remains a key objective of life science research. This is particularly important in the context of proteins that, through genetic sequencing of patients, have been identified as containing missense mutations. A recent publication in Cell1 highlights the prominence of protein mislocalization as a hallmark of dysfunctional proteins. The use of high-content subcellular phenotypic screens and allied technology by Lacoste and colleagues has enormous potential to change the landscape of how we approach both diagnostic and therapeutic decisions.
{"title":"Lost in translation: Illuminating protein mislocalization through high-content screening microscopy.","authors":"Lucy G Dornan, Jeremy C Simpson","doi":"10.1016/j.xgen.2024.100695","DOIUrl":"10.1016/j.xgen.2024.100695","url":null,"abstract":"<p><p>Establishing the subcellular distribution of all proteins encoded by the human genome remains a key objective of life science research. This is particularly important in the context of proteins that, through genetic sequencing of patients, have been identified as containing missense mutations. A recent publication in Cell<sup>1</sup> highlights the prominence of protein mislocalization as a hallmark of dysfunctional proteins. The use of high-content subcellular phenotypic screens and allied technology by Lacoste and colleagues has enormous potential to change the landscape of how we approach both diagnostic and therapeutic decisions.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"4 11","pages":"100695"},"PeriodicalIF":11.1,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-11-01DOI: 10.1016/j.xgen.2024.100691
Liujia Qian, Rui Sun, Ruedi Aebersold, Peter Bühlmann, Chris Sander, Tiannan Guo
The insufficient availability of comprehensive protein-level perturbation data is impeding the widespread adoption of systems biology. In this perspective, we introduce the rationale, essentiality, and practicality of perturbation proteomics. Biological systems are perturbed with diverse biological, chemical, and/or physical factors, followed by proteomic measurements at various levels, including changes in protein expression and turnover, post-translational modifications, protein interactions, transport, and localization, along with phenotypic data. Computational models, employing traditional machine learning or deep learning, identify or predict perturbation responses, mechanisms of action, and protein functions, aiding in therapy selection, compound design, and efficient experiment design. We propose to outline a generic PMMP (perturbation, measurement, modeling to prediction) pipeline and build foundation models or other suitable mathematical models based on large-scale perturbation proteomic data. Finally, we contrast modeling between artificially and naturally perturbed systems and highlight the importance of perturbation proteomics for advancing our understanding and predictive modeling of biological systems.
{"title":"AI-empowered perturbation proteomics for complex biological systems.","authors":"Liujia Qian, Rui Sun, Ruedi Aebersold, Peter Bühlmann, Chris Sander, Tiannan Guo","doi":"10.1016/j.xgen.2024.100691","DOIUrl":"10.1016/j.xgen.2024.100691","url":null,"abstract":"<p><p>The insufficient availability of comprehensive protein-level perturbation data is impeding the widespread adoption of systems biology. In this perspective, we introduce the rationale, essentiality, and practicality of perturbation proteomics. Biological systems are perturbed with diverse biological, chemical, and/or physical factors, followed by proteomic measurements at various levels, including changes in protein expression and turnover, post-translational modifications, protein interactions, transport, and localization, along with phenotypic data. Computational models, employing traditional machine learning or deep learning, identify or predict perturbation responses, mechanisms of action, and protein functions, aiding in therapy selection, compound design, and efficient experiment design. We propose to outline a generic PMMP (perturbation, measurement, modeling to prediction) pipeline and build foundation models or other suitable mathematical models based on large-scale perturbation proteomic data. Finally, we contrast modeling between artificially and naturally perturbed systems and highlight the importance of perturbation proteomics for advancing our understanding and predictive modeling of biological systems.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100691"},"PeriodicalIF":11.1,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-10-21DOI: 10.1016/j.xgen.2024.100679
Sara Saez-Atienzar, Cleide Dos Santos Souza, Ruth Chia, Selina N Beal, Ileana Lorenzini, Ruili Huang, Jennifer Levy, Camelia Burciu, Jinhui Ding, J Raphael Gibbs, Ashley Jones, Ramita Dewan, Viviana Pensato, Silvia Peverelli, Lucia Corrado, Joke J F A van Vugt, Wouter van Rheenen, Ceren Tunca, Elif Bayraktar, Menghang Xia, Alfredo Iacoangeli, Aleksey Shatunov, Cinzia Tiloca, Nicola Ticozzi, Federico Verde, Letizia Mazzini, Kevin Kenna, Ahmad Al Khleifat, Sarah Opie-Martin, Flavia Raggi, Massimiliano Filosto, Stefano Cotti Piccinelli, Alessandro Padovani, Stella Gagliardi, Maurizio Inghilleri, Alessandra Ferlini, Rosario Vasta, Andrea Calvo, Cristina Moglia, Antonio Canosa, Umberto Manera, Maurizio Grassano, Jessica Mandrioli, Gabriele Mora, Christian Lunetta, Raffaella Tanel, Francesca Trojsi, Patrizio Cardinali, Salvatore Gallone, Maura Brunetti, Daniela Galimberti, Maria Serpente, Chiara Fenoglio, Elio Scarpini, Giacomo P Comi, Stefania Corti, Roberto Del Bo, Mauro Ceroni, Giuseppe Lauria Pinter, Franco Taroni, Eleonora Dalla Bella, Enrica Bersano, Charles J Curtis, Sang Hyuck Lee, Raymond Chung, Hamel Patel, Karen E Morrison, Johnathan Cooper-Knock, Pamela J Shaw, Gerome Breen, Richard J B Dobson, Clifton L Dalgard, Sonja W Scholz, Ammar Al-Chalabi, Leonard H van den Berg, Russell McLaughlin, Orla Hardiman, Cristina Cereda, Gianni Sorarù, Sandra D'Alfonso, Siddharthan Chandran, Suvankar Pal, Antonia Ratti, Cinzia Gellera, Kory Johnson, Tara Doucet-O'Hare, Nicholas Pasternack, Tongguang Wang, Avindra Nath, Gabriele Siciliano, Vincenzo Silani, Ayşe Nazlı Başak, Jan H Veldink, William Camu, Jonathan D Glass, John E Landers, Adriano Chiò, Rita Sattler, Christopher E Shaw, Laura Ferraiuolo, Isabella Fogh, Bryan J Traynor
Repeat expansions in the C9orf72 gene are the most common genetic cause of (ALS) and frontotemporal dementia (FTD). Like other genetic forms of neurodegeneration, pinpointing the precise mechanism(s) by which this mutation leads to neuronal death remains elusive, and this lack of knowledge hampers the development of therapy for C9orf72-related disease. We used an agnostic approach based on genomic data (n = 41,273 ALS and healthy samples, and n = 1,516 C9orf72 carriers) to overcome these bottlenecks. Our drug-repurposing screen, based on gene- and expression-pattern matching and information about the genetic variants influencing onset age among C9orf72 carriers, identified acamprosate, a γ-aminobutyric acid analog, as a potentially repurposable treatment for patients carrying C9orf72 repeat expansions. We validated its neuroprotective effect in cell models and showed comparable efficacy to riluzole, the current standard of care. Our work highlights the potential value of genomics in repurposing drugs in situations where the underlying pathomechanisms are inherently complex. VIDEO ABSTRACT.
C9orf72 基因的重复扩增是 ALS 和额颞叶痴呆症(FTD)最常见的遗传病因。与其他神经变性的遗传形式一样,确定这种突变导致神经元死亡的确切机制仍是一个难题,这种知识的缺乏阻碍了 C9orf72 相关疾病疗法的开发。我们采用了一种基于基因组数据(n = 41,273 ALS 和健康样本,n = 1,516 C9orf72 携带者)的不可知论方法来克服这些瓶颈。根据基因和表达模式匹配以及影响 C9orf72 携带者发病年龄的基因变异信息,我们进行了药物再利用筛选,发现γ-氨基丁酸类似物阿坎酸(acamprosate)是一种可用于携带 C9orf72 重复扩增患者的潜在再利用疗法。我们在细胞模型中验证了它的神经保护作用,其疗效与目前的标准疗法利鲁唑相当。我们的工作凸显了基因组学在潜在病理机制固有复杂的情况下重新设计药物用途的潜在价值。视频摘要。
{"title":"Mechanism-free repurposing of drugs for C9orf72-related ALS/FTD using large-scale genomic data.","authors":"Sara Saez-Atienzar, Cleide Dos Santos Souza, Ruth Chia, Selina N Beal, Ileana Lorenzini, Ruili Huang, Jennifer Levy, Camelia Burciu, Jinhui Ding, J Raphael Gibbs, Ashley Jones, Ramita Dewan, Viviana Pensato, Silvia Peverelli, Lucia Corrado, Joke J F A van Vugt, Wouter van Rheenen, Ceren Tunca, Elif Bayraktar, Menghang Xia, Alfredo Iacoangeli, Aleksey Shatunov, Cinzia Tiloca, Nicola Ticozzi, Federico Verde, Letizia Mazzini, Kevin Kenna, Ahmad Al Khleifat, Sarah Opie-Martin, Flavia Raggi, Massimiliano Filosto, Stefano Cotti Piccinelli, Alessandro Padovani, Stella Gagliardi, Maurizio Inghilleri, Alessandra Ferlini, Rosario Vasta, Andrea Calvo, Cristina Moglia, Antonio Canosa, Umberto Manera, Maurizio Grassano, Jessica Mandrioli, Gabriele Mora, Christian Lunetta, Raffaella Tanel, Francesca Trojsi, Patrizio Cardinali, Salvatore Gallone, Maura Brunetti, Daniela Galimberti, Maria Serpente, Chiara Fenoglio, Elio Scarpini, Giacomo P Comi, Stefania Corti, Roberto Del Bo, Mauro Ceroni, Giuseppe Lauria Pinter, Franco Taroni, Eleonora Dalla Bella, Enrica Bersano, Charles J Curtis, Sang Hyuck Lee, Raymond Chung, Hamel Patel, Karen E Morrison, Johnathan Cooper-Knock, Pamela J Shaw, Gerome Breen, Richard J B Dobson, Clifton L Dalgard, Sonja W Scholz, Ammar Al-Chalabi, Leonard H van den Berg, Russell McLaughlin, Orla Hardiman, Cristina Cereda, Gianni Sorarù, Sandra D'Alfonso, Siddharthan Chandran, Suvankar Pal, Antonia Ratti, Cinzia Gellera, Kory Johnson, Tara Doucet-O'Hare, Nicholas Pasternack, Tongguang Wang, Avindra Nath, Gabriele Siciliano, Vincenzo Silani, Ayşe Nazlı Başak, Jan H Veldink, William Camu, Jonathan D Glass, John E Landers, Adriano Chiò, Rita Sattler, Christopher E Shaw, Laura Ferraiuolo, Isabella Fogh, Bryan J Traynor","doi":"10.1016/j.xgen.2024.100679","DOIUrl":"10.1016/j.xgen.2024.100679","url":null,"abstract":"<p><p>Repeat expansions in the C9orf72 gene are the most common genetic cause of (ALS) and frontotemporal dementia (FTD). Like other genetic forms of neurodegeneration, pinpointing the precise mechanism(s) by which this mutation leads to neuronal death remains elusive, and this lack of knowledge hampers the development of therapy for C9orf72-related disease. We used an agnostic approach based on genomic data (n = 41,273 ALS and healthy samples, and n = 1,516 C9orf72 carriers) to overcome these bottlenecks. Our drug-repurposing screen, based on gene- and expression-pattern matching and information about the genetic variants influencing onset age among C9orf72 carriers, identified acamprosate, a γ-aminobutyric acid analog, as a potentially repurposable treatment for patients carrying C9orf72 repeat expansions. We validated its neuroprotective effect in cell models and showed comparable efficacy to riluzole, the current standard of care. Our work highlights the potential value of genomics in repurposing drugs in situations where the underlying pathomechanisms are inherently complex. VIDEO ABSTRACT.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100679"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13Epub Date: 2024-10-14DOI: 10.1016/j.xgen.2024.100672
Jessica L Zhou, Karthik Guruvayurappan, Shushan Toneyan, Hsiuyi V Chen, Aaron R Chen, Peter Koo, Graham McVicker
A single gene may have multiple enhancers, but how they work in concert to regulate transcription is poorly understood. To analyze enhancer interactions throughout the genome, we developed a generalized linear modeling framework, GLiMMIRS, for interrogating enhancer effects from single-cell CRISPR experiments. We applied GLiMMIRS to a published dataset and tested for interactions between 46,166 enhancer pairs and corresponding genes, including 264 "high-confidence" enhancer pairs. We found that enhancer effects combine multiplicatively but with limited evidence for further interactions. Only 31 enhancer pairs exhibited significant interactions (false discovery rate <0.1), none of which came from the high-confidence set, and 20 were driven by outlier expression values. Additional analyses of a second CRISPR dataset and in silico enhancer perturbations with Enformer both support a multiplicative model of enhancer effects without interactions. Altogether, our results indicate that enhancer interactions are uncommon or have small effects that are difficult to detect.
{"title":"Analysis of single-cell CRISPR perturbations indicates that enhancers predominantly act multiplicatively.","authors":"Jessica L Zhou, Karthik Guruvayurappan, Shushan Toneyan, Hsiuyi V Chen, Aaron R Chen, Peter Koo, Graham McVicker","doi":"10.1016/j.xgen.2024.100672","DOIUrl":"10.1016/j.xgen.2024.100672","url":null,"abstract":"<p><p>A single gene may have multiple enhancers, but how they work in concert to regulate transcription is poorly understood. To analyze enhancer interactions throughout the genome, we developed a generalized linear modeling framework, GLiMMIRS, for interrogating enhancer effects from single-cell CRISPR experiments. We applied GLiMMIRS to a published dataset and tested for interactions between 46,166 enhancer pairs and corresponding genes, including 264 \"high-confidence\" enhancer pairs. We found that enhancer effects combine multiplicatively but with limited evidence for further interactions. Only 31 enhancer pairs exhibited significant interactions (false discovery rate <0.1), none of which came from the high-confidence set, and 20 were driven by outlier expression values. Additional analyses of a second CRISPR dataset and in silico enhancer perturbations with Enformer both support a multiplicative model of enhancer effects without interactions. Altogether, our results indicate that enhancer interactions are uncommon or have small effects that are difficult to detect.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100672"},"PeriodicalIF":3.5,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}