CRISPR-GEM: A Novel Machine Learning Model for CRISPR Genetic Target Discovery and Evaluation

IF 3.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS ACS Synthetic Biology Pub Date : 2024-10-07 DOI:10.1021/acssynbio.4c0047310.1021/acssynbio.4c00473
Joshua P. Graham, Yu Zhang, Lifang He and Tomas Gonzalez-Fernandez*, 
{"title":"CRISPR-GEM: A Novel Machine Learning Model for CRISPR Genetic Target Discovery and Evaluation","authors":"Joshua P. Graham,&nbsp;Yu Zhang,&nbsp;Lifang He and Tomas Gonzalez-Fernandez*,&nbsp;","doi":"10.1021/acssynbio.4c0047310.1021/acssynbio.4c00473","DOIUrl":null,"url":null,"abstract":"<p >CRISPR gene editing strategies are shaping cell therapies through precise and tunable control over gene expression. However, limitations in safely delivering high quantities of CRISPR machinery demand careful target gene selection to achieve reliable therapeutic effects. Informed target gene selection requires a thorough understanding of the involvement of target genes in gene regulatory networks (GRNs) and thus their impact on cell phenotype. Effective decoding of these complex networks has been achieved using machine learning models, but current techniques are limited to single cell types and focus mainly on transcription factors, limiting their applicability to CRISPR strategies. To address this, we present CRISPR-GEM, a multilayer perceptron (MLP) based synthetic GRN constructed to accurately predict the downstream effects of CRISPR gene editing. First, input and output nodes are identified as differentially expressed genes between defined experimental and target cell/tissue types, respectively. Then, MLP training learns regulatory relationships in a black-box approach allowing accurate prediction of output gene expression using only input gene expression. Finally, CRISPR-mimetic perturbations are made to each input gene individually, and the resulting model predictions are compared to those for the target group to score and assess each input gene as a CRISPR candidate. The top scoring genes provided by CRISPR-GEM therefore best modulate experimental group GRNs to motivate transcriptomic shifts toward a target group phenotype. This machine learning model is the first of its kind for predicting optimal CRISPR target genes and serves as a powerful tool for enhanced CRISPR strategies across a range of cell therapies.</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acssynbio.4c00473","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssynbio.4c00473","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

CRISPR gene editing strategies are shaping cell therapies through precise and tunable control over gene expression. However, limitations in safely delivering high quantities of CRISPR machinery demand careful target gene selection to achieve reliable therapeutic effects. Informed target gene selection requires a thorough understanding of the involvement of target genes in gene regulatory networks (GRNs) and thus their impact on cell phenotype. Effective decoding of these complex networks has been achieved using machine learning models, but current techniques are limited to single cell types and focus mainly on transcription factors, limiting their applicability to CRISPR strategies. To address this, we present CRISPR-GEM, a multilayer perceptron (MLP) based synthetic GRN constructed to accurately predict the downstream effects of CRISPR gene editing. First, input and output nodes are identified as differentially expressed genes between defined experimental and target cell/tissue types, respectively. Then, MLP training learns regulatory relationships in a black-box approach allowing accurate prediction of output gene expression using only input gene expression. Finally, CRISPR-mimetic perturbations are made to each input gene individually, and the resulting model predictions are compared to those for the target group to score and assess each input gene as a CRISPR candidate. The top scoring genes provided by CRISPR-GEM therefore best modulate experimental group GRNs to motivate transcriptomic shifts toward a target group phenotype. This machine learning model is the first of its kind for predicting optimal CRISPR target genes and serves as a powerful tool for enhanced CRISPR strategies across a range of cell therapies.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CRISPR-GEM:用于发现和评估 CRISPR 基因靶标的新型机器学习模型
CRISPR 基因编辑策略通过对基因表达的精确和可调控制,正在塑造细胞疗法。然而,由于安全输送大量 CRISPR 机器的局限性,需要仔细选择靶基因,以获得可靠的治疗效果。知情的靶基因选择需要全面了解靶基因在基因调控网络(GRN)中的参与情况及其对细胞表型的影响。利用机器学习模型已经实现了对这些复杂网络的有效解码,但目前的技术仅限于单一细胞类型,而且主要侧重于转录因子,限制了它们对 CRISPR 策略的适用性。为了解决这个问题,我们提出了 CRISPR-GEM,这是一种基于多层感知器(MLP)的合成 GRN,用于准确预测 CRISPR 基因编辑的下游效应。首先,输入和输出节点分别被识别为定义的实验和目标细胞/组织类型之间的差异表达基因。然后,通过 MLP 训练学习黑箱方法中的调控关系,从而仅使用输入基因表达量就能准确预测输出基因表达量。最后,对每个输入基因分别进行 CRISPR 模拟扰动,并将模型预测结果与目标组的预测结果进行比较,从而对每个输入基因作为 CRISPR 候选基因进行评分和评估。因此,CRISPR-GEM 提供的得分最高的基因能最好地调节实验组 GRN,促使转录组向目标组表型转变。这种机器学习模型是首个用于预测最佳 CRISPR 目标基因的同类模型,是在一系列细胞疗法中增强 CRISPR 策略的有力工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.00
自引率
10.60%
发文量
380
审稿时长
6-12 weeks
期刊介绍: The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism. Topics may include, but are not limited to: Design and optimization of genetic systems Genetic circuit design and their principles for their organization into programs Computational methods to aid the design of genetic systems Experimental methods to quantify genetic parts, circuits, and metabolic fluxes Genetic parts libraries: their creation, analysis, and ontological representation Protein engineering including computational design Metabolic engineering and cellular manufacturing, including biomass conversion Natural product access, engineering, and production Creative and innovative applications of cellular programming Medical applications, tissue engineering, and the programming of therapeutic cells Minimal cell design and construction Genomics and genome replacement strategies Viral engineering Automated and robotic assembly platforms for synthetic biology DNA synthesis methodologies Metagenomics and synthetic metagenomic analysis Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction Gene optimization Methods for genome-scale measurements of transcription and metabolomics Systems biology and methods to integrate multiple data sources in vitro and cell-free synthetic biology and molecular programming Nucleic acid engineering.
期刊最新文献
Issue Publication Information Issue Editorial Masthead Ribozyme-Mediated Gene-Fragment Complementation for Nondestructive Reporting of DNA Transfer within Soil. Macroscopic Assembly of Materials with Engineered Bacterial Spores via Coiled-Coil Interaction. A High-Throughput Screen for Antiproliferative Peptides in Mammalian Cells Identifies Key Transcription Factor Families.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1