Chen Qi, Xuechun Shen, Baitao Li, Chuan Liu, Lei Huang, Hongxia Lan, Donglong Chen, Yuan Jiang, Dan Wang
{"title":"PAMPHLET:PAM 预测同源逻辑增强工具包,用于在 CRISPR-Cas 系统中精确预测 PAM。","authors":"Chen Qi, Xuechun Shen, Baitao Li, Chuan Liu, Lei Huang, Hongxia Lan, Donglong Chen, Yuan Jiang, Dan Wang","doi":"10.1016/j.jgg.2024.10.014","DOIUrl":null,"url":null,"abstract":"<p><p>The CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms, evolving from a singular Cas9 model to a diverse CRISPR toolbox. A critical bottleneck in developing new Cas proteins is identifying protospacer adjacent motif (PAM) sequences. Due to the limitations of experimental methods, bioinformatics approaches have become essential. However, existing PAM prediction programs are limited by the small number of spacers in CRISPR-Cas systems, resulting in low accuracy. To address this, we develop PAMPHLET, a novel pipeline that uses homology searches to identify additional spacers, significantly increasing the number of spacers up to 18-fold. PAMPHLET is validated on 20 CRISPR-Cas systems and successfully predicts PAM sequences for 18 protospacers. These predictions are further validated using the DocMF platform, which characterizes protein-DNA recognition patterns via next-generation sequencing. The high consistency between PAMPHLET predictions and DocMF results for novel Cas proteins demonstrates potential of PAMPHLET to enhance PAM sequence prediction accuracy, expedite the discovery process, and accelerate the development of CRISPR tools.</p>","PeriodicalId":54825,"journal":{"name":"Journal of Genetics and Genomics","volume":" ","pages":""},"PeriodicalIF":6.6000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PAMPHLET: PAM Prediction HomoLogous-Enhancement Toolkit for precise PAM prediction in CRISPR-Cas systems.\",\"authors\":\"Chen Qi, Xuechun Shen, Baitao Li, Chuan Liu, Lei Huang, Hongxia Lan, Donglong Chen, Yuan Jiang, Dan Wang\",\"doi\":\"10.1016/j.jgg.2024.10.014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms, evolving from a singular Cas9 model to a diverse CRISPR toolbox. A critical bottleneck in developing new Cas proteins is identifying protospacer adjacent motif (PAM) sequences. Due to the limitations of experimental methods, bioinformatics approaches have become essential. However, existing PAM prediction programs are limited by the small number of spacers in CRISPR-Cas systems, resulting in low accuracy. To address this, we develop PAMPHLET, a novel pipeline that uses homology searches to identify additional spacers, significantly increasing the number of spacers up to 18-fold. PAMPHLET is validated on 20 CRISPR-Cas systems and successfully predicts PAM sequences for 18 protospacers. These predictions are further validated using the DocMF platform, which characterizes protein-DNA recognition patterns via next-generation sequencing. The high consistency between PAMPHLET predictions and DocMF results for novel Cas proteins demonstrates potential of PAMPHLET to enhance PAM sequence prediction accuracy, expedite the discovery process, and accelerate the development of CRISPR tools.</p>\",\"PeriodicalId\":54825,\"journal\":{\"name\":\"Journal of Genetics and Genomics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Genetics and Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jgg.2024.10.014\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Genetics and Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.jgg.2024.10.014","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
PAMPHLET: PAM Prediction HomoLogous-Enhancement Toolkit for precise PAM prediction in CRISPR-Cas systems.
The CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms, evolving from a singular Cas9 model to a diverse CRISPR toolbox. A critical bottleneck in developing new Cas proteins is identifying protospacer adjacent motif (PAM) sequences. Due to the limitations of experimental methods, bioinformatics approaches have become essential. However, existing PAM prediction programs are limited by the small number of spacers in CRISPR-Cas systems, resulting in low accuracy. To address this, we develop PAMPHLET, a novel pipeline that uses homology searches to identify additional spacers, significantly increasing the number of spacers up to 18-fold. PAMPHLET is validated on 20 CRISPR-Cas systems and successfully predicts PAM sequences for 18 protospacers. These predictions are further validated using the DocMF platform, which characterizes protein-DNA recognition patterns via next-generation sequencing. The high consistency between PAMPHLET predictions and DocMF results for novel Cas proteins demonstrates potential of PAMPHLET to enhance PAM sequence prediction accuracy, expedite the discovery process, and accelerate the development of CRISPR tools.
期刊介绍:
The Journal of Genetics and Genomics (JGG, formerly known as Acta Genetica Sinica ) is an international journal publishing peer-reviewed articles of novel and significant discoveries in the fields of genetics and genomics. Topics of particular interest include but are not limited to molecular genetics, developmental genetics, cytogenetics, epigenetics, medical genetics, population and evolutionary genetics, genomics and functional genomics as well as bioinformatics and computational biology.