Syn-CpG-Spacer: A Panel web app for synonymous recoding of viral genomes with CpG dinucleotides

Aleksander Sulkowski, Clément Bouton, Chad Swanson
{"title":"Syn-CpG-Spacer: A Panel web app for synonymous recoding of viral genomes with CpG dinucleotides","authors":"Aleksander Sulkowski, Clément Bouton, Chad Swanson","doi":"10.21105/joss.06332","DOIUrl":null,"url":null,"abstract":"Summary Vertebrate genomes contain lower than expected frequencies of the CpG dinucleotide. Consequently, many vertebrate viruses have evolved to mimic this composition, possibly in order to evade host antiviral defences (Greenbaum et al., 2008). For example, the antiviral protein ZAP binds CpGs in viral single stranded RNA with specific spacing requirements (Gonçalves-Carneiro et al., 2022), though CpGs are also likely depleted in viral genomes due to other selective pressures (Forni et al., 2023). Increasing CpG abundance by synonymous recoding could facilitate attenuation of viruses without compromising their epitope antigenicity by changing non-CpG codons to alternatives containing CpG without changing the overall amino acid sequence (Gonçalves-Carneiro et al., 2022; Le Nouën et al., 2019; Sharp et al., 2023). There are three ways CpGs can be synonymously introduced in codons: at positions 1-2 for arginine (e.g. AGA → CGA), 2-3 for several amino acids (e.g. ACA → ACG), or in a 3-1 split configuration, if a subsequent codon begins with a G (e.g. ATA-GCA → ATC-GCA). Syn-CpG-Spacer is a Python progressive web app (PWA) (MDN Web Docs, 2023) made with the Panel library (Panel Development Team, 2024) that allows for consistent recoding of viral sequences and applying biologically relevant constraints. These include setting a minimum gap between CpG’s, optimising for an average CpG gap, protecting cis-acting regulatory signals from modification, and modulating the A-content in the overall sequence. The app features a sequence viewer made with the Bokeh library (Bokeh Development Team, 2024) that highlights CpG dinucleotides, allowing for efficient analysis of the resulting distribution of CpGs. This is complemented by a statistical data table. Utilising Biopython (Cock et al., 2009) modules, the user can load their sequence as a FASTA file and download the outputs as an alignment in the same format. As a PWA running on Pyodide (The Pyodide development team, 2023), the code is only executed in the user’s browser and they can install the app onto their machine for offline use.","PeriodicalId":16635,"journal":{"name":"Journal of open source software","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of open source software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21105/joss.06332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Summary Vertebrate genomes contain lower than expected frequencies of the CpG dinucleotide. Consequently, many vertebrate viruses have evolved to mimic this composition, possibly in order to evade host antiviral defences (Greenbaum et al., 2008). For example, the antiviral protein ZAP binds CpGs in viral single stranded RNA with specific spacing requirements (Gonçalves-Carneiro et al., 2022), though CpGs are also likely depleted in viral genomes due to other selective pressures (Forni et al., 2023). Increasing CpG abundance by synonymous recoding could facilitate attenuation of viruses without compromising their epitope antigenicity by changing non-CpG codons to alternatives containing CpG without changing the overall amino acid sequence (Gonçalves-Carneiro et al., 2022; Le Nouën et al., 2019; Sharp et al., 2023). There are three ways CpGs can be synonymously introduced in codons: at positions 1-2 for arginine (e.g. AGA → CGA), 2-3 for several amino acids (e.g. ACA → ACG), or in a 3-1 split configuration, if a subsequent codon begins with a G (e.g. ATA-GCA → ATC-GCA). Syn-CpG-Spacer is a Python progressive web app (PWA) (MDN Web Docs, 2023) made with the Panel library (Panel Development Team, 2024) that allows for consistent recoding of viral sequences and applying biologically relevant constraints. These include setting a minimum gap between CpG’s, optimising for an average CpG gap, protecting cis-acting regulatory signals from modification, and modulating the A-content in the overall sequence. The app features a sequence viewer made with the Bokeh library (Bokeh Development Team, 2024) that highlights CpG dinucleotides, allowing for efficient analysis of the resulting distribution of CpGs. This is complemented by a statistical data table. Utilising Biopython (Cock et al., 2009) modules, the user can load their sequence as a FASTA file and download the outputs as an alignment in the same format. As a PWA running on Pyodide (The Pyodide development team, 2023), the code is only executed in the user’s browser and they can install the app onto their machine for offline use.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Syn-CpG-Spacer:使用 CpG 二核苷酸对病毒基因组进行同义重编码的面板网络应用程序
摘要 脊椎动物基因组中 CpG 二核苷酸的频率低于预期。因此,许多脊椎动物病毒在进化过程中模仿了这种成分,可能是为了躲避宿主的抗病毒防御(Greenbaum 等人,2008 年)。例如,抗病毒蛋白 ZAP 与病毒单链 RNA 中具有特定间距要求的 CpGs 结合(Gonçalves-Carneiro 等人,2022 年),尽管由于其他选择性压力,CpGs 也可能在病毒基因组中被耗尽(Forni 等人,2023 年)。通过同义重编码增加 CpG 丰度,可以在不改变整体氨基酸序列的情况下,将非 CpG 密码子改变为含有 CpG 的替代密码子,从而促进病毒的衰减,同时不影响其表位抗原性(Gonçalves-Carneiro 等人,2022 年;Le Nouën 等人,2019 年;Sharp 等人,2023 年)。在密码子中引入 CpGs 的同义词有三种方式:1-2 位为精氨酸(如 AGA → CGA),2-3 位为几个氨基酸(如 ACA → ACG),或者 3-1 分裂配置,如果后面的密码子以 G 开头(如 ATA-GCA → ATC-GCA)。Syn-CpG-Spacer 是一个 Python 渐进式网络应用程序(PWA)(MDN Web Docs,2023 年),使用 Panel 库(Panel Development Team,2024 年)制作,可以对病毒序列进行一致的重新编码,并应用生物相关的限制条件。这些约束包括设置 CpG 之间的最小间隙、优化平均 CpG 间隙、保护顺式调控信号不被修改,以及调节整个序列中的 A 含量。该应用程序的特点是使用 Bokeh 库(Bokeh 开发团队,2024 年)制作的序列查看器,可突出显示 CpG 二核苷酸,从而有效分析由此产生的 CpG 分布。此外还有一个统计数据表作为补充。利用 Biopython(Cock 等人,2009 年)模块,用户可以将其序列加载为 FASTA 文件,并下载相同格式的比对结果。作为在 Pyodide(Pyodide 开发团队,2023 年)上运行的 PWA,代码只在用户的浏览器中执行,用户可以将应用程序安装到自己的机器上,以便离线使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
MousebreedeR: A novel software to assist in the design of breeding schema for complex genotypes of experimental organisms OpenTerrace: A fast, flexible and extendable Python framework for thermal energy storage packed bed simulations TrackSegNet: a tool for trajectory segmentation into diffusive states using supervised deep learning omni-fig: Unleashing Project Configuration and Organization in Python PowerAPI: A Python framework for building software-defined power meters
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1