Designing for durability: new tools to build stable, non-repetitive DNA.

IF 2.6 Q2 BIOCHEMICAL RESEARCH METHODS Synthetic biology (Oxford, England) Pub Date : 2020-08-19 eCollection Date: 2020-01-01 DOI:10.1093/synbio/ysaa016
Pablo Cárdenas
{"title":"Designing for durability: new tools to build stable, non-repetitive DNA.","authors":"Pablo Cárdenas","doi":"10.1093/synbio/ysaa016","DOIUrl":null,"url":null,"abstract":"The survival of genetic information hinges on identifying repetition. Genomes are repaired by mechanisms such as homologous recombination, in which matching DNA sequences are used as a template to replace missing information. This strategy works provided sequences in the genome are mostly unique. While sequence diversity has kept genomes stable enough to replicate for millions of years, it poses a problem for those trying to engineer DNA (1). After all, one of the central tenets of synthetic biology is the reutilization of standard parts. How, then, can we design stable, non-repetitive genetic systems with a limited toolkit of synthetic parts? Researchers in Howard Salis’s lab at Pennsylvania State University set out to address this challenge through the Non-Repetitive Parts Calculator (NRPC), a set of new algorithms described in a recent publication by Hossain et al. (2) and available online (https://sali slab.net/software/). As the name implies, NRPC builds collections of biological parts containing minimal repetitive sequences, where the repetitiveness of a collection is defined by Lmax, the maximum length of the longest shared repeat. Collections can be created using two different modes. The ‘Finder’ mode determines the largest subset of nonrepetitive elements within any given database of parts, given a maximum Lmax set by the user. The sheer number of possible subsets to evaluate can make this computationally impractical for large libraries. The authors solve this problem by representing parts as nodes on a graph and improving on existing algorithms in graph theory to efficiently maximize the number of disconnected components. The ‘Maker’ mode creates a new library of non-repetitive parts within the design constraints set by the user, which may include a degenerate DNA sequence or RNA structure template and a set Lmax value. In this case, all possible sequences are represented as a decision tree and hash tables are used to store and check for occurrences of sub-sequences within parts. Hossain et al. tested their new ‘Maker’ algorithm by generating libraries of 4350 synthetic, non-repetitive bacterial promoters and 1722 yeast promoters, designed to have a wide range of transcription rates. The authors validated each library’s predicted transcriptional behavior by assembling and characterizing every promoter through next-generation DNA and RNA sequencing in Escherichia coli and Saccharomyces cerevisiae. The increased stability of NRPC designs was demonstrated in E. coli by comparing versions of a construct with either repetitive or non-repetitive promoters. The former rapidly lost fluorescence and DNA content while the latter remained stable. Finally, the authors applied regression models and neural networks developed elsewhere (3) to explain and predict the strength of the synthetic promoters they created. This work can have tremendous, immediate impact in two ways. Not only did Hossain et al. produce vast libraries of bacterial and yeast promoters with known expression profiles and improved compatibility, but they also published software for researchers to design their own stable libraries for many different applications. This opens the question of what threshold of repetitiveness, whether measured as Lmax or with another metric, should be used in a given organismic context. Regardless, NRPC is noteworthy for tackling a pervasive problem in synthetic biology, one seemingly at odds with the principles of the field.","PeriodicalId":74902,"journal":{"name":"Synthetic biology (Oxford, England)","volume":"5 1","pages":"ysaa016"},"PeriodicalIF":2.6000,"publicationDate":"2020-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/synbio/ysaa016","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Synthetic biology (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/synbio/ysaa016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 2

Abstract

The survival of genetic information hinges on identifying repetition. Genomes are repaired by mechanisms such as homologous recombination, in which matching DNA sequences are used as a template to replace missing information. This strategy works provided sequences in the genome are mostly unique. While sequence diversity has kept genomes stable enough to replicate for millions of years, it poses a problem for those trying to engineer DNA (1). After all, one of the central tenets of synthetic biology is the reutilization of standard parts. How, then, can we design stable, non-repetitive genetic systems with a limited toolkit of synthetic parts? Researchers in Howard Salis’s lab at Pennsylvania State University set out to address this challenge through the Non-Repetitive Parts Calculator (NRPC), a set of new algorithms described in a recent publication by Hossain et al. (2) and available online (https://sali slab.net/software/). As the name implies, NRPC builds collections of biological parts containing minimal repetitive sequences, where the repetitiveness of a collection is defined by Lmax, the maximum length of the longest shared repeat. Collections can be created using two different modes. The ‘Finder’ mode determines the largest subset of nonrepetitive elements within any given database of parts, given a maximum Lmax set by the user. The sheer number of possible subsets to evaluate can make this computationally impractical for large libraries. The authors solve this problem by representing parts as nodes on a graph and improving on existing algorithms in graph theory to efficiently maximize the number of disconnected components. The ‘Maker’ mode creates a new library of non-repetitive parts within the design constraints set by the user, which may include a degenerate DNA sequence or RNA structure template and a set Lmax value. In this case, all possible sequences are represented as a decision tree and hash tables are used to store and check for occurrences of sub-sequences within parts. Hossain et al. tested their new ‘Maker’ algorithm by generating libraries of 4350 synthetic, non-repetitive bacterial promoters and 1722 yeast promoters, designed to have a wide range of transcription rates. The authors validated each library’s predicted transcriptional behavior by assembling and characterizing every promoter through next-generation DNA and RNA sequencing in Escherichia coli and Saccharomyces cerevisiae. The increased stability of NRPC designs was demonstrated in E. coli by comparing versions of a construct with either repetitive or non-repetitive promoters. The former rapidly lost fluorescence and DNA content while the latter remained stable. Finally, the authors applied regression models and neural networks developed elsewhere (3) to explain and predict the strength of the synthetic promoters they created. This work can have tremendous, immediate impact in two ways. Not only did Hossain et al. produce vast libraries of bacterial and yeast promoters with known expression profiles and improved compatibility, but they also published software for researchers to design their own stable libraries for many different applications. This opens the question of what threshold of repetitiveness, whether measured as Lmax or with another metric, should be used in a given organismic context. Regardless, NRPC is noteworthy for tackling a pervasive problem in synthetic biology, one seemingly at odds with the principles of the field.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
耐久性设计:构建稳定、非重复DNA的新工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
New gene sensors enable precise cell monitoring and control without altering gene sequence. In vitro transcription-based biosensing of glycolate for prototyping of a complex enzyme cascade. Cell-free synthesis of infective phages from in vitro assembled phage genomes for efficient phage engineering and production of large phage libraries. Data hazards in synthetic biology. Navigating the 'moral hazard' argument in synthetic biology's application.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1