Repetitive DNA sequences, satellite DNAs (satDNAs) and transposable elements (TEs) are essential components of the genome landscape, with many different roles in genome function and evolution. Despite significant advances in sequencing technologies and bioinformatics tools, detection and classification of repetitive sequences can still be an obstacle to the analysis of genomic repeats. Here, we summarize how specificities in repetitive DNA organizational patterns can lead to an inability to classify (and study) a significant fraction of bivalve mollusk repetitive sequences. We suggest that the main reasons for this inability are: the predominant association of satDNA arrays with Helitron/Helentron TEs; the existence of many complex loci; and the unusual, highly scattered organization of short satDNA arrays or single monomers across the whole genome. The specificities of bivalve genomes confirm the need for introducing diverse organisms as models in order to understand all aspects of repetitive DNA biology. It is expected that further development of sequencing techniques and synergy among different bioinformatics tools and databases will enable quick and unambiguous characterization and classification of repetitive DNA sequences in assembled genomes.
{"title":"Classification Problems of Repetitive DNA Sequences","authors":"Eva Šatović-Vukšić, M. Plohl","doi":"10.3390/dna1020009","DOIUrl":"https://doi.org/10.3390/dna1020009","url":null,"abstract":"Repetitive DNA sequences, satellite DNAs (satDNAs) and transposable elements (TEs) are essential components of the genome landscape, with many different roles in genome function and evolution. Despite significant advances in sequencing technologies and bioinformatics tools, detection and classification of repetitive sequences can still be an obstacle to the analysis of genomic repeats. Here, we summarize how specificities in repetitive DNA organizational patterns can lead to an inability to classify (and study) a significant fraction of bivalve mollusk repetitive sequences. We suggest that the main reasons for this inability are: the predominant association of satDNA arrays with Helitron/Helentron TEs; the existence of many complex loci; and the unusual, highly scattered organization of short satDNA arrays or single monomers across the whole genome. The specificities of bivalve genomes confirm the need for introducing diverse organisms as models in order to understand all aspects of repetitive DNA biology. It is expected that further development of sequencing techniques and synergy among different bioinformatics tools and databases will enable quick and unambiguous characterization and classification of repetitive DNA sequences in assembled genomes.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43800369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sandra Eloisa Bülau, Rafael Kretschmer, I. O. Furo, E. D. de Oliveira, T. R. D. de Freitas
Karyotypic analyses have several applications in studies of chromosome organization, evolution, and cytotaxonomy. They are also essential to genome assembly projects. Here, we present for the first time the karyotype description of the endangered species yellow cardinal, Gubernatrix cristata (Passeriformes, Thraupidae), using conventional staining with Giemsa and 18S rDNA probes. This species has 78 chromosomes, with 12 pairs of macrochromosomes and 27 microchromosome pairs. The 18S rDNA clusters were found in four microchromosomes. Our results revealed that G. cristata has a typical avian karyotype (approximately 80 chromosomes). However, G. cristata has an apomorphic state in relation to the 18S rDNA distribution since the ancestral condition corresponds to only two microchromosomes with these sequences. Probably, duplications and translocations were responsible for increasing the number of 18S rDNA clusters in G. cristata. The results were compared and discussed with respect to other Thraupidae and Passeriformes members. Considering the globally threatened status of G. cristata, we believe that its karyotype description could be a starting point for future cytogenetics and sequencing projects.
{"title":"Karyotype Organization of the Endangered Species Yellow Cardinal (Gubernatrix cristata)","authors":"Sandra Eloisa Bülau, Rafael Kretschmer, I. O. Furo, E. D. de Oliveira, T. R. D. de Freitas","doi":"10.3390/dna1020008","DOIUrl":"https://doi.org/10.3390/dna1020008","url":null,"abstract":"Karyotypic analyses have several applications in studies of chromosome organization, evolution, and cytotaxonomy. They are also essential to genome assembly projects. Here, we present for the first time the karyotype description of the endangered species yellow cardinal, Gubernatrix cristata (Passeriformes, Thraupidae), using conventional staining with Giemsa and 18S rDNA probes. This species has 78 chromosomes, with 12 pairs of macrochromosomes and 27 microchromosome pairs. The 18S rDNA clusters were found in four microchromosomes. Our results revealed that G. cristata has a typical avian karyotype (approximately 80 chromosomes). However, G. cristata has an apomorphic state in relation to the 18S rDNA distribution since the ancestral condition corresponds to only two microchromosomes with these sequences. Probably, duplications and translocations were responsible for increasing the number of 18S rDNA clusters in G. cristata. The results were compared and discussed with respect to other Thraupidae and Passeriformes members. Considering the globally threatened status of G. cristata, we believe that its karyotype description could be a starting point for future cytogenetics and sequencing projects.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44173115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nicole M. Lewis, C. Rathje, C. Canedo-Ribeiro, Lisa M. Bosman, Lucas G. Kiazim, Rebecca L. Jennings, R. O’Connor, G. Silvestri, D. Griffin
Pigs (Sus scrofa) have vast economic importance, with pork accounting for over 30% of the global meat consumption. Chromosomal abnormalities, and in particular reciprocal translocations (RTs), are an important cause of hypoprolificacy (litter size reduction) in pigs. However, these do not necessarily present with a recognizable phenotype and may cause significant economic losses for breeders when undetected. Here, we present a reappraisal of the incidence of RTs across several European pig herds, using contemporary methodology, as well as an analysis modelling the economic impact of these abnormalities. Molecular cytogenetic investigation was completed by karyotyping and/or multiprobe FISH (fluorescence in situ hybridisation) between 2016–2021, testing 2673 animals. We identified 19 types of chromosome abnormalities, the prevalence of these errors in the database was 9.1%, and the estimated incidence of de novo errors was 0.90%. Financial modelling across different scenarios revealed the potential economic impact of an undetected RT, ranging from £69,802 for an individual affected terminal boar in a commercial farm selling weaned pigs, to £51,215,378 for a genetics company with an undetected RT in a dam line boar used in a nucleus farm. Moreover, the added benefits of screening by FISH instead of karyotyping were estimated, providing a strong case for proactive screening by this approach.
{"title":"Incidence, Reproductive Outcome, and Economic Impact of Reciprocal Translocations in the Domestic Pig","authors":"Nicole M. Lewis, C. Rathje, C. Canedo-Ribeiro, Lisa M. Bosman, Lucas G. Kiazim, Rebecca L. Jennings, R. O’Connor, G. Silvestri, D. Griffin","doi":"10.3390/dna1020007","DOIUrl":"https://doi.org/10.3390/dna1020007","url":null,"abstract":"Pigs (Sus scrofa) have vast economic importance, with pork accounting for over 30% of the global meat consumption. Chromosomal abnormalities, and in particular reciprocal translocations (RTs), are an important cause of hypoprolificacy (litter size reduction) in pigs. However, these do not necessarily present with a recognizable phenotype and may cause significant economic losses for breeders when undetected. Here, we present a reappraisal of the incidence of RTs across several European pig herds, using contemporary methodology, as well as an analysis modelling the economic impact of these abnormalities. Molecular cytogenetic investigation was completed by karyotyping and/or multiprobe FISH (fluorescence in situ hybridisation) between 2016–2021, testing 2673 animals. We identified 19 types of chromosome abnormalities, the prevalence of these errors in the database was 9.1%, and the estimated incidence of de novo errors was 0.90%. Financial modelling across different scenarios revealed the potential economic impact of an undetected RT, ranging from £69,802 for an individual affected terminal boar in a commercial farm selling weaned pigs, to £51,215,378 for a genetics company with an undetected RT in a dam line boar used in a nucleus farm. Moreover, the added benefits of screening by FISH instead of karyotyping were estimated, providing a strong case for proactive screening by this approach.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45156598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shayer Mahmood Ibney Alam, Tulyawat Prasongmaneerut, D. Gleeson, A. Georges, S. Sarre, K. Srikulnath, T. Ezaz
Sex-determination mechanisms and sex chromosomes are known to vary among reptile species and, in a few celebrated examples, within populations of the same species. The oriental garden lizard, Calotes versicolor, is one of the most intriguing species in this regard, exhibiting evidence of multiple sex-determination modes within a single species. One possible explanation for this unusual distribution is that in C. versicolor, different modes of sex determination are confined to a particular population or a species within a cryptic species complex. Here, we report on a population genetic analysis using SNP data from a methylation-sensitive DArT sequencing analysis and mitochondrial DNA data obtained from samples collected from six locations: three from Bangladesh and three from Thailand. Our aim was to determine whether C. versicolor is best described as a single species with multiple lineages or as multiple species, as well as if its sex-determination mechanisms vary within or between species. We present evidence that the latter possibility is the case and that C. versicolor comprises a complex of cryptic species. We also identify sex-linked markers within these species and use them to identify modes of sex determination. Overall, our results suggest that different sex-determination modes have evolved among closely related species and within populations of Agamid lizards.
{"title":"Sex-Determination Mechanisms among Populations within Cryptic Species Complex of Calotes (Squamata: Agamidae: Draconinae)","authors":"Shayer Mahmood Ibney Alam, Tulyawat Prasongmaneerut, D. Gleeson, A. Georges, S. Sarre, K. Srikulnath, T. Ezaz","doi":"10.3390/dna1020006","DOIUrl":"https://doi.org/10.3390/dna1020006","url":null,"abstract":"Sex-determination mechanisms and sex chromosomes are known to vary among reptile species and, in a few celebrated examples, within populations of the same species. The oriental garden lizard, Calotes versicolor, is one of the most intriguing species in this regard, exhibiting evidence of multiple sex-determination modes within a single species. One possible explanation for this unusual distribution is that in C. versicolor, different modes of sex determination are confined to a particular population or a species within a cryptic species complex. Here, we report on a population genetic analysis using SNP data from a methylation-sensitive DArT sequencing analysis and mitochondrial DNA data obtained from samples collected from six locations: three from Bangladesh and three from Thailand. Our aim was to determine whether C. versicolor is best described as a single species with multiple lineages or as multiple species, as well as if its sex-determination mechanisms vary within or between species. We present evidence that the latter possibility is the case and that C. versicolor comprises a complex of cryptic species. We also identify sex-linked markers within these species and use them to identify modes of sex determination. Overall, our results suggest that different sex-determination modes have evolved among closely related species and within populations of Agamid lizards.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47204541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aninda Sundar Dey, Chayan Bhattacharya, Y. Guan, B. Jha, Mridul Mukherji
In the mammalian genome, cytosine methylation predominantly occurs at CpG sites. In addition, a number of recent studies have uncovered extensive C5 cytosine methylation (5mC) at non-CpG (5mCpH, where H = A/C/T) sites. Little is known about the enzyme responsible for active demethylation of 5mCpH sites. Using a very sensitive and quantitative LC–MS/MS method, we demonstrate that the human TET2, an iron (II)- and 2OG-dependent dioxygenase, which is a frequently mutated gene in several myeloid malignancies, as well as in a number of other types of cancers, can oxidize 5mCpH sites in double-stranded DNA in vitro. Similar to oxidation of 5mCpG, oxidation of 5mC at CpH sites produces 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxycytosine (5caC) bases in DNA. After 5mCpG, which is the most preferred substrate, TET2 prefers 5mCpC as a substrate, followed by 5mCpA and then 5mCpT. Since the TDG/BER pathway and deformylation or decarboxylation of 5fC or 5caC, respectively, can convert 5fCpH and 5caCpH to an unmodified cytosine base in DNA, our results suggest a novel demethylation pathway of 5mCpH sites initiated by TET2 dioxygenase.
{"title":"Demethylation of Non-CpG Sites in DNA Is Initiated by TET2 5-Methylcytosine Dioxygenase","authors":"Aninda Sundar Dey, Chayan Bhattacharya, Y. Guan, B. Jha, Mridul Mukherji","doi":"10.3390/dna1010004","DOIUrl":"https://doi.org/10.3390/dna1010004","url":null,"abstract":"In the mammalian genome, cytosine methylation predominantly occurs at CpG sites. In addition, a number of recent studies have uncovered extensive C5 cytosine methylation (5mC) at non-CpG (5mCpH, where H = A/C/T) sites. Little is known about the enzyme responsible for active demethylation of 5mCpH sites. Using a very sensitive and quantitative LC–MS/MS method, we demonstrate that the human TET2, an iron (II)- and 2OG-dependent dioxygenase, which is a frequently mutated gene in several myeloid malignancies, as well as in a number of other types of cancers, can oxidize 5mCpH sites in double-stranded DNA in vitro. Similar to oxidation of 5mCpG, oxidation of 5mC at CpH sites produces 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxycytosine (5caC) bases in DNA. After 5mCpG, which is the most preferred substrate, TET2 prefers 5mCpC as a substrate, followed by 5mCpA and then 5mCpT. Since the TDG/BER pathway and deformylation or decarboxylation of 5fC or 5caC, respectively, can convert 5fCpH and 5caCpH to an unmodified cytosine base in DNA, our results suggest a novel demethylation pathway of 5mCpH sites initiated by TET2 dioxygenase.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43457067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Brian M. Wendel, Adrián J. Hernández, C. T. Courcelle, J. Courcelle
In Escherichia coli, several enzymes have been identified that participate in completing replication on the chromosome, including RecG, SbcCD, ExoI, and RecBCD. However, other enzymes are likely to be involved and the precise enzymatic mechanism by which this reaction occurs remains unknown. Two steps predicted to be necessary to complete replication are removal of Okazaki RNA fragments and ligation of the nascent strands at convergent replication forks. E. coli encodes two RNases that remove RNA-DNA hybrids, rnhA and rnhB, as well as two ligases, ligA and ligB. Here, we used replication profiling to show that rnhA and ligA, encoding RNase HI and Ligase A, participate in the completion reaction. Deletion of rnhA impaired the ability to complete replication and resulted in over-replication in the terminus region. It additionally suppressed initiation events from oriC, suggesting a role for the enzyme in oriC-dependent initiation, as has been suggested previously. We also show that a temperature-sensitive mutation in Ligase A led to over-replication at sites where replication completes, and that degradation at these sites occurred upon shifting to the nonpermissive temperature. Deletion of rnhB or ligB did not affect the growth or profile of replication on the genome.
{"title":"Ligase A and RNase HI Participate in Completing Replication on the Chromosome in Escherichia coli","authors":"Brian M. Wendel, Adrián J. Hernández, C. T. Courcelle, J. Courcelle","doi":"10.3390/dna1010003","DOIUrl":"https://doi.org/10.3390/dna1010003","url":null,"abstract":"In Escherichia coli, several enzymes have been identified that participate in completing replication on the chromosome, including RecG, SbcCD, ExoI, and RecBCD. However, other enzymes are likely to be involved and the precise enzymatic mechanism by which this reaction occurs remains unknown. Two steps predicted to be necessary to complete replication are removal of Okazaki RNA fragments and ligation of the nascent strands at convergent replication forks. E. coli encodes two RNases that remove RNA-DNA hybrids, rnhA and rnhB, as well as two ligases, ligA and ligB. Here, we used replication profiling to show that rnhA and ligA, encoding RNase HI and Ligase A, participate in the completion reaction. Deletion of rnhA impaired the ability to complete replication and resulted in over-replication in the terminus region. It additionally suppressed initiation events from oriC, suggesting a role for the enzyme in oriC-dependent initiation, as has been suggested previously. We also show that a temperature-sensitive mutation in Ligase A led to over-replication at sites where replication completes, and that degradation at these sites occurred upon shifting to the nonpermissive temperature. Deletion of rnhB or ligB did not affect the growth or profile of replication on the genome.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43115214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pre-metaphase stretch is a term first coined by the preeminent cell biologist Sally Hughes-Schrader in 1950 to describe an elongation of prometaphase chromosomes observed in the primary spermatocytes of phasmid insects and praying mantids. Research from many groups since Hughes-Schrader’s initial observation has revealed reasons for both how and why chromosomes might elongate prior to metaphase. In this review, we describe Hughes-Schrader’s initial findings and discuss how recent work illuminates and provides some mechanistic explanation for this long-ago observed phenomenon.
{"title":"The Pre-Metaphase Stretch: A Re-Examination","authors":"Megan A. Czekalski, L. Paliulis","doi":"10.3390/dna1010002","DOIUrl":"https://doi.org/10.3390/dna1010002","url":null,"abstract":"Pre-metaphase stretch is a term first coined by the preeminent cell biologist Sally Hughes-Schrader in 1950 to describe an elongation of prometaphase chromosomes observed in the primary spermatocytes of phasmid insects and praying mantids. Research from many groups since Hughes-Schrader’s initial observation has revealed reasons for both how and why chromosomes might elongate prior to metaphase. In this review, we describe Hughes-Schrader’s initial findings and discuss how recent work illuminates and provides some mechanistic explanation for this long-ago observed phenomenon.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3390/dna1010002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46482487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We are very proud to launch this new and exciting international peer-reviewed open access journal [...]
我们非常自豪地推出这本新的、令人兴奋的国际同行评议开放获取期刊[…]
{"title":"Welcome to DNA—An Open Access Journal","authors":"D. Griffin","doi":"10.3390/DNA1010001","DOIUrl":"https://doi.org/10.3390/DNA1010001","url":null,"abstract":"We are very proud to launch this new and exciting international peer-reviewed open access journal [...]","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3390/DNA1010001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41890162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The thermodynamic binding networks (TBN) model was recently developed as a tool for studying engineered molecular systems. The TBN model allows one to reason about their behavior through a simplified abstraction that ignores details about molecular composition, focusing on two key determinants of a system's energetics common to any chemical substrate: how many molecular bonds are formed, and how many separate complexes exist in the system. We formulate as an integer program the NP-hard problem of computing stable configurations of a TBN (a.k.a., minimum energy: those that maximize the number of bonds and complexes). We provide open-source software that solves these formulations, and give empirical evidence that this approach enables dramatically faster computation of TBN stable configurations than previous approaches based on SAT solvers. Our setup can also reason about TBNs in which some molecules have unbounded counts. These improvements in turn allow us to efficiently automate verification of desired properties of practical TBNs. Finally, we show that the TBN's Graver basis (a kind of certificate of optimality in integer programming) has a natural interpretation as the "fundamental components" out of which locally minimal energy configurations are composed. This characterization helps verify correctness of not only stable configurations, but entire "kinetic pathways" in a TBN.
{"title":"Computing Properties of Thermodynamic Binding Networks: An Integer Programming Approach","authors":"David Haley, David Doty","doi":"10.4230/LIPIcs.DNA.27.2","DOIUrl":"https://doi.org/10.4230/LIPIcs.DNA.27.2","url":null,"abstract":"The thermodynamic binding networks (TBN) model was recently developed as a tool for studying engineered molecular systems. The TBN model allows one to reason about their behavior through a simplified abstraction that ignores details about molecular composition, focusing on two key determinants of a system's energetics common to any chemical substrate: how many molecular bonds are formed, and how many separate complexes exist in the system. We formulate as an integer program the NP-hard problem of computing stable configurations of a TBN (a.k.a., minimum energy: those that maximize the number of bonds and complexes). We provide open-source software that solves these formulations, and give empirical evidence that this approach enables dramatically faster computation of TBN stable configurations than previous approaches based on SAT solvers. Our setup can also reason about TBNs in which some molecules have unbounded counts. These improvements in turn allow us to efficiently automate verification of desired properties of practical TBNs. Finally, we show that the TBN's Graver basis (a kind of certificate of optimality in integer programming) has a natural interpretation as the \"fundamental components\" out of which locally minimal energy configurations are composed. This characterization helps verify correctness of not only stable configurations, but entire \"kinetic pathways\" in a TBN.","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41304342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-09-01DOI: 10.4230/LIPIcs.DNA.2020.8
Yuta Matsumura, I. Kawamata, S. Murata
The problem of finding the smallest DNA tile set that self-assembles into a desired pattern or shape is a research focus that has been investigated by many researchers. In this paper, we take a polyomino, which is a non-square element composed of several connected square units, as an element of assembly and consider the design problem of the minimal set of polyominoes that self-assembles into a desired shape. We developed a self-assembly simulator of polyominoes based on the agent-based Monte Carlo method, in which the potential energy among the polyominoes is evaluated and the simulation state is updated toward the direction to decrease the total potential. Aggregated polyominoes are represented as an agent, which can move, merge, and split during the simulation. In order to search the minimal set of polyominoes, two-step evaluation strategy is adopted, because of enormous search space including many parameters such as the shape, the size, and the glue types attached to the polyominoes. The feasibility of the proposed method is shown through three examples with different size and complexity. 2012 ACM Subject Classification Applied computing → Systems biology; Applied computing → Chemistry; Hardware → Biology-related information processing
{"title":"Design Automation of Polyomino Set That Self-Assembles into a Desired Shape","authors":"Yuta Matsumura, I. Kawamata, S. Murata","doi":"10.4230/LIPIcs.DNA.2020.8","DOIUrl":"https://doi.org/10.4230/LIPIcs.DNA.2020.8","url":null,"abstract":"The problem of finding the smallest DNA tile set that self-assembles into a desired pattern or shape is a research focus that has been investigated by many researchers. In this paper, we take a polyomino, which is a non-square element composed of several connected square units, as an element of assembly and consider the design problem of the minimal set of polyominoes that self-assembles into a desired shape. We developed a self-assembly simulator of polyominoes based on the agent-based Monte Carlo method, in which the potential energy among the polyominoes is evaluated and the simulation state is updated toward the direction to decrease the total potential. Aggregated polyominoes are represented as an agent, which can move, merge, and split during the simulation. In order to search the minimal set of polyominoes, two-step evaluation strategy is adopted, because of enormous search space including many parameters such as the shape, the size, and the glue types attached to the polyominoes. The feasibility of the proposed method is shown through three examples with different size and complexity. 2012 ACM Subject Classification Applied computing → Systems biology; Applied computing → Chemistry; Hardware → Biology-related information processing","PeriodicalId":77708,"journal":{"name":"DNA (Mary Ann Liebert, Inc.)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48330247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}