Matthias Riediger, Miguel A Hernández-Prieto, Kuo Song, Wolfgang R Hess, Matthias E Futschik
The Ferric uptake regulator (Fur) is crucial to both pathogenic and non-pathogenic bacteria for the maintenance of iron homeostasis as well as the defence against reactive oxygen species. Based on datasets from the genome-wide mapping of transcriptional start sites and transcriptome data, we identified a high confidence regulon controlled by Fur for the model cyanobacterium Synechocystis sp. PCC 6803 and its close relative, strain 6714, based on the conserved strong iron starvation response and Fur-binding site occurrence. This regulon comprises 33 protein-coding genes and the sRNA IsaR1 that are under the control of 16 or 14 individual promoters in strains 6803 and 6714, respectively. The associated gene functions are mostly restricted to transporters and enzymes involved in the uptake and storage of iron ions, with few exceptions or unknown functional relevance. Within the isiABC operon, we identified a previously neglected gene encoding a small cysteine-rich protein, which we suggest calling, IsiE. The regulation of iron uptake, storage, and utilization ultimately results from the interplay between the Fur regulon, several other transcription factors, the FtsH3 protease, and the sRNA IsaR1.
{"title":"Genome-wide identification and characterization of Fur-binding sites in the cyanobacteria Synechocystis sp. PCC 6803 and PCC 6714.","authors":"Matthias Riediger, Miguel A Hernández-Prieto, Kuo Song, Wolfgang R Hess, Matthias E Futschik","doi":"10.1093/dnares/dsab023","DOIUrl":"https://doi.org/10.1093/dnares/dsab023","url":null,"abstract":"<p><p>The Ferric uptake regulator (Fur) is crucial to both pathogenic and non-pathogenic bacteria for the maintenance of iron homeostasis as well as the defence against reactive oxygen species. Based on datasets from the genome-wide mapping of transcriptional start sites and transcriptome data, we identified a high confidence regulon controlled by Fur for the model cyanobacterium Synechocystis sp. PCC 6803 and its close relative, strain 6714, based on the conserved strong iron starvation response and Fur-binding site occurrence. This regulon comprises 33 protein-coding genes and the sRNA IsaR1 that are under the control of 16 or 14 individual promoters in strains 6803 and 6714, respectively. The associated gene functions are mostly restricted to transporters and enzymes involved in the uptake and storage of iron ions, with few exceptions or unknown functional relevance. Within the isiABC operon, we identified a previously neglected gene encoding a small cysteine-rich protein, which we suggest calling, IsiE. The regulation of iron uptake, storage, and utilization ultimately results from the interplay between the Fur regulon, several other transcription factors, the FtsH3 protease, and the sRNA IsaR1.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 6","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/87/61/dsab023.PMC8634477.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39537705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Increasing genome data are coming out. Genome size estimation plays an essential role in guiding genome assembly. Several months ago, other researchers were the first to publish a draft genome of the red gromwell (i.e. Lithospermum erythrorhizon). However, we considered that the genome size they estimated and assembled was incorrect. This study meticulously estimated the L. erythrorhizon genome size to should be ∼708.74 Mb and further provided a reliable genome version (size ≈ 693.34 Mb; contigN50 length ≈ 238.08 Kb) to support our objection. Furthermore, according to our genome, we identified a gene family of the alkannin/shikonin O-acyltransferases (i.e. AAT/SAT) that catalysed enantiomer-specific acylations in the alkannin/shikonin biosynthesis (a characteristic metabolic pathway in L. erythrorhizon's roots) and further explored its evolutionary process. The results indicated that the existing AAT/SAT were not generated from only one round of gene duplication but three rounds; after different rounds of gene duplication, the existing AAT/SAT and their recent ancestors were under positive selection at different amino acid sites. These suggested that a combined power from gene duplication plus positive selection plausibly propelled AAT/SAT's functional differentiation in evolution.
越来越多的基因组数据出来了。基因组大小估计在指导基因组组装中起着至关重要的作用。几个月前,其他研究人员率先公布了红龙葵(即紫草)的基因组草图。然而,我们认为他们估计和组装的基因组大小是不正确的。本研究精心估计了红绿杆菌的基因组大小应为~ 708.74 Mb,并进一步提供了可靠的基因组版本(大小≈693.34 Mb;contigN50长度≈238.08 Kb)来支持我们的反对意见。此外,根据我们的基因组,我们确定了一个基因家族的alkins /shikonin o -酰基转移酶(即AAT/SAT),在alkins /shikonin生物合成中催化对构象特异性酰基化(L. erythrorhizon根的特征代谢途径),并进一步探索其进化过程。结果表明,现有的AAT/SAT不是由一轮基因复制产生的,而是由三轮基因复制产生的;经过多轮基因复制,现有的AAT/SAT及其近祖先在不同的氨基酸位点都处于正选择状态。这表明,基因复制和正选择的共同作用似乎推动了AAT/SAT在进化中的功能分化。
{"title":"Exploring the evolutionary process of alkannin/shikonin O-acyltransferases by a reliable Lithospermum erythrorhizon genome.","authors":"Chengyi Tang","doi":"10.1093/dnares/dsab015","DOIUrl":"https://doi.org/10.1093/dnares/dsab015","url":null,"abstract":"<p><p>Increasing genome data are coming out. Genome size estimation plays an essential role in guiding genome assembly. Several months ago, other researchers were the first to publish a draft genome of the red gromwell (i.e. Lithospermum erythrorhizon). However, we considered that the genome size they estimated and assembled was incorrect. This study meticulously estimated the L. erythrorhizon genome size to should be ∼708.74 Mb and further provided a reliable genome version (size ≈ 693.34 Mb; contigN50 length ≈ 238.08 Kb) to support our objection. Furthermore, according to our genome, we identified a gene family of the alkannin/shikonin O-acyltransferases (i.e. AAT/SAT) that catalysed enantiomer-specific acylations in the alkannin/shikonin biosynthesis (a characteristic metabolic pathway in L. erythrorhizon's roots) and further explored its evolutionary process. The results indicated that the existing AAT/SAT were not generated from only one round of gene duplication but three rounds; after different rounds of gene duplication, the existing AAT/SAT and their recent ancestors were under positive selection at different amino acid sites. These suggested that a combined power from gene duplication plus positive selection plausibly propelled AAT/SAT's functional differentiation in evolution.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 5","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8435551/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9231145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The crown-of-thorns starfish (COTS) is a coral predator that is widely distributed in Indo-Pacific Oceans. A previous phylogenetic study using partial mitochondrial sequences suggested that COTS had diverged into four distinct species, but a nuclear genome-based analysis to confirm this was not conducted. To address this, COTS species nuclear genome sequences were analysed here, sequencing Northern Indian Ocean (NIO) and Red Sea (RS) species genomes for the first time, followed by a comparative analysis with the Pacific Ocean (PO) species. Phylogenetic analysis and ADMIXTURE analysis revealed clear divergences between the three COTS species. Furthermore, within the PO species, the phylogenetic position of the Hawaiian sample was further away from the other Pacific-derived samples than expected based on the mitochondrial data, suggesting that it may be a PO subspecies. The pairwise sequentially Markovian coalescent model showed that the trajectories of the population size diverged by region during the Mid-Pleistocene transition when the sea-level was dramatically decreased, strongly suggesting that the three COTS species experienced allopatric speciation. Analysis of the orthologues indicated that there were remarkable genes with species-specific positive selection in the genomes of the PO and RS species, which suggested that there may be local adaptations in the COTS species.
{"title":"Elucidation of the speciation history of three sister species of crown-of-thorns starfish (Acanthaster spp.) based on genomic analysis.","authors":"Hideaki Yuasa, Rei Kajitani, Yuta Nakamura, Kazuki Takahashi, Miki Okuno, Fumiya Kobayashi, Takahiro Shinoda, Atsushi Toyoda, Yutaka Suzuki, Nalinee Thongtham, Zac Forsman, Omri Bronstein, Davide Seveso, Enrico Montalbetti, Coralie Taquet, Gal Eyal, Nina Yasuda, Takehiko Itoh","doi":"10.1093/dnares/dsab012","DOIUrl":"https://doi.org/10.1093/dnares/dsab012","url":null,"abstract":"<p><p>The crown-of-thorns starfish (COTS) is a coral predator that is widely distributed in Indo-Pacific Oceans. A previous phylogenetic study using partial mitochondrial sequences suggested that COTS had diverged into four distinct species, but a nuclear genome-based analysis to confirm this was not conducted. To address this, COTS species nuclear genome sequences were analysed here, sequencing Northern Indian Ocean (NIO) and Red Sea (RS) species genomes for the first time, followed by a comparative analysis with the Pacific Ocean (PO) species. Phylogenetic analysis and ADMIXTURE analysis revealed clear divergences between the three COTS species. Furthermore, within the PO species, the phylogenetic position of the Hawaiian sample was further away from the other Pacific-derived samples than expected based on the mitochondrial data, suggesting that it may be a PO subspecies. The pairwise sequentially Markovian coalescent model showed that the trajectories of the population size diverged by region during the Mid-Pleistocene transition when the sea-level was dramatically decreased, strongly suggesting that the three COTS species experienced allopatric speciation. Analysis of the orthologues indicated that there were remarkable genes with species-specific positive selection in the genomes of the PO and RS species, which suggested that there may be local adaptations in the COTS species.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 4","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386664/pdf/dsab012.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39307244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Estrada Bonilla, Ana Rita Costa, Daan F van den Berg, Teunke van Rossum, Stefan Hagedoorn, Hielke Walinga, Minfeng Xiao, Wenchen Song, Pieter-Jan Haas, Franklin L Nobrega, Stan J J Brouns
Bacteriophages are an invaluable source of novel genetic diversity. Sequencing of phage genomes can reveal new proteins with potential uses as biotechnological and medical tools, and help unravel the diversity of biological mechanisms employed by phages to take over the host during viral infection. Aiming to expand the available collection of phage genomes, we have isolated, sequenced, and assembled the genome sequences of four phages that infect the clinical pathogen Klebsiella pneumoniae: vB_KpnP_FBKp16, vB_KpnP_FBKp27, vB_KpnM_FBKp34, and Jumbo phage vB_KpnM_FBKp24. The four phages show very low (0-13%) identity to genomic phage sequences deposited in the GenBank database. Three of the four phages encode tRNAs and have a GC content very dissimilar to that of the host. Importantly, the genome sequences of the phages reveal potentially novel DNA packaging mechanisms as well as distinct clades of tubulin spindle and nucleus shell proteins that some phages use to compartmentalize viral replication. Overall, this study contributes to uncovering previously unknown virus diversity, and provides novel candidates for phage therapy applications against antibiotic-resistant K. pneumoniae infections.
{"title":"Genomic characterization of four novel bacteriophages infecting the clinical pathogen Klebsiella pneumoniae.","authors":"Estrada Bonilla, Ana Rita Costa, Daan F van den Berg, Teunke van Rossum, Stefan Hagedoorn, Hielke Walinga, Minfeng Xiao, Wenchen Song, Pieter-Jan Haas, Franklin L Nobrega, Stan J J Brouns","doi":"10.1093/dnares/dsab013","DOIUrl":"https://doi.org/10.1093/dnares/dsab013","url":null,"abstract":"<p><p>Bacteriophages are an invaluable source of novel genetic diversity. Sequencing of phage genomes can reveal new proteins with potential uses as biotechnological and medical tools, and help unravel the diversity of biological mechanisms employed by phages to take over the host during viral infection. Aiming to expand the available collection of phage genomes, we have isolated, sequenced, and assembled the genome sequences of four phages that infect the clinical pathogen Klebsiella pneumoniae: vB_KpnP_FBKp16, vB_KpnP_FBKp27, vB_KpnM_FBKp34, and Jumbo phage vB_KpnM_FBKp24. The four phages show very low (0-13%) identity to genomic phage sequences deposited in the GenBank database. Three of the four phages encode tRNAs and have a GC content very dissimilar to that of the host. Importantly, the genome sequences of the phages reveal potentially novel DNA packaging mechanisms as well as distinct clades of tubulin spindle and nucleus shell proteins that some phages use to compartmentalize viral replication. Overall, this study contributes to uncovering previously unknown virus diversity, and provides novel candidates for phage therapy applications against antibiotic-resistant K. pneumoniae infections.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 4","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4b/6f/dsab013.PMC8386662.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39309520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sai Reddy Achakkagari, Helen H Tai, Charlotte Davidson, Hielke De Jong, Martina V Strömvik
The potato mitogenome is complex and to understand various biological functions and nuclear-cytoplasmic interactions, it is important to characterize its gene content and structure. In this study, the complete mitogenome sequences of nine diploid potato clones along with a diploid Solanum okadae clone were characterized. Each mitogenome was assembled and annotated from Pacific Biosciences (PacBio) long reads and 10X genomics short reads. The results show that each mitogenome consists of multiple circular molecules with similar structure and gene organization, though two groups (clones 07506-01, DW84-1457, 08675-21 and H412-1 in one group, and clones W5281-2, 12625-02, 12120-03 and 11379-03 in another group) could be distinguished, and two mitogenomes (clone 10908-06 and OKA15) were not consistent with those or with each other. Significant differences in the repeat structure of the 10 mitogenomes were found, as was recombination events leading to multiple sub-genomic circles. Comparison between individual molecules revealed a translocation of ∼774 bp region located between a short repeat of 40 bp in molecule 3 of each mitogenome, and an insertion of the same in molecule 2 of the 10908-06 mitogenome. Finally, phylogenetic analyses revealed a close relationship between the mitogenomes of these clones and previously published potato mitogenomes.
{"title":"The complete mitogenome assemblies of 10 diploid potato clones reveal recombination and overlapping variants.","authors":"Sai Reddy Achakkagari, Helen H Tai, Charlotte Davidson, Hielke De Jong, Martina V Strömvik","doi":"10.1093/dnares/dsab009","DOIUrl":"https://doi.org/10.1093/dnares/dsab009","url":null,"abstract":"<p><p>The potato mitogenome is complex and to understand various biological functions and nuclear-cytoplasmic interactions, it is important to characterize its gene content and structure. In this study, the complete mitogenome sequences of nine diploid potato clones along with a diploid Solanum okadae clone were characterized. Each mitogenome was assembled and annotated from Pacific Biosciences (PacBio) long reads and 10X genomics short reads. The results show that each mitogenome consists of multiple circular molecules with similar structure and gene organization, though two groups (clones 07506-01, DW84-1457, 08675-21 and H412-1 in one group, and clones W5281-2, 12625-02, 12120-03 and 11379-03 in another group) could be distinguished, and two mitogenomes (clone 10908-06 and OKA15) were not consistent with those or with each other. Significant differences in the repeat structure of the 10 mitogenomes were found, as was recombination events leading to multiple sub-genomic circles. Comparison between individual molecules revealed a translocation of ∼774 bp region located between a short repeat of 40 bp in molecule 3 of each mitogenome, and an insertion of the same in molecule 2 of the 10908-06 mitogenome. Finally, phylogenetic analyses revealed a close relationship between the mitogenomes of these clones and previously published potato mitogenomes.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 4","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386665/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39178506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kazuhiro Sato, Fumitaka Abe, Martin Mascher, Georg Haberer, Heidrun Gundlach, Manuel Spannagl, Kenta Shirasawa, Sachiko Isobe
We have established a high-quality, chromosome-level genome assembly for the hexaploid common wheat cultivar 'Fielder', an American, soft, white, pastry-type wheat released in 1974 and known for its amenability to Agrobacterium tumefaciens-mediated transformation and genome editing. Accurate, long-read sequences were obtained using PacBio circular consensus sequencing with the HiFi approach. Sequence reads from 16 SMRT cells assembled using the hifiasm assembler produced assemblies with N50 greater than 20 Mb. We used the Omni-C chromosome conformation capture technique to order contigs into chromosome-level assemblies, resulting in 21 pseudomolecules with a cumulative size of 14.7 and 0.3 Gb of unanchored contigs. Mapping of published short reads from a transgenic wheat plant with an edited seed-dormancy gene, TaQsd1, identified four positions of transgene insertion into wheat chromosomes. Detection of guide RNA sequences in pseudomolecules provided candidates for off-target mutation induction. These results demonstrate the efficiency of chromosome-scale assembly using PacBio HiFi reads and their application in wheat genome-editing studies.
{"title":"Chromosome-scale genome assembly of the transformation-amenable common wheat cultivar 'Fielder'.","authors":"Kazuhiro Sato, Fumitaka Abe, Martin Mascher, Georg Haberer, Heidrun Gundlach, Manuel Spannagl, Kenta Shirasawa, Sachiko Isobe","doi":"10.1093/dnares/dsab008","DOIUrl":"https://doi.org/10.1093/dnares/dsab008","url":null,"abstract":"<p><p>We have established a high-quality, chromosome-level genome assembly for the hexaploid common wheat cultivar 'Fielder', an American, soft, white, pastry-type wheat released in 1974 and known for its amenability to Agrobacterium tumefaciens-mediated transformation and genome editing. Accurate, long-read sequences were obtained using PacBio circular consensus sequencing with the HiFi approach. Sequence reads from 16 SMRT cells assembled using the hifiasm assembler produced assemblies with N50 greater than 20 Mb. We used the Omni-C chromosome conformation capture technique to order contigs into chromosome-level assemblies, resulting in 21 pseudomolecules with a cumulative size of 14.7 and 0.3 Gb of unanchored contigs. Mapping of published short reads from a transgenic wheat plant with an edited seed-dormancy gene, TaQsd1, identified four positions of transgene insertion into wheat chromosomes. Detection of guide RNA sequences in pseudomolecules provided candidates for off-target mutation induction. These results demonstrate the efficiency of chromosome-scale assembly using PacBio HiFi reads and their application in wheat genome-editing studies.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 3","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320877/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9545065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Myofibres (primary and secondary myofibre) are the basic structure of muscle and the determinant of muscle mass. To explore the skeletal muscle developmental processes from primary myofibres to secondary myofibres in pigs, we conducted an integrative three-dimensional structure of genome and transcriptomic characterization of longissimus dorsi muscle of pig from primary myofibre formation stage [embryonic Day 35 (E35)] to secondary myofibre formation stage (E80). In the hierarchical genomic structure, we found that 11.43% of genome switched compartment A/B status, 14.53% of topologically associating domains are changed intradomain interactions (D-scores) and 2,730 genes with differential promoter-enhancer interactions and (or) enhancer activity from E35 to E80. The alterations of genome architecture were found to correlate with expression of genes that play significant roles in neuromuscular junction, embryonic morphogenesis, skeletal muscle development or metabolism, typically, NEFL, MuSK, SLN, Mef2D and GCK. Significantly, Sox6 and MATN2 play important roles in the process of primary to secondary myofibres formation and increase the regulatory potential score and genes expression in it. In brief, we reveal the genomic reorganization from E35 to E80 and construct genome-wide high-resolution interaction maps that provide a resource for studying long-range control of gene expression from E35 to E80.
{"title":"Reorganization of chromatin architecture during prenatal development of porcine skeletal muscle.","authors":"Renqiang Yuan, Jiaman Zhang, Yujie Wang, Xingxing Zhu, Silu Hu, Jianhua Zeng, Feng Liang, Qianzi Tang, Yaosheng Chen, Luxi Chen, Wei Zhu, Mingzhou Li, Delin Mo","doi":"10.1093/dnares/dsab003","DOIUrl":"https://doi.org/10.1093/dnares/dsab003","url":null,"abstract":"<p><p>Myofibres (primary and secondary myofibre) are the basic structure of muscle and the determinant of muscle mass. To explore the skeletal muscle developmental processes from primary myofibres to secondary myofibres in pigs, we conducted an integrative three-dimensional structure of genome and transcriptomic characterization of longissimus dorsi muscle of pig from primary myofibre formation stage [embryonic Day 35 (E35)] to secondary myofibre formation stage (E80). In the hierarchical genomic structure, we found that 11.43% of genome switched compartment A/B status, 14.53% of topologically associating domains are changed intradomain interactions (D-scores) and 2,730 genes with differential promoter-enhancer interactions and (or) enhancer activity from E35 to E80. The alterations of genome architecture were found to correlate with expression of genes that play significant roles in neuromuscular junction, embryonic morphogenesis, skeletal muscle development or metabolism, typically, NEFL, MuSK, SLN, Mef2D and GCK. Significantly, Sox6 and MATN2 play important roles in the process of primary to secondary myofibres formation and increase the regulatory potential score and genes expression in it. In brief, we reveal the genomic reorganization from E35 to E80 and construct genome-wide high-resolution interaction maps that provide a resource for studying long-range control of gene expression from E35 to E80.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/dnares/dsab003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38997679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
André Gomes-Dos-Santos, Manuel Lopes-Lima, André M Machado, António Marcos Ramos, Ana Usié, Ivan N Bolotov, Ilya V Vikhrev, Sophie Breton, L Filipe C Castro, Rute R da Fonseca, Juergen Geist, Martin E Österling, Vincent Prié, Amílcar Teixeira, Han Ming Gan, Oleg Simakov, Elsa Froufe
Since historical times, the inherent human fascination with pearls turned the freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758) into a highly valuable cultural and economic resource. Although pearl harvesting in M. margaritifera is nowadays residual, other human threats have aggravated the species conservation status, especially in Europe. This mussel presents a myriad of rare biological features, e.g. high longevity coupled with low senescence and Doubly Uniparental Inheritance of mitochondrial DNA, for which the underlying molecular mechanisms are poorly known. Here, the first draft genome assembly of M. margaritifera was produced using a combination of Illumina Paired-end and Mate-pair approaches. The genome assembly was 2.4 Gb long, possessing 105,185 scaffolds and a scaffold N50 length of 288,726 bp. The ab initio gene prediction allowed the identification of 35,119 protein-coding genes. This genome represents an essential resource for studying this species' unique biological and evolutionary features and ultimately will help to develop new tools to promote its conservation.
{"title":"The Crown Pearl: a draft genome assembly of the European freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758).","authors":"André Gomes-Dos-Santos, Manuel Lopes-Lima, André M Machado, António Marcos Ramos, Ana Usié, Ivan N Bolotov, Ilya V Vikhrev, Sophie Breton, L Filipe C Castro, Rute R da Fonseca, Juergen Geist, Martin E Österling, Vincent Prié, Amílcar Teixeira, Han Ming Gan, Oleg Simakov, Elsa Froufe","doi":"10.1093/dnares/dsab002","DOIUrl":"https://doi.org/10.1093/dnares/dsab002","url":null,"abstract":"<p><p>Since historical times, the inherent human fascination with pearls turned the freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758) into a highly valuable cultural and economic resource. Although pearl harvesting in M. margaritifera is nowadays residual, other human threats have aggravated the species conservation status, especially in Europe. This mussel presents a myriad of rare biological features, e.g. high longevity coupled with low senescence and Doubly Uniparental Inheritance of mitochondrial DNA, for which the underlying molecular mechanisms are poorly known. Here, the first draft genome assembly of M. margaritifera was produced using a combination of Illumina Paired-end and Mate-pair approaches. The genome assembly was 2.4 Gb long, possessing 105,185 scaffolds and a scaffold N50 length of 288,726 bp. The ab initio gene prediction allowed the identification of 35,119 protein-coding genes. This genome represents an essential resource for studying this species' unique biological and evolutionary features and ultimately will help to develop new tools to promote its conservation.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/dnares/dsab002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25507523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We analyzed the genome sequence of a Japanese pear (Pyrus pyrifolia) to facilitate its genetics and genomics as well as breeding programs, in which a variety 'Nijisseiki' with superior flesh texture has been used as a parent for most Japanese pear cultivars. De novo assembly of long sequence reads covered 136× of the Japanese pear genome and generated 503.9 Mb contigs consisting of 114 sequences with an N50 value of 7.6 Mb. Contigs were assigned to Japanese pear genetic maps to establish 17 chromosome-scale sequences. In total, 44,876 high-confidence protein-encoding genes were predicted, 84.3% of which were supported by predicted genes and transcriptome data from Japanese pear relatives. As expected, evidence of genome-wide duplication was observed, consistent with related species. This is the first chromosome-scale genome sequence analysis reported for Japanese pear, and this resource will support breeding programs and provide new insights into the physiology and evolutionary history of Japanese pear.
{"title":"Chromosome-scale genome assembly of Japanese pear (Pyrus pyrifolia) variety 'Nijisseiki'.","authors":"Kenta Shirasawa, Akihiro Itai, Sachiko Isobe","doi":"10.1093/dnares/dsab001","DOIUrl":"https://doi.org/10.1093/dnares/dsab001","url":null,"abstract":"<p><p>We analyzed the genome sequence of a Japanese pear (Pyrus pyrifolia) to facilitate its genetics and genomics as well as breeding programs, in which a variety 'Nijisseiki' with superior flesh texture has been used as a parent for most Japanese pear cultivars. De novo assembly of long sequence reads covered 136× of the Japanese pear genome and generated 503.9 Mb contigs consisting of 114 sequences with an N50 value of 7.6 Mb. Contigs were assigned to Japanese pear genetic maps to establish 17 chromosome-scale sequences. In total, 44,876 high-confidence protein-encoding genes were predicted, 84.3% of which were supported by predicted genes and transcriptome data from Japanese pear relatives. As expected, evidence of genome-wide duplication was observed, consistent with related species. This is the first chromosome-scale genome sequence analysis reported for Japanese pear, and this resource will support breeding programs and provide new insights into the physiology and evolutionary history of Japanese pear.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/dnares/dsab001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25411649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.
已知祖先番茄物种具有在育种中改善性状的有价值的基因。本研究旨在构建源自秘鲁的茄茄(Solanum pimpinellifolium) LA1670和番茄变种(S. lycopersicum var. cerasiformme) LA1673的高质量从头基因组组装。对覆盖110x和104x的PacBio (Pacific Biosciences)长读序列进行组装和优化,得到LA1670和LA1673的长读序列分别为244和202个,长度分别为808.8 Mbp和804.5 Mbp。在参考引导下,构建了12条番茄染色体对应的14条骨架序列和2条未分配序列。使用Benchmarking Universal Single-Copy Orthologs和长末端重复序列组装索引确认高质量的基因组组装。然后预测蛋白质编码序列,并确认其转录组。根据转录组分析结果,新组装的葡萄球菌(S. pimpinellifolium)和葡萄球菌(S. lycopersicum vars . cerasiformme)基因组分别含有71,945个和75,230个蛋白质编码基因,其中非冗余基因分别为29,629个和29,185个。染色体水平的基因组组装与两份材料的转录组数据集相结合,将为深入了解番茄驯化和基因组规模育种提供有价值的信息。
{"title":"De novo genome assembly of two tomato ancestors, Solanum pimpinellifolium and Solanum lycopersicum var. cerasiforme, by long-read sequencing.","authors":"Hitomi Takei, Kenta Shirasawa, Kosuke Kuwabara, Atsushi Toyoda, Yuma Matsuzawa, Shinji Iioka, Tohru Ariizumi","doi":"10.1093/dnares/dsaa029","DOIUrl":"https://doi.org/10.1093/dnares/dsaa029","url":null,"abstract":"<p><p>The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/dnares/dsaa029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38777431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}