N. Urasaki, H. Takagi, S. Natsume, Aiko Uemura, Naoki Taniai, N. Miyagi, Mai Fukushima, Shouta Suzuki, K. Tarora, Moritoshi Tamaki, M. Sakamoto, R. Terauchi, H. Matsumura
Abstract Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant.
{"title":"Draft genome sequence of bitter gourd (Momordica charantia), a vegetable and medicinal plant in tropical and subtropical regions","authors":"N. Urasaki, H. Takagi, S. Natsume, Aiko Uemura, Naoki Taniai, N. Miyagi, Mai Fukushima, Shouta Suzuki, K. Tarora, Moritoshi Tamaki, M. Sakamoto, R. Terauchi, H. Matsumura","doi":"10.1093/dnares/dsw047","DOIUrl":"https://doi.org/10.1093/dnares/dsw047","url":null,"abstract":"Abstract Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"24 1","pages":"51 - 58"},"PeriodicalIF":0.0,"publicationDate":"2016-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85856001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Carnevali, Anastasia Conti, M. Pellegrini, G. Dieci
Abstract With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations.
{"title":"Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines","authors":"D. Carnevali, Anastasia Conti, M. Pellegrini, G. Dieci","doi":"10.1093/dnares/dsw048","DOIUrl":"https://doi.org/10.1093/dnares/dsw048","url":null,"abstract":"Abstract With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"111 1","pages":"59 - 69"},"PeriodicalIF":0.0,"publicationDate":"2016-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79333083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Ruggieri, Irantzu Anzar, Andreu Paytuví, Roberta Calafiore, R. A. Cigliano, W. Sanseverino, A. Barone
Abstract The recent development of Sequence Capture methodology represents a powerful strategy for enhancing data generation to assess genetic variation of targeted genomic regions. Here, we present SUPER-CAP, a bioinformatics web tool aimed at handling Sequence Capture data, fine calculating the allele frequency of variations and building genotype-specific sequence of captured genes. The dataset used to develop this in silico strategy consists of 378 loci and related regulative regions in a collection of 44 tomato landraces. About 14,000 high-quality variants were identified. The high depth (>40×) of coverage and adopting the correct filtering criteria allowed identification of about 4,000 rare variants and 10 genes with a different copy number variation. We also show that the tool is capable to reconstruct genotype-specific sequences for each genotype by using the detected variants. This allows evaluating the combined effect of multiple variants in the same protein. The architecture and functionality of SUPER-CAP makes the software appropriate for a broad set of analyses including SNP discovery and mining. Its functionality, together with the capability to process large data sets and efficient detection of sequence variation, makes SUPER-CAP a valuable bioinformatics tool for genomics and breeding purposes.
{"title":"Exploiting the great potential of Sequence Capture data by a new tool, SUPER-CAP","authors":"V. Ruggieri, Irantzu Anzar, Andreu Paytuví, Roberta Calafiore, R. A. Cigliano, W. Sanseverino, A. Barone","doi":"10.1093/dnares/dsw050","DOIUrl":"https://doi.org/10.1093/dnares/dsw050","url":null,"abstract":"Abstract The recent development of Sequence Capture methodology represents a powerful strategy for enhancing data generation to assess genetic variation of targeted genomic regions. Here, we present SUPER-CAP, a bioinformatics web tool aimed at handling Sequence Capture data, fine calculating the allele frequency of variations and building genotype-specific sequence of captured genes. The dataset used to develop this in silico strategy consists of 378 loci and related regulative regions in a collection of 44 tomato landraces. About 14,000 high-quality variants were identified. The high depth (>40×) of coverage and adopting the correct filtering criteria allowed identification of about 4,000 rare variants and 10 genes with a different copy number variation. We also show that the tool is capable to reconstruct genotype-specific sequences for each genotype by using the detected variants. This allows evaluating the combined effect of multiple variants in the same protein. The architecture and functionality of SUPER-CAP makes the software appropriate for a broad set of analyses including SNP discovery and mining. Its functionality, together with the capability to process large data sets and efficient detection of sequence variation, makes SUPER-CAP a valuable bioinformatics tool for genomics and breeding purposes.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"47 1","pages":"81 - 91"},"PeriodicalIF":0.0,"publicationDate":"2016-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76170921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Fujita, Fusako Kitaura, Miyuki Yuno, Yutaka Suzuki, S. Sugano, H. Fujii
Chromosomal interactions regulate genome functions, such as transcription, via dynamic chromosomal organization in the nucleus. In this study, we identified genomic regions that physically bind to the promoter region of the Pax5 gene in the chicken B-cell line DT40, with the goal of obtaining mechanistic insight into transcriptional regulation through chromosomal interaction. Using insertional chromatin immunoprecipitation (iChIP) in combination with next-generation sequencing (NGS) (iChIP-Seq), we found that the Pax5 promoter bound to multiple genomic regions. The identified chromosomal interactions were independently confirmed by in vitro engineered DNA-binding molecule-mediated ChIP (in vitro enChIP) in combination with NGS (in vitro enChIP-Seq). Comparing chromosomal interactions in wild-type DT40 with those in a macrophage-like counterpart, we found that some of the identified chromosomal interactions were organized in a B cell–specific manner. In addition, deletion of a B cell–specific interacting genomic region in chromosome 11, which was marked by active enhancer histone modifications, resulted in moderate but significant down-regulation of Pax5 transcription. Together, these results suggested that Pax5 transcription in DT40 cells is regulated by inter-chromosomal interactions. Moreover, these analyses showed that iChIP-Seq and in vitro enChIP-Seq are useful for non-biased identification of functional genomic regions that physically interact with a locus of interest.
染色体相互作用通过细胞核内的动态染色体组织调节基因组功能,如转录。在本研究中,我们确定了鸡b细胞系DT40中与Pax5基因启动子区域物理结合的基因组区域,目的是通过染色体相互作用获得转录调控的机制。利用插入染色质免疫沉淀(iChIP)和下一代测序(iChIP- seq)技术,我们发现Pax5启动子与多个基因组区域结合。鉴定的染色体相互作用通过体外工程dna结合分子介导的ChIP (in vitro enChIP)联合NGS (in vitro enChIP- seq)独立证实。将野生型DT40的染色体相互作用与巨噬细胞样的染色体相互作用进行比较,我们发现一些已鉴定的染色体相互作用是以B细胞特异性方式组织的。此外,11号染色体上一个以活性增强子组蛋白修饰为标志的B细胞特异性相互作用基因组区域的缺失,导致Pax5转录适度但显著下调。总之,这些结果表明Pax5在DT40细胞中的转录受染色体间相互作用的调节。此外,这些分析表明,iChIP-Seq和体外enchhip - seq可用于无偏倚地鉴定与感兴趣位点物理相互作用的功能基因组区域。
{"title":"Locus-specific ChIP combined with NGS analysis reveals genomic regulatory regions that physically interact with the Pax5 promoter in a chicken B cell line","authors":"T. Fujita, Fusako Kitaura, Miyuki Yuno, Yutaka Suzuki, S. Sugano, H. Fujii","doi":"10.1101/089821","DOIUrl":"https://doi.org/10.1101/089821","url":null,"abstract":"Chromosomal interactions regulate genome functions, such as transcription, via dynamic chromosomal organization in the nucleus. In this study, we identified genomic regions that physically bind to the promoter region of the Pax5 gene in the chicken B-cell line DT40, with the goal of obtaining mechanistic insight into transcriptional regulation through chromosomal interaction. Using insertional chromatin immunoprecipitation (iChIP) in combination with next-generation sequencing (NGS) (iChIP-Seq), we found that the Pax5 promoter bound to multiple genomic regions. The identified chromosomal interactions were independently confirmed by in vitro engineered DNA-binding molecule-mediated ChIP (in vitro enChIP) in combination with NGS (in vitro enChIP-Seq). Comparing chromosomal interactions in wild-type DT40 with those in a macrophage-like counterpart, we found that some of the identified chromosomal interactions were organized in a B cell–specific manner. In addition, deletion of a B cell–specific interacting genomic region in chromosome 11, which was marked by active enhancer histone modifications, resulted in moderate but significant down-regulation of Pax5 transcription. Together, these results suggested that Pax5 transcription in DT40 cells is regulated by inter-chromosomal interactions. Moreover, these analyses showed that iChIP-Seq and in vitro enChIP-Seq are useful for non-biased identification of functional genomic regions that physically interact with a locus of interest.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"20 1","pages":"537 - 548"},"PeriodicalIF":0.0,"publicationDate":"2016-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88237193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Chua, M. Wise, Y. Khosravi, S. Seow, A. A. Amoyo, S. Pettersson, Fanny Peters, C. Tay, Timothy T. Perkins, M. Loke, B. Marshall, J. Vadivelu
Abstract Helicobacter pylori is a highly successful gastric pathogen. High genomic plasticity allows its adaptation to changing host environments. Complete genomes of H. pylori clinical isolate UM032 and its mice-adapted serial derivatives 298 and 299, generated using both PacBio RS and Illumina MiSeq sequencing technologies, were compared to identify novel elements responsible for host-adaptation. The acquisition of a jhp0562-like allele, which encodes for a galactosyltransferase, was identified in the mice-adapted strains. Our analysis implies a new β-1,4-galactosyltransferase role for this enzyme, essential for Ley antigen expression. Intragenomic recombination between babA and babB genes was also observed. Further, we expanded on the list of candidate genes whose expression patterns have been mediated by upstream homopolymer-length alterations to facilitate host adaption. Importantly, greater than four-fold reduction of mRNA levels was demonstrated in five genes. Among the down-regulated genes, three encode for outer membrane proteins, including BabA, BabB and HopD. As expected, a substantial reduction in BabA protein abundance was detected in mice-adapted strains 298 and 299 via Western analysis. Our results suggest that the expression of Ley antigen and reduced outer membrane protein expressions may facilitate H. pylori colonisation of mouse gastric epithelium.
{"title":"Quantum changes in Helicobacter pylori gene expression accompany host-adaptation","authors":"E. Chua, M. Wise, Y. Khosravi, S. Seow, A. A. Amoyo, S. Pettersson, Fanny Peters, C. Tay, Timothy T. Perkins, M. Loke, B. Marshall, J. Vadivelu","doi":"10.1093/dnares/dsw046","DOIUrl":"https://doi.org/10.1093/dnares/dsw046","url":null,"abstract":"Abstract Helicobacter pylori is a highly successful gastric pathogen. High genomic plasticity allows its adaptation to changing host environments. Complete genomes of H. pylori clinical isolate UM032 and its mice-adapted serial derivatives 298 and 299, generated using both PacBio RS and Illumina MiSeq sequencing technologies, were compared to identify novel elements responsible for host-adaptation. The acquisition of a jhp0562-like allele, which encodes for a galactosyltransferase, was identified in the mice-adapted strains. Our analysis implies a new β-1,4-galactosyltransferase role for this enzyme, essential for Ley antigen expression. Intragenomic recombination between babA and babB genes was also observed. Further, we expanded on the list of candidate genes whose expression patterns have been mediated by upstream homopolymer-length alterations to facilitate host adaption. Importantly, greater than four-fold reduction of mRNA levels was demonstrated in five genes. Among the down-regulated genes, three encode for outer membrane proteins, including BabA, BabB and HopD. As expected, a substantial reduction in BabA protein abundance was detected in mice-adapted strains 298 and 299 via Western analysis. Our results suggest that the expression of Ley antigen and reduced outer membrane protein expressions may facilitate H. pylori colonisation of mouse gastric epithelium.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"235 1","pages":"37 - 49"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89959113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Bonacina, N. Suárez, R. Hormigo, S. Fadda, Marcus Lechner, L. Saavedra
Abstract The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization.
{"title":"A genomic view of food-related and probiotic Enterococcus strains","authors":"J. Bonacina, N. Suárez, R. Hormigo, S. Fadda, Marcus Lechner, L. Saavedra","doi":"10.1093/dnares/dsw043","DOIUrl":"https://doi.org/10.1093/dnares/dsw043","url":null,"abstract":"Abstract The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"5 1","pages":"11 - 24"},"PeriodicalIF":0.0,"publicationDate":"2016-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74371208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Y. Yasui, H. Hirakawa, M. Ueno, K. Matsui, T. Katsube-Tanaka, S. Yang, J. Aii, Shingo Sato, M. Mori
Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits.
{"title":"Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes","authors":"Y. Yasui, H. Hirakawa, M. Ueno, K. Matsui, T. Katsube-Tanaka, S. Yang, J. Aii, Shingo Sato, M. Mori","doi":"10.1093/dnares/dsw012","DOIUrl":"https://doi.org/10.1093/dnares/dsw012","url":null,"abstract":"Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"6 1","pages":"215 - 224"},"PeriodicalIF":0.0,"publicationDate":"2016-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90950195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Lloyd-Price, S. Startceva, Vinodh Kandavalli, Jerome G. Chandraseelan, Nadia S. M. Goncalves, Samuel M. D. Oliveira, A. Häkkinen, A. Ribeiro
We investigate the hypothesis that, in Escherichia coli, while the concentration of RNA polymerases differs in different growth conditions, the fraction of RNA polymerases free for transcription remains approximately constant within a certain range of these conditions. After establishing this, we apply a standard model-fitting procedure to fully characterize the in vivo kinetics of the rate-limiting steps in transcription initiation of the Plac/ara-1 promoter from distributions of intervals between transcription events in cells with different RNA polymerase concentrations. We find that, under full induction, the closed complex lasts ∼788 s while subsequent steps last ∼193 s, on average. We then establish that the closed complex formation usually occurs multiple times prior to each successful initiation event. Furthermore, the promoter intermittently switches to an inactive state that, on average, lasts ∼87 s. This is shown to arise from the intermittent repression of the promoter by LacI. The methods employed here should be of use to resolve the rate-limiting steps governing the in vivo dynamics of initiation of prokaryotic promoters, similar to established steady-state assays to resolve the in vitro dynamics.
{"title":"Dissecting the stochastic transcription initiation process in live Escherichia coli","authors":"J. Lloyd-Price, S. Startceva, Vinodh Kandavalli, Jerome G. Chandraseelan, Nadia S. M. Goncalves, Samuel M. D. Oliveira, A. Häkkinen, A. Ribeiro","doi":"10.1093/dnares/dsw009","DOIUrl":"https://doi.org/10.1093/dnares/dsw009","url":null,"abstract":"We investigate the hypothesis that, in Escherichia coli, while the concentration of RNA polymerases differs in different growth conditions, the fraction of RNA polymerases free for transcription remains approximately constant within a certain range of these conditions. After establishing this, we apply a standard model-fitting procedure to fully characterize the in vivo kinetics of the rate-limiting steps in transcription initiation of the Plac/ara-1 promoter from distributions of intervals between transcription events in cells with different RNA polymerase concentrations. We find that, under full induction, the closed complex lasts ∼788 s while subsequent steps last ∼193 s, on average. We then establish that the closed complex formation usually occurs multiple times prior to each successful initiation event. Furthermore, the promoter intermittently switches to an inactive state that, on average, lasts ∼87 s. This is shown to arise from the intermittent repression of the promoter by LacI. The methods employed here should be of use to resolve the rate-limiting steps governing the in vivo dynamics of initiation of prokaryotic promoters, similar to established steady-state assays to resolve the in vitro dynamics.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"10 1","pages":"203 - 214"},"PeriodicalIF":0.0,"publicationDate":"2016-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78415975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Nakahigashi, Yuki Takai, Michiko Kimura, Nozomi Abe, T. Nakayashiki, Yuh Shiwa, H. Yoshikawa, B. Wanner, Y. Ishihama, H. Mori
Tetracycline-inhibited ribosome profiling (TetRP) provides a powerful new experimental tool for comprehensive genome-wide identification of translation initiation sites in bacteria. We validated TetRP by confirming the translation start sites of protein-coding genes in accordance with the 2006 version of Escherichia coli K-12 annotation record (GenBank U00096.2) and found ∼150 new start sites within 60 nucleotides of the annotated site. This analysis revealed 72 per cent of the genes whose initiation site annotations were changed from the 2006 GenBank record to the newer 2014 annotation record (GenBank U00096.3), indicating a high sensitivity. Also, results from reporter fusion and proteomics of N-terminally enriched peptides showed high specificity of the TetRP results. In addition, we discovered over 300 translation start sites within non-coding, intergenic regions of the genome, using a threshold that retains ∼2,000 known coding genes. While some appear to correspond to pseudogenes, others may encode small peptides or have previously unforeseen roles. In summary, we showed that ribosome profiling upon translation inhibition by tetracycline offers a simple, reliable and comprehensive experimental tool for precise annotation of translation start sites of expressed genes in bacteria.
{"title":"Comprehensive identification of translation start sites by tetracycline-inhibited ribosome profiling","authors":"K. Nakahigashi, Yuki Takai, Michiko Kimura, Nozomi Abe, T. Nakayashiki, Yuh Shiwa, H. Yoshikawa, B. Wanner, Y. Ishihama, H. Mori","doi":"10.1093/dnares/dsw008","DOIUrl":"https://doi.org/10.1093/dnares/dsw008","url":null,"abstract":"Tetracycline-inhibited ribosome profiling (TetRP) provides a powerful new experimental tool for comprehensive genome-wide identification of translation initiation sites in bacteria. We validated TetRP by confirming the translation start sites of protein-coding genes in accordance with the 2006 version of Escherichia coli K-12 annotation record (GenBank U00096.2) and found ∼150 new start sites within 60 nucleotides of the annotated site. This analysis revealed 72 per cent of the genes whose initiation site annotations were changed from the 2006 GenBank record to the newer 2014 annotation record (GenBank U00096.3), indicating a high sensitivity. Also, results from reporter fusion and proteomics of N-terminally enriched peptides showed high specificity of the TetRP results. In addition, we discovered over 300 translation start sites within non-coding, intergenic regions of the genome, using a threshold that retains ∼2,000 known coding genes. While some appear to correspond to pseudogenes, others may encode small peptides or have previously unforeseen roles. In summary, we showed that ribosome profiling upon translation inhibition by tetracycline offers a simple, reliable and comprehensive experimental tool for precise annotation of translation start sites of expressed genes in bacteria.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"39 1","pages":"193 - 201"},"PeriodicalIF":0.0,"publicationDate":"2016-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81238213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Bodénès, É. Chancerel, F. Ehrenmann, A. Kremer, C. Plomion
We developed the densest single-nucleotide polymorphism (SNP)-based linkage genetic map to date for the genus Quercus. An 8k gene-based SNP array was used to genotype more than 1,000 full-sibs from two intraspecific and two interspecific full-sib families of Quercus petraea and Quercus robur. A high degree of collinearity was observed between the eight parental maps of the two species. A composite map was then established with 4,261 SNP markers spanning 742 cM over the 12 linkage groups (LGs) of the oak genome. Nine genomic regions from six LGs displayed highly significant distortions of segregation. Two main hypotheses concerning the mechanisms underlying segregation distortion are discussed: genetic load vs. reproductive barriers. Our findings suggest a predominance of pre-zygotic to post-zygotic barriers.
{"title":"High-density linkage mapping and distribution of segregation distortion regions in the oak genome","authors":"C. Bodénès, É. Chancerel, F. Ehrenmann, A. Kremer, C. Plomion","doi":"10.1093/dnares/dsw001","DOIUrl":"https://doi.org/10.1093/dnares/dsw001","url":null,"abstract":"We developed the densest single-nucleotide polymorphism (SNP)-based linkage genetic map to date for the genus Quercus. An 8k gene-based SNP array was used to genotype more than 1,000 full-sibs from two intraspecific and two interspecific full-sib families of Quercus petraea and Quercus robur. A high degree of collinearity was observed between the eight parental maps of the two species. A composite map was then established with 4,261 SNP markers spanning 742 cM over the 12 linkage groups (LGs) of the oak genome. Nine genomic regions from six LGs displayed highly significant distortions of segregation. Two main hypotheses concerning the mechanisms underlying segregation distortion are discussed: genetic load vs. reproductive barriers. Our findings suggest a predominance of pre-zygotic to post-zygotic barriers.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"47 1","pages":"115 - 124"},"PeriodicalIF":0.0,"publicationDate":"2016-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87058870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}