Liping Xu, Kaining Hu, Zhenqian Zhang, C. Guan, Song Chen, Wei Hua, Jiana Li, J. Wen, B. Yi, Jinxiong Shen, Chaozhi Ma, J. Tu, T. Fu
Flowering time adaptation is a major breeding goal in the allopolyploid species Brassica napus. To investigate the genetic architecture of flowering time, a genome-wide association study (GWAS) of flowering time was conducted with a diversity panel comprising 523 B. napus cultivars and inbred lines grown in eight different environments. Genotyping was performed with a Brassica 60K Illumina Infinium SNP array. A total of 41 single-nucleotide polymorphisms (SNPs) distributed on 14 chromosomes were found to be associated with flowering time, and 12 SNPs located in the confidence intervals of quantitative trait loci (QTL) identified in previous researches based on linkage analyses. Twenty-five candidate genes were orthologous to Arabidopsis thaliana flowering genes. To further our understanding of the genetic factors influencing flowering time in different environments, GWAS was performed on two derived traits, environment sensitivity and temperature sensitivity. The most significant SNPs were found near Bn-scaff_16362_1-p380982, just 13 kb away from BnaC09g41990D, which is orthologous to A. thaliana CONSTANS (CO), an important gene in the photoperiod flowering pathway. These results provide new insights into the genetic control of flowering time in B. napus and indicate that GWAS is an effective method by which to reveal natural variations of complex traits in B. napus.
{"title":"Genome-wide association study reveals the genetic architecture of flowering time in rapeseed (Brassica napus L.)","authors":"Liping Xu, Kaining Hu, Zhenqian Zhang, C. Guan, Song Chen, Wei Hua, Jiana Li, J. Wen, B. Yi, Jinxiong Shen, Chaozhi Ma, J. Tu, T. Fu","doi":"10.1093/dnares/dsv035","DOIUrl":"https://doi.org/10.1093/dnares/dsv035","url":null,"abstract":"Flowering time adaptation is a major breeding goal in the allopolyploid species Brassica napus. To investigate the genetic architecture of flowering time, a genome-wide association study (GWAS) of flowering time was conducted with a diversity panel comprising 523 B. napus cultivars and inbred lines grown in eight different environments. Genotyping was performed with a Brassica 60K Illumina Infinium SNP array. A total of 41 single-nucleotide polymorphisms (SNPs) distributed on 14 chromosomes were found to be associated with flowering time, and 12 SNPs located in the confidence intervals of quantitative trait loci (QTL) identified in previous researches based on linkage analyses. Twenty-five candidate genes were orthologous to Arabidopsis thaliana flowering genes. To further our understanding of the genetic factors influencing flowering time in different environments, GWAS was performed on two derived traits, environment sensitivity and temperature sensitivity. The most significant SNPs were found near Bn-scaff_16362_1-p380982, just 13 kb away from BnaC09g41990D, which is orthologous to A. thaliana CONSTANS (CO), an important gene in the photoperiod flowering pathway. These results provide new insights into the genetic control of flowering time in B. napus and indicate that GWAS is an effective method by which to reveal natural variations of complex traits in B. napus.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"5 1","pages":"43 - 52"},"PeriodicalIF":0.0,"publicationDate":"2015-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88097785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Naito, Yoshitoshi Ogura, T. Itoh, M. Shoji, M. Okamoto, Tetsuya Hayashi, K. Nakayama
Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria.
{"title":"The complete genome sequencing of Prevotella intermedia strain OMA14 and a subsequent fine-scale, intra-species genomic comparison reveal an unusual amplification of conjugative and mobile transposons and identify a novel Prevotella-lineage-specific repeat","authors":"M. Naito, Yoshitoshi Ogura, T. Itoh, M. Shoji, M. Okamoto, Tetsuya Hayashi, K. Nakayama","doi":"10.1093/dnares/dsv032","DOIUrl":"https://doi.org/10.1093/dnares/dsv032","url":null,"abstract":"Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"13 1","pages":"11 - 19"},"PeriodicalIF":0.0,"publicationDate":"2015-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83203244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The relationship between linkage disequilibrium (LD) and recombination fraction can be used to infer the pattern of genetic variation and evolutionary process in humans and other systems. We described a computational framework to construct a linkage–LD map from commonly used biallelic, single-nucleotide polymorphism (SNP) markers for outcrossing plants by which the decline of LD is visualized with genetic distance. The framework was derived from an open-pollinated (OP) design composed of plants randomly sampled from a natural population and seeds from each sampled plant, enabling simultaneous estimation of the LD in the natural population and recombination fraction due to allelic co-segregation during meiosis. We modified the framework to infer evolutionary pasts of natural populations using those marker types that are segregating in a dominant manner, given their role in creating and maintaining population genetic diversity. A sophisticated two-level EM algorithm was implemented to estimate and retrieve the missing information of segregation characterized by dominant-segregating markers such as single methylation polymorphisms. The model was applied to study the relationship between linkage and LD for a non-model outcrossing species, a gymnosperm species, Torreya grandis, naturally distributed in mountains of the southeastern China. The linkage–LD map constructed from various types of molecular markers opens a powerful gateway for studying the history of plant evolution.
{"title":"Constructing a linkage–linkage disequilibrium map using dominant-segregating markers","authors":"Xuli Zhu, Leiming Dong, Libo Jiang, Huan Li, Lidan Sun, Hui Zhang, Weiwu Yu, Haokai Liu, Wensheng Dai, Yanru Zeng, R. Wu","doi":"10.1093/dnares/dsv031","DOIUrl":"https://doi.org/10.1093/dnares/dsv031","url":null,"abstract":"The relationship between linkage disequilibrium (LD) and recombination fraction can be used to infer the pattern of genetic variation and evolutionary process in humans and other systems. We described a computational framework to construct a linkage–LD map from commonly used biallelic, single-nucleotide polymorphism (SNP) markers for outcrossing plants by which the decline of LD is visualized with genetic distance. The framework was derived from an open-pollinated (OP) design composed of plants randomly sampled from a natural population and seeds from each sampled plant, enabling simultaneous estimation of the LD in the natural population and recombination fraction due to allelic co-segregation during meiosis. We modified the framework to infer evolutionary pasts of natural populations using those marker types that are segregating in a dominant manner, given their role in creating and maintaining population genetic diversity. A sophisticated two-level EM algorithm was implemented to estimate and retrieve the missing information of segregation characterized by dominant-segregating markers such as single methylation polymorphisms. The model was applied to study the relationship between linkage and LD for a non-model outcrossing species, a gymnosperm species, Torreya grandis, naturally distributed in mountains of the southeastern China. The linkage–LD map constructed from various types of molecular markers opens a powerful gateway for studying the history of plant evolution.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"23 1","pages":"1 - 10"},"PeriodicalIF":0.0,"publicationDate":"2015-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90488708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jonghoon Lee, N. Izzah, Beom-Soon Choi, H. Joh, Sang-Choon Lee, Sampath Perumal, Joodeok Seo, Kyounggu Ahn, Eun Ju Jo, G. Choi, I. Nou, Yeisoo Yu, Tae-Jin Yang
Clubroot is a devastating disease caused by Plasmodiophora brassicae and results in severe losses of yield and quality in Brassica crops. Many clubroot resistance genes and markers are available in Brassica rapa but less is known in Brassica oleracea. Here, we applied the genotyping-by-sequencing (GBS) technique to construct a high-resolution genetic map and identify clubroot resistance (CR) genes. A total of 43,821 SNPs were identified using GBS data for two parental lines, one resistant and one susceptible lines to clubroot, and 18,187 of them showed >5× coverage in the GBS data. Among those, 4,103 were credibly genotyped for all 78 F2 individual plants. These markers were clustered into nine linkage groups spanning 879.9 cM with an average interval of 1.15 cM. Quantitative trait loci (QTLs) survey based on three rounds of clubroot resistance tests using F2 : 3 progenies revealed two and single major QTLs for Race 2 and Race 9 of P. brassicae, respectively. The QTLs show similar locations to the previously reported CR loci for Race 4 in B. oleracea but are in different positions from any of the CR loci found in B. rapa. We utilized two reference genome sequences in this study. The high-resolution genetic map developed herein allowed us to reposition 37 and 2 misanchored scaffolds in the 02–12 and TO1000DH genome sequences, respectively. Our data also support additional positioning of two unanchored 3.3 Mb scaffolds into the 02–12 genome sequence.
{"title":"Genotyping-by-sequencing map permits identification of clubroot resistance QTLs and revision of the reference genome assembly in cabbage (Brassica oleracea L.)","authors":"Jonghoon Lee, N. Izzah, Beom-Soon Choi, H. Joh, Sang-Choon Lee, Sampath Perumal, Joodeok Seo, Kyounggu Ahn, Eun Ju Jo, G. Choi, I. Nou, Yeisoo Yu, Tae-Jin Yang","doi":"10.1093/dnares/dsv034","DOIUrl":"https://doi.org/10.1093/dnares/dsv034","url":null,"abstract":"Clubroot is a devastating disease caused by Plasmodiophora brassicae and results in severe losses of yield and quality in Brassica crops. Many clubroot resistance genes and markers are available in Brassica rapa but less is known in Brassica oleracea. Here, we applied the genotyping-by-sequencing (GBS) technique to construct a high-resolution genetic map and identify clubroot resistance (CR) genes. A total of 43,821 SNPs were identified using GBS data for two parental lines, one resistant and one susceptible lines to clubroot, and 18,187 of them showed >5× coverage in the GBS data. Among those, 4,103 were credibly genotyped for all 78 F2 individual plants. These markers were clustered into nine linkage groups spanning 879.9 cM with an average interval of 1.15 cM. Quantitative trait loci (QTLs) survey based on three rounds of clubroot resistance tests using F2 : 3 progenies revealed two and single major QTLs for Race 2 and Race 9 of P. brassicae, respectively. The QTLs show similar locations to the previously reported CR loci for Race 4 in B. oleracea but are in different positions from any of the CR loci found in B. rapa. We utilized two reference genome sequences in this study. The high-resolution genetic map developed herein allowed us to reposition 37 and 2 misanchored scaffolds in the 02–12 and TO1000DH genome sequences, respectively. Our data also support additional positioning of two unanchored 3.3 Mb scaffolds into the 02–12 genome sequence.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"99 1","pages":"29 - 41"},"PeriodicalIF":0.0,"publicationDate":"2015-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82228267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kazuhiro Sato, Tsuyoshi Tanaka, S. Shigenobu, Yuka Motoi, Jianzhon Wu, T. Itoh
Full-length (FL) cDNA sequences provide the most reliable evidence for the presence of genes in genomes. In this report, detailed gene structures of barley, whole genome shotgun (WGS) and additional transcript data of the cultivar Haruna Nijo were quality controlled and compared with the published Morex genome information. Haruna Nijo scaffolds have longer total sequence length with much higher N50 and fewer sequences than those in Morex WGS contigs. The longer Haruna Nijo scaffolds provided efficient FLcDNA mapping, resulting in high coverage and detection of the transcription start sites. In combination with FLcDNAs and RNA-Seq data from four different tissue samples of Haruna Nijo, we identified 51,249 gene models on 30,606 loci. Overall sequence similarity between Haruna Nijo and Morex genome was 95.99%, while that of exon regions was higher (99.71%). These sequence and annotation data of Haruna Nijo are combined with Morex genome data and released from a genome browser. The genome sequence of Haruna Nijo may provide detailed gene structures in addition to the current Morex barley genome information.
{"title":"Improvement of barley genome annotations by deciphering the Haruna Nijo genome","authors":"Kazuhiro Sato, Tsuyoshi Tanaka, S. Shigenobu, Yuka Motoi, Jianzhon Wu, T. Itoh","doi":"10.1093/dnares/dsv033","DOIUrl":"https://doi.org/10.1093/dnares/dsv033","url":null,"abstract":"Full-length (FL) cDNA sequences provide the most reliable evidence for the presence of genes in genomes. In this report, detailed gene structures of barley, whole genome shotgun (WGS) and additional transcript data of the cultivar Haruna Nijo were quality controlled and compared with the published Morex genome information. Haruna Nijo scaffolds have longer total sequence length with much higher N50 and fewer sequences than those in Morex WGS contigs. The longer Haruna Nijo scaffolds provided efficient FLcDNA mapping, resulting in high coverage and detection of the transcription start sites. In combination with FLcDNAs and RNA-Seq data from four different tissue samples of Haruna Nijo, we identified 51,249 gene models on 30,606 loci. Overall sequence similarity between Haruna Nijo and Morex genome was 95.99%, while that of exon regions was higher (99.71%). These sequence and annotation data of Haruna Nijo are combined with Morex genome data and released from a genome browser. The genome sequence of Haruna Nijo may provide detailed gene structures in addition to the current Morex barley genome information.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"32 1","pages":"21 - 28"},"PeriodicalIF":0.0,"publicationDate":"2015-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83475250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Piovesan, M. Caracausi, Marco Ricci, P. Strippoli, L. Vitale, M. C. Pelleri
We have developed GeneBase, a full parser of the National Center for Biotechnology Information (NCBI) Gene database, which generates a fully structured local database with an intuitive user-friendly graphic interface for personal computers. Features of all the annotated eukaryotic genes are accessible through three main software tables, including for each entry details such as the gene summary, the gene exon/intron structure and the specific Gene Ontology attributions. The structuring of the data, the creation of additional calculation fields and the integration with nucleotide sequences allow users to make many types of comparisons and calculations that are useful for data retrieval and analysis. We provide an original example analysis of the existing introns across all the available species, through which the classic biological problem of the ‘minimal intron’ may find a solution using available data. Based on all currently available data, we can define the shortest known eukaryotic GT-AG intron length, setting the physical limit at the 30 base pair intron belonging to the human MST1L gene. This ‘model intron’ will shed light on the minimal requirement elements of recognition used for conventional splicing functioning. Remarkably, this size is indeed consistent with the sum of the splicing consensus sequence lengths.
{"title":"Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank","authors":"A. Piovesan, M. Caracausi, Marco Ricci, P. Strippoli, L. Vitale, M. C. Pelleri","doi":"10.1093/dnares/dsv028","DOIUrl":"https://doi.org/10.1093/dnares/dsv028","url":null,"abstract":"We have developed GeneBase, a full parser of the National Center for Biotechnology Information (NCBI) Gene database, which generates a fully structured local database with an intuitive user-friendly graphic interface for personal computers. Features of all the annotated eukaryotic genes are accessible through three main software tables, including for each entry details such as the gene summary, the gene exon/intron structure and the specific Gene Ontology attributions. The structuring of the data, the creation of additional calculation fields and the integration with nucleotide sequences allow users to make many types of comparisons and calculations that are useful for data retrieval and analysis. We provide an original example analysis of the existing introns across all the available species, through which the classic biological problem of the ‘minimal intron’ may find a solution using available data. Based on all currently available data, we can define the shortest known eukaryotic GT-AG intron length, setting the physical limit at the 30 base pair intron belonging to the human MST1L gene. This ‘model intron’ will shed light on the minimal requirement elements of recognition used for conventional splicing functioning. Remarkably, this size is indeed consistent with the sum of the splicing consensus sequence lengths.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"52 1","pages":"495 - 503"},"PeriodicalIF":0.0,"publicationDate":"2015-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85709469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Setsuko Shimada, Y. Makita, Tomoko Kuriyama-Kondou, M. Kawashima, Yoshiki Mochizuki, H. Hirakawa, Shusei Sato, Tetsuro Toyoda, M. Matsui
Sorghum bicolor is one of the most important crops for food and bioethanol production. Its small diploid genome and resistance to environmental stress make sorghum an attractive model for studying the functional genomics of the Saccharinae and other C4 grasses. We analyzed the domain-based functional annotation of the cDNAs using the gene ontology (GO) categories for molecular function to characterize all the genes cloned in the full-length cDNA library of sorghum. The sorghum cDNA library successfully captured a wide range of cDNA-encoded proteins with various functions. To characterize the protein function of newly identified cDNAs, a search of their deduced domains and comparative analyses in the Oryza sativa and Zea mays genomes were carried out. Furthermore, genes on the sense strand corresponding to antisense transcripts were classified based on the GO of molecular function. To add more information about these genes, we have analyzed the expression profiles using RNA-Seq of three tissues (spikelet, seed and stem) during the starch-filling phase. We performed functional analysis of tissue-specific genes and expression analysis of genes of starch biosynthesis enzymes. This functional analysis of sorghum full-length cDNAs and the transcriptome information will facilitate further analysis of the Saccharinae and grass families.
{"title":"Functional and expression analyses of transcripts based on full-length cDNAs of Sorghum bicolor","authors":"Setsuko Shimada, Y. Makita, Tomoko Kuriyama-Kondou, M. Kawashima, Yoshiki Mochizuki, H. Hirakawa, Shusei Sato, Tetsuro Toyoda, M. Matsui","doi":"10.1093/dnares/dsv030","DOIUrl":"https://doi.org/10.1093/dnares/dsv030","url":null,"abstract":"Sorghum bicolor is one of the most important crops for food and bioethanol production. Its small diploid genome and resistance to environmental stress make sorghum an attractive model for studying the functional genomics of the Saccharinae and other C4 grasses. We analyzed the domain-based functional annotation of the cDNAs using the gene ontology (GO) categories for molecular function to characterize all the genes cloned in the full-length cDNA library of sorghum. The sorghum cDNA library successfully captured a wide range of cDNA-encoded proteins with various functions. To characterize the protein function of newly identified cDNAs, a search of their deduced domains and comparative analyses in the Oryza sativa and Zea mays genomes were carried out. Furthermore, genes on the sense strand corresponding to antisense transcripts were classified based on the GO of molecular function. To add more information about these genes, we have analyzed the expression profiles using RNA-Seq of three tissues (spikelet, seed and stem) during the starch-filling phase. We performed functional analysis of tissue-specific genes and expression analysis of genes of starch biosynthesis enzymes. This functional analysis of sorghum full-length cDNAs and the transcriptome information will facilitate further analysis of the Saccharinae and grass families.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"43 1","pages":"485 - 493"},"PeriodicalIF":0.0,"publicationDate":"2015-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74362792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Codon bias is the non-random use of synonymous codons, a phenomenon that has been observed in species as diverse as bacteria, plants and mammals. The preferential use of particular synonymous codons may reflect neutral mechanisms (e.g. mutational bias, G|C-biased gene conversion, genetic drift) and/or selection for mRNA stability, translational efficiency and accuracy. The extent to which these different factors influence codon usage is unknown, so we dissected the contribution of mutational bias and selection towards codon bias in genes from 15 eudicots, 4 monocots and 2 mosses. We analysed the frequency of mononucleotides, dinucleotides and trinucleotides and investigated whether the compositional genomic background could account for the observed codon usage profiles. Neutral forces such as mutational pressure and G|C-biased gene conversion appeared to underlie most of the observed codon bias, although there was also evidence for the selection of optimal translational efficiency and mRNA folding. Our data confirmed the compositional differences between monocots and dicots, with the former featuring in general a lower background compositional bias but a higher overall codon bias.
{"title":"New insights into the interplay between codon bias determinants in plants","authors":"S. Camiolo, S. Melito, A. Porceddu","doi":"10.1093/dnares/dsv027","DOIUrl":"https://doi.org/10.1093/dnares/dsv027","url":null,"abstract":"Codon bias is the non-random use of synonymous codons, a phenomenon that has been observed in species as diverse as bacteria, plants and mammals. The preferential use of particular synonymous codons may reflect neutral mechanisms (e.g. mutational bias, G|C-biased gene conversion, genetic drift) and/or selection for mRNA stability, translational efficiency and accuracy. The extent to which these different factors influence codon usage is unknown, so we dissected the contribution of mutational bias and selection towards codon bias in genes from 15 eudicots, 4 monocots and 2 mosses. We analysed the frequency of mononucleotides, dinucleotides and trinucleotides and investigated whether the compositional genomic background could account for the observed codon usage profiles. Neutral forces such as mutational pressure and G|C-biased gene conversion appeared to underlie most of the observed codon bias, although there was also evidence for the selection of optimal translational efficiency and mRNA folding. Our data confirmed the compositional differences between monocots and dicots, with the former featuring in general a lower background compositional bias but a higher overall codon bias.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"8 1","pages":"461 - 470"},"PeriodicalIF":0.0,"publicationDate":"2015-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89782317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Takita, K. Kohda, Hajime Tomatsu, S. Hanano, Kanami Moriya, T. Hosouchi, N. Sakurai, Hideyuki Suzuki, A. Shinmyō, D. Shibata
Ligation, the joining of DNA fragments, is a fundamental procedure in molecular cloning and is indispensable to the production of genetically modified organisms that can be used for basic research, the applied bios-ciences, or both. Given that many genes cooperate in various pathways, incorporating multiple gene cassettes in tandem in a transgenic DNA construct for the purpose of genetic modification is often necessary when generating organisms that produce multiple foreign gene products. Here, we describe a novel method, designated PRESSO (precise sequential DNA ligation on a solid substrate), for the tandem ligation of multiple DNA fragments. We amplified donor DNA fragments with non-palindromic ends, and ligated the fragment to acceptor DNA fragments on solid beads. After the final donor DNA fragments, which included vector sequences, were joined to the construct that contained the array of fragments, the ligation product (the construct) was thereby released from the beads via digestion with a rare-cut meganuclease; the freed linear construct was circularized via an intra-molecular ligation. PRESSO allowed us to rapidly and efficiently join multiple genes in an optimized order and orientation. This method can overcome many technical challenges in functional genomics during the post-sequencing generation.
{"title":"Precise Sequential DNA Ligation on A Solid Substrate: Solid-Based Rapid Sequential Ligation of Multiple DNA Molecules","authors":"E. Takita, K. Kohda, Hajime Tomatsu, S. Hanano, Kanami Moriya, T. Hosouchi, N. Sakurai, Hideyuki Suzuki, A. Shinmyō, D. Shibata","doi":"10.1093/dnares/dsu037","DOIUrl":"https://doi.org/10.1093/dnares/dsu037","url":null,"abstract":"Ligation, the joining of DNA fragments, is a fundamental procedure in molecular cloning and is indispensable to the production of genetically modified organisms that can be used for basic research, the applied bios-ciences, or both. Given that many genes cooperate in various pathways, incorporating multiple gene cassettes in tandem in a transgenic DNA construct for the purpose of genetic modification is often necessary when generating organisms that produce multiple foreign gene products. Here, we describe a novel method, designated PRESSO (precise sequential DNA ligation on a solid substrate), for the tandem ligation of multiple DNA fragments. We amplified donor DNA fragments with non-palindromic ends, and ligated the fragment to acceptor DNA fragments on solid beads. After the final donor DNA fragments, which included vector sequences, were joined to the construct that contained the array of fragments, the ligation product (the construct) was thereby released from the beads via digestion with a rare-cut meganuclease; the freed linear construct was circularized via an intra-molecular ligation. PRESSO allowed us to rapidly and efficiently join multiple genes in an optimized order and orientation. This method can overcome many technical challenges in functional genomics during the post-sequencing generation.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"19 1","pages":"727 - 727"},"PeriodicalIF":0.0,"publicationDate":"2014-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89515497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
There was an error in the first three rows of Table 1 for the column B. oleracea Repeat number: the 3 should be 12. The corrected part of the table is given below. Table 1. Number, repeat number and total repeat length of the mono-to hexanucleotide repeats or motifs of microsatellites in the assembled genomic sequences of B. rapa, B. oleracea and B. napus Motif B. rapa B. oleracea which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
在表1的前三行b列中有一个错误。重复数:3应该是12。表格的更正部分如下表1。B. rapa, B. oleracea和B. napus Motif B. rapa B. oleracea基因组序列中单至六核苷酸重复序列或微卫星序列的重复数、重复数和总重复长度,该序列允许在任何介质中不受限制地重复使用、分发和复制,前提是原始作品被适当引用。
{"title":"Genome-Wide Microsatellite Characterization and Marker Development in the Sequenced Brassica Crop Species","authors":"Jiaqin Shi, Shunmou Huang, Jiepeng Zhan, Jingyin Yu, Xinfa Wang, Wei Hua, Shengyi Liu, Guihua Liu, Hanzhong Wang","doi":"10.1093/dnares/dsu011","DOIUrl":"https://doi.org/10.1093/dnares/dsu011","url":null,"abstract":"There was an error in the first three rows of Table 1 for the column B. oleracea Repeat number: the 3 should be 12. The corrected part of the table is given below. Table 1. Number, repeat number and total repeat length of the mono-to hexanucleotide repeats or motifs of microsatellites in the assembled genomic sequences of B. rapa, B. oleracea and B. napus Motif B. rapa B. oleracea which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"401 1","pages":"229 - 229"},"PeriodicalIF":0.0,"publicationDate":"2014-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79888042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}