In this study, we modified the multiple alignment method based on the generation of random position weight matrices (RPWM) and used it to search for tandem repeats (TRs) in the Capsicum annuum genome. The application of the modified (m)RPWM method, which considers the correlation of adjusting nucleotides, resulted in the identification of 908,072 TR regions with repeat lengths from 2 to 200 bp in the C. annuum genome, where they occupied ~29%. The most common TRs were 2 and 3 bp long followed by those of 21, 4, and 15 bp. We performed clustering analysis of TRs with repeat lengths of 2 and 21 bp and created position-weight matrices (PWMs) for each group; these templates could be used to search for TRs of a given length in any nucleotide sequence. All detected TRs can be accessed through publicly available database (http : //victoria.biengi.ac.ru/capsicum_tr/). Comparison of mRPWM with other TR search methods such as Tandem Repeat Finder, T-REKS, and XSTREAM indicated that mRPWM could detect significantly more TRs at similar false discovery rates, indicating its superior performance. The developed mRPWM method can be successfully applied to the identification of highly divergent TRs, which is important for functional analysis of genomes and evolutionary studies.
{"title":"Detection of tandem repeats in the Capsicum annuum genome.","authors":"Valentina Rudenko, Eugene Korotkov","doi":"10.1093/dnares/dsad007","DOIUrl":"https://doi.org/10.1093/dnares/dsad007","url":null,"abstract":"<p><p>In this study, we modified the multiple alignment method based on the generation of random position weight matrices (RPWM) and used it to search for tandem repeats (TRs) in the Capsicum annuum genome. The application of the modified (m)RPWM method, which considers the correlation of adjusting nucleotides, resulted in the identification of 908,072 TR regions with repeat lengths from 2 to 200 bp in the C. annuum genome, where they occupied ~29%. The most common TRs were 2 and 3 bp long followed by those of 21, 4, and 15 bp. We performed clustering analysis of TRs with repeat lengths of 2 and 21 bp and created position-weight matrices (PWMs) for each group; these templates could be used to search for TRs of a given length in any nucleotide sequence. All detected TRs can be accessed through publicly available database (http : //victoria.biengi.ac.ru/capsicum_tr/). Comparison of mRPWM with other TR search methods such as Tandem Repeat Finder, T-REKS, and XSTREAM indicated that mRPWM could detect significantly more TRs at similar false discovery rates, indicating its superior performance. The developed mRPWM method can be successfully applied to the identification of highly divergent TRs, which is important for functional analysis of genomes and evolutionary studies.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":" ","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10211496/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9528783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Here, we report the first telomere-to-telomere genome assembly of matsutake (Tricholoma matsutake), which consists of 13 sequences (spanning 161.0 Mb) and a 76 kb circular mitochondrial genome. All the 13 sequences were supported with telomeric repeats at the ends. GC-rich regions are located at the middle of the sequences and are enriched with long interspersed nuclear elements (LINEs). Repetitive sequences including long-terminal repeats (LTRs) and LINEs occupy 71.6% of the genome. A total of 21,887 potential protein-coding genes were predicted. The genomic data reported in this study served not only matsutake gene sequences but also genome structures and intergenic sequences. The information gained would be a great reference for exploring the genetics, genomics, and evolutionary study of matsutake in the future, and ultimately facilitate the conservation of this vulnerable genetic resource.
{"title":"Telomere-to-telomere genome assembly of matsutake (Tricholoma matsutake).","authors":"Hiroyuki Kurokochi, Naoyuki Tajima, Mitsuhiko P Sato, Kazutoshi Yoshitake, Shuichi Asakawa, Sachiko Isobe, Kenta Shirasawa","doi":"10.1093/dnares/dsad006","DOIUrl":"https://doi.org/10.1093/dnares/dsad006","url":null,"abstract":"<p><p>Here, we report the first telomere-to-telomere genome assembly of matsutake (Tricholoma matsutake), which consists of 13 sequences (spanning 161.0 Mb) and a 76 kb circular mitochondrial genome. All the 13 sequences were supported with telomeric repeats at the ends. GC-rich regions are located at the middle of the sequences and are enriched with long interspersed nuclear elements (LINEs). Repetitive sequences including long-terminal repeats (LTRs) and LINEs occupy 71.6% of the genome. A total of 21,887 potential protein-coding genes were predicted. The genomic data reported in this study served not only matsutake gene sequences but also genome structures and intergenic sequences. The information gained would be a great reference for exploring the genetics, genomics, and evolutionary study of matsutake in the future, and ultimately facilitate the conservation of this vulnerable genetic resource.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":" ","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10211490/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9523501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Phrynocephalus forsythii is a viviparous sand lizard that is endemic to the Tarim Basin with a broad altitudinal range of 872-3,100 m. Such variation in altitude and ecological variables can offer an opportunity to uncover genetic mechanisms of ectothermic adaptation to extreme environments at high- and low-altitude. Furthermore, the evolutionary relationship of karyotype with two different chromosome numbers (2n = 46 or 2n = 48) in the Chinese Phrynocephalus is unclear. In this study, a chromosome-level reference genome of P. forsythii was assembled. The genome assembly size was 1.82 Gb with a contig N50 length of 46.22 Mb, 20,194 protein-coding genes were predicted and 95.50% of these genes were annotated in functional public databases. After cluster contigs into chromosome level using Hi-C paired-end reads, we found that two chromosomes of P. forsythii were originated from one ancestral chromosome of species with 46 chromosomes. Comparative genomic analysis revealed that numerous characteristics associated with high- or low-altitude adaptation, including energy metabolism pathways, hypoxic adaptation, and immune, exhibit rapid changes or show signals of positive selection in the P. forsythii genome. This genome provides an excellent genome resource for the study of the karyotype evolution and ecological genomics of Phrynocephalus.
{"title":"Chromosome-level genome assembly of Phrynocephalus forsythii using third-generation DNA sequencing and Hi-C analysis.","authors":"Yue Qi, Wei Zhao, Yangyang Zhao, Chenkai Niu, Shuhui Cao, Yingmei Zhang","doi":"10.1093/dnares/dsad003","DOIUrl":"https://doi.org/10.1093/dnares/dsad003","url":null,"abstract":"<p><p>Phrynocephalus forsythii is a viviparous sand lizard that is endemic to the Tarim Basin with a broad altitudinal range of 872-3,100 m. Such variation in altitude and ecological variables can offer an opportunity to uncover genetic mechanisms of ectothermic adaptation to extreme environments at high- and low-altitude. Furthermore, the evolutionary relationship of karyotype with two different chromosome numbers (2n = 46 or 2n = 48) in the Chinese Phrynocephalus is unclear. In this study, a chromosome-level reference genome of P. forsythii was assembled. The genome assembly size was 1.82 Gb with a contig N50 length of 46.22 Mb, 20,194 protein-coding genes were predicted and 95.50% of these genes were annotated in functional public databases. After cluster contigs into chromosome level using Hi-C paired-end reads, we found that two chromosomes of P. forsythii were originated from one ancestral chromosome of species with 46 chromosomes. Comparative genomic analysis revealed that numerous characteristics associated with high- or low-altitude adaptation, including energy metabolism pathways, hypoxic adaptation, and immune, exhibit rapid changes or show signals of positive selection in the P. forsythii genome. This genome provides an excellent genome resource for the study of the karyotype evolution and ecological genomics of Phrynocephalus.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10113879/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9330727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
To adapt to high-altitude habitats, many alpine plants develop self-compatible breeding systems from outcrossing. The genetic bases for this shift and the resulting demographic consequences remain largely unexplored. Here, we present a high-quality, chromosome-level genome assembly of the monotypic and endangered alpine perennial Przewalskia tangutica (Solanaceae) occurring on the Qinghai-Tibet Plateau (QTP). Our assembled genome is approximately 3 Gb, with a contig N50 size of 17 Mb, and we identified one lineage-specific whole-genome duplication. We found that the gametophytic self-incompatibility (GSI) syntenic locus to the other obligate outcrossing Solanaceae species was broken by the inserted the long terminal repeats, and changes in the flower-specific expression of the homologous genes, and the linked GSI genes in this species. Such changes may have led to its self-compatibility. We identified three deeply diverged lineages in the central distribution of this species, and the gene flow between them was weak but continuous. All three lineages diverged and decreased their population sizes since the largest glaciations occurred in the QTP approximately 720-500 thousand years ago. In addition, we identified one obvious hybrid population between two lineages, suggesting that genetic exchanges between and within lineages still occur. Our results provide insights into evolutionary adaptation through facultative self-pollination and demographic consequences of this alpine rare species in arid habitats.
{"title":"The genome sequence and demographic history of Przewalskia tangutica (Solanaceae), an endangered alpine plant on the Qinghai-Tibet Plateau.","authors":"Ying Wu, Jiao Yang, Yongzhi Yang, Jianquan Liu","doi":"10.1093/dnares/dsad005","DOIUrl":"https://doi.org/10.1093/dnares/dsad005","url":null,"abstract":"<p><p>To adapt to high-altitude habitats, many alpine plants develop self-compatible breeding systems from outcrossing. The genetic bases for this shift and the resulting demographic consequences remain largely unexplored. Here, we present a high-quality, chromosome-level genome assembly of the monotypic and endangered alpine perennial Przewalskia tangutica (Solanaceae) occurring on the Qinghai-Tibet Plateau (QTP). Our assembled genome is approximately 3 Gb, with a contig N50 size of 17 Mb, and we identified one lineage-specific whole-genome duplication. We found that the gametophytic self-incompatibility (GSI) syntenic locus to the other obligate outcrossing Solanaceae species was broken by the inserted the long terminal repeats, and changes in the flower-specific expression of the homologous genes, and the linked GSI genes in this species. Such changes may have led to its self-compatibility. We identified three deeply diverged lineages in the central distribution of this species, and the gene flow between them was weak but continuous. All three lineages diverged and decreased their population sizes since the largest glaciations occurred in the QTP approximately 720-500 thousand years ago. In addition, we identified one obvious hybrid population between two lineages, suggesting that genetic exchanges between and within lineages still occur. Our results provide insights into evolutionary adaptation through facultative self-pollination and demographic consequences of this alpine rare species in arid habitats.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10119639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9338183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tetraena mongolica is an endangered xerophytic shrub with high ecological value for the restoration of desert vegetation because of its high tolerance to drought and heat stress. Here, we generated a high-quality chromosome-level reference genome of T. mongolica by combining PacBio HiFi data and Hi-C sequencing technologies, which was approximately 1.12 Gb (contig N50 of 25.5 Mb) in size and contained 61,888 protein-coding genes; repetitive sequences comprised 44.8% of the genome. This genome of T. mongolica is the first published genome sequence of a member of the order Zygophyllales. Genome analysis showed that T. mongolica has undergone a recent whole genome duplication event, and a recent burst of long terminal repeat insertions afterward, which may be responsible for its genome size expansion and drought adaptation. We also conducted searches for gene homologues and identified terpene synthase (TPS) gene families and candidate genes involved in triacylglycerol biosynthesis. The T. mongolica genome sequence could aid future studies aimed at functional gene identification, germplasm resource management, molecular breeding efforts, as well as evolutionary studies of Fabids and angiosperm taxa.
{"title":"Chromosome-level genome assembly of the endangered plant Tetraena mongolica.","authors":"Bingru Liu, Xiaoyu Zhao, Ziyin Wang, Huili Liu, Xueshuang Huang, Peng Yang","doi":"10.1093/dnares/dsad004","DOIUrl":"https://doi.org/10.1093/dnares/dsad004","url":null,"abstract":"<p><p>Tetraena mongolica is an endangered xerophytic shrub with high ecological value for the restoration of desert vegetation because of its high tolerance to drought and heat stress. Here, we generated a high-quality chromosome-level reference genome of T. mongolica by combining PacBio HiFi data and Hi-C sequencing technologies, which was approximately 1.12 Gb (contig N50 of 25.5 Mb) in size and contained 61,888 protein-coding genes; repetitive sequences comprised 44.8% of the genome. This genome of T. mongolica is the first published genome sequence of a member of the order Zygophyllales. Genome analysis showed that T. mongolica has undergone a recent whole genome duplication event, and a recent burst of long terminal repeat insertions afterward, which may be responsible for its genome size expansion and drought adaptation. We also conducted searches for gene homologues and identified terpene synthase (TPS) gene families and candidate genes involved in triacylglycerol biosynthesis. The T. mongolica genome sequence could aid future studies aimed at functional gene identification, germplasm resource management, molecular breeding efforts, as well as evolutionary studies of Fabids and angiosperm taxa.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 2","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10113878/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9329957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yu Ning, Yang Li, Shu Bin Dong, Hong Guo Yang, Chun Yi Li, Biao Xiong, Jun Yang, Yu Kun Hu, Xian Yun Mu, Xiao Fei Xia
Kobresia species are common in meadows on the Qinghai-Tibet Plateau. They are important food resources for local livestock, and serve a critical foundation for ecosystem integration. Genetic resources of Kobresia species are scarce. Here, we generated a chromosome-level genome assembly for K. myosuroides (Cyperaceae), using PacBio long-reads, Illumina short-reads, and Hi-C technology. The final assembly had a total size of 399.9 Mb with a contig N50 value of 11.9 Mb. The Hi-C result supported a 29 pseudomolecules model which was in consistent with cytological results. A total of 185.5 Mb (44.89% of the genome) transposable elements were detected, and 26,748 protein-coding genes were predicted. Comparative analysis revealed that Kobresia plants have experienced recent diversification events during the late Miocene to Pliocene. Karyotypes analysis indicated that the fission and fusion of chromosomes have been a major driver of speciation, which complied with the lack of whole-genome duplication (WGD) in K. myosuroides genome. Generally, this high-quality reference genome provides insights into the evolution of alpine sedges, and may be helpful to endemic forage improvement and alpine ecosystem preservation.
{"title":"The chromosome-scale genome of Kobresia myosuroides sheds light on karyotype evolution and recent diversification of a dominant herb group on the Qinghai-Tibet Plateau.","authors":"Yu Ning, Yang Li, Shu Bin Dong, Hong Guo Yang, Chun Yi Li, Biao Xiong, Jun Yang, Yu Kun Hu, Xian Yun Mu, Xiao Fei Xia","doi":"10.1093/dnares/dsac049","DOIUrl":"https://doi.org/10.1093/dnares/dsac049","url":null,"abstract":"<p><p>Kobresia species are common in meadows on the Qinghai-Tibet Plateau. They are important food resources for local livestock, and serve a critical foundation for ecosystem integration. Genetic resources of Kobresia species are scarce. Here, we generated a chromosome-level genome assembly for K. myosuroides (Cyperaceae), using PacBio long-reads, Illumina short-reads, and Hi-C technology. The final assembly had a total size of 399.9 Mb with a contig N50 value of 11.9 Mb. The Hi-C result supported a 29 pseudomolecules model which was in consistent with cytological results. A total of 185.5 Mb (44.89% of the genome) transposable elements were detected, and 26,748 protein-coding genes were predicted. Comparative analysis revealed that Kobresia plants have experienced recent diversification events during the late Miocene to Pliocene. Karyotypes analysis indicated that the fission and fusion of chromosomes have been a major driver of speciation, which complied with the lack of whole-genome duplication (WGD) in K. myosuroides genome. Generally, this high-quality reference genome provides insights into the evolution of alpine sedges, and may be helpful to endemic forage improvement and alpine ecosystem preservation.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9835760/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10537969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mycorrhizae are one of the most fundamental symbioses between plants and fungi, with ectomycorrhizae being the most widespread in boreal forest ecosystems. Ectomycorrhizal fungi are hypothesized to have evolved convergently from saprotrophic ancestors in several fungal clades, especially members of the subdivision Agaricomycotina. Studies on fungal genomes have identified several typical characteristics of mycorrhizal fungi, such as genome size expansion and decreases in plant cell-wall degrading enzymes (PCWDEs). However, genomic changes concerning the evolutionary transition to the ectomycorrhizal lifestyle are largely unknown. In this study, we sequenced the genome of Lyophyllum shimeji, an ectomycorrhizal fungus that is phylogenetically related to saprotrophic species and retains some saprotroph-like traits. We found that the genome of Ly. shimeji strain AT787 lacks both incremental increases in genome size and reduced numbers of PCWDEs. Our findings suggest that the previously reported common genomic traits of mycorrhizal fungi are not essential for the ectomycorrhizal lifestyle, but are a result of abolishing saprotrophic activity. Since Ly. shimeji is commercially consumed as an edible mushroom, the newly available genomic information may also impact research designed to enhance the cultivation of this mushroom.
{"title":"The genome of Lyophyllum shimeji provides insight into the initial evolution of ectomycorrhizal fungal genomes.","authors":"Yuuki Kobayashi, Tomoko F Shibata, Hideki Hirakawa, Tomoaki Nishiyama, Akiyoshi Yamada, Mitsuyasu Hasebe, Shuji Shigenobu, Masayoshi Kawaguchi","doi":"10.1093/dnares/dsac053","DOIUrl":"https://doi.org/10.1093/dnares/dsac053","url":null,"abstract":"<p><p>Mycorrhizae are one of the most fundamental symbioses between plants and fungi, with ectomycorrhizae being the most widespread in boreal forest ecosystems. Ectomycorrhizal fungi are hypothesized to have evolved convergently from saprotrophic ancestors in several fungal clades, especially members of the subdivision Agaricomycotina. Studies on fungal genomes have identified several typical characteristics of mycorrhizal fungi, such as genome size expansion and decreases in plant cell-wall degrading enzymes (PCWDEs). However, genomic changes concerning the evolutionary transition to the ectomycorrhizal lifestyle are largely unknown. In this study, we sequenced the genome of Lyophyllum shimeji, an ectomycorrhizal fungus that is phylogenetically related to saprotrophic species and retains some saprotroph-like traits. We found that the genome of Ly. shimeji strain AT787 lacks both incremental increases in genome size and reduced numbers of PCWDEs. Our findings suggest that the previously reported common genomic traits of mycorrhizal fungi are not essential for the ectomycorrhizal lifestyle, but are a result of abolishing saprotrophic activity. Since Ly. shimeji is commercially consumed as an edible mushroom, the newly available genomic information may also impact research designed to enhance the cultivation of this mushroom.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/24/22/dsac053.PMC9896470.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10649995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ladybird beetle Henosepilachna vigintioctomaculata is an economically significant oligophagous pest that induces damage to many Solanaceae crops. An increasing number of studies have examined the population and phenotype diversity of ladybird beetles. However, few comparative genome analyses of ladybird beetle species have been conducted. Here, we obtained a high-quality chromosome-level genome assembly of H. vigintioctomaculata using various sequencing technologies, and the chromosome-level genome assembly was ~581.63 Mb, with 11 chromosomes successfully assembled. The phylogenetic analysis showed that H. vigintioctomaculata is a more ancient lineage than the other three sequenced ladybird beetles, Harmonia axyridis, Propylea japonica, and Coccinella septempunctata. We also compared positively selected genes (PSGs), transposable elements (TEs) ratios and insertion times, and key gene families associated with environmental adaptation among these ladybird beetles. The pattern of TEs evolution of H. vigintioctomaculata differs from the other three ladybird beetles. The PSGs were associated with ladybird beetles development. However, the key gene families associated with environmental adaptation in ladybird beetles varied. Overall, the high-quality draft genome sequence of H. vigintioctomaculata provides a useful resource for studies of beetle biology, especially for the invasive biology of ladybird beetles.
{"title":"A chromosome-level genome assembly of the Henosepilachna vigintioctomaculata provides insights into the evolution of ladybird beetles.","authors":"Wenbo Zhu, Shengqi Chi, Yanchun Wang, Haorong Li, Zhongkai Wang, Songdong Gu, Ting Sun, Hui Xiang, Ping You, Yandong Ren","doi":"10.1093/dnares/dsad001","DOIUrl":"https://doi.org/10.1093/dnares/dsad001","url":null,"abstract":"<p><p>The ladybird beetle Henosepilachna vigintioctomaculata is an economically significant oligophagous pest that induces damage to many Solanaceae crops. An increasing number of studies have examined the population and phenotype diversity of ladybird beetles. However, few comparative genome analyses of ladybird beetle species have been conducted. Here, we obtained a high-quality chromosome-level genome assembly of H. vigintioctomaculata using various sequencing technologies, and the chromosome-level genome assembly was ~581.63 Mb, with 11 chromosomes successfully assembled. The phylogenetic analysis showed that H. vigintioctomaculata is a more ancient lineage than the other three sequenced ladybird beetles, Harmonia axyridis, Propylea japonica, and Coccinella septempunctata. We also compared positively selected genes (PSGs), transposable elements (TEs) ratios and insertion times, and key gene families associated with environmental adaptation among these ladybird beetles. The pattern of TEs evolution of H. vigintioctomaculata differs from the other three ladybird beetles. The PSGs were associated with ladybird beetles development. However, the key gene families associated with environmental adaptation in ladybird beetles varied. Overall, the high-quality draft genome sequence of H. vigintioctomaculata provides a useful resource for studies of beetle biology, especially for the invasive biology of ladybird beetles.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4f/4d/dsad001.PMC9936504.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10807136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Perilla frutescens (Lamiaceae) is an important herbal plant with hundreds of bioactive chemicals, among which perillaldehyde and rosmarinic acid are the two major bioactive compounds in the plant. The leaves of red perilla are used as traditional Kampo medicine or food ingredients. However, the medicinal and nutritional uses of this plant could be improved by enhancing the production of valuable metabolites through the manipulation of key enzymes or regulatory genes using genome editing technology. Here, we generated a high-quality genome assembly of red perilla domesticated in Japan. A near-complete chromosome-level assembly of P. frutescens was generated contigs with N50 of 41.5 Mb from PacBio HiFi reads. 99.2% of the assembly was anchored into 20 pseudochromosomes, among which seven pseudochromosomes consisted of one contig, while the rest consisted of less than six contigs. Gene annotation and prediction of the sequences successfully predicted 86,258 gene models, including 76,825 protein-coding genes. Further analysis showed that potential targets of genome editing for the engineering of anthocyanin pathways in P. frutescens are located on the late-stage pathways. Overall, our genome assembly could serve as a valuable reference for selecting target genes for genome editing of P. frutescens.
{"title":"A highly contiguous genome assembly of red perilla (Perilla frutescens) domesticated in Japan.","authors":"Keita Tamura, Mika Sakamoto, Yasuhiro Tanizawa, Takako Mochizuki, Shuji Matsushita, Yoshihiro Kato, Takeshi Ishikawa, Keisuke Okuhara, Yasukazu Nakamura, Hidemasa Bono","doi":"10.1093/dnares/dsac044","DOIUrl":"10.1093/dnares/dsac044","url":null,"abstract":"<p><p>Perilla frutescens (Lamiaceae) is an important herbal plant with hundreds of bioactive chemicals, among which perillaldehyde and rosmarinic acid are the two major bioactive compounds in the plant. The leaves of red perilla are used as traditional Kampo medicine or food ingredients. However, the medicinal and nutritional uses of this plant could be improved by enhancing the production of valuable metabolites through the manipulation of key enzymes or regulatory genes using genome editing technology. Here, we generated a high-quality genome assembly of red perilla domesticated in Japan. A near-complete chromosome-level assembly of P. frutescens was generated contigs with N50 of 41.5 Mb from PacBio HiFi reads. 99.2% of the assembly was anchored into 20 pseudochromosomes, among which seven pseudochromosomes consisted of one contig, while the rest consisted of less than six contigs. Gene annotation and prediction of the sequences successfully predicted 86,258 gene models, including 76,825 protein-coding genes. Further analysis showed that potential targets of genome editing for the engineering of anthocyanin pathways in P. frutescens are located on the late-stage pathways. Overall, our genome assembly could serve as a valuable reference for selecting target genes for genome editing of P. frutescens.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"30 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9835750/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10593700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}