Xi Liu , Linlin Cai , Zhiming Zhou , Peiming Huang , Zhonglu Ren
{"title":"基于从头开始的转录组组装和注释预测双花鳞栉草中的环苷酸","authors":"Xi Liu , Linlin Cai , Zhiming Zhou , Peiming Huang , Zhonglu Ren","doi":"10.1016/j.jhip.2024.06.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>There is a scarcity of transcriptome sequencing data available for the <em>Leptopetalum biflorum</em>, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on <em>de novo</em> transcriptome assembly and make systematic prediction of cyclotides in <em>Leptopetalum biflorum</em>, to provide a reference for functional analysis of cyclotides.</p></div><div><h3>Methods</h3><p>In this study, we performed RNA-seq on roots, leaves, and flowers of <em>Leptopetalum biflorum</em> to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. <em>De novo</em> transcriptome assembly of <em>Leptopetalum biflorum</em> was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in <em>Leptopetalum biflorum</em> via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team.</p></div><div><h3>Results</h3><p>Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results.</p></div><div><h3>Conclusion</h3><p>In this study, we developed a <em>de novo</em> transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from <em>Leptopetalum biflorum</em>. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.</p></div>","PeriodicalId":100787,"journal":{"name":"Journal of Holistic Integrative Pharmacy","volume":"5 2","pages":"Pages 103-112"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2707368824000323/pdfft?md5=685b54ead1fa41e2dca6c49fac4eb96e&pid=1-s2.0-S2707368824000323-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation\",\"authors\":\"Xi Liu , Linlin Cai , Zhiming Zhou , Peiming Huang , Zhonglu Ren\",\"doi\":\"10.1016/j.jhip.2024.06.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><p>There is a scarcity of transcriptome sequencing data available for the <em>Leptopetalum biflorum</em>, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on <em>de novo</em> transcriptome assembly and make systematic prediction of cyclotides in <em>Leptopetalum biflorum</em>, to provide a reference for functional analysis of cyclotides.</p></div><div><h3>Methods</h3><p>In this study, we performed RNA-seq on roots, leaves, and flowers of <em>Leptopetalum biflorum</em> to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. <em>De novo</em> transcriptome assembly of <em>Leptopetalum biflorum</em> was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in <em>Leptopetalum biflorum</em> via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team.</p></div><div><h3>Results</h3><p>Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results.</p></div><div><h3>Conclusion</h3><p>In this study, we developed a <em>de novo</em> transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from <em>Leptopetalum biflorum</em>. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.</p></div>\",\"PeriodicalId\":100787,\"journal\":{\"name\":\"Journal of Holistic Integrative Pharmacy\",\"volume\":\"5 2\",\"pages\":\"Pages 103-112\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2707368824000323/pdfft?md5=685b54ead1fa41e2dca6c49fac4eb96e&pid=1-s2.0-S2707368824000323-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Holistic Integrative Pharmacy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2707368824000323\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Holistic Integrative Pharmacy","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2707368824000323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
Objective
There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to provide a reference for functional analysis of cyclotides.
Methods
In this study, we performed RNA-seq on roots, leaves, and flowers of Leptopetalum biflorum to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. De novo transcriptome assembly of Leptopetalum biflorum was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in Leptopetalum biflorum via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team.
Results
Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results.
Conclusion
In this study, we developed a de novo transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from Leptopetalum biflorum. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.