首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
TcEVdb: a database for T-cell-derived small extracellular vesicles from single-cell transcriptomes. TcEVdb:来自单细胞转录组的t细胞衍生的小细胞外囊泡数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-28 DOI: 10.1093/database/baaf012
Tao Luo, Wen-Kang Shen, Chu-Yu Zhang, Dan-Dan Song, Xiu-Qing Zhang, An-Yuan Guo, Qian Lei

T-Cell-derived extracellular vesicles (TcEVs) play key roles in immune regulation and tumor microenvironment modulation. However, the heterogeneity of TcEV remains poorly understood due to technical limitations of EV analysis and the lack of comprehensive data. To address this, we constructed TcEVdb, a comprehensive database that explores the expression and cluster of TcEV by the SEVtras method from T-cell single-cell RNA sequencing data. TcEVdb contains 277 265 EV droplets from 51 T-cell types across 221 samples from 21 projects, covering 9 tissue sources and 23 disease conditions. The database provides two main functional modules. The Browse module enables users to investigate EV secretion activity indices across samples, visualize TcEV clusters, analyze differentially expressed genes (DEGs) and pathway enrichment in TcEV subpopulations, and compare TcEV transcriptomes with their cellular origins. The Search module allows users to query specific genes across all datasets and visualize their expression distribution. Furthermore, our analysis of TcEV in diffuse large B-cell lymphoma revealed increased EV secretion in CD4+ T exhausted cells compared to healthy controls. Subsequent analyses identified distinct droplet clusters with differential expression genes, including clusters enriched for genes associated with cell motility and mitochondrial function. Overall, TcEVdb serves as a comprehensive resource for exploring the transcriptome of TcEV, which will contribute to advancements in EV-based diagnostics and therapeutics across a wide range of diseases. Database URL: https://guolab.wchscu.cn/TcEVdb.

t细胞来源的细胞外囊泡(tcev)在免疫调节和肿瘤微环境调节中发挥关键作用。然而,由于EV分析的技术限制和缺乏全面的数据,TcEV的异质性仍然知之甚少。为了解决这个问题,我们构建了TcEVdb,这是一个综合数据库,通过SEVtras方法从t细胞单细胞RNA测序数据中探索TcEV的表达和集群。TcEVdb包含来自21个项目221个样本的51种t细胞类型的277 265个EV液滴,涵盖9个组织来源和23种疾病。数据库主要提供两个功能模块。浏览模块使用户能够研究样本中的EV分泌活性指数,可视化TcEV簇,分析TcEV亚群中的差异表达基因(DEGs)和途径富集,并比较TcEV转录组与其细胞起源。Search模块允许用户查询所有数据集中的特定基因,并可视化其表达分布。此外,我们对弥漫性大b细胞淋巴瘤的TcEV分析显示,与健康对照相比,CD4+ T耗竭细胞的EV分泌增加。随后的分析发现,不同的液滴簇具有不同的表达基因,包括与细胞运动和线粒体功能相关的基因富集的簇。总的来说,TcEVdb是探索TcEV转录组的综合资源,这将有助于在广泛的疾病中推进基于ev的诊断和治疗。数据库地址:https://guolab.wchscu.cn/TcEVdb。
{"title":"TcEVdb: a database for T-cell-derived small extracellular vesicles from single-cell transcriptomes.","authors":"Tao Luo, Wen-Kang Shen, Chu-Yu Zhang, Dan-Dan Song, Xiu-Qing Zhang, An-Yuan Guo, Qian Lei","doi":"10.1093/database/baaf012","DOIUrl":"10.1093/database/baaf012","url":null,"abstract":"<p><p>T-Cell-derived extracellular vesicles (TcEVs) play key roles in immune regulation and tumor microenvironment modulation. However, the heterogeneity of TcEV remains poorly understood due to technical limitations of EV analysis and the lack of comprehensive data. To address this, we constructed TcEVdb, a comprehensive database that explores the expression and cluster of TcEV by the SEVtras method from T-cell single-cell RNA sequencing data. TcEVdb contains 277 265 EV droplets from 51 T-cell types across 221 samples from 21 projects, covering 9 tissue sources and 23 disease conditions. The database provides two main functional modules. The Browse module enables users to investigate EV secretion activity indices across samples, visualize TcEV clusters, analyze differentially expressed genes (DEGs) and pathway enrichment in TcEV subpopulations, and compare TcEV transcriptomes with their cellular origins. The Search module allows users to query specific genes across all datasets and visualize their expression distribution. Furthermore, our analysis of TcEV in diffuse large B-cell lymphoma revealed increased EV secretion in CD4+ T exhausted cells compared to healthy controls. Subsequent analyses identified distinct droplet clusters with differential expression genes, including clusters enriched for genes associated with cell motility and mitochondrial function. Overall, TcEVdb serves as a comprehensive resource for exploring the transcriptome of TcEV, which will contribute to advancements in EV-based diagnostics and therapeutics across a wide range of diseases. Database URL: https://guolab.wchscu.cn/TcEVdb.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11885782/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143555417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TcEVdb: a database for T-cell-derived small extracellular vesicles from single-cell transcriptomes. TcEVdb:来自单细胞转录组的t细胞衍生的小细胞外囊泡数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-28 DOI: 10.1093/database/baaf012
Tao Luo, Wen-Kang Shen, Chu-Yu Zhang, Dan-Dan Song, Xiu-Qing Zhang, An-Yuan Guo, Qian Lei

T-Cell-derived extracellular vesicles (TcEVs) play key roles in immune regulation and tumor microenvironment modulation. However, the heterogeneity of TcEV remains poorly understood due to technical limitations of EV analysis and the lack of comprehensive data. To address this, we constructed TcEVdb, a comprehensive database that explores the expression and cluster of TcEV by the SEVtras method from T-cell single-cell RNA sequencing data. TcEVdb contains 277 265 EV droplets from 51 T-cell types across 221 samples from 21 projects, covering 9 tissue sources and 23 disease conditions. The database provides two main functional modules. The Browse module enables users to investigate EV secretion activity indices across samples, visualize TcEV clusters, analyze differentially expressed genes (DEGs) and pathway enrichment in TcEV subpopulations, and compare TcEV transcriptomes with their cellular origins. The Search module allows users to query specific genes across all datasets and visualize their expression distribution. Furthermore, our analysis of TcEV in diffuse large B-cell lymphoma revealed increased EV secretion in CD4+ T exhausted cells compared to healthy controls. Subsequent analyses identified distinct droplet clusters with differential expression genes, including clusters enriched for genes associated with cell motility and mitochondrial function. Overall, TcEVdb serves as a comprehensive resource for exploring the transcriptome of TcEV, which will contribute to advancements in EV-based diagnostics and therapeutics across a wide range of diseases. Database URL: https://guolab.wchscu.cn/TcEVdb.

t细胞来源的细胞外囊泡(tcev)在免疫调节和肿瘤微环境调节中发挥关键作用。然而,由于EV分析的技术限制和缺乏全面的数据,TcEV的异质性仍然知之甚少。为了解决这个问题,我们构建了TcEVdb,这是一个综合数据库,通过SEVtras方法从t细胞单细胞RNA测序数据中探索TcEV的表达和集群。TcEVdb包含来自21个项目221个样本的51种t细胞类型的277 265个EV液滴,涵盖9个组织来源和23种疾病。数据库主要提供两个功能模块。浏览模块使用户能够研究样本中的EV分泌活性指数,可视化TcEV簇,分析TcEV亚群中的差异表达基因(DEGs)和途径富集,并比较TcEV转录组与其细胞起源。Search模块允许用户查询所有数据集中的特定基因,并可视化其表达分布。此外,我们对弥漫性大b细胞淋巴瘤的TcEV分析显示,与健康对照相比,CD4+ T耗竭细胞的EV分泌增加。随后的分析发现,不同的液滴簇具有不同的表达基因,包括与细胞运动和线粒体功能相关的基因富集的簇。总的来说,TcEVdb是探索TcEV转录组的综合资源,这将有助于在广泛的疾病中推进基于ev的诊断和治疗。数据库地址:https://guolab.wchscu.cn/TcEVdb。
{"title":"TcEVdb: a database for T-cell-derived small extracellular vesicles from single-cell transcriptomes.","authors":"Tao Luo, Wen-Kang Shen, Chu-Yu Zhang, Dan-Dan Song, Xiu-Qing Zhang, An-Yuan Guo, Qian Lei","doi":"10.1093/database/baaf012","DOIUrl":"https://doi.org/10.1093/database/baaf012","url":null,"abstract":"<p><p>T-Cell-derived extracellular vesicles (TcEVs) play key roles in immune regulation and tumor microenvironment modulation. However, the heterogeneity of TcEV remains poorly understood due to technical limitations of EV analysis and the lack of comprehensive data. To address this, we constructed TcEVdb, a comprehensive database that explores the expression and cluster of TcEV by the SEVtras method from T-cell single-cell RNA sequencing data. TcEVdb contains 277 265 EV droplets from 51 T-cell types across 221 samples from 21 projects, covering 9 tissue sources and 23 disease conditions. The database provides two main functional modules. The Browse module enables users to investigate EV secretion activity indices across samples, visualize TcEV clusters, analyze differentially expressed genes (DEGs) and pathway enrichment in TcEV subpopulations, and compare TcEV transcriptomes with their cellular origins. The Search module allows users to query specific genes across all datasets and visualize their expression distribution. Furthermore, our analysis of TcEV in diffuse large B-cell lymphoma revealed increased EV secretion in CD4+ T exhausted cells compared to healthy controls. Subsequent analyses identified distinct droplet clusters with differential expression genes, including clusters enriched for genes associated with cell motility and mitochondrial function. Overall, TcEVdb serves as a comprehensive resource for exploring the transcriptome of TcEV, which will contribute to advancements in EV-based diagnostics and therapeutics across a wide range of diseases. Database URL: https://guolab.wchscu.cn/TcEVdb.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MANUDB: database and application to retrieve and visualize mammalian NUMTs. 检索和可视化哺乳动物numt的数据库和应用程序。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-22 DOI: 10.1093/database/baaf009
Bálint Biró, Zoltán Gál, Zsófia Nagy, Juan Francisco Garcia, Tsend-Ayush Batbold, Orsolya Ivett Hoffmann

There is an ongoing genetic flow from the mitochondrial genome to the nuclear genome. The mitochondrial sequences that have integrated into the nuclear genome have been shown to be drivers of evolutionary processes and cancerous transformations. In addition to their fundamental biological importance, these sequences have significant consequences for genome assembly and phylogenetic and forensic analyses as well. Previously, our research group developed a computational pipeline that provides a uniform way of identifying these sequences in mammalian genomes. In this paper, we publish MANUDB-the MAmmalian NUclear mitochondrial sequences DataBase, which makes the results of our pipeline publicly accessible. With MANUDB one can retrieve and visualize mitochondrial genome fragments that have been integrated into the nuclear genome of mammalian species. Database URL: manudb.streamlit.app.

从线粒体基因组到核基因组有一个持续的遗传流动。已经整合到核基因组中的线粒体序列已被证明是进化过程和癌变的驱动因素。除了它们的基本生物学重要性外,这些序列对基因组组装、系统发育和法医分析也有重要的影响。以前,我们的研究小组开发了一种计算管道,提供了一种在哺乳动物基因组中识别这些序列的统一方法。在本文中,我们发布了manudb -哺乳动物核线粒体序列数据库,使我们的管道结果公开访问。使用MANUDB可以检索和可视化已经整合到哺乳动物物种核基因组中的线粒体基因组片段。数据库URL: manudb. streamlite .app。
{"title":"MANUDB: database and application to retrieve and visualize mammalian NUMTs.","authors":"Bálint Biró, Zoltán Gál, Zsófia Nagy, Juan Francisco Garcia, Tsend-Ayush Batbold, Orsolya Ivett Hoffmann","doi":"10.1093/database/baaf009","DOIUrl":"https://doi.org/10.1093/database/baaf009","url":null,"abstract":"<p><p>There is an ongoing genetic flow from the mitochondrial genome to the nuclear genome. The mitochondrial sequences that have integrated into the nuclear genome have been shown to be drivers of evolutionary processes and cancerous transformations. In addition to their fundamental biological importance, these sequences have significant consequences for genome assembly and phylogenetic and forensic analyses as well. Previously, our research group developed a computational pipeline that provides a uniform way of identifying these sequences in mammalian genomes. In this paper, we publish MANUDB-the MAmmalian NUclear mitochondrial sequences DataBase, which makes the results of our pipeline publicly accessible. With MANUDB one can retrieve and visualize mitochondrial genome fragments that have been integrated into the nuclear genome of mammalian species. Database URL: manudb.streamlit.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PotatoBSLnc: a curated repository of potato long noncoding RNAs in response to biotic stress. PotatoBSLnc:马铃薯长链非编码rna的资源库,以应对生物胁迫。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-22 DOI: 10.1093/database/baaf015
Pingping Huang, Weilin Cao, Zhaojun Li, Qingshuai Chen, Guangchao Wang, Bailing Zhou, Jihua Wang

The biotic stress significantly influences the production of potato (Solanum tuberosum L.) all over the world. Long noncoding RNAs (lncRNAs) play key roles in the plant response to environmental stressors. However, their roles in potato resistance to pathogens, insects, and other biotic stress are still unclear. The PotatoBSLnc is a database for the study of potato lncRNAs in response to major biotic stress. Here, we collected 364 RNA sequencing (RNA-seq) data derived from 12 kinds of biotic stresses in 26 cultivars and wild potatoes. PotatoBSLnc currently contains 18 636 lncRNAs and 44 263 mRNAs. In addition, to select the functional lncRNAs and mRNAs under different stresses, the differential expression analyses and the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses related to the cis/trans-targets of differentially expressed lncRNAs (DElncRNAs) and to the differentially expressed mRNAs (DEmRNAs) were also conducted. The database contains five modules: Home, Browse, Expression, Biotic stress, and Download. Among these, the "Browse" module can be used to search detailed information about RNA-seq data (disease, cultivator, organ types, treatment of samples, and others), the exon numbers, length, location, and sequence of each lncRNA/mRNA. The "Expression" module can be used to search the transcripts per million/raw count value of lncRNAs/mRNAs at different RNA-seq data. The "Biotic stress" module shows the results of differential expression analyses under each of the 12 biotic stresses, the cis/trans-targets of DElncRNAs, the GO and KEGG analysis results of DEmRNAs, and the targets of DElncRNAs. The PotatoBSLnc platform provides researchers with detailed information on potato lncRNAs and mRNAs under biotic stress, which can speed up the breeding of resistant varieties based on the molecular methods. Database URL: https://www.sdklab-biophysics-dzu.net/PotatoBSLnc.

生物胁迫对马铃薯(Solanum tuberosum L.)产量有显著影响。长链非编码rna (lncRNAs)在植物对环境胁迫的反应中起着关键作用。然而,它们在马铃薯抵抗病原体、昆虫和其他生物胁迫中的作用仍不清楚。PotatoBSLnc是一个研究马铃薯lncrna对主要生物胁迫反应的数据库。在此,我们收集了来自26个品种和野生马铃薯的12种生物胁迫的364个RNA测序(RNA-seq)数据。PotatoBSLnc目前包含18 636个lncrna和44 263个mrna。此外,为了选择不同胁迫下的功能性lncRNAs和mrna,我们还进行了差异表达分析,以及与差异表达lncRNAs (DElncRNAs)和差异表达mrna (DEmRNAs)的顺式/反式靶标相关的基因本体(GO)和京都基因与基因组百科全书(KEGG)分析。数据库包含五个模块:首页、浏览、表达、生物压力和下载。其中,“Browse”模块可用于查询RNA-seq数据的详细信息(疾病、培养、器官类型、样品处理等),以及每个lncRNA/mRNA的外显子数目、长度、位置和序列。“Expression”模块可用于查询不同RNA-seq数据下lncRNAs/ mrna的转录本/原始计数值。“生物胁迫”模块显示了12种生物胁迫下的差异表达分析结果、DElncRNAs的顺式/反式靶标、demrna的GO和KEGG分析结果以及DElncRNAs的靶标。PotatoBSLnc平台为研究人员提供了生物胁迫下马铃薯lncrna和mrna的详细信息,可以加快基于分子方法的抗性品种的选育。数据库地址:https://www.sdklab-biophysics-dzu.net/PotatoBSLnc。
{"title":"PotatoBSLnc: a curated repository of potato long noncoding RNAs in response to biotic stress.","authors":"Pingping Huang, Weilin Cao, Zhaojun Li, Qingshuai Chen, Guangchao Wang, Bailing Zhou, Jihua Wang","doi":"10.1093/database/baaf015","DOIUrl":"10.1093/database/baaf015","url":null,"abstract":"<p><p>The biotic stress significantly influences the production of potato (Solanum tuberosum L.) all over the world. Long noncoding RNAs (lncRNAs) play key roles in the plant response to environmental stressors. However, their roles in potato resistance to pathogens, insects, and other biotic stress are still unclear. The PotatoBSLnc is a database for the study of potato lncRNAs in response to major biotic stress. Here, we collected 364 RNA sequencing (RNA-seq) data derived from 12 kinds of biotic stresses in 26 cultivars and wild potatoes. PotatoBSLnc currently contains 18 636 lncRNAs and 44 263 mRNAs. In addition, to select the functional lncRNAs and mRNAs under different stresses, the differential expression analyses and the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses related to the cis/trans-targets of differentially expressed lncRNAs (DElncRNAs) and to the differentially expressed mRNAs (DEmRNAs) were also conducted. The database contains five modules: Home, Browse, Expression, Biotic stress, and Download. Among these, the \"Browse\" module can be used to search detailed information about RNA-seq data (disease, cultivator, organ types, treatment of samples, and others), the exon numbers, length, location, and sequence of each lncRNA/mRNA. The \"Expression\" module can be used to search the transcripts per million/raw count value of lncRNAs/mRNAs at different RNA-seq data. The \"Biotic stress\" module shows the results of differential expression analyses under each of the 12 biotic stresses, the cis/trans-targets of DElncRNAs, the GO and KEGG analysis results of DEmRNAs, and the targets of DElncRNAs. The PotatoBSLnc platform provides researchers with detailed information on potato lncRNAs and mRNAs under biotic stress, which can speed up the breeding of resistant varieties based on the molecular methods. Database URL: https://www.sdklab-biophysics-dzu.net/PotatoBSLnc.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846501/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143476331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MANUDB: database and application to retrieve and visualize mammalian NUMTs. 检索和可视化哺乳动物numt的数据库和应用程序。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-22 DOI: 10.1093/database/baaf009
Bálint Biró, Zoltán Gál, Zsófia Nagy, Juan Francisco Garcia, Tsend-Ayush Batbold, Orsolya Ivett Hoffmann

There is an ongoing genetic flow from the mitochondrial genome to the nuclear genome. The mitochondrial sequences that have integrated into the nuclear genome have been shown to be drivers of evolutionary processes and cancerous transformations. In addition to their fundamental biological importance, these sequences have significant consequences for genome assembly and phylogenetic and forensic analyses as well. Previously, our research group developed a computational pipeline that provides a uniform way of identifying these sequences in mammalian genomes. In this paper, we publish MANUDB-the MAmmalian NUclear mitochondrial sequences DataBase, which makes the results of our pipeline publicly accessible. With MANUDB one can retrieve and visualize mitochondrial genome fragments that have been integrated into the nuclear genome of mammalian species. Database URL: manudb.streamlit.app.

从线粒体基因组到核基因组有一个持续的遗传流动。已经整合到核基因组中的线粒体序列已被证明是进化过程和癌变的驱动因素。除了它们的基本生物学重要性外,这些序列对基因组组装、系统发育和法医分析也有重要的影响。以前,我们的研究小组开发了一种计算管道,提供了一种在哺乳动物基因组中识别这些序列的统一方法。在本文中,我们发布了manudb -哺乳动物核线粒体序列数据库,使我们的管道结果公开访问。使用MANUDB可以检索和可视化已经整合到哺乳动物物种核基因组中的线粒体基因组片段。数据库URL: manudb. streamlite .app。
{"title":"MANUDB: database and application to retrieve and visualize mammalian NUMTs.","authors":"Bálint Biró, Zoltán Gál, Zsófia Nagy, Juan Francisco Garcia, Tsend-Ayush Batbold, Orsolya Ivett Hoffmann","doi":"10.1093/database/baaf009","DOIUrl":"10.1093/database/baaf009","url":null,"abstract":"<p><p>There is an ongoing genetic flow from the mitochondrial genome to the nuclear genome. The mitochondrial sequences that have integrated into the nuclear genome have been shown to be drivers of evolutionary processes and cancerous transformations. In addition to their fundamental biological importance, these sequences have significant consequences for genome assembly and phylogenetic and forensic analyses as well. Previously, our research group developed a computational pipeline that provides a uniform way of identifying these sequences in mammalian genomes. In this paper, we publish MANUDB-the MAmmalian NUclear mitochondrial sequences DataBase, which makes the results of our pipeline publicly accessible. With MANUDB one can retrieve and visualize mitochondrial genome fragments that have been integrated into the nuclear genome of mammalian species. Database URL: manudb.streamlit.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11845865/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143476330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PotatoBSLnc: a curated repository of potato long noncoding RNAs in response to biotic stress. PotatoBSLnc:马铃薯长链非编码rna的资源库,以应对生物胁迫。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-22 DOI: 10.1093/database/baaf015
Pingping Huang, Weilin Cao, Zhaojun Li, Qingshuai Chen, Guangchao Wang, Bailing Zhou, Jihua Wang

The biotic stress significantly influences the production of potato (Solanum tuberosum L.) all over the world. Long noncoding RNAs (lncRNAs) play key roles in the plant response to environmental stressors. However, their roles in potato resistance to pathogens, insects, and other biotic stress are still unclear. The PotatoBSLnc is a database for the study of potato lncRNAs in response to major biotic stress. Here, we collected 364 RNA sequencing (RNA-seq) data derived from 12 kinds of biotic stresses in 26 cultivars and wild potatoes. PotatoBSLnc currently contains 18 636 lncRNAs and 44 263 mRNAs. In addition, to select the functional lncRNAs and mRNAs under different stresses, the differential expression analyses and the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses related to the cis/trans-targets of differentially expressed lncRNAs (DElncRNAs) and to the differentially expressed mRNAs (DEmRNAs) were also conducted. The database contains five modules: Home, Browse, Expression, Biotic stress, and Download. Among these, the "Browse" module can be used to search detailed information about RNA-seq data (disease, cultivator, organ types, treatment of samples, and others), the exon numbers, length, location, and sequence of each lncRNA/mRNA. The "Expression" module can be used to search the transcripts per million/raw count value of lncRNAs/mRNAs at different RNA-seq data. The "Biotic stress" module shows the results of differential expression analyses under each of the 12 biotic stresses, the cis/trans-targets of DElncRNAs, the GO and KEGG analysis results of DEmRNAs, and the targets of DElncRNAs. The PotatoBSLnc platform provides researchers with detailed information on potato lncRNAs and mRNAs under biotic stress, which can speed up the breeding of resistant varieties based on the molecular methods. Database URL: https://www.sdklab-biophysics-dzu.net/PotatoBSLnc.

生物胁迫对马铃薯(Solanum tuberosum L.)产量有显著影响。长链非编码rna (lncRNAs)在植物对环境胁迫的反应中起着关键作用。然而,它们在马铃薯抵抗病原体、昆虫和其他生物胁迫中的作用仍不清楚。PotatoBSLnc是一个研究马铃薯lncrna对主要生物胁迫反应的数据库。在此,我们收集了来自26个品种和野生马铃薯的12种生物胁迫的364个RNA测序(RNA-seq)数据。PotatoBSLnc目前包含18 636个lncrna和44 263个mrna。此外,为了选择不同胁迫下的功能性lncRNAs和mrna,我们还进行了差异表达分析,以及与差异表达lncRNAs (DElncRNAs)和差异表达mrna (DEmRNAs)的顺式/反式靶标相关的基因本体(GO)和京都基因与基因组百科全书(KEGG)分析。数据库包含五个模块:首页、浏览、表达、生物压力和下载。其中,“Browse”模块可用于查询RNA-seq数据的详细信息(疾病、培养、器官类型、样品处理等),以及每个lncRNA/mRNA的外显子数目、长度、位置和序列。“Expression”模块可用于查询不同RNA-seq数据下lncRNAs/ mrna的转录本/原始计数值。“生物胁迫”模块显示了12种生物胁迫下的差异表达分析结果、DElncRNAs的顺式/反式靶标、demrna的GO和KEGG分析结果以及DElncRNAs的靶标。PotatoBSLnc平台为研究人员提供了生物胁迫下马铃薯lncrna和mrna的详细信息,可以加快基于分子方法的抗性品种的选育。数据库地址:https://www.sdklab-biophysics-dzu.net/PotatoBSLnc。
{"title":"PotatoBSLnc: a curated repository of potato long noncoding RNAs in response to biotic stress.","authors":"Pingping Huang, Weilin Cao, Zhaojun Li, Qingshuai Chen, Guangchao Wang, Bailing Zhou, Jihua Wang","doi":"10.1093/database/baaf015","DOIUrl":"https://doi.org/10.1093/database/baaf015","url":null,"abstract":"<p><p>The biotic stress significantly influences the production of potato (Solanum tuberosum L.) all over the world. Long noncoding RNAs (lncRNAs) play key roles in the plant response to environmental stressors. However, their roles in potato resistance to pathogens, insects, and other biotic stress are still unclear. The PotatoBSLnc is a database for the study of potato lncRNAs in response to major biotic stress. Here, we collected 364 RNA sequencing (RNA-seq) data derived from 12 kinds of biotic stresses in 26 cultivars and wild potatoes. PotatoBSLnc currently contains 18 636 lncRNAs and 44 263 mRNAs. In addition, to select the functional lncRNAs and mRNAs under different stresses, the differential expression analyses and the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses related to the cis/trans-targets of differentially expressed lncRNAs (DElncRNAs) and to the differentially expressed mRNAs (DEmRNAs) were also conducted. The database contains five modules: Home, Browse, Expression, Biotic stress, and Download. Among these, the \"Browse\" module can be used to search detailed information about RNA-seq data (disease, cultivator, organ types, treatment of samples, and others), the exon numbers, length, location, and sequence of each lncRNA/mRNA. The \"Expression\" module can be used to search the transcripts per million/raw count value of lncRNAs/mRNAs at different RNA-seq data. The \"Biotic stress\" module shows the results of differential expression analyses under each of the 12 biotic stresses, the cis/trans-targets of DElncRNAs, the GO and KEGG analysis results of DEmRNAs, and the targets of DElncRNAs. The PotatoBSLnc platform provides researchers with detailed information on potato lncRNAs and mRNAs under biotic stress, which can speed up the breeding of resistant varieties based on the molecular methods. Database URL: https://www.sdklab-biophysics-dzu.net/PotatoBSLnc.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database. 将来自PubTator的人工智能文本挖掘集成到比较毒物基因组学数据库的手动管理工作流程中。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-21 DOI: 10.1093/database/baaf013
Thomas C Wiegers, Allan Peter Davis, Jolene Wiegers, Daniela Sciaky, Fern Barkalow, Brent Wyatt, Melissa Strong, Roy McMorran, Sakib Abrar, Carolyn J Mattingly

The Comparative Toxicogenomics Database (CTD) is a manually curated knowledge- and discovery-base that seeks to advance understanding about the relationship between environmental exposures and human health. CTD's manual curation process extracts from the biomedical literature molecular relationships between chemicals/drugs, genes/proteins, phenotypes, diseases, anatomical terms, and species. These relationships are organized in a highly systematic way in order to make them not only informative but also scientifically computational, enabling inferential hypotheses to be formed to address gaps in understanding. Integral to CTD's functionality is the use of structured, hierarchical ontologies and controlled vocabularies to describe these molecular relationships. Normalizing text (i.e. translating raw text from the literature into these controlled vocabularies) can be a time-consuming process for biocurators. To facilitate the normalization process and improve the efficiency with which our scientists curate the literature, CTD evaluated and integrated into the curation process PubTator 3.0, a state-of-the-art, AI-powered resource which extracts and normalizes from the literature many of the key biomedical concepts CTD curates. Here, we describe CTD's long-standing history with Natural Language Processing (NLP), how this history helped form our objectives for NLP integration, the evaluation of PubTator against our objectives, and the integration of PubTator into CTD's curation workflow. Database URL: https://ctdbase.org.

比较毒物基因组学数据库(CTD)是一个人工管理的知识和发现基础,旨在促进对环境暴露与人类健康之间关系的理解。CTD的人工整理过程从生物医学文献中提取化学物质/药物、基因/蛋白质、表型、疾病、解剖术语和物种之间的分子关系。这些关系以一种高度系统的方式组织起来,以便使它们不仅具有信息性,而且具有科学计算性,从而形成推理假设,以解决理解上的差距。CTD功能的一部分是使用结构化的、分层的本体和受控词汇表来描述这些分子关系。规范化文本(即将原始文本从文献翻译成这些受控词汇表)对于生物馆长来说可能是一个耗时的过程。为了促进规范化过程并提高我们的科学家整理文献的效率,CTD评估了PubTator 3.0,并将其整合到整理过程中。PubTator 3.0是一种最先进的人工智能资源,可以从CTD整理的文献中提取许多关键的生物医学概念并进行规范化。在这里,我们描述了CTD与自然语言处理(NLP)的长期历史,这段历史如何帮助形成我们的NLP集成目标,根据我们的目标评估PubTator,以及将PubTator集成到CTD的策展工作流程中。数据库地址:https://ctdbase.org。
{"title":"Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.","authors":"Thomas C Wiegers, Allan Peter Davis, Jolene Wiegers, Daniela Sciaky, Fern Barkalow, Brent Wyatt, Melissa Strong, Roy McMorran, Sakib Abrar, Carolyn J Mattingly","doi":"10.1093/database/baaf013","DOIUrl":"10.1093/database/baaf013","url":null,"abstract":"<p><p>The Comparative Toxicogenomics Database (CTD) is a manually curated knowledge- and discovery-base that seeks to advance understanding about the relationship between environmental exposures and human health. CTD's manual curation process extracts from the biomedical literature molecular relationships between chemicals/drugs, genes/proteins, phenotypes, diseases, anatomical terms, and species. These relationships are organized in a highly systematic way in order to make them not only informative but also scientifically computational, enabling inferential hypotheses to be formed to address gaps in understanding. Integral to CTD's functionality is the use of structured, hierarchical ontologies and controlled vocabularies to describe these molecular relationships. Normalizing text (i.e. translating raw text from the literature into these controlled vocabularies) can be a time-consuming process for biocurators. To facilitate the normalization process and improve the efficiency with which our scientists curate the literature, CTD evaluated and integrated into the curation process PubTator 3.0, a state-of-the-art, AI-powered resource which extracts and normalizes from the literature many of the key biomedical concepts CTD curates. Here, we describe CTD's long-standing history with Natural Language Processing (NLP), how this history helped form our objectives for NLP integration, the evaluation of PubTator against our objectives, and the integration of PubTator into CTD's curation workflow. Database URL: https://ctdbase.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844237/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143472396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database. 将来自PubTator的人工智能文本挖掘集成到比较毒物基因组学数据库的手动管理工作流程中。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-21 DOI: 10.1093/database/baaf013
Thomas C Wiegers, Allan Peter Davis, Jolene Wiegers, Daniela Sciaky, Fern Barkalow, Brent Wyatt, Melissa Strong, Roy McMorran, Sakib Abrar, Carolyn J Mattingly

The Comparative Toxicogenomics Database (CTD) is a manually curated knowledge- and discovery-base that seeks to advance understanding about the relationship between environmental exposures and human health. CTD's manual curation process extracts from the biomedical literature molecular relationships between chemicals/drugs, genes/proteins, phenotypes, diseases, anatomical terms, and species. These relationships are organized in a highly systematic way in order to make them not only informative but also scientifically computational, enabling inferential hypotheses to be formed to address gaps in understanding. Integral to CTD's functionality is the use of structured, hierarchical ontologies and controlled vocabularies to describe these molecular relationships. Normalizing text (i.e. translating raw text from the literature into these controlled vocabularies) can be a time-consuming process for biocurators. To facilitate the normalization process and improve the efficiency with which our scientists curate the literature, CTD evaluated and integrated into the curation process PubTator 3.0, a state-of-the-art, AI-powered resource which extracts and normalizes from the literature many of the key biomedical concepts CTD curates. Here, we describe CTD's long-standing history with Natural Language Processing (NLP), how this history helped form our objectives for NLP integration, the evaluation of PubTator against our objectives, and the integration of PubTator into CTD's curation workflow. Database URL: https://ctdbase.org.

比较毒物基因组学数据库(CTD)是一个人工管理的知识和发现基础,旨在促进对环境暴露与人类健康之间关系的理解。CTD的人工整理过程从生物医学文献中提取化学物质/药物、基因/蛋白质、表型、疾病、解剖术语和物种之间的分子关系。这些关系以一种高度系统的方式组织起来,以便使它们不仅具有信息性,而且具有科学计算性,从而形成推理假设,以解决理解上的差距。CTD功能的一部分是使用结构化的、分层的本体和受控词汇表来描述这些分子关系。规范化文本(即将原始文本从文献翻译成这些受控词汇表)对于生物馆长来说可能是一个耗时的过程。为了促进规范化过程并提高我们的科学家整理文献的效率,CTD评估了PubTator 3.0,并将其整合到整理过程中。PubTator 3.0是一种最先进的人工智能资源,可以从CTD整理的文献中提取许多关键的生物医学概念并进行规范化。在这里,我们描述了CTD与自然语言处理(NLP)的长期历史,这段历史如何帮助形成我们的NLP集成目标,根据我们的目标评估PubTator,以及将PubTator集成到CTD的策展工作流程中。数据库地址:https://ctdbase.org。
{"title":"Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.","authors":"Thomas C Wiegers, Allan Peter Davis, Jolene Wiegers, Daniela Sciaky, Fern Barkalow, Brent Wyatt, Melissa Strong, Roy McMorran, Sakib Abrar, Carolyn J Mattingly","doi":"10.1093/database/baaf013","DOIUrl":"https://doi.org/10.1093/database/baaf013","url":null,"abstract":"<p><p>The Comparative Toxicogenomics Database (CTD) is a manually curated knowledge- and discovery-base that seeks to advance understanding about the relationship between environmental exposures and human health. CTD's manual curation process extracts from the biomedical literature molecular relationships between chemicals/drugs, genes/proteins, phenotypes, diseases, anatomical terms, and species. These relationships are organized in a highly systematic way in order to make them not only informative but also scientifically computational, enabling inferential hypotheses to be formed to address gaps in understanding. Integral to CTD's functionality is the use of structured, hierarchical ontologies and controlled vocabularies to describe these molecular relationships. Normalizing text (i.e. translating raw text from the literature into these controlled vocabularies) can be a time-consuming process for biocurators. To facilitate the normalization process and improve the efficiency with which our scientists curate the literature, CTD evaluated and integrated into the curation process PubTator 3.0, a state-of-the-art, AI-powered resource which extracts and normalizes from the literature many of the key biomedical concepts CTD curates. Here, we describe CTD's long-standing history with Natural Language Processing (NLP), how this history helped form our objectives for NLP integration, the evaluation of PubTator against our objectives, and the integration of PubTator into CTD's curation workflow. Database URL: https://ctdbase.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LICEDB: light industrial core enzyme database for industrial applications and AI enzyme design. LICEDB:用于工业应用和AI酶设计的轻工业核心酶数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-19 DOI: 10.1093/database/baaf001
Lei Gong, Fufeng Liu, Chuanxi Zhang, Yongfan Ming, Yulan Mou, ZhaoTing Yuan, Haiming Jiang, Bei Gao, Fuping Lu, Lujia Zhang

Enzymes, serving as eco-friendly catalysts, are progressively supplanting traditional chemical catalysts in light industry sectors such as feed, papermaking, textiles, detergents, leather, and sugar production. Despite this advancement, the variability in the performance of natural enzymes and the fragmentation and diversity of existing data formats pose significant challenges to researchers. Furthermore, AI-driven enzyme design is limited by the quality and quantity of available data. To address these issues, we introduce the light industrial core enzyme database (LICEDB), the first database dedicated exclusively to managing and standardizing enzymes for light industry applications. LICEDB, with its integrated modules for data retrieval, similarity analysis, and structural analysis, will enhance the efficient industrial application of enzymes and strengthen AI-driven predictive research, thereby advancing data sharing and utilization in the field of enzyme innovation. Database URL: http://lujialab.org.cn/on-line-databases/.

酶作为环保催化剂,在饲料、造纸、纺织、洗涤剂、皮革和制糖等轻工业领域正逐步取代传统的化学催化剂。尽管取得了这一进展,但天然酶性能的可变性以及现有数据格式的碎片化和多样性给研究人员带来了重大挑战。此外,人工智能驱动的酶设计受到可用数据的质量和数量的限制。为了解决这些问题,我们推出了轻工业核心酶数据库(LICEDB),这是第一个专门用于管理和标准化轻工业应用酶的数据库。LICEDB集成了数据检索、相似度分析和结构分析等模块,将提高酶的高效工业应用,加强ai驱动的预测研究,从而促进酶创新领域的数据共享和利用。数据库地址:http://lujialab.org.cn/on-line-databases/。
{"title":"LICEDB: light industrial core enzyme database for industrial applications and AI enzyme design.","authors":"Lei Gong, Fufeng Liu, Chuanxi Zhang, Yongfan Ming, Yulan Mou, ZhaoTing Yuan, Haiming Jiang, Bei Gao, Fuping Lu, Lujia Zhang","doi":"10.1093/database/baaf001","DOIUrl":"10.1093/database/baaf001","url":null,"abstract":"<p><p>Enzymes, serving as eco-friendly catalysts, are progressively supplanting traditional chemical catalysts in light industry sectors such as feed, papermaking, textiles, detergents, leather, and sugar production. Despite this advancement, the variability in the performance of natural enzymes and the fragmentation and diversity of existing data formats pose significant challenges to researchers. Furthermore, AI-driven enzyme design is limited by the quality and quantity of available data. To address these issues, we introduce the light industrial core enzyme database (LICEDB), the first database dedicated exclusively to managing and standardizing enzymes for light industry applications. LICEDB, with its integrated modules for data retrieval, similarity analysis, and structural analysis, will enhance the efficient industrial application of enzymes and strengthen AI-driven predictive research, thereby advancing data sharing and utilization in the field of enzyme innovation. Database URL: http://lujialab.org.cn/on-line-databases/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11842304/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143467293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LICEDB: light industrial core enzyme database for industrial applications and AI enzyme design. LICEDB:用于工业应用和AI酶设计的轻工业核心酶数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-02-19 DOI: 10.1093/database/baaf001
Lei Gong, Fufeng Liu, Chuanxi Zhang, Yongfan Ming, Yulan Mou, ZhaoTing Yuan, Haiming Jiang, Bei Gao, Fuping Lu, Lujia Zhang

Enzymes, serving as eco-friendly catalysts, are progressively supplanting traditional chemical catalysts in light industry sectors such as feed, papermaking, textiles, detergents, leather, and sugar production. Despite this advancement, the variability in the performance of natural enzymes and the fragmentation and diversity of existing data formats pose significant challenges to researchers. Furthermore, AI-driven enzyme design is limited by the quality and quantity of available data. To address these issues, we introduce the light industrial core enzyme database (LICEDB), the first database dedicated exclusively to managing and standardizing enzymes for light industry applications. LICEDB, with its integrated modules for data retrieval, similarity analysis, and structural analysis, will enhance the efficient industrial application of enzymes and strengthen AI-driven predictive research, thereby advancing data sharing and utilization in the field of enzyme innovation. Database URL: http://lujialab.org.cn/on-line-databases/.

酶作为环保催化剂,在饲料、造纸、纺织、洗涤剂、皮革和制糖等轻工业领域正逐步取代传统的化学催化剂。尽管取得了这一进展,但天然酶性能的可变性以及现有数据格式的碎片化和多样性给研究人员带来了重大挑战。此外,人工智能驱动的酶设计受到可用数据的质量和数量的限制。为了解决这些问题,我们推出了轻工业核心酶数据库(LICEDB),这是第一个专门用于管理和标准化轻工业应用酶的数据库。LICEDB集成了数据检索、相似度分析和结构分析等模块,将提高酶的高效工业应用,加强ai驱动的预测研究,从而促进酶创新领域的数据共享和利用。数据库地址:http://lujialab.org.cn/on-line-databases/。
{"title":"LICEDB: light industrial core enzyme database for industrial applications and AI enzyme design.","authors":"Lei Gong, Fufeng Liu, Chuanxi Zhang, Yongfan Ming, Yulan Mou, ZhaoTing Yuan, Haiming Jiang, Bei Gao, Fuping Lu, Lujia Zhang","doi":"10.1093/database/baaf001","DOIUrl":"https://doi.org/10.1093/database/baaf001","url":null,"abstract":"<p><p>Enzymes, serving as eco-friendly catalysts, are progressively supplanting traditional chemical catalysts in light industry sectors such as feed, papermaking, textiles, detergents, leather, and sugar production. Despite this advancement, the variability in the performance of natural enzymes and the fragmentation and diversity of existing data formats pose significant challenges to researchers. Furthermore, AI-driven enzyme design is limited by the quality and quantity of available data. To address these issues, we introduce the light industrial core enzyme database (LICEDB), the first database dedicated exclusively to managing and standardizing enzymes for light industry applications. LICEDB, with its integrated modules for data retrieval, similarity analysis, and structural analysis, will enhance the efficient industrial application of enzymes and strengthen AI-driven predictive research, thereby advancing data sharing and utilization in the field of enzyme innovation. Database URL: http://lujialab.org.cn/on-line-databases/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1