Pub Date : 2024-07-02DOI: 10.1093/database/baae056
Rayapadi G Swetha, Benita S Arakal, Santhosh Rajendran, K Sekar, David E Whitworth, Sudha Ramaiah, Philip E James, Paul G Livingstone, Anand Anbarasu
Myxobacteria are predatory bacteria with antimicrobial activity, utilizing complex mechanisms to kill their prey and assimilate their macromolecules. Having large genomes encoding hundreds of secondary metabolites, hydrolytic enzymes and antimicrobial peptides, these organisms are widely studied for their antibiotic potential. MyxoPortal is a comprehensive genomic database hosting 262 genomes of myxobacterial strains. Datasets included provide genome annotations with gene locations, functions, amino acids and nucleotide sequences, allowing analysis of evolutionary and taxonomical relationships between strains and genes. Biosynthetic gene clusters are identified by AntiSMASH, and dbAMP-generated antimicrobial peptide sequences are included as a resource for novel antimicrobial discoveries, while curated datasets of CRISPR/Cas genes, regulatory protein sequences, and phage associated genes give useful insights into each strain's biological properties. MyxoPortal is an intuitive open-source database that brings together application-oriented genomic features that can be used in taxonomy, evolution, predation and antimicrobial research. MyxoPortal can be accessed at http://dicsoft1.physics.iisc.ac.in/MyxoPortal/. Database URL: http://dicsoft1.physics.iisc.ac.in/MyxoPortal/. Graphical Abstract.
{"title":"MyxoPortal: a database of myxobacterial genomic features.","authors":"Rayapadi G Swetha, Benita S Arakal, Santhosh Rajendran, K Sekar, David E Whitworth, Sudha Ramaiah, Philip E James, Paul G Livingstone, Anand Anbarasu","doi":"10.1093/database/baae056","DOIUrl":"10.1093/database/baae056","url":null,"abstract":"<p><p>Myxobacteria are predatory bacteria with antimicrobial activity, utilizing complex mechanisms to kill their prey and assimilate their macromolecules. Having large genomes encoding hundreds of secondary metabolites, hydrolytic enzymes and antimicrobial peptides, these organisms are widely studied for their antibiotic potential. MyxoPortal is a comprehensive genomic database hosting 262 genomes of myxobacterial strains. Datasets included provide genome annotations with gene locations, functions, amino acids and nucleotide sequences, allowing analysis of evolutionary and taxonomical relationships between strains and genes. Biosynthetic gene clusters are identified by AntiSMASH, and dbAMP-generated antimicrobial peptide sequences are included as a resource for novel antimicrobial discoveries, while curated datasets of CRISPR/Cas genes, regulatory protein sequences, and phage associated genes give useful insights into each strain's biological properties. MyxoPortal is an intuitive open-source database that brings together application-oriented genomic features that can be used in taxonomy, evolution, predation and antimicrobial research. MyxoPortal can be accessed at http://dicsoft1.physics.iisc.ac.in/MyxoPortal/. Database URL: http://dicsoft1.physics.iisc.ac.in/MyxoPortal/. Graphical Abstract.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11219305/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141491208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-29DOI: 10.1093/database/baae053
Jinfu Peng, Jiacai Yi, Guoping Yang, Zhijun Huang, Dongsheng Cao
Drug transporters, integral membrane proteins found throughout the human body, play critical roles in physiological and biochemical processes through interactions with ligands, such as substrates and inhibitors. The extensive and disparate data on drug transporters complicate understanding their complex relationships with ligands. To address this challenge, it is essential to gather and summarize information on drug transporters, inhibitors and substrates, and simultaneously develop a comprehensive and user-friendly database. Current online resources often provide fragmented information and have limited coverage of drug transporter substrates and inhibitors, highlighting the need for a specialized, comprehensive and openly accessible database. ISTransbase addresses this gap by amassing a substantial amount of data from literature, government documents and open databases. It includes 16 528 inhibitors and 4465 substrates of 163 drug transporters from 18 different species, resulting in a total of 93 841 inhibitor records and 51 053 substrate records. ISTransbase provides detailed insights into drug transporters and their inhibitors/substrates, encompassing transporter and molecule structure, transporter function and distribution, as well as experimental methods and results from transport or inhibition experiments. Furthermore, ISTransbase offers three search strategies that allow users to retrieve drugs and transporters based on multiple selectable constraints, as well as perform checks for drug-drug interactions. Users can also browse and download data. In summary, ISTransbase (https://istransbase.scbdd.com/) serves as a valuable resource for accurately and efficiently accessing information on drug transporter inhibitors and substrates, aiding researchers in exploring drug transporter mechanisms and assisting clinicians in mitigating adverse drug reactions Database URL: https://istransbase.scbdd.com/.
{"title":"ISTransbase: an online database for inhibitor and substrate of drug transporters.","authors":"Jinfu Peng, Jiacai Yi, Guoping Yang, Zhijun Huang, Dongsheng Cao","doi":"10.1093/database/baae053","DOIUrl":"https://doi.org/10.1093/database/baae053","url":null,"abstract":"<p><p>Drug transporters, integral membrane proteins found throughout the human body, play critical roles in physiological and biochemical processes through interactions with ligands, such as substrates and inhibitors. The extensive and disparate data on drug transporters complicate understanding their complex relationships with ligands. To address this challenge, it is essential to gather and summarize information on drug transporters, inhibitors and substrates, and simultaneously develop a comprehensive and user-friendly database. Current online resources often provide fragmented information and have limited coverage of drug transporter substrates and inhibitors, highlighting the need for a specialized, comprehensive and openly accessible database. ISTransbase addresses this gap by amassing a substantial amount of data from literature, government documents and open databases. It includes 16 528 inhibitors and 4465 substrates of 163 drug transporters from 18 different species, resulting in a total of 93 841 inhibitor records and 51 053 substrate records. ISTransbase provides detailed insights into drug transporters and their inhibitors/substrates, encompassing transporter and molecule structure, transporter function and distribution, as well as experimental methods and results from transport or inhibition experiments. Furthermore, ISTransbase offers three search strategies that allow users to retrieve drugs and transporters based on multiple selectable constraints, as well as perform checks for drug-drug interactions. Users can also browse and download data. In summary, ISTransbase (https://istransbase.scbdd.com/) serves as a valuable resource for accurately and efficiently accessing information on drug transporter inhibitors and substrates, aiding researchers in exploring drug transporter mechanisms and assisting clinicians in mitigating adverse drug reactions Database URL: https://istransbase.scbdd.com/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11214160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141466807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-25DOI: 10.1093/database/baae042
Yichao Zhao, Ju Xiang, Xingyuan Shi, Pengzhen Jia, Yan Zhang, Min Li
Major depressive disorder (MDD) is a pressing global health issue. Its pathogenesis remains elusive, but numerous studies have revealed its intricate associations with various biological factors. Consequently, there is an urgent need for a comprehensive multi-omics resource to help researchers in conducting multi-omics data analysis for MDD. To address this issue, we constructed the MDDOmics database (Major Depressive Disorder Omics, (https://www.csuligroup.com/MDDOmics/), which integrates an extensive collection of published multi-omics data related to MDD. The database contains 41 222 entries of MDD research results and several original datasets, including Single Nucleotide Polymorphisms, genes, non-coding RNAs, DNA methylations, metabolites and proteins, and offers various interfaces for searching and visualization. We also provide extensive downstream analyses of the collected MDD data, including differential analysis, enrichment analysis and disease-gene prediction. Moreover, the database also incorporates multi-omics data for bipolar disorder, schizophrenia and anxiety disorder, due to the challenge in differentiating MDD from similar psychiatric disorders. In conclusion, by leveraging the rich content and online interfaces from MDDOmics, researchers can conduct more comprehensive analyses of MDD and its similar disorders from various perspectives, thereby gaining a deeper understanding of potential MDD biomarkers and intricate disease pathogenesis. Database URL: https://www.csuligroup.com/MDDOmics/.
{"title":"MDDOmics: multi-omics resource of major depressive disorder.","authors":"Yichao Zhao, Ju Xiang, Xingyuan Shi, Pengzhen Jia, Yan Zhang, Min Li","doi":"10.1093/database/baae042","DOIUrl":"10.1093/database/baae042","url":null,"abstract":"<p><p>Major depressive disorder (MDD) is a pressing global health issue. Its pathogenesis remains elusive, but numerous studies have revealed its intricate associations with various biological factors. Consequently, there is an urgent need for a comprehensive multi-omics resource to help researchers in conducting multi-omics data analysis for MDD. To address this issue, we constructed the MDDOmics database (Major Depressive Disorder Omics, (https://www.csuligroup.com/MDDOmics/), which integrates an extensive collection of published multi-omics data related to MDD. The database contains 41 222 entries of MDD research results and several original datasets, including Single Nucleotide Polymorphisms, genes, non-coding RNAs, DNA methylations, metabolites and proteins, and offers various interfaces for searching and visualization. We also provide extensive downstream analyses of the collected MDD data, including differential analysis, enrichment analysis and disease-gene prediction. Moreover, the database also incorporates multi-omics data for bipolar disorder, schizophrenia and anxiety disorder, due to the challenge in differentiating MDD from similar psychiatric disorders. In conclusion, by leveraging the rich content and online interfaces from MDDOmics, researchers can conduct more comprehensive analyses of MDD and its similar disorders from various perspectives, thereby gaining a deeper understanding of potential MDD biomarkers and intricate disease pathogenesis. Database URL: https://www.csuligroup.com/MDDOmics/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11197964/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141450008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-24DOI: 10.1093/database/baae059
{"title":"Correction to: An interactive web application for exploring systemic lupus erythematosus blood transcriptomic diversity.","authors":"","doi":"10.1093/database/baae059","DOIUrl":"10.1093/database/baae059","url":null,"abstract":"","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11197958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141450007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Transcription regulation in multicellular species is mediated by modular transcription factor (TF) binding site combinations termed cis-regulatory modules (CRMs). Such CRM-mediated transcription regulation determines the gene expression patterns during development. Biologists frequently investigate CRM transcription regulation on gene expressions. However, the knowledge of the target genes and regulatory TFs participating in the CRMs under study is mostly fragmentary throughout the literature. Researchers need to afford tremendous human resources to fully surf through the articles deposited in biomedical literature databases in order to obtain the information. Although several novel text-mining systems are now available for literature triaging, these tools do not specifically focus on CRM-related literature prescreening, failing to correctly extract the information of the CRM target genes and regulatory TFs from the literature. For this reason, we constructed a supportive auto-literature prescreener called Drosophila Modular transcription-regulation Literature Screener (DMLS) that achieves the following: (i) prescreens articles describing experiments on modular transcription regulation, (ii) identifies the described target genes and TFs of the CRMs under study for each modular transcription-regulation-describing article and (iii) features an automated and extendable pipeline to perform the task. We demonstrated that the final performance of DMLS in extracting the described target gene and regulatory TF lists of CRMs under study for given articles achieved test macro area under the ROC curve (auROC) = 89.7% and area under the precision-recall curve (auPRC) = 77.6%, outperforming the intuitive gene name-occurrence-counting method by at least 19.9% in auROC and 30.5% in auPRC. The web service and the command line versions of DMLS are available at https://cobis.bme.ncku.edu.tw/DMLS/ and https://github.com/cobisLab/DMLS/, respectively. Database Tool URL: https://cobis.bme.ncku.edu.tw/DMLS/.
{"title":"DMLS: an automated pipeline to extract the Drosophila modular transcription regulators and targets from massive literature articles.","authors":"Tzu-Hsien Yang, Yu-Huai Yu, Sheng-Hang Wu, Fang-Yuan Chang, Hsiu-Chun Tsai, Ya-Chiao Yang","doi":"10.1093/database/baae049","DOIUrl":"10.1093/database/baae049","url":null,"abstract":"<p><p>Transcription regulation in multicellular species is mediated by modular transcription factor (TF) binding site combinations termed cis-regulatory modules (CRMs). Such CRM-mediated transcription regulation determines the gene expression patterns during development. Biologists frequently investigate CRM transcription regulation on gene expressions. However, the knowledge of the target genes and regulatory TFs participating in the CRMs under study is mostly fragmentary throughout the literature. Researchers need to afford tremendous human resources to fully surf through the articles deposited in biomedical literature databases in order to obtain the information. Although several novel text-mining systems are now available for literature triaging, these tools do not specifically focus on CRM-related literature prescreening, failing to correctly extract the information of the CRM target genes and regulatory TFs from the literature. For this reason, we constructed a supportive auto-literature prescreener called Drosophila Modular transcription-regulation Literature Screener (DMLS) that achieves the following: (i) prescreens articles describing experiments on modular transcription regulation, (ii) identifies the described target genes and TFs of the CRMs under study for each modular transcription-regulation-describing article and (iii) features an automated and extendable pipeline to perform the task. We demonstrated that the final performance of DMLS in extracting the described target gene and regulatory TF lists of CRMs under study for given articles achieved test macro area under the ROC curve (auROC) = 89.7% and area under the precision-recall curve (auPRC) = 77.6%, outperforming the intuitive gene name-occurrence-counting method by at least 19.9% in auROC and 30.5% in auPRC. The web service and the command line versions of DMLS are available at https://cobis.bme.ncku.edu.tw/DMLS/ and https://github.com/cobisLab/DMLS/, respectively. Database Tool URL: https://cobis.bme.ncku.edu.tw/DMLS/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":"0"},"PeriodicalIF":3.4,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11188685/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141431635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-19DOI: 10.1093/database/baae050
{"title":"Correction to: A Terpenoids Database with the Chemical Content as A Novel Agronomic Trait.","authors":"","doi":"10.1093/database/baae050","DOIUrl":"10.1093/database/baae050","url":null,"abstract":"","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11186476/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141426574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-12DOI: 10.1093/database/baae048
Chong Peng, Xiaofeng Liu, Xiangbo Meng, Congge Chen, Xinming Wu, Lin Bai, Fuping Lu, Fufeng Liu
Alzheimer's disease (AD) is a universal neurodegenerative disease with the feature of progressive dementia. Currently, there are only seven Food and Drug Administration-approved drugs for the treatment of AD, which merely offer temporary relief from symptom deterioration without reversing the underlying disease process. The identification of inhibitors capable of interacting with proteins associated with AD plays a pivotal role in the development of effective therapeutic interventions. However, a vast number of such inhibitors are dispersed throughout numerous published articles, rendering it inconvenient for researchers to explore potential drug candidates for AD. In light of this, we have manually compiled inhibitors targeting proteins associated with AD and constructed a comprehensive database known as IPAD-DB (Inhibitors of Proteins associated with Alzheimer's Disease Database). The curated inhibitors within this database encompass a diverse range of compounds, including natural compounds, synthetic compounds, drugs, natural extracts and nano-inhibitors. To date, the database has compiled >4800 entries, each representing a correspondent relationship between an inhibitor and its target protein. IPAD-DB offers a user-friendly interface that facilitates browsing, searching and downloading of its records. We firmly believe that IPAD-DB represents a valuable resource for screening potential AD drug candidates and investigating the underlying mechanisms of this debilitating disease. Access to IPAD-DB is freely available at http://www.lamee.cn/ipad-db/ and is compatible with all major web browsers. Database URL: http://www.lamee.cn/ipad-db/.
阿尔茨海默病(AD)是一种以进行性痴呆为特征的神经退行性疾病。目前,只有七种治疗阿尔茨海默病的药物获得了美国食品和药物管理局的批准,这些药物只能暂时缓解症状的恶化,却无法逆转潜在的疾病进程。找到能够与注意力缺失症相关蛋白相互作用的抑制剂对开发有效的治疗干预措施起着关键作用。然而,大量此类抑制剂分散在众多已发表的文章中,给研究人员探索治疗 AD 的潜在候选药物带来了不便。有鉴于此,我们手动汇编了针对与阿尔茨海默病相关蛋白的抑制剂,并构建了一个名为 IPAD-DB(阿尔茨海默病相关蛋白抑制剂数据库)的综合数据库。该数据库中的抑制剂种类繁多,包括天然化合物、合成化合物、药物、天然提取物和纳米抑制剂。迄今为止,该数据库已收录了超过 4800 个条目,每个条目都代表了抑制剂与其靶蛋白之间的对应关系。IPAD-DB 提供友好的用户界面,便于浏览、搜索和下载记录。我们坚信,IPAD-DB 是筛选潜在的 AD 候选药物和研究这种使人衰弱的疾病潜在机制的宝贵资源。访问 IPAD-DB 的免费网址是 http://www.lamee.cn/ipad-db/,并与所有主要网络浏览器兼容。数据库网址:http://www.lamee.cn/ipad-db/。
{"title":"IPAD-DB: a manually curated database for experimentally verified inhibitors of proteins associated with Alzheimer's disease.","authors":"Chong Peng, Xiaofeng Liu, Xiangbo Meng, Congge Chen, Xinming Wu, Lin Bai, Fuping Lu, Fufeng Liu","doi":"10.1093/database/baae048","DOIUrl":"10.1093/database/baae048","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a universal neurodegenerative disease with the feature of progressive dementia. Currently, there are only seven Food and Drug Administration-approved drugs for the treatment of AD, which merely offer temporary relief from symptom deterioration without reversing the underlying disease process. The identification of inhibitors capable of interacting with proteins associated with AD plays a pivotal role in the development of effective therapeutic interventions. However, a vast number of such inhibitors are dispersed throughout numerous published articles, rendering it inconvenient for researchers to explore potential drug candidates for AD. In light of this, we have manually compiled inhibitors targeting proteins associated with AD and constructed a comprehensive database known as IPAD-DB (Inhibitors of Proteins associated with Alzheimer's Disease Database). The curated inhibitors within this database encompass a diverse range of compounds, including natural compounds, synthetic compounds, drugs, natural extracts and nano-inhibitors. To date, the database has compiled >4800 entries, each representing a correspondent relationship between an inhibitor and its target protein. IPAD-DB offers a user-friendly interface that facilitates browsing, searching and downloading of its records. We firmly believe that IPAD-DB represents a valuable resource for screening potential AD drug candidates and investigating the underlying mechanisms of this debilitating disease. Access to IPAD-DB is freely available at http://www.lamee.cn/ipad-db/ and is compatible with all major web browsers. Database URL: http://www.lamee.cn/ipad-db/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168334/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141310308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-12DOI: 10.1093/database/baae043
Leho Tedersoo, Mahdieh S Hosseyni Moghaddam, Vladimir Mikryukov, Ali Hakimzadeh, Mohammad Bahram, R Henrik Nilsson, Iryna Yatsiuk, Stefan Geisen, Arne Schwelm, Kasia Piwosz, Marko Prous, Sirje Sildever, Dominika Chmolowska, Sonja Rueckert, Pavel Skaloud, Peeter Laas, Marco Tines, Jae-Ho Jung, Ji Hye Choi, Saad Alkahtani, Sten Anslan
Molecular identification of micro- and macroorganisms based on nuclear markers has revolutionized our understanding of their taxonomy, phylogeny and ecology. Today, research on the diversity of eukaryotes in global ecosystems heavily relies on nuclear ribosomal RNA (rRNA) markers. Here, we present the research community-curated reference database EUKARYOME for nuclear ribosomal 18S rRNA, internal transcribed spacer (ITS) and 28S rRNA markers for all eukaryotes, including metazoans (animals), protists, fungi and plants. It is particularly useful for the identification of arbuscular mycorrhizal fungi as it bridges the four commonly used molecular markers-ITS1, ITS2, 18S V4-V5 and 28S D1-D2 subregions. The key benefits of this database over other annotated reference sequence databases are that it is not restricted to certain taxonomic groups and it includes all rRNA markers. EUKARYOME also offers a number of reference long-read sequences that are derived from (meta)genomic and (meta)barcoding-a unique feature that can be used for taxonomic identification and chimera control of third-generation, long-read, high-throughput sequencing data. Taxonomic assignments of rRNA genes in the database are verified based on phylogenetic approaches. The reference datasets are available in multiple formats from the project homepage, http://www.eukaryome.org.
{"title":"EUKARYOME: the rRNA gene reference database for identification of all eukaryotes.","authors":"Leho Tedersoo, Mahdieh S Hosseyni Moghaddam, Vladimir Mikryukov, Ali Hakimzadeh, Mohammad Bahram, R Henrik Nilsson, Iryna Yatsiuk, Stefan Geisen, Arne Schwelm, Kasia Piwosz, Marko Prous, Sirje Sildever, Dominika Chmolowska, Sonja Rueckert, Pavel Skaloud, Peeter Laas, Marco Tines, Jae-Ho Jung, Ji Hye Choi, Saad Alkahtani, Sten Anslan","doi":"10.1093/database/baae043","DOIUrl":"10.1093/database/baae043","url":null,"abstract":"<p><p>Molecular identification of micro- and macroorganisms based on nuclear markers has revolutionized our understanding of their taxonomy, phylogeny and ecology. Today, research on the diversity of eukaryotes in global ecosystems heavily relies on nuclear ribosomal RNA (rRNA) markers. Here, we present the research community-curated reference database EUKARYOME for nuclear ribosomal 18S rRNA, internal transcribed spacer (ITS) and 28S rRNA markers for all eukaryotes, including metazoans (animals), protists, fungi and plants. It is particularly useful for the identification of arbuscular mycorrhizal fungi as it bridges the four commonly used molecular markers-ITS1, ITS2, 18S V4-V5 and 28S D1-D2 subregions. The key benefits of this database over other annotated reference sequence databases are that it is not restricted to certain taxonomic groups and it includes all rRNA markers. EUKARYOME also offers a number of reference long-read sequences that are derived from (meta)genomic and (meta)barcoding-a unique feature that can be used for taxonomic identification and chimera control of third-generation, long-read, high-throughput sequencing data. Taxonomic assignments of rRNA genes in the database are verified based on phylogenetic approaches. The reference datasets are available in multiple formats from the project homepage, http://www.eukaryome.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168333/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141310307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-07DOI: 10.1093/database/baae014
Daniel Faria, Patrícia Eugénio, Marta Contreiras Silva, Laura Balbi, Georges Bedran, Ashwin Adrian Kallor, Susana Nunes, Aleksander Palkowski, Michal Waleron, Javier A Alfaro, Catia Pesquita
The adaptive immune response plays a vital role in eliminating infected and aberrant cells from the body. This process hinges on the presentation of short peptides by major histocompatibility complex Class I molecules on the cell surface. Immunopeptidomics, the study of peptides displayed on cells, delves into the wide variety of these peptides. Understanding the mechanisms behind antigen processing and presentation is crucial for effectively evaluating cancer immunotherapies. As an emerging domain, immunopeptidomics currently lacks standardization-there is neither an established terminology nor formally defined semantics-a critical concern considering the complexity, heterogeneity, and growing volume of data involved in immunopeptidomics studies. Additionally, there is a disconnection between how the proteomics community delivers the information about antigen presentation and its uptake by the clinical genomics community. Considering the significant relevance of immunopeptidomics in cancer, this shortcoming must be addressed to bridge the gap between research and clinical practice. In this work, we detail the development of the ImmunoPeptidomics Ontology, ImPO, the first effort at standardizing the terminology and semantics in the domain. ImPO aims to encapsulate and systematize data generated by immunopeptidomics experimental processes and bioinformatics analysis. ImPO establishes cross-references to 24 relevant ontologies, including the National Cancer Institute Thesaurus, Mondo Disease Ontology, Logical Observation Identifier Names and Codes and Experimental Factor Ontology. Although ImPO was developed using expert knowledge to characterize a large and representative data collection, it may be readily used to encode other datasets within the domain. Ultimately, ImPO facilitates data integration and analysis, enabling querying, inference and knowledge generation and importantly bridging the gap between the clinical proteomics and genomics communities. As the field of immunogenomics uses protein-level immunopeptidomics data, we expect ImPO to play a key role in supporting a rich and standardized description of the large-scale data that emerging high-throughput technologies are expected to bring in the near future. Ontology URL: https://zenodo.org/record/10237571 Project GitHub: https://github.com/liseda-lab/ImPO/blob/main/ImPO.owl.
{"title":"The Immunopeptidomics Ontology (ImPO).","authors":"Daniel Faria, Patrícia Eugénio, Marta Contreiras Silva, Laura Balbi, Georges Bedran, Ashwin Adrian Kallor, Susana Nunes, Aleksander Palkowski, Michal Waleron, Javier A Alfaro, Catia Pesquita","doi":"10.1093/database/baae014","DOIUrl":"10.1093/database/baae014","url":null,"abstract":"<p><p>The adaptive immune response plays a vital role in eliminating infected and aberrant cells from the body. This process hinges on the presentation of short peptides by major histocompatibility complex Class I molecules on the cell surface. Immunopeptidomics, the study of peptides displayed on cells, delves into the wide variety of these peptides. Understanding the mechanisms behind antigen processing and presentation is crucial for effectively evaluating cancer immunotherapies. As an emerging domain, immunopeptidomics currently lacks standardization-there is neither an established terminology nor formally defined semantics-a critical concern considering the complexity, heterogeneity, and growing volume of data involved in immunopeptidomics studies. Additionally, there is a disconnection between how the proteomics community delivers the information about antigen presentation and its uptake by the clinical genomics community. Considering the significant relevance of immunopeptidomics in cancer, this shortcoming must be addressed to bridge the gap between research and clinical practice. In this work, we detail the development of the ImmunoPeptidomics Ontology, ImPO, the first effort at standardizing the terminology and semantics in the domain. ImPO aims to encapsulate and systematize data generated by immunopeptidomics experimental processes and bioinformatics analysis. ImPO establishes cross-references to 24 relevant ontologies, including the National Cancer Institute Thesaurus, Mondo Disease Ontology, Logical Observation Identifier Names and Codes and Experimental Factor Ontology. Although ImPO was developed using expert knowledge to characterize a large and representative data collection, it may be readily used to encode other datasets within the domain. Ultimately, ImPO facilitates data integration and analysis, enabling querying, inference and knowledge generation and importantly bridging the gap between the clinical proteomics and genomics communities. As the field of immunogenomics uses protein-level immunopeptidomics data, we expect ImPO to play a key role in supporting a rich and standardized description of the large-scale data that emerging high-throughput technologies are expected to bring in the near future. Ontology URL: https://zenodo.org/record/10237571 Project GitHub: https://github.com/liseda-lab/ImPO/blob/main/ImPO.owl.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11164101/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141300289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-06DOI: 10.1093/database/baae046
Shumin Ren, Lin Yang, Jiale Du, Mengqiao He, Bairong Shen
As a prospective payment method, diagnosis-related groups (DRGs)'s implementation has varying effects on different regions and adopt different case classification systems. Our goal is to build a structured public online knowledgebase describing the worldwide practice of DRGs, which includes systematic indicators for DRGs' performance assessment. Therefore, we manually collected the qualified literature from PUBMED and constructed DRGKB website. We divided the evaluation indicators into four categories, including (i) medical service quality; (ii) medical service efficiency; (iii) profitability and sustainability; (iv) case grouping ability. Then we carried out descriptive analysis and comprehensive scoring on outcome measurements performance, improvement strategy and specialty performance. At last, the DRGKB finally contains 297 entries. It was found that DRGs generally have a considerable impact on hospital operations, including average length of stay, medical quality and use of medical resources. At the same time, the current DRGs also have many deficiencies, including insufficient reimbursement rates and the ability to classify complex cases. We analyzed these underperforming parts by domain. In conclusion, this research innovatively constructed a knowledgebase to quantify the practice effects of DRGs, analyzed and visualized the development trends and area performance from a comprehensive perspective. This study provides a data-driven research paradigm for following DRGs-related work along with a proposed DRGs evolution model. Availability and implementation: DRGKB is freely available at http://www.sysbio.org.cn/drgkb/. Database URL: http://www.sysbio.org.cn/drgkb/.
{"title":"DRGKB: a knowledgebase of worldwide diagnosis-related groups' practices for comparison, evaluation and knowledge-guided application.","authors":"Shumin Ren, Lin Yang, Jiale Du, Mengqiao He, Bairong Shen","doi":"10.1093/database/baae046","DOIUrl":"10.1093/database/baae046","url":null,"abstract":"<p><p>As a prospective payment method, diagnosis-related groups (DRGs)'s implementation has varying effects on different regions and adopt different case classification systems. Our goal is to build a structured public online knowledgebase describing the worldwide practice of DRGs, which includes systematic indicators for DRGs' performance assessment. Therefore, we manually collected the qualified literature from PUBMED and constructed DRGKB website. We divided the evaluation indicators into four categories, including (i) medical service quality; (ii) medical service efficiency; (iii) profitability and sustainability; (iv) case grouping ability. Then we carried out descriptive analysis and comprehensive scoring on outcome measurements performance, improvement strategy and specialty performance. At last, the DRGKB finally contains 297 entries. It was found that DRGs generally have a considerable impact on hospital operations, including average length of stay, medical quality and use of medical resources. At the same time, the current DRGs also have many deficiencies, including insufficient reimbursement rates and the ability to classify complex cases. We analyzed these underperforming parts by domain. In conclusion, this research innovatively constructed a knowledgebase to quantify the practice effects of DRGs, analyzed and visualized the development trends and area performance from a comprehensive perspective. This study provides a data-driven research paradigm for following DRGs-related work along with a proposed DRGs evolution model. Availability and implementation: DRGKB is freely available at http://www.sysbio.org.cn/drgkb/. Database URL: http://www.sysbio.org.cn/drgkb/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11155695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141283277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}