首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations. DUVEL:用于识别寡基因组合的主动学习注释生物医学语料库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-28 DOI: 10.1093/database/baae039
Charlotte Nachtegael, Jacopo De Stefani, Anthony Cnudde, Tom Lenaerts

While biomedical relation extraction (bioRE) datasets have been instrumental in the development of methods to support biocuration of single variants from texts, no datasets are currently available for the extraction of digenic or even oligogenic variant relations, despite the reports in literature that epistatic effects between combinations of variants in different loci (or genes) are important to understand disease etiologies. This work presents the creation of a unique dataset of oligogenic variant combinations, geared to train tools to help in the curation of scientific literature. To overcome the hurdles associated with the number of unlabelled instances and the cost of expertise, active learning (AL) was used to optimize the annotation, thus getting assistance in finding the most informative subset of samples to label. By pre-annotating 85 full-text articles containing the relevant relations from the Oligogenic Diseases Database (OLIDA) with PubTator, text fragments featuring potential digenic variant combinations, i.e. gene-variant-gene-variant, were extracted. The resulting fragments of texts were annotated with ALAMBIC, an AL-based annotation platform. The resulting dataset, called DUVEL, is used to fine-tune four state-of-the-art biomedical language models: BiomedBERT, BiomedBERT-large, BioLinkBERT and BioM-BERT. More than 500 000 text fragments were considered for annotation, finally resulting in a dataset with 8442 fragments, 794 of them being positive instances, covering 95% of the original annotated articles. When applied to gene-variant pair detection, BiomedBERT-large achieves the highest F1 score (0.84) after fine-tuning, demonstrating significant improvement compared to the non-fine-tuned model, underlining the relevance of the DUVEL dataset. This study shows how AL may play an important role in the creation of bioRE dataset relevant for biomedical curation applications. DUVEL provides a unique biomedical corpus focusing on 4-ary relations between two genes and two variants. It is made freely available for research on GitHub and Hugging Face. Database URL: https://huggingface.co/datasets/cnachteg/duvel or https://doi.org/10.57967/hf/1571.

生物医学关系提取(bioRE)数据集有助于开发支持从文本中提取单个变异体的生物化方法,但尽管有文献报道不同位点(或基因)变异体组合之间的表观效应对了解疾病病因很重要,但目前还没有数据集可用于提取双基因甚至寡基因变异体关系。这项工作展示了一个独特的寡源变异组合数据集的创建过程,该数据集旨在培训有助于科学文献整理的工具。为了克服与未标注实例数量和专业知识成本相关的障碍,我们采用了主动学习(AL)来优化标注,从而帮助找到信息量最大的标注样本子集。通过使用 PubTator 对包含寡核苷酸疾病数据库(OLIDA)中相关关系的 85 篇全文文章进行预标注,提取出具有潜在二基因变异组合(即基因-变异体-基因-变异体)特征的文本片段。由此产生的文本片段使用基于 AL 的注释平台 ALAMBIC 进行注释。得到的数据集称为 DUVEL,用于微调四种最先进的生物医学语言模型:BiomedBERT、BiomedBERT-large、BioLinkBERT 和 BioM-BERT。在标注过程中考虑了 500 000 多个文本片段,最终形成了一个包含 8442 个片段的数据集,其中 794 个为正例,覆盖了原始标注文章的 95%。在应用于基因变异对检测时,BiomedBERT-large 在微调后获得了最高的 F1 分数(0.84),与未微调的模型相比有了显著改善,突出了 DUVEL 数据集的相关性。这项研究显示了 AL 如何在创建生物RE 数据集的过程中发挥重要作用,使其适用于生物医学研究应用。DUVEL 提供了一个独特的生物医学语料库,侧重于两个基因和两个变体之间的 4ary 关系。该语料库在 GitHub 和 Hugging Face 上免费供研究使用。数据库网址:https://huggingface.co/datasets/cnachteg/duvel 或 https://doi.org/10.57967/hf/1571。
{"title":"DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations.","authors":"Charlotte Nachtegael, Jacopo De Stefani, Anthony Cnudde, Tom Lenaerts","doi":"10.1093/database/baae039","DOIUrl":"10.1093/database/baae039","url":null,"abstract":"<p><p>While biomedical relation extraction (bioRE) datasets have been instrumental in the development of methods to support biocuration of single variants from texts, no datasets are currently available for the extraction of digenic or even oligogenic variant relations, despite the reports in literature that epistatic effects between combinations of variants in different loci (or genes) are important to understand disease etiologies. This work presents the creation of a unique dataset of oligogenic variant combinations, geared to train tools to help in the curation of scientific literature. To overcome the hurdles associated with the number of unlabelled instances and the cost of expertise, active learning (AL) was used to optimize the annotation, thus getting assistance in finding the most informative subset of samples to label. By pre-annotating 85 full-text articles containing the relevant relations from the Oligogenic Diseases Database (OLIDA) with PubTator, text fragments featuring potential digenic variant combinations, i.e. gene-variant-gene-variant, were extracted. The resulting fragments of texts were annotated with ALAMBIC, an AL-based annotation platform. The resulting dataset, called DUVEL, is used to fine-tune four state-of-the-art biomedical language models: BiomedBERT, BiomedBERT-large, BioLinkBERT and BioM-BERT. More than 500 000 text fragments were considered for annotation, finally resulting in a dataset with 8442 fragments, 794 of them being positive instances, covering 95% of the original annotated articles. When applied to gene-variant pair detection, BiomedBERT-large achieves the highest F1 score (0.84) after fine-tuning, demonstrating significant improvement compared to the non-fine-tuned model, underlining the relevance of the DUVEL dataset. This study shows how AL may play an important role in the creation of bioRE dataset relevant for biomedical curation applications. DUVEL provides a unique biomedical corpus focusing on 4-ary relations between two genes and two variants. It is made freely available for research on GitHub and Hugging Face. Database URL: https://huggingface.co/datasets/cnachteg/duvel or https://doi.org/10.57967/hf/1571.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11131422/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141160776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PDB NextGen Archive: centralizing access to integrated annotations and enriched structural information by the Worldwide Protein Data Bank. PDB NextGen Archive:通过全球蛋白质数据库集中获取综合注释和丰富的结构信息。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-27 DOI: 10.1093/database/baae041
Preeti Choudhary, Zukang Feng, John Berrisford, Henry Chao, Yasuyo Ikegawa, Ezra Peisach, Dennis W Piehl, James Smith, Ahsan Tanweer, Mihaly Varadi, John D Westbrook, Jasmine Y Young, Ardan Patwardhan, Kyle L Morris, Jeffrey C Hoch, Genji Kurisu, Sameer Velankar, Stephen K Burley

The Protein Data Bank (PDB) is the global repository for public-domain experimentally determined 3D biomolecular structural information. The archival nature of the PDB presents certain challenges pertaining to updating or adding associated annotations from trusted external biodata resources. While each Worldwide PDB (wwPDB) partner has made best efforts to provide up-to-date external annotations, accessing and integrating information from disparate wwPDB data centers can be an involved process. To address this issue, the wwPDB has established the PDB Next Generation (or NextGen) Archive, developed to centralize and streamline access to enriched structural annotations from wwPDB partners and trusted external sources. At present, the NextGen Archive provides mappings between experimentally determined 3D structures of proteins and UniProt amino acid sequences, domain annotations from Pfam, SCOP2 and CATH databases and intra-molecular connectivity information. Since launch, the PDB NextGen Archive has seen substantial user engagement with over 3.5 million data file downloads, ensuring researchers have access to accurate, up-to-date and easily accessible structural annotations. Database URL: http://www.wwpdb.org/ftp/pdb-nextgen-archive-site.

蛋白质数据库(PDB)是公共领域实验确定的三维生物分子结构信息的全球存储库。PDB 的存档性质给从可信的外部生物数据资源更新或添加相关注释带来了一定的挑战。虽然全球生物数据库(wwPDB)的每个合作伙伴都尽最大努力提供最新的外部注释,但从不同的wwPDB数据中心访问和整合信息可能是一个复杂的过程。为了解决这个问题,wwPDB 建立了 PDB 下一代(或 NextGen)档案,以集中和简化对来自 wwPDB 合作伙伴和可信外部来源的丰富结构注释的访问。目前,NextGen 档案提供了实验确定的蛋白质三维结构与 UniProt 氨基酸序列之间的映射、来自 Pfam、SCOP2 和 CATH 数据库的结构域注释以及分子内连接性信息。自推出以来,PDB NextGen Archive 的用户参与度很高,数据文件下载量超过 350 万次,确保研究人员能够获得准确、最新且易于访问的结构注释。数据库网址:http://www.wwpdb.org/ftp/pdb-nextgen-archive-site.
{"title":"PDB NextGen Archive: centralizing access to integrated annotations and enriched structural information by the Worldwide Protein Data Bank.","authors":"Preeti Choudhary, Zukang Feng, John Berrisford, Henry Chao, Yasuyo Ikegawa, Ezra Peisach, Dennis W Piehl, James Smith, Ahsan Tanweer, Mihaly Varadi, John D Westbrook, Jasmine Y Young, Ardan Patwardhan, Kyle L Morris, Jeffrey C Hoch, Genji Kurisu, Sameer Velankar, Stephen K Burley","doi":"10.1093/database/baae041","DOIUrl":"10.1093/database/baae041","url":null,"abstract":"<p><p>The Protein Data Bank (PDB) is the global repository for public-domain experimentally determined 3D biomolecular structural information. The archival nature of the PDB presents certain challenges pertaining to updating or adding associated annotations from trusted external biodata resources. While each Worldwide PDB (wwPDB) partner has made best efforts to provide up-to-date external annotations, accessing and integrating information from disparate wwPDB data centers can be an involved process. To address this issue, the wwPDB has established the PDB Next Generation (or NextGen) Archive, developed to centralize and streamline access to enriched structural annotations from wwPDB partners and trusted external sources. At present, the NextGen Archive provides mappings between experimentally determined 3D structures of proteins and UniProt amino acid sequences, domain annotations from Pfam, SCOP2 and CATH databases and intra-molecular connectivity information. Since launch, the PDB NextGen Archive has seen substantial user engagement with over 3.5 million data file downloads, ensuring researchers have access to accurate, up-to-date and easily accessible structural annotations. Database URL: http://www.wwpdb.org/ftp/pdb-nextgen-archive-site.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11130521/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141158130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CoSFISH: a comprehensive reference database of COI and 18S rRNA barcodes for fish. CoSFISH:鱼类 COI 和 18S rRNA 条形码综合参考数据库。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-27 DOI: 10.1093/database/baae038
Yuanyuan Wang, Yexin Yang, Yi Liu, Chao Liu, Meng Xu, Miao Fang, Xidong Mu

Fish, being a crucial component of aquatic ecosystems, holds significant importance from both economic and ecological perspectives. However, the identification of fish at the species level remains challenging, and there is a lack of a taxonomically complete and comprehensive reference sequence database for fish. Therefore, we developed CoSFISH, an online fish database. Currently, the database contains 21 535 cytochrome oxidase I sequences and 1074 18S rRNA sequences of 21 589 species, belonging to 8 classes and 90 orders. We additionally incorporate online analysis tools to aid users in comparing, aligning and analyzing sequences, as well as designing primers. Users can upload their own data for analysis, in addition to using the data stored in the database directly. CoSFISH offers an extensive fish database and incorporates online analysis tools, making it a valuable resource for the study of fish diversity, phylogenetics and biological evolution. Database URL:  http://210.22.121.250:8888/CoSFISH/home/indexPage.

鱼类是水生生态系统的重要组成部分,在经济和生态方面都具有重要意义。然而,鱼类物种水平的鉴定仍然具有挑战性,而且缺乏一个在分类学上完整而全面的鱼类参考序列数据库。因此,我们开发了在线鱼类数据库 CoSFISH。目前,该数据库包含 21 535 个细胞色素氧化酶 I 序列和 1074 个 18S rRNA 序列,涉及 21 589 个物种,分属 8 类 90 目。此外,我们还提供在线分析工具,帮助用户比较、排列和分析序列以及设计引物。用户除了可以直接使用数据库中存储的数据外,还可以上传自己的数据进行分析。CoSFISH 提供了一个庞大的鱼类数据库,并集成了在线分析工具,是研究鱼类多样性、系统发生学和生物进化的宝贵资源。数据库网址:http://210.22.121.250:8888/CoSFISH/home/indexPage。
{"title":"CoSFISH: a comprehensive reference database of COI and 18S rRNA barcodes for fish.","authors":"Yuanyuan Wang, Yexin Yang, Yi Liu, Chao Liu, Meng Xu, Miao Fang, Xidong Mu","doi":"10.1093/database/baae038","DOIUrl":"10.1093/database/baae038","url":null,"abstract":"<p><p>Fish, being a crucial component of aquatic ecosystems, holds significant importance from both economic and ecological perspectives. However, the identification of fish at the species level remains challenging, and there is a lack of a taxonomically complete and comprehensive reference sequence database for fish. Therefore, we developed CoSFISH, an online fish database. Currently, the database contains 21 535 cytochrome oxidase I sequences and 1074 18S rRNA sequences of 21 589 species, belonging to 8 classes and 90 orders. We additionally incorporate online analysis tools to aid users in comparing, aligning and analyzing sequences, as well as designing primers. Users can upload their own data for analysis, in addition to using the data stored in the database directly. CoSFISH offers an extensive fish database and incorporates online analysis tools, making it a valuable resource for the study of fish diversity, phylogenetics and biological evolution. Database URL:  http://210.22.121.250:8888/CoSFISH/home/indexPage.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11130519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141158172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSGD: a manually curated database of genomic, transcriptomic, proteomic and drug information for multiple sclerosis. MSGD:人工编辑的多发性硬化症基因组、转录组、蛋白质组和药物信息数据库。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-24 DOI: 10.1093/database/baae037
Tao Wu, Yaopan Hou, Guanghao Xin, Jingyan Niu, Shanshan Peng, Fanfan Xu, Ying Li, Yuling Chen, Yifangfei Yu, Huixue Zhang, Xiaotong Kong, Yuze Cao, Shangwei Ning, Lihua Wang, Junwei Hao

Multiple sclerosis (MS) is the most common inflammatory demyelinating disease of the central nervous system. 'Omics' technologies (genomics, transcriptomics, proteomics) and associated drug information have begun reshaping our understanding of multiple sclerosis. However, these data are scattered across numerous references, making them challenging to fully utilize. We manually mined and compiled these data within the Multiple Sclerosis Gene Database (MSGD) database, intending to continue updating it in the future. We screened 5485 publications and constructed the current version of MSGD. MSGD comprises 6255 entries, including 3274 variant entries, 1175 RNA entries, 418 protein entries, 313 knockout entries, 612 drug entries and 463 high-throughput entries. Each entry contains detailed information, such as species, disease type, detailed gene descriptions (such as official gene symbols), and original references. MSGD is freely accessible and provides a user-friendly web interface. Users can easily search for genes of interest, view their expression patterns and detailed information, manage gene sets and submit new MS-gene associations through the platform. The primary principle behind MSGD's design is to provide an exploratory platform, aiming to minimize filtration and interpretation barriers while ensuring highly accessible presentation of data. This initiative is expected to significantly assist researchers in deciphering gene mechanisms and improving the prevention, diagnosis and treatment of MS. Database URL: http://bio-bigdata.hrbmu.edu.cn/MSGD.

多发性硬化症(MS)是中枢神经系统最常见的炎症性脱髓鞘疾病。Omics "技术(基因组学、转录组学、蛋白质组学)和相关药物信息已开始重塑我们对多发性硬化症的认识。然而,这些数据分散在众多参考文献中,因此要充分利用它们具有挑战性。我们在多发性硬化基因数据库(MSGD)数据库中人工挖掘并编译了这些数据,并打算在未来继续更新。我们筛选了 5485 篇文献,构建了当前版本的 MSGD。MSGD 共有 6255 个条目,包括 3274 个变异条目、1175 个 RNA 条目、418 个蛋白质条目、313 个基因敲除条目、612 个药物条目和 463 个高通量条目。每个条目都包含详细信息,如物种、疾病类型、详细的基因描述(如官方基因符号)和原始参考文献。MSGD 可免费访问,并提供用户友好的网络界面。用户可以通过该平台轻松搜索感兴趣的基因,查看其表达模式和详细信息,管理基因集,并提交新的 MS 基因关联。MSGD 设计的主要原则是提供一个探索性平台,旨在最大限度地减少过滤和解释障碍,同时确保数据的高度可访问性。这一举措有望极大地帮助研究人员破译基因机制,改善多发性硬化症的预防、诊断和治疗。数据库网址:http://bio-bigdata.hrbmu.edu.cn/MSGD。
{"title":"MSGD: a manually curated database of genomic, transcriptomic, proteomic and drug information for multiple sclerosis.","authors":"Tao Wu, Yaopan Hou, Guanghao Xin, Jingyan Niu, Shanshan Peng, Fanfan Xu, Ying Li, Yuling Chen, Yifangfei Yu, Huixue Zhang, Xiaotong Kong, Yuze Cao, Shangwei Ning, Lihua Wang, Junwei Hao","doi":"10.1093/database/baae037","DOIUrl":"10.1093/database/baae037","url":null,"abstract":"<p><p>Multiple sclerosis (MS) is the most common inflammatory demyelinating disease of the central nervous system. 'Omics' technologies (genomics, transcriptomics, proteomics) and associated drug information have begun reshaping our understanding of multiple sclerosis. However, these data are scattered across numerous references, making them challenging to fully utilize. We manually mined and compiled these data within the Multiple Sclerosis Gene Database (MSGD) database, intending to continue updating it in the future. We screened 5485 publications and constructed the current version of MSGD. MSGD comprises 6255 entries, including 3274 variant entries, 1175 RNA entries, 418 protein entries, 313 knockout entries, 612 drug entries and 463 high-throughput entries. Each entry contains detailed information, such as species, disease type, detailed gene descriptions (such as official gene symbols), and original references. MSGD is freely accessible and provides a user-friendly web interface. Users can easily search for genes of interest, view their expression patterns and detailed information, manage gene sets and submit new MS-gene associations through the platform. The primary principle behind MSGD's design is to provide an exploratory platform, aiming to minimize filtration and interpretation barriers while ensuring highly accessible presentation of data. This initiative is expected to significantly assist researchers in deciphering gene mechanisms and improving the prevention, diagnosis and treatment of MS. Database URL: http://bio-bigdata.hrbmu.edu.cn/MSGD.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11126313/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141093077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A terpenoids database with the chemical content as a novel agronomic trait. 萜类化合物数据库,其化学成分是一种新的农艺性状。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-22 DOI: 10.1093/database/baae027
Wenqian Li, Yinliang Chen, Ruofei Yang, Zilong Hu, Shaozhong Wei, Sheng Hu, Xinjun Xiong, Meijuan Wang, Ammar Lubeiny, Xiaohua Li, Minglei Feng, Shuang Dong, Xinlu Xie, Chao Nie, Jingyi Zhang, Yunhao Luo, Yichen Zhou, Ruodi Liu, Jinhai Pan, De-Xin Kong, Xuebo Hu

Natural products play a pivotal role in drug discovery, and the richness of natural products, albeit significantly influenced by various environmental factors, is predominantly determined by intrinsic genetics of a series of enzymatic reactions and produced as secondary metabolites of organisms. Heretofore, few natural product-related databases take the chemical content into consideration as a prominent property. To gain unique insights into the quantitative diversity of natural products, we have developed the first TerPenoids database embedded with Content information (TPCN) with features such as compound browsing, structural search, scaffold analysis, similarity analysis and data download. This database can be accessed through a web-based computational toolkit available at http://www.tpcn.pro/. By conducting meticulous manual searches and analyzing over 10 000 reference papers, the TPCN database has successfully integrated 6383 terpenoids obtained from 1254 distinct plant species. The database encompasses exhaustive details including isolation parts, comprehensive molecule structures, chemical abstracts service registry number (CAS number) and 7508 content descriptions. The TPCN database accentuates both the qualitative and quantitative dimensions as invaluable phenotypic characteristics of natural products that have undergone genetic evolution. By acting as an indispensable criterion, the TPCN database facilitates the discovery of drug alternatives with high content and the selection of high-yield medicinal plant species or phylogenetic alternatives, thereby fostering sustainable, cost-effective and environmentally friendly drug discovery in pharmaceutical farming. Database URL: http://www.tpcn.pro/.

天然产物在药物发现中起着举足轻重的作用,天然产物的丰富性尽管受各种环境因素的影响很大,但主要是由一系列酶促反应的内在遗传决定的,并作为生物体的次级代谢产物产生。迄今为止,很少有与天然产物相关的数据库将化学成分作为一个突出的属性加以考虑。为了深入了解天然产物的定量多样性,我们开发了首个嵌入内容信息的 TerPenoids 数据库(TPCN),具有化合物浏览、结构搜索、支架分析、相似性分析和数据下载等功能。该数据库可通过 http://www.tpcn.pro/ 网站上的网络计算工具包访问。通过细致的人工搜索和对 10,000 多篇参考文献的分析,TPCN 数据库已成功整合了从 1254 种不同植物中提取的 6383 种萜类化合物。该数据库包含详尽的详细信息,包括分离部位、全面的分子结构、化学文摘服务登记号(CAS 号)和 7508 项内容描述。TPCN 数据库从定性和定量两个维度强调了经历了遗传进化的天然产品的宝贵表型特征。作为一个不可或缺的标准,TPCN 数据库有助于发现高含量的药物替代品,选择高产药用植物物种或系统发育替代品,从而促进医药农业中可持续、经济高效和环保的药物发现。数据库网址:http://www.tpcn.pro/。
{"title":"A terpenoids database with the chemical content as a novel agronomic trait.","authors":"Wenqian Li, Yinliang Chen, Ruofei Yang, Zilong Hu, Shaozhong Wei, Sheng Hu, Xinjun Xiong, Meijuan Wang, Ammar Lubeiny, Xiaohua Li, Minglei Feng, Shuang Dong, Xinlu Xie, Chao Nie, Jingyi Zhang, Yunhao Luo, Yichen Zhou, Ruodi Liu, Jinhai Pan, De-Xin Kong, Xuebo Hu","doi":"10.1093/database/baae027","DOIUrl":"10.1093/database/baae027","url":null,"abstract":"<p><p>Natural products play a pivotal role in drug discovery, and the richness of natural products, albeit significantly influenced by various environmental factors, is predominantly determined by intrinsic genetics of a series of enzymatic reactions and produced as secondary metabolites of organisms. Heretofore, few natural product-related databases take the chemical content into consideration as a prominent property. To gain unique insights into the quantitative diversity of natural products, we have developed the first TerPenoids database embedded with Content information (TPCN) with features such as compound browsing, structural search, scaffold analysis, similarity analysis and data download. This database can be accessed through a web-based computational toolkit available at http://www.tpcn.pro/. By conducting meticulous manual searches and analyzing over 10 000 reference papers, the TPCN database has successfully integrated 6383 terpenoids obtained from 1254 distinct plant species. The database encompasses exhaustive details including isolation parts, comprehensive molecule structures, chemical abstracts service registry number (CAS number) and 7508 content descriptions. The TPCN database accentuates both the qualitative and quantitative dimensions as invaluable phenotypic characteristics of natural products that have undergone genetic evolution. By acting as an indispensable criterion, the TPCN database facilitates the discovery of drug alternatives with high content and the selection of high-yield medicinal plant species or phylogenetic alternatives, thereby fostering sustainable, cost-effective and environmentally friendly drug discovery in pharmaceutical farming. Database URL: http://www.tpcn.pro/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11110934/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141080781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collecting and managing in situ banana genetic resources information (Musa spp.) using online resources and citizen science. 利用在线资源和公民科学就地收集和管理香蕉遗传资源信息(Musa spp.)
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-22 DOI: 10.1093/database/baae036
Christophe Jenny, Valentin Guignon, Felip Manyer I Ballester, Max Ruas, Mathieu Rouard

The Musa Germplasm Information System (MGIS) stands as a pivotal database for managing global banana genetic resources information. In our latest effort, we have expanded MGIS to incorporate in situ observations. We thus incorporated more than 3000 in situ observations from 133 countries primarily sourced from iNaturalist, GBIF, Flickr, Pl@ntNet, Google Street view and expert curation of the literature. This addition provides a more comprehensive and detailed view of banana diversity and its distribution. Additional graphical interfaces, supported by new Drupal modules, were developed, allowing users to compare banana accessions and explore them based on various filters including taxonomy and geographic location. The integrated maps present a unified view, showcasing both in situ observations and the collecting locations of accessions held in germplasm collections. This enhancement not only broadens the scope of MGIS but also promotes a collaborative and open approach in documenting banana diversity, to allow more effective conservation and use of banana germplasm. Furthermore, this work documents a citizen-science approach that could be relevant for other communities. Database URL: https://www.crop-diversity.org/mgis/musa-in-situ.

穆萨种质信息系统(MGIS)是管理全球香蕉遗传资源信息的重要数据库。在最近的工作中,我们对 MGIS 进行了扩展,纳入了原生境观测数据。因此,我们纳入了来自 133 个国家的 3000 多个原地观测数据,这些数据主要来自 iNaturalist、GBIF、Flickr、Pl@ntNet、谷歌街景以及专家对文献的整理。这一新增内容提供了更全面、更详细的香蕉多样性及其分布情况。在新的 Drupal 模块支持下,还开发了更多的图形界面,使用户能够根据分类和地理位置等各种筛选条件,比较和探索香蕉入选品种。综合地图提供了一个统一的视图,展示了原地观测结果和种质资源库中加入物的采集地点。这一改进不仅扩大了 MGIS 的范围,还促进了以协作和开放的方式记录香蕉多样性,从而更有效地保护和利用香蕉种质。此外,这项工作还记录了一种公民科学方法,可用于其他社区。数据库网址:https://www.crop-diversity.org/mgis/musa-in-situ。
{"title":"Collecting and managing in situ banana genetic resources information (Musa spp.) using online resources and citizen science.","authors":"Christophe Jenny, Valentin Guignon, Felip Manyer I Ballester, Max Ruas, Mathieu Rouard","doi":"10.1093/database/baae036","DOIUrl":"10.1093/database/baae036","url":null,"abstract":"<p><p>The Musa Germplasm Information System (MGIS) stands as a pivotal database for managing global banana genetic resources information. In our latest effort, we have expanded MGIS to incorporate in situ observations. We thus incorporated more than 3000 in situ observations from 133 countries primarily sourced from iNaturalist, GBIF, Flickr, Pl@ntNet, Google Street view and expert curation of the literature. This addition provides a more comprehensive and detailed view of banana diversity and its distribution. Additional graphical interfaces, supported by new Drupal modules, were developed, allowing users to compare banana accessions and explore them based on various filters including taxonomy and geographic location. The integrated maps present a unified view, showcasing both in situ observations and the collecting locations of accessions held in germplasm collections. This enhancement not only broadens the scope of MGIS but also promotes a collaborative and open approach in documenting banana diversity, to allow more effective conservation and use of banana germplasm. Furthermore, this work documents a citizen-science approach that could be relevant for other communities. Database URL: https://www.crop-diversity.org/mgis/musa-in-situ.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11110932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141080783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CardioHotspots: a database of mutational hotspots for cardiac disorders. CardioHotspots:心脏疾病突变热点数据库。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-15 DOI: 10.1093/database/baae034
Alberto García S, Mireia Costa, Alba García-Zarzoso, Oscar Pastor

Mutational hotspots are DNA regions with an abnormally high frequency of genetic variants. Identifying whether a variant is located in a mutational hotspot is critical for determining the variant's role in disorder predisposition, development, and treatment response. Despite their significance, current databases on mutational hotspots are limited to the oncology domain. However, identifying mutational hotspots is critical for any disorder in which genetics plays a role. This is true for the world's leading cause of death: cardiac disorders. In this work, we present CardioHotspots, a literature-based database of manually curated hotspots for cardiac diseases. This is the only database we know of that provides high-quality and easily accessible information about hotspots associated with cardiac disorders. CardioHotspots is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3099/). Database URL: https://genomics-hub.pros.dsic.upv.es:3099/.

突变热点是基因变异频率异常高的 DNA 区域。确定一个变体是否位于突变热点,对于确定该变体在疾病易感性、发展和治疗反应中的作用至关重要。尽管突变热点非常重要,但目前有关突变热点的数据库仅限于肿瘤学领域。然而,对于遗传学起作用的任何疾病来说,识别突变热点都是至关重要的。对于世界头号死因--心脏疾病来说,情况也是如此。在这项工作中,我们介绍了 CardioHotspots,这是一个基于文献的、人工策划的心脏疾病热点数据库。据我们所知,这是唯一一个提供与心脏疾病相关热点的高质量且易于访问的数据库。CardioHotspots 可通过网络平台 (https://genomics-hub.pros.dsic.upv.es:3099/) 公开访问。数据库网址:https://genomics-hub.pros.dsic.upv.es:3099/。
{"title":"CardioHotspots: a database of mutational hotspots for cardiac disorders.","authors":"Alberto García S, Mireia Costa, Alba García-Zarzoso, Oscar Pastor","doi":"10.1093/database/baae034","DOIUrl":"10.1093/database/baae034","url":null,"abstract":"<p><p>Mutational hotspots are DNA regions with an abnormally high frequency of genetic variants. Identifying whether a variant is located in a mutational hotspot is critical for determining the variant's role in disorder predisposition, development, and treatment response. Despite their significance, current databases on mutational hotspots are limited to the oncology domain. However, identifying mutational hotspots is critical for any disorder in which genetics plays a role. This is true for the world's leading cause of death: cardiac disorders. In this work, we present CardioHotspots, a literature-based database of manually curated hotspots for cardiac diseases. This is the only database we know of that provides high-quality and easily accessible information about hotspots associated with cardiac disorders. CardioHotspots is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3099/). Database URL: https://genomics-hub.pros.dsic.upv.es:3099/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":"0"},"PeriodicalIF":5.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11096770/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140944457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PMBC: a manually curated database for prognostic markers of breast cancer. PMBC:人工编辑的乳腺癌预后标志物数据库。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-15 DOI: 10.1093/database/baae033
Jiabei Liu, Yiyi Yu, Mingyue Li, Yixuan Wu, Weijun Chen, Guanru Liu, Lingxian Liu, Jiechun Lin, Chujun Peng, Weijun Sun, Xiaoli Wu, Xin Chen

Breast cancer is notorious for its high mortality and heterogeneity, resulting in different therapeutic responses. Classical biomarkers have been identified and successfully commercially applied to predict the outcome of breast cancer patients. Accumulating biomarkers, including non-coding RNAs, have been reported as prognostic markers for breast cancer with the development of sequencing techniques. However, there are currently no databases dedicated to the curation and characterization of prognostic markers for breast cancer. Therefore, we constructed a curated database for prognostic markers of breast cancer (PMBC). PMBC consists of 1070 markers covering mRNAs, lncRNAs, miRNAs and circRNAs. These markers are enriched in various cancer- and epithelial-related functions including mitogen-activated protein kinases signaling. We mapped the prognostic markers into the ceRNA network from starBase. The lncRNA NEAT1 competes with 11 RNAs, including lncRNAs and mRNAs. The majority of the ceRNAs in ABAT belong to pseudogenes. The topology analysis of the ceRNA network reveals that known prognostic RNAs have higher closeness than random. Among all the biomarkers, prognostic lncRNAs have a higher degree, while prognostic mRNAs have significantly higher closeness than random RNAs. These results indicate that the lncRNAs play important roles in maintaining the interactions between lncRNAs and their ceRNAs, which might be used as a characteristic to prioritize prognostic lncRNAs based on the ceRNA network. PMBC renders a user-friendly interface and provides detailed information about individual prognostic markers, which will facilitate the precision treatment of breast cancer. PMBC is available at the following URL: http://www.pmbreastcancer.com/.

乳腺癌因其死亡率高和异质性导致不同的治疗反应而臭名昭著。经典的生物标志物已被发现并成功应用于商业领域,以预测乳腺癌患者的预后。随着测序技术的发展,包括非编码 RNA 在内的越来越多的生物标志物被报道为乳腺癌的预后标志物。然而,目前还没有专门用于整理和表征乳腺癌预后标志物的数据库。因此,我们构建了一个乳腺癌预后标志物的策划数据库(PMBC)。PMBC 由 1070 个标记物组成,涵盖 mRNA、lncRNA、miRNA 和 circRNA。这些标记物富含各种癌症和上皮相关功能,包括丝裂原活化蛋白激酶信号转导。我们将预后标志物映射到 starBase 的 ceRNA 网络中。lncRNA NEAT1与11种RNA(包括lncRNA和mRNA)竞争。ABAT中的大部分ceRNA属于假基因。ceRNA网络的拓扑分析表明,已知的预后RNA比随机RNA具有更高的亲缘关系。在所有生物标志物中,预后lncRNA的亲和度较高,而预后mRNA的亲和度明显高于随机RNA。这些结果表明,lncRNAs在维持lncRNAs与其ceRNAs之间的相互作用方面发挥着重要作用,这可以作为根据ceRNA网络优先选择预后lncRNAs的特征。PMBC 界面友好,可提供有关单个预后标志物的详细信息,有助于乳腺癌的精准治疗。PMBC的网址如下:http://www.pmbreastcancer.com/。
{"title":"PMBC: a manually curated database for prognostic markers of breast cancer.","authors":"Jiabei Liu, Yiyi Yu, Mingyue Li, Yixuan Wu, Weijun Chen, Guanru Liu, Lingxian Liu, Jiechun Lin, Chujun Peng, Weijun Sun, Xiaoli Wu, Xin Chen","doi":"10.1093/database/baae033","DOIUrl":"10.1093/database/baae033","url":null,"abstract":"<p><p>Breast cancer is notorious for its high mortality and heterogeneity, resulting in different therapeutic responses. Classical biomarkers have been identified and successfully commercially applied to predict the outcome of breast cancer patients. Accumulating biomarkers, including non-coding RNAs, have been reported as prognostic markers for breast cancer with the development of sequencing techniques. However, there are currently no databases dedicated to the curation and characterization of prognostic markers for breast cancer. Therefore, we constructed a curated database for prognostic markers of breast cancer (PMBC). PMBC consists of 1070 markers covering mRNAs, lncRNAs, miRNAs and circRNAs. These markers are enriched in various cancer- and epithelial-related functions including mitogen-activated protein kinases signaling. We mapped the prognostic markers into the ceRNA network from starBase. The lncRNA NEAT1 competes with 11 RNAs, including lncRNAs and mRNAs. The majority of the ceRNAs in ABAT belong to pseudogenes. The topology analysis of the ceRNA network reveals that known prognostic RNAs have higher closeness than random. Among all the biomarkers, prognostic lncRNAs have a higher degree, while prognostic mRNAs have significantly higher closeness than random RNAs. These results indicate that the lncRNAs play important roles in maintaining the interactions between lncRNAs and their ceRNAs, which might be used as a characteristic to prioritize prognostic lncRNAs based on the ceRNA network. PMBC renders a user-friendly interface and provides detailed information about individual prognostic markers, which will facilitate the precision treatment of breast cancer. PMBC is available at the following URL: http://www.pmbreastcancer.com/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11095525/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140944463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes. 基于途径、反应特异性的疾病变异注释,以阐明分子表型。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-07 DOI: 10.1093/database/baae031
Marija Orlic-Milacic, Karen Rothfels, Lisa Matthews, Adam Wright, Bijay Jassal, Veronica Shamovsky, Quang Trinh, Marc E Gillespie, Cristoffer Sevilla, Krishna Tiwari, Eliot Ragueneau, Chuqiao Gong, Ralf Stephan, Bruce May, Robin Haw, Joel Weiser, Deidre Beavers, Patrick Conley, Henning Hermjakob, Lincoln D Stein, Peter D'Eustachio, Guanming Wu

Germline and somatic mutations can give rise to proteins with altered activity, including both gain and loss-of-function. The effects of these variants can be captured in disease-specific reactions and pathways that highlight the resulting changes to normal biology. A disease reaction is defined as an aberrant reaction in which a variant protein participates. A disease pathway is defined as a pathway that contains a disease reaction. Annotation of disease variants as participants of disease reactions and disease pathways can provide a standardized overview of molecular phenotypes of pathogenic variants that is amenable to computational mining and mathematical modeling. Reactome (https://reactome.org/), an open source, manually curated, peer-reviewed database of human biological pathways, in addition to providing annotations for >11 000 unique human proteins in the context of ∼15 000 wild-type reactions within more than 2000 wild-type pathways, also provides annotations for >4000 disease variants of close to 400 genes as participants of ∼800 disease reactions in the context of ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, described in wild-type reactions and pathways, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Reactome's data model enables mapping of disease variant datasets to specific disease reactions within disease pathways, providing a platform to infer pathway output impacts of numerous human disease variants and model organism orthologs, complementing computational predictions of variant pathogenicity. Database URL: https://reactome.org/.

基因突变和体细胞突变可导致蛋白质活性改变,包括功能获得和丧失。这些变异的影响可以通过疾病特异性反应和途径来捕捉,从而凸显正常生物学的变化。疾病反应是指变异蛋白参与的异常反应。疾病通路是指包含疾病反应的通路。将疾病变异体标注为疾病反应和疾病通路的参与者,可提供致病变异体分子表型的标准化概览,便于计算挖掘和数学建模。Reactome(https://reactome.org/)是一个开放源码、人工编辑、同行评议的人类生物通路数据库,它除了为2000多条野生型通路中15000个野生型反应背景下的>11000个独特人类蛋白质提供注释外,还为400多条疾病通路中800多个疾病反应背景下近400个基因的>4000个疾病变异体提供注释。疾病变异体的功能注释从野生型反应和通路中描述的正常基因功能开始,经过与正常分子行为不同的疾病变异体的实验验证,再根据美国医学遗传学和基因组学学院以及分子病理学协会的标准,从特征变异体的分子表型推断出意义不明的变异体。Reactome 的数据模型可将疾病变异数据集映射到疾病通路中的特定疾病反应,为推断众多人类疾病变异和模式生物直系同源物的通路输出影响提供了一个平台,补充了对变异致病性的计算预测。数据库网址:https://reactome.org/.
{"title":"Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes.","authors":"Marija Orlic-Milacic, Karen Rothfels, Lisa Matthews, Adam Wright, Bijay Jassal, Veronica Shamovsky, Quang Trinh, Marc E Gillespie, Cristoffer Sevilla, Krishna Tiwari, Eliot Ragueneau, Chuqiao Gong, Ralf Stephan, Bruce May, Robin Haw, Joel Weiser, Deidre Beavers, Patrick Conley, Henning Hermjakob, Lincoln D Stein, Peter D'Eustachio, Guanming Wu","doi":"10.1093/database/baae031","DOIUrl":"10.1093/database/baae031","url":null,"abstract":"<p><p>Germline and somatic mutations can give rise to proteins with altered activity, including both gain and loss-of-function. The effects of these variants can be captured in disease-specific reactions and pathways that highlight the resulting changes to normal biology. A disease reaction is defined as an aberrant reaction in which a variant protein participates. A disease pathway is defined as a pathway that contains a disease reaction. Annotation of disease variants as participants of disease reactions and disease pathways can provide a standardized overview of molecular phenotypes of pathogenic variants that is amenable to computational mining and mathematical modeling. Reactome (https://reactome.org/), an open source, manually curated, peer-reviewed database of human biological pathways, in addition to providing annotations for >11 000 unique human proteins in the context of ∼15 000 wild-type reactions within more than 2000 wild-type pathways, also provides annotations for >4000 disease variants of close to 400 genes as participants of ∼800 disease reactions in the context of ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, described in wild-type reactions and pathways, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Reactome's data model enables mapping of disease variant datasets to specific disease reactions within disease pathways, providing a platform to infer pathway output impacts of numerous human disease variants and model organism orthologs, complementing computational predictions of variant pathogenicity. Database URL: https://reactome.org/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184451/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140876125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: ESKtides: a comprehensive database and mining method for ESKAPE phage-derived antimicrobial peptides. 更正:ESKtides:ESKAPE噬菌体衍生抗菌肽的综合数据库和挖掘方法。
IF 5.8 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-06 DOI: 10.1093/database/baae035
{"title":"Correction to: ESKtides: a comprehensive database and mining method for ESKAPE phage-derived antimicrobial peptides.","authors":"","doi":"10.1093/database/baae035","DOIUrl":"10.1093/database/baae035","url":null,"abstract":"","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":5.8,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184445/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140891796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1