Database: The Journal of Biological Databases and Curation最新文献_第10页

The TOXIN knowledge graph: supporting animal-free risk assessment of cosmetics. 毒素知识图谱：支持化妆品的无动物风险评估。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-28 DOI: 10.1093/database/baae121

Sara Sepehri, Anja Heymans, Dinja De Win, Jan Maushagen, Audrey Sanctorum, Christophe Debruyne, Robim M Rodrigues, Joery De Kock, Vera Rogiers, Olga De Troyer, Tamara Vanhaecke

The European Union's ban on animal testing for cosmetic products and their ingredients, combined with the lack of validated animal-free methods, poses challenges in evaluating their potential repeated-dose organ toxicity. To address this, innovative strategies like Next-Generation Risk Assessment (NGRA) are being explored, integrating historical animal data with new mechanistic insights from non-animal New Approach Methodologies (NAMs). This paper introduces the TOXIN knowledge graph (TOXIN KG), a tool designed to retrieve toxicological information on cosmetic ingredients, with a focus on liver-related data. TOXIN KG uses graph-structured semantic technology and integrates toxicological data through ontologies, ensuring interoperable representation. The primary data source is safety information on cosmetic ingredients from scientific opinions issued by the Scientific Committee on Consumer Safety between 2009 and 2019. The ToxRTool automates the reliability assessment of toxicity studies, while the Simplified Molecular Input Line Entry System (SMILES) notation standardizes chemical identification, enabling in silico prediction of repeated-dose toxicity via the implementation of the Organization for Economic Co-operation and Development Quantitative Structure-Activity Relationship Toolbox (OECD QSAR Toolbox). The ToXic Process Ontology, enriched with relevant biological repositories, is employed to represent toxicological concepts systematically. Search filters allow the identification of cosmetic compounds potentially linked to liver toxicity. Data visualization is achieved through Ontodia, a JavaScript library. TOXIN KG, filled with information for 88 cosmetic ingredients, allowed us to identify 53 compounds affecting at least one liver toxicity parameter in a 90-day repeated-dose animal study. For one compound, we illustrate how TOXIN KG links this observation to hepatic cholestasis as an adverse outcome. In an ab initio NGRA context, follow-up in vitro studies using human-based NAMs would be necessary to understand the compound's biological activity and the molecular mechanism leading to the adverse effect. In summary, TOXIN KG emerges as a valuable tool for advancing the reusability of cosmetics safety data, providing knowledge in support of NAM-based hazard and risk assessments. Database URL: https://toxin-search.netlify.app/.

欧盟禁止对化妆品及其成分进行动物实验，再加上缺乏经过验证的无动物实验方法，这给评估化妆品潜在的重复给药器官毒性带来了挑战。为了解决这个问题，人们正在探索下一代风险评估（NGRA）等创新策略，将历史动物数据与来自非动物新方法方法论（NAMs）的新机制见解相结合。本文介绍了毒素知识图谱（TOXIN KG），一个用于检索化妆品成分毒理学信息的工具，重点是肝脏相关数据。毒素KG使用图结构语义技术，并通过本体集成毒理学数据，确保可互操作表示。主要数据来源是2009年至2019年消费者安全科学委员会发布的科学意见中关于化妆品成分的安全信息。ToxRTool自动化了毒性研究的可靠性评估，而简化分子输入线输入系统（SMILES）符号标准化了化学鉴定，通过实施经济合作与发展组织定量结构-活性关系工具箱（OECD QSAR工具箱），实现了重复剂量毒性的计算机预测。毒性过程本体，丰富了相关的生物资源库，被用来系统地表示毒理学概念。搜索过滤器允许识别可能与肝毒性有关的化妆品化合物。数据可视化是通过JavaScript库Ontodia实现的。毒素KG含有88种化妆品成分的信息，使我们能够在90天的重复给药动物研究中确定53种影响至少一种肝脏毒性参数的化合物。对于一种化合物，我们说明了毒素KG如何将这种观察与肝脏胆汁淤积作为不利结果联系起来。在从头开始的NGRA背景下，有必要使用基于人的NAMs进行后续的体外研究，以了解该化合物的生物活性和导致不良反应的分子机制。总之，毒素KG是促进化妆品安全数据可重复使用的宝贵工具，为支持基于nama的危害和风险评估提供了知识。数据库地址：https://toxin-search.netlify.app/。

{"title":"The TOXIN knowledge graph: supporting animal-free risk assessment of cosmetics.","authors":"Sara Sepehri, Anja Heymans, Dinja De Win, Jan Maushagen, Audrey Sanctorum, Christophe Debruyne, Robim M Rodrigues, Joery De Kock, Vera Rogiers, Olga De Troyer, Tamara Vanhaecke","doi":"10.1093/database/baae121","DOIUrl":"10.1093/database/baae121","url":null,"abstract":"The European Union's ban on animal testing for cosmetic products and their ingredients, combined with the lack of validated animal-free methods, poses challenges in evaluating their potential repeated-dose organ toxicity. To address this, innovative strategies like Next-Generation Risk Assessment (NGRA) are being explored, integrating historical animal data with new mechanistic insights from non-animal New Approach Methodologies (NAMs). This paper introduces the TOXIN knowledge graph (TOXIN KG), a tool designed to retrieve toxicological information on cosmetic ingredients, with a focus on liver-related data. TOXIN KG uses graph-structured semantic technology and integrates toxicological data through ontologies, ensuring interoperable representation. The primary data source is safety information on cosmetic ingredients from scientific opinions issued by the Scientific Committee on Consumer Safety between 2009 and 2019. The ToxRTool automates the reliability assessment of toxicity studies, while the Simplified Molecular Input Line Entry System (SMILES) notation standardizes chemical identification, enabling in silico prediction of repeated-dose toxicity via the implementation of the Organization for Economic Co-operation and Development Quantitative Structure-Activity Relationship Toolbox (OECD QSAR Toolbox). The ToXic Process Ontology, enriched with relevant biological repositories, is employed to represent toxicological concepts systematically. Search filters allow the identification of cosmetic compounds potentially linked to liver toxicity. Data visualization is achieved through Ontodia, a JavaScript library. TOXIN KG, filled with information for 88 cosmetic ingredients, allowed us to identify 53 compounds affecting at least one liver toxicity parameter in a 90-day repeated-dose animal study. For one compound, we illustrate how TOXIN KG links this observation to hepatic cholestasis as an adverse outcome. In an ab initio NGRA context, follow-up in vitro studies using human-based NAMs would be necessary to understand the compound's biological activity and the molecular mechanism leading to the adverse effect. In summary, TOXIN KG emerges as a valuable tool for advancing the reusability of cosmetics safety data, providing knowledge in support of NAM-based hazard and risk assessments. Database URL: https://toxin-search.netlify.app/.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11776536/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143064250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The TOXIN knowledge graph: supporting animal-free risk assessment of cosmetics. 毒素知识图谱：支持化妆品的无动物风险评估。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-28 DOI: 10.1093/database/baae121

Sara Sepehri, Anja Heymans, Dinja De Win, Jan Maushagen, Audrey Sanctorum, Christophe Debruyne, Robim M Rodrigues, Joery De Kock, Vera Rogiers, Olga De Troyer, Tamara Vanhaecke

The European Union's ban on animal testing for cosmetic products and their ingredients, combined with the lack of validated animal-free methods, poses challenges in evaluating their potential repeated-dose organ toxicity. To address this, innovative strategies like Next-Generation Risk Assessment (NGRA) are being explored, integrating historical animal data with new mechanistic insights from non-animal New Approach Methodologies (NAMs). This paper introduces the TOXIN knowledge graph (TOXIN KG), a tool designed to retrieve toxicological information on cosmetic ingredients, with a focus on liver-related data. TOXIN KG uses graph-structured semantic technology and integrates toxicological data through ontologies, ensuring interoperable representation. The primary data source is safety information on cosmetic ingredients from scientific opinions issued by the Scientific Committee on Consumer Safety between 2009 and 2019. The ToxRTool automates the reliability assessment of toxicity studies, while the Simplified Molecular Input Line Entry System (SMILES) notation standardizes chemical identification, enabling in silico prediction of repeated-dose toxicity via the implementation of the Organization for Economic Co-operation and Development Quantitative Structure-Activity Relationship Toolbox (OECD QSAR Toolbox). The ToXic Process Ontology, enriched with relevant biological repositories, is employed to represent toxicological concepts systematically. Search filters allow the identification of cosmetic compounds potentially linked to liver toxicity. Data visualization is achieved through Ontodia, a JavaScript library. TOXIN KG, filled with information for 88 cosmetic ingredients, allowed us to identify 53 compounds affecting at least one liver toxicity parameter in a 90-day repeated-dose animal study. For one compound, we illustrate how TOXIN KG links this observation to hepatic cholestasis as an adverse outcome. In an ab initio NGRA context, follow-up in vitro studies using human-based NAMs would be necessary to understand the compound's biological activity and the molecular mechanism leading to the adverse effect. In summary, TOXIN KG emerges as a valuable tool for advancing the reusability of cosmetics safety data, providing knowledge in support of NAM-based hazard and risk assessments. Database URL: https://toxin-search.netlify.app/.

欧盟禁止对化妆品及其成分进行动物实验，再加上缺乏经过验证的无动物实验方法，这给评估化妆品潜在的重复给药器官毒性带来了挑战。为了解决这个问题，人们正在探索下一代风险评估（NGRA）等创新策略，将历史动物数据与来自非动物新方法方法论（NAMs）的新机制见解相结合。本文介绍了毒素知识图谱（TOXIN KG），一个用于检索化妆品成分毒理学信息的工具，重点是肝脏相关数据。毒素KG使用图结构语义技术，并通过本体集成毒理学数据，确保可互操作表示。主要数据来源是2009年至2019年消费者安全科学委员会发布的科学意见中关于化妆品成分的安全信息。ToxRTool自动化了毒性研究的可靠性评估，而简化分子输入线输入系统（SMILES）符号标准化了化学鉴定，通过实施经济合作与发展组织定量结构-活性关系工具箱（OECD QSAR工具箱），实现了重复剂量毒性的计算机预测。毒性过程本体，丰富了相关的生物资源库，被用来系统地表示毒理学概念。搜索过滤器允许识别可能与肝毒性有关的化妆品化合物。数据可视化是通过JavaScript库Ontodia实现的。毒素KG含有88种化妆品成分的信息，使我们能够在90天的重复给药动物研究中确定53种影响至少一种肝脏毒性参数的化合物。对于一种化合物，我们说明了毒素KG如何将这种观察与肝脏胆汁淤积作为不利结果联系起来。在从头开始的NGRA背景下，有必要使用基于人的NAMs进行后续的体外研究，以了解该化合物的生物活性和导致不良反应的分子机制。总之，毒素KG是促进化妆品安全数据可重复使用的宝贵工具，为支持基于nama的危害和风险评估提供了知识。数据库地址：https://toxin-search.netlify.app/。

{"title":"The TOXIN knowledge graph: supporting animal-free risk assessment of cosmetics.","authors":"Sara Sepehri, Anja Heymans, Dinja De Win, Jan Maushagen, Audrey Sanctorum, Christophe Debruyne, Robim M Rodrigues, Joery De Kock, Vera Rogiers, Olga De Troyer, Tamara Vanhaecke","doi":"10.1093/database/baae121","DOIUrl":"https://doi.org/10.1093/database/baae121","url":null,"abstract":"The European Union's ban on animal testing for cosmetic products and their ingredients, combined with the lack of validated animal-free methods, poses challenges in evaluating their potential repeated-dose organ toxicity. To address this, innovative strategies like Next-Generation Risk Assessment (NGRA) are being explored, integrating historical animal data with new mechanistic insights from non-animal New Approach Methodologies (NAMs). This paper introduces the TOXIN knowledge graph (TOXIN KG), a tool designed to retrieve toxicological information on cosmetic ingredients, with a focus on liver-related data. TOXIN KG uses graph-structured semantic technology and integrates toxicological data through ontologies, ensuring interoperable representation. The primary data source is safety information on cosmetic ingredients from scientific opinions issued by the Scientific Committee on Consumer Safety between 2009 and 2019. The ToxRTool automates the reliability assessment of toxicity studies, while the Simplified Molecular Input Line Entry System (SMILES) notation standardizes chemical identification, enabling in silico prediction of repeated-dose toxicity via the implementation of the Organization for Economic Co-operation and Development Quantitative Structure-Activity Relationship Toolbox (OECD QSAR Toolbox). The ToXic Process Ontology, enriched with relevant biological repositories, is employed to represent toxicological concepts systematically. Search filters allow the identification of cosmetic compounds potentially linked to liver toxicity. Data visualization is achieved through Ontodia, a JavaScript library. TOXIN KG, filled with information for 88 cosmetic ingredients, allowed us to identify 53 compounds affecting at least one liver toxicity parameter in a 90-day repeated-dose animal study. For one compound, we illustrate how TOXIN KG links this observation to hepatic cholestasis as an adverse outcome. In an ab initio NGRA context, follow-up in vitro studies using human-based NAMs would be necessary to understand the compound's biological activity and the molecular mechanism leading to the adverse effect. In summary, TOXIN KG emerges as a valuable tool for advancing the reusability of cosmetics safety data, providing knowledge in support of NAM-based hazard and risk assessments. Database URL: https://toxin-search.netlify.app/.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BuffExDb: web-based tissue-specific gene expression resource for breeding and conservation programmes in Bubalus bubalis. Bubalus bubalis的组织特异性基因表达资源。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-24 DOI: 10.1093/database/baae128

Naina Kumari, Samir Kumar, Anupama Roy, Princy Saini, Sarika Jaiswal, Mir Asif Iquebal, Ulavappa B Angadi, Dinesh Kumar

Amidst the global challenge of extreme poverty, the livestock sector can significantly contribute to global sustainable development goals by enhancing resilience, smallholder productivity, and market participation. The Indian livestock sector is one of the largest in the world with a total livestock population of 535.82 million, ∼10.7% of the world's livestock population. Buffalo (Bubalus bubalis) holds significant importance in India and other Asian countries, notably contributing to their economies by surpassing cattle in milk production and providing various valuable products. The limited availability of genomic and transcriptomic resources for buffaloes hinders the efforts to enhance their traits for increased milk and meat production. To address this gap, this study adopted the state-of-the-art bioinformatics tools to analyse 2429 transcriptomes representing 438 BioSamples from 23 BioProjects obtained from a public domain database, representing 76 different types of tissues and cell types from all major organ systems in buffalo species (river and swamp). The outcome of this exhaustive genomic data led to the development of a relational buffalo expression database based on a three-tier architecture named as BuffExDb (http://46.202.167.198/buffex/). The user-friendliness and flexibilities in retrieval of tissue-specific genes (TSGs) and their functional annotation are the major characteristics of BuffExDb. This is the first of its kind that offers an effortlessly navigable and filterable database, enabling users to examine and visualize the expression levels of each tissue across multiple samples, simultaneously. It also provides the Tau score parameter for the identification of TSGs along with their essential roles in tissue development, maintenance, and function as observed through the enrichment test for gene ontologies. The exhaustive outcome of this work would pave the way for the biological, functional, and evolutionary studies for easy access. This prior information based on tissue-specific mechanisms can be used for future genomic research, especially in association studies in endeavour of enhanced buffalo breeding and conservation programmes. Database URL: http://46.202.167.198/buffex/.

在极端贫困的全球挑战中，畜牧业可以通过提高抵御力、小农生产力和市场参与度，为全球可持续发展目标做出重大贡献。印度畜牧业是世界上最大的畜牧业之一，牲畜总数为5.3582亿，占世界牲畜总数的10.7%。水牛（Bubalus bubalis）在印度和其他亚洲国家占有重要地位，特别是通过在牛奶产量方面超过牛，并提供各种有价值的产品，对他们的经济做出了贡献。水牛基因组和转录组学资源的有限性阻碍了提高其性状以增加牛奶和肉类产量的努力。为了解决这一差距，本研究采用了最先进的生物信息学工具，分析了来自公共领域数据库的23个生物项目的2429个转录组，代表了438个生物样本，代表了水牛物种（河流和沼泽）所有主要器官系统的76种不同类型的组织和细胞类型。这种详尽的基因组数据的结果导致了基于三层架构的关系水牛表达式数据库的开发，该数据库名为BuffExDb （http://46.202.167.198/buffex/）。在检索组织特异性基因（tsg）及其功能注释方面的易用性和灵活性是BuffExDb的主要特点。这是同类中第一个提供轻松导航和可过滤数据库的，使用户能够同时检查和可视化多个样本中每个组织的表达水平。它还提供了Tau评分参数，用于识别tsg及其在组织发育，维持和功能中的重要作用，通过基因本体的富集测试观察到。这项工作的详尽结果将为生物、功能和进化研究铺平道路。这种基于组织特异性机制的先验信息可用于未来的基因组研究，特别是在努力加强水牛繁殖和保护计划的关联研究中。数据库地址：http://46.202.167.198/buffex/。

{"title":"BuffExDb: web-based tissue-specific gene expression resource for breeding and conservation programmes in Bubalus bubalis.","authors":"Naina Kumari, Samir Kumar, Anupama Roy, Princy Saini, Sarika Jaiswal, Mir Asif Iquebal, Ulavappa B Angadi, Dinesh Kumar","doi":"10.1093/database/baae128","DOIUrl":"10.1093/database/baae128","url":null,"abstract":"Amidst the global challenge of extreme poverty, the livestock sector can significantly contribute to global sustainable development goals by enhancing resilience, smallholder productivity, and market participation. The Indian livestock sector is one of the largest in the world with a total livestock population of 535.82 million, ∼10.7% of the world's livestock population. Buffalo (Bubalus bubalis) holds significant importance in India and other Asian countries, notably contributing to their economies by surpassing cattle in milk production and providing various valuable products. The limited availability of genomic and transcriptomic resources for buffaloes hinders the efforts to enhance their traits for increased milk and meat production. To address this gap, this study adopted the state-of-the-art bioinformatics tools to analyse 2429 transcriptomes representing 438 BioSamples from 23 BioProjects obtained from a public domain database, representing 76 different types of tissues and cell types from all major organ systems in buffalo species (river and swamp). The outcome of this exhaustive genomic data led to the development of a relational buffalo expression database based on a three-tier architecture named as BuffExDb (http://46.202.167.198/buffex/). The user-friendliness and flexibilities in retrieval of tissue-specific genes (TSGs) and their functional annotation are the major characteristics of BuffExDb. This is the first of its kind that offers an effortlessly navigable and filterable database, enabling users to examine and visualize the expression levels of each tissue across multiple samples, simultaneously. It also provides the Tau score parameter for the identification of TSGs along with their essential roles in tissue development, maintenance, and function as observed through the enrichment test for gene ontologies. The exhaustive outcome of this work would pave the way for the biological, functional, and evolutionary studies for easy access. This prior information based on tissue-specific mechanisms can be used for future genomic research, especially in association studies in endeavour of enhanced buffalo breeding and conservation programmes. Database URL: http://46.202.167.198/buffex/.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11758923/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143032529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BuffExDb: web-based tissue-specific gene expression resource for breeding and conservation programmes in Bubalus bubalis. Bubalus bubalis的组织特异性基因表达资源。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-24 DOI: 10.1093/database/baae128

Naina Kumari, Samir Kumar, Anupama Roy, Princy Saini, Sarika Jaiswal, Mir Asif Iquebal, Ulavappa B Angadi, Dinesh Kumar

Amidst the global challenge of extreme poverty, the livestock sector can significantly contribute to global sustainable development goals by enhancing resilience, smallholder productivity, and market participation. The Indian livestock sector is one of the largest in the world with a total livestock population of 535.82 million, ∼10.7% of the world's livestock population. Buffalo (Bubalus bubalis) holds significant importance in India and other Asian countries, notably contributing to their economies by surpassing cattle in milk production and providing various valuable products. The limited availability of genomic and transcriptomic resources for buffaloes hinders the efforts to enhance their traits for increased milk and meat production. To address this gap, this study adopted the state-of-the-art bioinformatics tools to analyse 2429 transcriptomes representing 438 BioSamples from 23 BioProjects obtained from a public domain database, representing 76 different types of tissues and cell types from all major organ systems in buffalo species (river and swamp). The outcome of this exhaustive genomic data led to the development of a relational buffalo expression database based on a three-tier architecture named as BuffExDb (http://46.202.167.198/buffex/). The user-friendliness and flexibilities in retrieval of tissue-specific genes (TSGs) and their functional annotation are the major characteristics of BuffExDb. This is the first of its kind that offers an effortlessly navigable and filterable database, enabling users to examine and visualize the expression levels of each tissue across multiple samples, simultaneously. It also provides the Tau score parameter for the identification of TSGs along with their essential roles in tissue development, maintenance, and function as observed through the enrichment test for gene ontologies. The exhaustive outcome of this work would pave the way for the biological, functional, and evolutionary studies for easy access. This prior information based on tissue-specific mechanisms can be used for future genomic research, especially in association studies in endeavour of enhanced buffalo breeding and conservation programmes. Database URL: http://46.202.167.198/buffex/.

在极端贫困的全球挑战中，畜牧业可以通过提高抵御力、小农生产力和市场参与度，为全球可持续发展目标做出重大贡献。印度畜牧业是世界上最大的畜牧业之一，牲畜总数为5.3582亿，占世界牲畜总数的10.7%。水牛（Bubalus bubalis）在印度和其他亚洲国家占有重要地位，特别是通过在牛奶产量方面超过牛，并提供各种有价值的产品，对他们的经济做出了贡献。水牛基因组和转录组学资源的有限性阻碍了提高其性状以增加牛奶和肉类产量的努力。为了解决这一差距，本研究采用了最先进的生物信息学工具，分析了来自公共领域数据库的23个生物项目的2429个转录组，代表了438个生物样本，代表了水牛物种（河流和沼泽）所有主要器官系统的76种不同类型的组织和细胞类型。这种详尽的基因组数据的结果导致了基于三层架构的关系水牛表达式数据库的开发，该数据库名为BuffExDb （http://46.202.167.198/buffex/）。在检索组织特异性基因（tsg）及其功能注释方面的易用性和灵活性是BuffExDb的主要特点。这是同类中第一个提供轻松导航和可过滤数据库的，使用户能够同时检查和可视化多个样本中每个组织的表达水平。它还提供了Tau评分参数，用于识别tsg及其在组织发育，维持和功能中的重要作用，通过基因本体的富集测试观察到。这项工作的详尽结果将为生物、功能和进化研究铺平道路。这种基于组织特异性机制的先验信息可用于未来的基因组研究，特别是在努力加强水牛繁殖和保护计划的关联研究中。数据库地址：http://46.202.167.198/buffex/。

{"title":"BuffExDb: web-based tissue-specific gene expression resource for breeding and conservation programmes in Bubalus bubalis.","authors":"Naina Kumari, Samir Kumar, Anupama Roy, Princy Saini, Sarika Jaiswal, Mir Asif Iquebal, Ulavappa B Angadi, Dinesh Kumar","doi":"10.1093/database/baae128","DOIUrl":"https://doi.org/10.1093/database/baae128","url":null,"abstract":"Amidst the global challenge of extreme poverty, the livestock sector can significantly contribute to global sustainable development goals by enhancing resilience, smallholder productivity, and market participation. The Indian livestock sector is one of the largest in the world with a total livestock population of 535.82 million, ∼10.7% of the world's livestock population. Buffalo (Bubalus bubalis) holds significant importance in India and other Asian countries, notably contributing to their economies by surpassing cattle in milk production and providing various valuable products. The limited availability of genomic and transcriptomic resources for buffaloes hinders the efforts to enhance their traits for increased milk and meat production. To address this gap, this study adopted the state-of-the-art bioinformatics tools to analyse 2429 transcriptomes representing 438 BioSamples from 23 BioProjects obtained from a public domain database, representing 76 different types of tissues and cell types from all major organ systems in buffalo species (river and swamp). The outcome of this exhaustive genomic data led to the development of a relational buffalo expression database based on a three-tier architecture named as BuffExDb (http://46.202.167.198/buffex/). The user-friendliness and flexibilities in retrieval of tissue-specific genes (TSGs) and their functional annotation are the major characteristics of BuffExDb. This is the first of its kind that offers an effortlessly navigable and filterable database, enabling users to examine and visualize the expression levels of each tissue across multiple samples, simultaneously. It also provides the Tau score parameter for the identification of TSGs along with their essential roles in tissue development, maintenance, and function as observed through the enrichment test for gene ontologies. The exhaustive outcome of this work would pave the way for the biological, functional, and evolutionary studies for easy access. This prior information based on tissue-specific mechanisms can be used for future genomic research, especially in association studies in endeavour of enhanced buffalo breeding and conservation programmes. Database URL: http://46.202.167.198/buffex/.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database. 标准化的管道支持并促进了大鼠基因组数据库中不同数据集的集成。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-22 DOI: 10.1093/database/baae132

Jennifer R Smith, Marek A Tutaj, Jyothi Thota, Logan Lamers, Adam C Gibson, Akhilanand Kundurthi, Varun Reddy Gollapally, Kent C Brodie, Stacy Zacher, Stanley J F Laulederkind, G Thomas Hayman, Shur-Jen Wang, Monika Tutaj, Mary L Kaldunski, Mahima Vedi, Wendy M Demos, Jeffrey L De Pons, Melinda R Dwinell, Anne E Kwitek

The Rat Genome Database (RGD) is a multispecies knowledgebase which integrates genetic, multiomic, phenotypic, and disease data across 10 mammalian species. To support cross-species, multiomics studies and to enhance and expand on data manually extracted from the biomedical literature by the RGD team of expert curators, RGD imports and integrates data from multiple sources. These include major databases and a substantial number of domain-specific resources, as well as direct submissions by individual researchers. The incorporation of these diverse datatypes is handled by a growing list of automated import, export, data processing, and quality control pipelines. This article outlines the development over time of a standardized infrastructure for automated RGD pipelines with a summary of key design decisions and a focus on lessons learned.

大鼠基因组数据库（RGD）是一个多物种知识库，集成了10个哺乳动物物种的遗传、多组学、表型和疾病数据。为了支持跨物种、多组学研究，并加强和扩展由RGD专家管理团队手动从生物医学文献中提取的数据，RGD导入并整合了来自多个来源的数据。这些包括主要数据库和大量特定领域的资源，以及个人研究人员直接提交的文件。这些不同数据类型的合并由越来越多的自动化导入、导出、数据处理和质量控制管道来处理。本文概述了自动化RGD管道标准化基础设施的开发过程，并总结了关键的设计决策和经验教训。

{"title":"Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database.","authors":"Jennifer R Smith, Marek A Tutaj, Jyothi Thota, Logan Lamers, Adam C Gibson, Akhilanand Kundurthi, Varun Reddy Gollapally, Kent C Brodie, Stacy Zacher, Stanley J F Laulederkind, G Thomas Hayman, Shur-Jen Wang, Monika Tutaj, Mary L Kaldunski, Mahima Vedi, Wendy M Demos, Jeffrey L De Pons, Melinda R Dwinell, Anne E Kwitek","doi":"10.1093/database/baae132","DOIUrl":"https://doi.org/10.1093/database/baae132","url":null,"abstract":"The Rat Genome Database (RGD) is a multispecies knowledgebase which integrates genetic, multiomic, phenotypic, and disease data across 10 mammalian species. To support cross-species, multiomics studies and to enhance and expand on data manually extracted from the biomedical literature by the RGD team of expert curators, RGD imports and integrates data from multiple sources. These include major databases and a substantial number of domain-specific resources, as well as direct submissions by individual researchers. The incorporation of these diverse datatypes is handled by a growing list of automated import, export, data processing, and quality control pipelines. This article outlines the development over time of a standardized infrastructure for automated RGD pipelines with a summary of key design decisions and a focus on lessons learned.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A change language for ontologies and knowledge graphs. 本体和知识图的变更语言。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-22 DOI: 10.1093/database/baae133

Harshad Hegde, Jennifer Vendetti, Damien Goutte-Gattat, J Harry Caufield, John B Graybeal, Nomi L Harris, Naouel Karam, Christian Kindermann, Nicolas Matentzoglu, James A Overton, Mark A Musen, Christopher J Mungall

Ontologies and knowledge graphs (KGs) are general-purpose computable representations of some domain, such as human anatomy, and are frequently a crucial part of modern information systems. Most of these structures change over time, incorporating new knowledge or information that was previously missing. Managing these changes is a challenge, both in terms of communicating changes to users and providing mechanisms to make it easier for multiple stakeholders to contribute. To fill that need, we have created KGCL, the Knowledge Graph Change Language (https://github.com/INCATools/kgcl), a standard data model for describing changes to KGs and ontologies at a high level, and an accompanying human-readable Controlled Natural Language (CNL). This language serves two purposes: a curator can use it to request desired changes, and it can also be used to describe changes that have already happened, corresponding to the concepts of "apply patch" and "diff" commonly used for managing changes in text documents and computer programs. Another key feature of KGCL is that descriptions are at a high enough level to be useful and understood by a variety of stakeholders-e.g. ontology edits can be specified by commands like "add synonym 'arm' to 'forelimb'" or "move 'Parkinson disease' under 'neurodegenerative disease'." We have also built a suite of tools for managing ontology changes. These include an automated agent that integrates with and monitors GitHub ontology repositories and applies any requested changes and a new component in the BioPortal ontology resource that allows users to make change requests directly from within the BioPortal user interface. Overall, the KGCL data model, its CNL, and associated tooling allow for easier management and processing of changes associated with the development of ontologies and KGs. Database URL: https://github.com/INCATools/kgcl.

本体和知识图（KGs）是某些领域的通用可计算表示，例如人体解剖学，并且通常是现代信息系统的关键部分。这些结构中的大多数随着时间的推移而变化，吸收了以前缺失的新知识或信息。管理这些更改是一项挑战，既要与用户沟通更改，又要提供使多个涉众更容易做出贡献的机制。为了满足这一需求，我们创建了KGCL，知识图谱变更语言（https://github.com/INCATools/kgcl），这是一种用于在高层次上描述知识图谱和本体变更的标准数据模型，以及伴随的人类可读的受控自然语言（CNL）。这种语言有两个用途：管理员可以用它来请求所需的更改，也可以用它来描述已经发生的更改，这与通常用于管理文本文档和计算机程序中的更改的“应用补丁”和“diff”概念相对应。KGCL的另一个关键特性是描述的层次足够高，可以被各种涉众使用和理解。本体编辑可以通过“将同义词‘手臂’添加到‘前肢’”或“将‘帕金森病’移到‘神经退行性疾病’下”等命令来指定。我们还构建了一套用于管理本体更改的工具。其中包括一个自动化代理，它集成并监视GitHub本体存储库，并应用任何请求的更改，以及biopportal本体资源中的一个新组件，该组件允许用户直接从biopportal用户界面中发出更改请求。总的来说，KGCL数据模型、它的CNL和相关的工具允许更容易地管理和处理与本体和KGs开发相关的更改。

{"title":"A change language for ontologies and knowledge graphs.","authors":"Harshad Hegde, Jennifer Vendetti, Damien Goutte-Gattat, J Harry Caufield, John B Graybeal, Nomi L Harris, Naouel Karam, Christian Kindermann, Nicolas Matentzoglu, James A Overton, Mark A Musen, Christopher J Mungall","doi":"10.1093/database/baae133","DOIUrl":"https://doi.org/10.1093/database/baae133","url":null,"abstract":"Ontologies and knowledge graphs (KGs) are general-purpose computable representations of some domain, such as human anatomy, and are frequently a crucial part of modern information systems. Most of these structures change over time, incorporating new knowledge or information that was previously missing. Managing these changes is a challenge, both in terms of communicating changes to users and providing mechanisms to make it easier for multiple stakeholders to contribute. To fill that need, we have created KGCL, the Knowledge Graph Change Language (https://github.com/INCATools/kgcl), a standard data model for describing changes to KGs and ontologies at a high level, and an accompanying human-readable Controlled Natural Language (CNL). This language serves two purposes: a curator can use it to request desired changes, and it can also be used to describe changes that have already happened, corresponding to the concepts of \"apply patch\" and \"diff\" commonly used for managing changes in text documents and computer programs. Another key feature of KGCL is that descriptions are at a high enough level to be useful and understood by a variety of stakeholders-e.g. ontology edits can be specified by commands like \"add synonym 'arm' to 'forelimb'\" or \"move 'Parkinson disease' under 'neurodegenerative disease'.\" We have also built a suite of tools for managing ontology changes. These include an automated agent that integrates with and monitors GitHub ontology repositories and applies any requested changes and a new component in the BioPortal ontology resource that allows users to make change requests directly from within the BioPortal user interface. Overall, the KGCL data model, its CNL, and associated tooling allow for easier management and processing of changes associated with the development of ontologies and KGs. Database URL: https://github.com/INCATools/kgcl.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A change language for ontologies and knowledge graphs. 本体和知识图的变更语言。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-22 DOI: 10.1093/database/baae133

Harshad Hegde, Jennifer Vendetti, Damien Goutte-Gattat, J Harry Caufield, John B Graybeal, Nomi L Harris, Naouel Karam, Christian Kindermann, Nicolas Matentzoglu, James A Overton, Mark A Musen, Christopher J Mungall

Ontologies and knowledge graphs (KGs) are general-purpose computable representations of some domain, such as human anatomy, and are frequently a crucial part of modern information systems. Most of these structures change over time, incorporating new knowledge or information that was previously missing. Managing these changes is a challenge, both in terms of communicating changes to users and providing mechanisms to make it easier for multiple stakeholders to contribute. To fill that need, we have created KGCL, the Knowledge Graph Change Language (https://github.com/INCATools/kgcl), a standard data model for describing changes to KGs and ontologies at a high level, and an accompanying human-readable Controlled Natural Language (CNL). This language serves two purposes: a curator can use it to request desired changes, and it can also be used to describe changes that have already happened, corresponding to the concepts of "apply patch" and "diff" commonly used for managing changes in text documents and computer programs. Another key feature of KGCL is that descriptions are at a high enough level to be useful and understood by a variety of stakeholders-e.g. ontology edits can be specified by commands like "add synonym 'arm' to 'forelimb'" or "move 'Parkinson disease' under 'neurodegenerative disease'." We have also built a suite of tools for managing ontology changes. These include an automated agent that integrates with and monitors GitHub ontology repositories and applies any requested changes and a new component in the BioPortal ontology resource that allows users to make change requests directly from within the BioPortal user interface. Overall, the KGCL data model, its CNL, and associated tooling allow for easier management and processing of changes associated with the development of ontologies and KGs. Database URL: https://github.com/INCATools/kgcl.

本体和知识图（KGs）是某些领域的通用可计算表示，例如人体解剖学，并且通常是现代信息系统的关键部分。这些结构中的大多数随着时间的推移而变化，吸收了以前缺失的新知识或信息。管理这些更改是一项挑战，既要与用户沟通更改，又要提供使多个涉众更容易做出贡献的机制。为了满足这一需求，我们创建了KGCL，知识图谱变更语言（https://github.com/INCATools/kgcl），这是一种用于在高层次上描述知识图谱和本体变更的标准数据模型，以及伴随的人类可读的受控自然语言（CNL）。这种语言有两个用途：管理员可以用它来请求所需的更改，也可以用它来描述已经发生的更改，这与通常用于管理文本文档和计算机程序中的更改的“应用补丁”和“diff”概念相对应。KGCL的另一个关键特性是描述的层次足够高，可以被各种涉众使用和理解。本体编辑可以通过“将同义词‘手臂’添加到‘前肢’”或“将‘帕金森病’移到‘神经退行性疾病’下”等命令来指定。我们还构建了一套用于管理本体更改的工具。其中包括一个自动化代理，它集成并监视GitHub本体存储库，并应用任何请求的更改，以及biopportal本体资源中的一个新组件，该组件允许用户直接从biopportal用户界面中发出更改请求。总的来说，KGCL数据模型、它的CNL和相关的工具允许更容易地管理和处理与本体和KGs开发相关的更改。

{"title":"A change language for ontologies and knowledge graphs.","authors":"Harshad Hegde, Jennifer Vendetti, Damien Goutte-Gattat, J Harry Caufield, John B Graybeal, Nomi L Harris, Naouel Karam, Christian Kindermann, Nicolas Matentzoglu, James A Overton, Mark A Musen, Christopher J Mungall","doi":"10.1093/database/baae133","DOIUrl":"10.1093/database/baae133","url":null,"abstract":"Ontologies and knowledge graphs (KGs) are general-purpose computable representations of some domain, such as human anatomy, and are frequently a crucial part of modern information systems. Most of these structures change over time, incorporating new knowledge or information that was previously missing. Managing these changes is a challenge, both in terms of communicating changes to users and providing mechanisms to make it easier for multiple stakeholders to contribute. To fill that need, we have created KGCL, the Knowledge Graph Change Language (https://github.com/INCATools/kgcl), a standard data model for describing changes to KGs and ontologies at a high level, and an accompanying human-readable Controlled Natural Language (CNL). This language serves two purposes: a curator can use it to request desired changes, and it can also be used to describe changes that have already happened, corresponding to the concepts of \"apply patch\" and \"diff\" commonly used for managing changes in text documents and computer programs. Another key feature of KGCL is that descriptions are at a high enough level to be useful and understood by a variety of stakeholders-e.g. ontology edits can be specified by commands like \"add synonym 'arm' to 'forelimb'\" or \"move 'Parkinson disease' under 'neurodegenerative disease'.\" We have also built a suite of tools for managing ontology changes. These include an automated agent that integrates with and monitors GitHub ontology repositories and applies any requested changes and a new component in the BioPortal ontology resource that allows users to make change requests directly from within the BioPortal user interface. Overall, the KGCL data model, its CNL, and associated tooling allow for easier management and processing of changes associated with the development of ontologies and KGs. Database URL: https://github.com/INCATools/kgcl.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11753292/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143022562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database. 标准化的管道支持并促进了大鼠基因组数据库中不同数据集的集成。

IF 3.4 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-22 DOI: 10.1093/database/baae132

Jennifer R Smith, Marek A Tutaj, Jyothi Thota, Logan Lamers, Adam C Gibson, Akhilanand Kundurthi, Varun Reddy Gollapally, Kent C Brodie, Stacy Zacher, Stanley J F Laulederkind, G Thomas Hayman, Shur-Jen Wang, Monika Tutaj, Mary L Kaldunski, Mahima Vedi, Wendy M Demos, Jeffrey L De Pons, Melinda R Dwinell, Anne E Kwitek

The Rat Genome Database (RGD) is a multispecies knowledgebase which integrates genetic, multiomic, phenotypic, and disease data across 10 mammalian species. To support cross-species, multiomics studies and to enhance and expand on data manually extracted from the biomedical literature by the RGD team of expert curators, RGD imports and integrates data from multiple sources. These include major databases and a substantial number of domain-specific resources, as well as direct submissions by individual researchers. The incorporation of these diverse datatypes is handled by a growing list of automated import, export, data processing, and quality control pipelines. This article outlines the development over time of a standardized infrastructure for automated RGD pipelines with a summary of key design decisions and a focus on lessons learned.

大鼠基因组数据库（RGD）是一个多物种知识库，集成了10个哺乳动物物种的遗传、多组学、表型和疾病数据。为了支持跨物种、多组学研究，并加强和扩展由RGD专家管理团队手动从生物医学文献中提取的数据，RGD导入并整合了来自多个来源的数据。这些包括主要数据库和大量特定领域的资源，以及个人研究人员直接提交的文件。这些不同数据类型的合并由越来越多的自动化导入、导出、数据处理和质量控制管道来处理。本文概述了自动化RGD管道标准化基础设施的开发过程，并总结了关键的设计决策和经验教训。

{"title":"Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database.","authors":"Jennifer R Smith, Marek A Tutaj, Jyothi Thota, Logan Lamers, Adam C Gibson, Akhilanand Kundurthi, Varun Reddy Gollapally, Kent C Brodie, Stacy Zacher, Stanley J F Laulederkind, G Thomas Hayman, Shur-Jen Wang, Monika Tutaj, Mary L Kaldunski, Mahima Vedi, Wendy M Demos, Jeffrey L De Pons, Melinda R Dwinell, Anne E Kwitek","doi":"10.1093/database/baae132","DOIUrl":"10.1093/database/baae132","url":null,"abstract":"The Rat Genome Database (RGD) is a multispecies knowledgebase which integrates genetic, multiomic, phenotypic, and disease data across 10 mammalian species. To support cross-species, multiomics studies and to enhance and expand on data manually extracted from the biomedical literature by the RGD team of expert curators, RGD imports and integrates data from multiple sources. These include major databases and a substantial number of domain-specific resources, as well as direct submissions by individual researchers. The incorporation of these diverse datatypes is handled by a growing list of automated import, export, data processing, and quality control pipelines. This article outlines the development over time of a standardized infrastructure for automated RGD pipelines with a summary of key design decisions and a focus on lessons learned.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11753291/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143022144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A database on the historical and current occurrences of snakes in Eswatini. 关于史瓦蒂尼蛇的历史和当前事件的数据库。

IF 3.6 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-18 DOI: 10.1093/database/baaf040

Ara Monadjem, Richard C Boycott, Thea Litscha-Koen, Adam Kane, Wisdom M Dlamini, Lindelwa Mmema, Katharine L Strutton, Zakhele Hlophe, Sara Padidar

Snakes are among the most difficult terrestrial vertebrates to survey, resulting in poor distributional information on most species. This database comprises of 3812 records of 58 species of snakes in 37 genera reported from within the boundaries of Eswatini. The data were compiled from multiple sources including museum specimens, iNaturalist records, literature records, and snake rescue operations. For each specimen reported in the database, we provide the scientific name, latitude and longitude coordinates, and location. Most records also have an associated date. This comprehensive database will be useful to biodiversity experts, conservationists, medical practitioners, researchers, and snake enthusiasts, especially for mapping and modelling snake distributions in the country. To allow easy viewing of the distribution of snakes in the country, we provide an online visualization tool, which should allow a greater number of non-scientists to utilize this database.

蛇是最难调查的陆地脊椎动物之一，导致大多数物种的分布信息很差。该数据库包括在斯瓦蒂尼境内报道的37属58种蛇的3812条记录。这些数据来自多种来源，包括博物馆标本、自然学家记录、文献记录和蛇救援行动。对于数据库中报告的每个标本，我们提供了学名、经纬度坐标和位置。大多数记录也有一个相关的日期。这个综合数据库将对生物多样性专家、自然资源保护者、医疗从业者、研究人员和蛇爱好者有用，特别是对绘制和模拟该国蛇的分布非常有用。为了方便地查看蛇在该国的分布，我们提供了一个在线可视化工具，这应该允许更多的非科学家使用这个数据库。

{"title":"A database on the historical and current occurrences of snakes in Eswatini.","authors":"Ara Monadjem, Richard C Boycott, Thea Litscha-Koen, Adam Kane, Wisdom M Dlamini, Lindelwa Mmema, Katharine L Strutton, Zakhele Hlophe, Sara Padidar","doi":"10.1093/database/baaf040","DOIUrl":"10.1093/database/baaf040","url":null,"abstract":"Snakes are among the most difficult terrestrial vertebrates to survey, resulting in poor distributional information on most species. This database comprises of 3812 records of 58 species of snakes in 37 genera reported from within the boundaries of Eswatini. The data were compiled from multiple sources including museum specimens, iNaturalist records, literature records, and snake rescue operations. For each specimen reported in the database, we provide the scientific name, latitude and longitude coordinates, and location. Most records also have an associated date. This comprehensive database will be useful to biodiversity experts, conservationists, medical practitioners, researchers, and snake enthusiasts, especially for mapping and modelling snake distributions in the country. To allow easy viewing of the distribution of snakes in the country, we provide an online visualization tool, which should allow a greater number of non-scientists to utilize this database.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":" ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462622/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144583337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Microbe Directory: a centralized database for biological interpretation of microbiome data. 微生物目录：对微生物组数据进行生物学解释的集中数据库。

IF 3.6 4区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Database: The Journal of Biological Databases and Curation

Pub Date : 2025-01-18 DOI: 10.1093/database/baaf060

Maria A Sierra, Krista Ryon, Mohith R Arikatla, Radwa Elshafey, Hardik Bhaskar, Jacqueline Proszynski, Chandrima Bhattacharya, Heba Shaaban, David C Danko, Pradeep Ambrose, Sarah A Spaulding, Maria Mercedes Zambrano, The Microbe Directory Consortium, Christopher E Mason

The Microbe Directory (TMD) is a centralized database of metadata for microbes from all domains that helps with the biological interpretation of metagenomic data. The database comprises phenotypical and ecological traits of microorganisms, which have been verified by independent manual annotations. This effort has been possible by the help of a community of volunteer students worldwide who were trained in manual curation of microbiology data. To summarize this information, we have built an interactive browser that makes the database accessible to everyone, including non-bioinformaticians. We used the TMD data to analyse microbiome samples from different projects such as MetaSUB, TARA Oceans, Human Microbiome Project, and Sponge Microbiome Project, showcasing the utility of TMD. Furthermore, we compare our microbial annotations with annotations collected by artificial intelligence (AI) and demonstrate that despite the high speed of AI in reviewing and collecting microbial data, annotation requires domain knowledge and therefore manual curation. Collectively, TMD provides a unique source of information that can help to interpret microbiome data and uncover biological associations. Database URL: www.themicrobedirectory.com/.

微生物目录（TMD）是一个集中的元数据数据库，包含来自所有领域的微生物，有助于对宏基因组数据进行生物学解释。该数据库包括微生物的表型和生态性状，这些性状已由独立的人工注释验证。这项工作是在世界各地的志愿者学生社区的帮助下实现的，他们接受过微生物数据手工管理方面的培训。为了总结这些信息，我们建立了一个交互式浏览器，使每个人都可以访问数据库，包括非生物信息学家。我们利用TMD数据分析了来自MetaSUB、TARA Oceans、Human microbiome Project和Sponge microbiome Project等不同项目的微生物组样本，展示了TMD的实用性。此外，我们将我们的微生物注释与人工智能（AI）收集的注释进行了比较，并证明尽管人工智能在审查和收集微生物数据方面速度很快，但注释需要领域知识，因此需要人工管理。总的来说，TMD提供了一个独特的信息来源，可以帮助解释微生物组数据并揭示生物学关联。数据库地址：www.themicrobedirectory.com/。

{"title":"The Microbe Directory: a centralized database for biological interpretation of microbiome data.","authors":"Maria A Sierra, Krista Ryon, Mohith R Arikatla, Radwa Elshafey, Hardik Bhaskar, Jacqueline Proszynski, Chandrima Bhattacharya, Heba Shaaban, David C Danko, Pradeep Ambrose, Sarah A Spaulding, Maria Mercedes Zambrano, The Microbe Directory Consortium, Christopher E Mason","doi":"10.1093/database/baaf060","DOIUrl":"10.1093/database/baaf060","url":null,"abstract":"The Microbe Directory (TMD) is a centralized database of metadata for microbes from all domains that helps with the biological interpretation of metagenomic data. The database comprises phenotypical and ecological traits of microorganisms, which have been verified by independent manual annotations. This effort has been possible by the help of a community of volunteer students worldwide who were trained in manual curation of microbiology data. To summarize this information, we have built an interactive browser that makes the database accessible to everyone, including non-bioinformaticians. We used the TMD data to analyse microbiome samples from different projects such as MetaSUB, TARA Oceans, Human Microbiome Project, and Sponge Microbiome Project, showcasing the utility of TMD. Furthermore, we compare our microbial annotations with annotations collected by artificial intelligence (AI) and demonstrate that despite the high speed of AI in reviewing and collecting microbial data, annotation requires domain knowledge and therefore manual curation. Collectively, TMD provides a unique source of information that can help to interpret microbiome data and uncover biological associations. Database URL: www.themicrobedirectory.com/.","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462379/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0