首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
Interactive tools for functional annotation of bacterial genomes. 细菌基因组功能注释互动工具。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-06 DOI: 10.1093/database/baae089
Morgan N Price, Adam P Arkin

Automated annotations of protein functions are error-prone because of our lack of knowledge of protein functions. For example, it is often impossible to predict the correct substrate for an enzyme or a transporter. Furthermore, much of the knowledge that we do have about the functions of proteins is missing from the underlying databases. We discuss how to use interactive tools to quickly find different kinds of information relevant to a protein's function. Many of these tools are available via PaperBLAST (http://papers.genomics.lbl.gov). Combining these tools often allows us to infer a protein's function. Ideally, accurate annotations would allow us to predict a bacterium's capabilities from its genome sequence, but in practice, this remains challenging. We describe interactive tools that infer potential capabilities from a genome sequence or that search a genome to find proteins that might perform a specific function of interest. Database URL: http://papers.genomics.lbl.gov.

由于我们对蛋白质功能缺乏了解,自动注释蛋白质功能容易出错。例如,通常无法预测酶或转运体的正确底物。此外,我们所掌握的有关蛋白质功能的大部分知识都是从基础数据库中丢失的。我们将讨论如何使用交互式工具快速查找与蛋白质功能相关的各种信息。其中许多工具可通过 PaperBLAST (http://papers.genomics.lbl.gov) 获得。结合这些工具,我们往往可以推断出蛋白质的功能。理想情况下,准确的注释能让我们根据细菌的基因组序列预测其功能,但在实践中,这仍然具有挑战性。我们将介绍一些交互式工具,这些工具可从基因组序列中推断出潜在的功能,或通过搜索基因组来发现可能具有特定功能的蛋白质。数据库网址:http://papers.genomics.lbl.gov。
{"title":"Interactive tools for functional annotation of bacterial genomes.","authors":"Morgan N Price, Adam P Arkin","doi":"10.1093/database/baae089","DOIUrl":"10.1093/database/baae089","url":null,"abstract":"<p><p>Automated annotations of protein functions are error-prone because of our lack of knowledge of protein functions. For example, it is often impossible to predict the correct substrate for an enzyme or a transporter. Furthermore, much of the knowledge that we do have about the functions of proteins is missing from the underlying databases. We discuss how to use interactive tools to quickly find different kinds of information relevant to a protein's function. Many of these tools are available via PaperBLAST (http://papers.genomics.lbl.gov). Combining these tools often allows us to infer a protein's function. Ideally, accurate annotations would allow us to predict a bacterium's capabilities from its genome sequence, but in practice, this remains challenging. We describe interactive tools that infer potential capabilities from a genome sequence or that search a genome to find proteins that might perform a specific function of interest. Database URL: http://papers.genomics.lbl.gov.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11378808/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Globin Network and adopting genomic variant database requirements for thalassemia. 全球球蛋白网络和采用地中海贫血基因组变异数据库要求。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-04 DOI: 10.1093/database/baae080
Hashim Halim-Fikri, Ninie Nadia Zulkipli, Hafiza Alauddin, Celeste Bento, Carsten W Lederer, Petros Kountouris, Marina Kleanthous, Yetti Hernaningsih, Meow-Keong Thong, Muhammad Hamdi Mahmood, Norafiza Mohd Yasin, Ezalia Esa, Jacques Elion, Domenico Coviello, Raja-Zahratul-Azma Raja-Sabudin, Ghada El-Kamah, John Burn, Narazah Mohd Yusoff, Raj Ramesar, Bin Alwi Zilfalil

Thalassemia is one of the most prevalent monogenic disorders in low- and middle-income countries (LMICs). There are an estimated 270 million carriers of hemoglobinopathies (abnormal hemoglobins and/or thalassemia) worldwide, necessitating global methods and solutions for effective and optimal therapy. LMICs are disproportionately impacted by thalassemia, and due to disparities in genomics awareness and diagnostic resources, certain LMICs lag behind high-income countries (HICs). This spurred the establishment of the Global Globin Network (GGN) in 2015 at UNESCO, Paris, as a project-wide endeavor within the Human Variome Project (HVP). Primarily aimed at enhancing thalassemia clinical services, research, and genomic diagnostic capabilities with a focus on LMIC needs, GGN aims to foster data collection in a shared database by all affected nations, thus improving data sharing and thalassemia management. In this paper, we propose a minimum requirement for establishing a genomic database in thalassemia based on the HVP database guidelines. We suggest using an existing platform recommended by HVP, the Leiden Open Variation Database (LOVD) (https://www.lovd.nl/). Adoption of our proposed criteria will assist in improving or supplementing the existing databases, allowing for better-quality services for individuals with thalassemia. Database URL: https://www.lovd.nl/.

地中海贫血症是中低收入国家(LMICs)最常见的单基因疾病之一。据估计,全球有 2.7 亿血红蛋白病(血红蛋白异常和/或地中海贫血)携带者,因此需要全球性的方法和解决方案来提供有效和最佳的治疗。低收入和中等收入国家受地中海贫血症的影响尤为严重,由于基因组学意识和诊断资源的差异,某些低收入和中等收入国家落后于高收入国家。这促使全球球蛋白网络(GGN)于 2015 年在巴黎教科文组织成立,作为人类变异组计划(HVP)内的一项全项目工作。GGN 的主要目的是加强地中海贫血的临床服务、研究和基因组诊断能力,重点关注低收入国家的需求,旨在促进所有受影响国家在共享数据库中收集数据,从而改善数据共享和地中海贫血管理。在本文中,我们根据 HVP 数据库指南提出了建立地中海贫血基因组数据库的最低要求。我们建议使用 HVP 推荐的现有平台,即莱顿开放变异数据库 (LOVD) (https://www.lovd.nl/)。采用我们建议的标准将有助于改进或补充现有数据库,从而为地中海贫血患者提供更高质量的服务。数据库网址:https://www.lovd.nl/。
{"title":"Global Globin Network and adopting genomic variant database requirements for thalassemia.","authors":"Hashim Halim-Fikri, Ninie Nadia Zulkipli, Hafiza Alauddin, Celeste Bento, Carsten W Lederer, Petros Kountouris, Marina Kleanthous, Yetti Hernaningsih, Meow-Keong Thong, Muhammad Hamdi Mahmood, Norafiza Mohd Yasin, Ezalia Esa, Jacques Elion, Domenico Coviello, Raja-Zahratul-Azma Raja-Sabudin, Ghada El-Kamah, John Burn, Narazah Mohd Yusoff, Raj Ramesar, Bin Alwi Zilfalil","doi":"10.1093/database/baae080","DOIUrl":"10.1093/database/baae080","url":null,"abstract":"<p><p>Thalassemia is one of the most prevalent monogenic disorders in low- and middle-income countries (LMICs). There are an estimated 270 million carriers of hemoglobinopathies (abnormal hemoglobins and/or thalassemia) worldwide, necessitating global methods and solutions for effective and optimal therapy. LMICs are disproportionately impacted by thalassemia, and due to disparities in genomics awareness and diagnostic resources, certain LMICs lag behind high-income countries (HICs). This spurred the establishment of the Global Globin Network (GGN) in 2015 at UNESCO, Paris, as a project-wide endeavor within the Human Variome Project (HVP). Primarily aimed at enhancing thalassemia clinical services, research, and genomic diagnostic capabilities with a focus on LMIC needs, GGN aims to foster data collection in a shared database by all affected nations, thus improving data sharing and thalassemia management. In this paper, we propose a minimum requirement for establishing a genomic database in thalassemia based on the HVP database guidelines. We suggest using an existing platform recommended by HVP, the Leiden Open Variation Database (LOVD) (https://www.lovd.nl/). Adoption of our proposed criteria will assist in improving or supplementing the existing databases, allowing for better-quality services for individuals with thalassemia. Database URL: https://www.lovd.nl/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11373567/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142132087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes. CuPCA:用于大规模杯突相关基因泛癌症关联分析的网络服务器。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-09-03 DOI: 10.1093/database/baae075
Yishu Xu, Zhenshu Ma, Yajie Wang, Long Zhang, Jiaming Ye, Yuan Chen, Zhengrong Yuan

Copper-induced cell death is a novel mechanism of cell death, which is defined as cuproptosis. The increasing level of copper can produce toxicity in cells and may cause the occurrence of cell death. Several previous studies have proved that cuproptosis has a tight association with various cancers. Thus, the discovery of relationships between cuproptosis-related genes (CRGs) and human cancers is of great importance. Pan-cancer analysis can efficiently help researchers find out the relationship between multiple cancers and target genes precisely and make various prognostic analyses on cancers and cancer patients. Pan-cancer web servers can provide researchers with direct results of pan-cancer prognostic analyses, which can greatly improve the efficiency of their work. However, to date, no web server provides pan-cancer analysis about CRGs. Therefore, we introduce the cuproptosis pan-cancer analysis database (CuPCA), the first database for various analysis results of CRGs through 33 cancer types. CuPCA is a user-friendly resource for cancer researchers to gain various prognostic analyses between cuproptosis and cancers. It provides single CRG pan-cancer analysis, multi-CRGs pan-cancer analysis, multi-CRlncRNA pan-cancer analysis, and mRNA-circRNA-lncRNA conjoint analysis. These analysis results can not only indicate the relationship between cancers and cuproptosis at both gene level and protein level, but also predict the conditions of different cancer patients, which include their clinical condition, survival condition, and their immunological condition. CuPCA procures the delivery of analyzed data to end users, which improves the efficiency of wide research as well as releases the value of data resources. Database URL: http://cupca.cn/.

铜诱导的细胞死亡是一种新的细胞死亡机制,被定义为杯突症。铜含量的增加会对细胞产生毒性,并可能导致细胞死亡。之前的一些研究已经证明,铜中毒与各种癌症有着密切的联系。因此,发现铜氧化相关基因(CRGs)与人类癌症之间的关系非常重要。泛癌症分析可以有效地帮助研究人员精确地发现多种癌症与靶基因之间的关系,并对癌症和癌症患者进行各种预后分析。泛癌网络服务器可以为研究人员提供直接的泛癌预后分析结果,从而大大提高他们的工作效率。然而,迄今为止,还没有一个网络服务器提供有关 CRG 的泛癌症分析。因此,我们引入了杯突病泛癌症分析数据库(CuPCA),这是首个包含 33 种癌症类型的 CRGs 各种分析结果的数据库。CuPCA 是癌症研究人员获得杯突与癌症之间各种预后分析的友好资源。它提供单CRG泛癌分析、多CRG泛癌分析、多CRlncRNA泛癌分析和mRNA-circRNA-lncRNA联合分析。这些分析结果不仅能在基因水平和蛋白质水平上指出癌症与杯突症的关系,还能预测不同癌症患者的病情,包括其临床状况、生存状况和免疫状况。CuPCA 将分析数据交付给最终用户,从而提高了广泛研究的效率,并释放了数据资源的价值。数据库网址:http://cupca.cn/.
{"title":"CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes.","authors":"Yishu Xu, Zhenshu Ma, Yajie Wang, Long Zhang, Jiaming Ye, Yuan Chen, Zhengrong Yuan","doi":"10.1093/database/baae075","DOIUrl":"10.1093/database/baae075","url":null,"abstract":"<p><p>Copper-induced cell death is a novel mechanism of cell death, which is defined as cuproptosis. The increasing level of copper can produce toxicity in cells and may cause the occurrence of cell death. Several previous studies have proved that cuproptosis has a tight association with various cancers. Thus, the discovery of relationships between cuproptosis-related genes (CRGs) and human cancers is of great importance. Pan-cancer analysis can efficiently help researchers find out the relationship between multiple cancers and target genes precisely and make various prognostic analyses on cancers and cancer patients. Pan-cancer web servers can provide researchers with direct results of pan-cancer prognostic analyses, which can greatly improve the efficiency of their work. However, to date, no web server provides pan-cancer analysis about CRGs. Therefore, we introduce the cuproptosis pan-cancer analysis database (CuPCA), the first database for various analysis results of CRGs through 33 cancer types. CuPCA is a user-friendly resource for cancer researchers to gain various prognostic analyses between cuproptosis and cancers. It provides single CRG pan-cancer analysis, multi-CRGs pan-cancer analysis, multi-CRlncRNA pan-cancer analysis, and mRNA-circRNA-lncRNA conjoint analysis. These analysis results can not only indicate the relationship between cancers and cuproptosis at both gene level and protein level, but also predict the conditions of different cancer patients, which include their clinical condition, survival condition, and their immunological condition. CuPCA procures the delivery of analyzed data to end users, which improves the efficiency of wide research as well as releases the value of data resources. Database URL: http://cupca.cn/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11373563/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142132086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CBGDA: a manually curated resource for gene-disease associations based on genome-wide CRISPR. CBGDA:基于全基因组 CRISPR 的人工编辑的基因-疾病关联资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-30 DOI: 10.1093/database/baae077
Qingsong Du, Zhiyu Zhang, Wanyi Yang, Xunyu Zhou, Nan Zhou, Chuanfang Wu, Jinku Bao

The field of understanding the association between genes and diseases is rapidly expanding, making it challenging for researchers to keep up with the influx of new publications and genetic datasets. Fortunately, there are now several regularly updated databases available that focus on cataloging gene-disease relationships. The development of the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas9 system has revolutionized the field of gene editing, providing a highly efficient, accurate, and reliable method for exploring gene-disease associations. However, currently, there is no resource specifically dedicated to collecting and integrating the latest experimentally supported gene-disease association data derived from genome-wide CRISPR screening. To address this gap, we have developed the CRISPR-Based Gene-Disease Associations (CBGDA) database, which includes over 200 manually curated gene-disease association data derived from genome-wide CRISPR screening studies. Through CBGDA, users can explore gene-disease association data derived from genome-wide CRISPR screening, gaining insights into the expression patterns of genes in different diseases, associated chemical data, and variant information. This provides a novel perspective on understanding the associations between genes and diseases. What is more, CBGDA integrates data from several other databases and resources, enhancing its comprehensiveness and utility. In summary, CBGDA offers a fresh perspective and comprehensive insights into the research on gene-disease associations. It fills the gap by providing a dedicated resource for accessing up-to-date, experimentally supported gene-disease association data derived from genome-wide CRISPR screening. Database URL: http://cbgda.zhounan.org/main.

了解基因与疾病之间关系的领域正在迅速扩大,这使得研究人员很难跟上大量新出版物和基因数据集的涌入。幸运的是,现在有几个定期更新的数据库专注于编目基因与疾病的关系。聚类正则间隔短码回文(CRISPR)-Cas9 系统的开发彻底改变了基因编辑领域,为探索基因与疾病的关联提供了一种高效、准确、可靠的方法。然而,目前还没有专门用于收集和整合从全基因组 CRISPR 筛选中获得的最新实验支持的基因-疾病关联数据的资源。为了填补这一空白,我们开发了基于CRISPR的基因-疾病关联(CBGDA)数据库,其中包括200多条从全基因组CRISPR筛选研究中获得的人工编辑的基因-疾病关联数据。通过 CBGDA,用户可以探索从全基因组 CRISPR 筛选中获得的基因-疾病关联数据,深入了解不同疾病中基因的表达模式、相关化学数据和变异信息。这为了解基因与疾病之间的关联提供了一个新的视角。此外,CBGDA 还整合了其他几个数据库和资源的数据,增强了其全面性和实用性。总之,CBGDA 为基因与疾病的关联研究提供了全新的视角和全面的见解。它提供了一个专门的资源,用于访问从全基因组 CRISPR 筛选中获得的最新的、有实验支持的基因-疾病关联数据,从而填补了这一空白。数据库网址:http://cbgda.zhounan.org/main。
{"title":"CBGDA: a manually curated resource for gene-disease associations based on genome-wide CRISPR.","authors":"Qingsong Du, Zhiyu Zhang, Wanyi Yang, Xunyu Zhou, Nan Zhou, Chuanfang Wu, Jinku Bao","doi":"10.1093/database/baae077","DOIUrl":"10.1093/database/baae077","url":null,"abstract":"<p><p>The field of understanding the association between genes and diseases is rapidly expanding, making it challenging for researchers to keep up with the influx of new publications and genetic datasets. Fortunately, there are now several regularly updated databases available that focus on cataloging gene-disease relationships. The development of the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas9 system has revolutionized the field of gene editing, providing a highly efficient, accurate, and reliable method for exploring gene-disease associations. However, currently, there is no resource specifically dedicated to collecting and integrating the latest experimentally supported gene-disease association data derived from genome-wide CRISPR screening. To address this gap, we have developed the CRISPR-Based Gene-Disease Associations (CBGDA) database, which includes over 200 manually curated gene-disease association data derived from genome-wide CRISPR screening studies. Through CBGDA, users can explore gene-disease association data derived from genome-wide CRISPR screening, gaining insights into the expression patterns of genes in different diseases, associated chemical data, and variant information. This provides a novel perspective on understanding the associations between genes and diseases. What is more, CBGDA integrates data from several other databases and resources, enhancing its comprehensiveness and utility. In summary, CBGDA offers a fresh perspective and comprehensive insights into the research on gene-disease associations. It fills the gap by providing a dedicated resource for accessing up-to-date, experimentally supported gene-disease association data derived from genome-wide CRISPR screening. Database URL: http://cbgda.zhounan.org/main.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363955/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Landscape of racial and ethnic health disparities in the All of Us Research Program. 我们所有人 "研究计划中的种族和民族健康差异情况。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-30 DOI: 10.1093/database/baae082
Vincent Lam, Shivam Sharma, John L Spouge, I King Jordan, Leonardo Mariño-Ramírez

The All of Us Research Program ("All of Us") is an initiative led by the National Institutes of Health whose goal is to advance research on personalized medicine and health equity through the collection of genetic, environmental, demographic, and health data from volunteer participants who reside in the USA. The program's emphasis on recruiting a diverse participant cohort makes "All of Us" an effective platform for investigating health disparities. In this work, we analyzed participant electronic health record (EHR) data to identify the diseases and disease categories in the "All of Us" cohort for which racial and ethnic prevalence disparities can be observed. In conjunction with these analyses, we developed the US Health Disparities Browser as an interactive web application that enables users to visualize differences in race- and ethnic-group-specific prevalence estimates for 1755 different diseases: https://usdisparities.biosci.gatech.edu/. The web application features a catalog of all diseases represented in the browser, which can be sorted by overall prevalence as well as the variance in prevalence across racial and ethnic groups. The analyses outlined here provide details on the nature and extent of racial and ethnic health disparities in the "All of Us" participant cohort, and the accompanying browser can serve as a resource through which researchers can explore these disparities Database URL: https://usdisparities.biosci.gatech.edu.

我们所有人 "研究计划("All of Us")是由美国国立卫生研究院(National Institutes of Health)领导的一项计划,其目标是通过收集居住在美国的志愿参与者的基因、环境、人口和健康数据,推动个性化医疗和健康公平方面的研究。该计划强调招募多样化的参与者群体,这使 "我们所有人 "成为调查健康差异的有效平台。在这项工作中,我们分析了参与者的电子健康记录(EHR)数据,以确定 "我们所有人 "队列中可观察到种族和民族流行率差异的疾病和疾病类别。结合这些分析,我们开发了 "美国健康差异浏览器"(US Health Disparities Browser)这一交互式网络应用程序,使用户能够直观地看到 1755 种不同疾病的种族和族裔群体患病率估计值的差异:https://usdisparities.biosci.gatech.edu/。该网络应用程序提供了浏览器中所有疾病的目录,可按总体患病率以及不同种族和族裔群体患病率的差异进行排序。本文概述的分析详细说明了 "我们所有人 "参与者队列中种族和民族健康差异的性质和程度,随附的浏览器可作为研究人员探索这些差异的资源 数据库网址:https://usdisparities.biosci.gatech.edu。
{"title":"Landscape of racial and ethnic health disparities in the All of Us Research Program.","authors":"Vincent Lam, Shivam Sharma, John L Spouge, I King Jordan, Leonardo Mariño-Ramírez","doi":"10.1093/database/baae082","DOIUrl":"10.1093/database/baae082","url":null,"abstract":"<p><p>The All of Us Research Program (\"All of Us\") is an initiative led by the National Institutes of Health whose goal is to advance research on personalized medicine and health equity through the collection of genetic, environmental, demographic, and health data from volunteer participants who reside in the USA. The program's emphasis on recruiting a diverse participant cohort makes \"All of Us\" an effective platform for investigating health disparities. In this work, we analyzed participant electronic health record (EHR) data to identify the diseases and disease categories in the \"All of Us\" cohort for which racial and ethnic prevalence disparities can be observed. In conjunction with these analyses, we developed the US Health Disparities Browser as an interactive web application that enables users to visualize differences in race- and ethnic-group-specific prevalence estimates for 1755 different diseases: https://usdisparities.biosci.gatech.edu/. The web application features a catalog of all diseases represented in the browser, which can be sorted by overall prevalence as well as the variance in prevalence across racial and ethnic groups. The analyses outlined here provide details on the nature and extent of racial and ethnic health disparities in the \"All of Us\" participant cohort, and the accompanying browser can serve as a resource through which researchers can explore these disparities Database URL: https://usdisparities.biosci.gatech.edu.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach. 使用基于 BERT 的方法从生物医学文献中自动提取细菌的转录调控相互作用。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-30 DOI: 10.1093/database/baae094
Alfredo Varela-Vega, Ali-Berenice Posada-Reyes, Carlos-Francisco Méndez-Cruz

Transcriptional regulatory networks (TRNs) give a global view of the regulatory mechanisms of bacteria to respond to environmental signals. These networks are published in biological databases as a valuable resource for experimental and bioinformatics researchers. Despite the efforts to publish TRNs of diverse bacteria, many of them still lack one and many of the existing TRNs are incomplete. In addition, the manual extraction of information from biomedical literature ("literature curation") has been the traditional way to extract these networks, despite this being demanding and time-consuming. Recently, language models based on pretrained transformers have been used to extract relevant knowledge from biomedical literature. Moreover, the benefit of fine-tuning a large pretrained model with new limited data for a specific task ("transfer learning") opens roads to address new problems of biomedical information extraction. Here, to alleviate this lack of knowledge and assist literature curation, we present a new approach based on the Bidirectional Transformer for Language Understanding (BERT) architecture to classify transcriptional regulatory interactions of bacteria as a first step to extract TRNs from literature. The approach achieved a significant performance in a test dataset of sentences of Escherichia coli (F1-Score: 0.8685, Matthew's correlation coefficient: 0.8163). The examination of model predictions revealed that the model learned different ways to express the regulatory interaction. The approach was evaluated to extract a TRN of Salmonella using 264 complete articles. The evaluation showed that the approach was able to accurately extract 82% of the network and that it was able to extract interactions absent in curation data. To the best of our knowledge, the present study is the first effort to obtain a BERT-based approach to extract this specific kind of interaction. This approach is a starting point to address the limitations of reconstructing TRNs of bacteria and diseases of biological interest. Database URL: https://github.com/laigen-unam/BERT-trn-extraction.

转录调控网络(TRN)提供了细菌响应环境信号的调控机制的全局视图。这些网络发布在生物数据库中,是实验和生物信息学研究人员的宝贵资源。尽管人们努力发布各种细菌的调控网络,但许多细菌仍然缺乏调控网络,而且许多现有的调控网络并不完整。此外,从生物医学文献中手动提取信息("文献整理")一直是提取这些网络的传统方法,尽管这种方法要求高且耗时。最近,基于预训练转换器的语言模型被用于从生物医学文献中提取相关知识。此外,针对特定任务使用新的有限数据对大型预训练模型进行微调("迁移学习")的好处为解决生物医学信息提取的新问题开辟了道路。在此,为了缓解这种知识匮乏并协助文献整理,我们提出了一种基于双向语言理解转换器(BERT)架构的新方法,对细菌的转录调控相互作用进行分类,作为从文献中提取 TRN 的第一步。该方法在大肠杆菌句子测试数据集中取得了显著的性能(F1-分数:0.8685,马修相关系数:0.8163)。对模型预测的检查表明,该模型学会了表达调控相互作用的不同方式。利用 264 篇完整文章对该方法进行了评估,以提取沙门氏菌的 TRN。评估结果表明,该方法能够准确提取网络中 82% 的内容,而且还能提取出馆藏数据中不存在的相互作用。据我们所知,本研究是首次采用基于 BERT 的方法来提取这种特定的交互作用。这种方法是解决重建细菌和生物疾病 TRN 的局限性的一个起点。数据库网址:https://github.com/laigen-unam/BERT-trn-extraction.
{"title":"Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach.","authors":"Alfredo Varela-Vega, Ali-Berenice Posada-Reyes, Carlos-Francisco Méndez-Cruz","doi":"10.1093/database/baae094","DOIUrl":"10.1093/database/baae094","url":null,"abstract":"<p><p>Transcriptional regulatory networks (TRNs) give a global view of the regulatory mechanisms of bacteria to respond to environmental signals. These networks are published in biological databases as a valuable resource for experimental and bioinformatics researchers. Despite the efforts to publish TRNs of diverse bacteria, many of them still lack one and many of the existing TRNs are incomplete. In addition, the manual extraction of information from biomedical literature (\"literature curation\") has been the traditional way to extract these networks, despite this being demanding and time-consuming. Recently, language models based on pretrained transformers have been used to extract relevant knowledge from biomedical literature. Moreover, the benefit of fine-tuning a large pretrained model with new limited data for a specific task (\"transfer learning\") opens roads to address new problems of biomedical information extraction. Here, to alleviate this lack of knowledge and assist literature curation, we present a new approach based on the Bidirectional Transformer for Language Understanding (BERT) architecture to classify transcriptional regulatory interactions of bacteria as a first step to extract TRNs from literature. The approach achieved a significant performance in a test dataset of sentences of Escherichia coli (F1-Score: 0.8685, Matthew's correlation coefficient: 0.8163). The examination of model predictions revealed that the model learned different ways to express the regulatory interaction. The approach was evaluated to extract a TRN of Salmonella using 264 complete articles. The evaluation showed that the approach was able to accurately extract 82% of the network and that it was able to extract interactions absent in curation data. To the best of our knowledge, the present study is the first effort to obtain a BERT-based approach to extract this specific kind of interaction. This approach is a starting point to address the limitations of reconstructing TRNs of bacteria and diseases of biological interest. Database URL: https://github.com/laigen-unam/BERT-trn-extraction.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363960/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acupuncture indication knowledge bases: meridian entity recognition and classification based on ACUBERT. 针灸适应症知识库:基于 ACUBERT 的经络实体识别与分类。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-30 DOI: 10.1093/database/baae083
TianCheng Xu, Jing Wen, Lei Wang, YueYing Huang, ZiJing Zhu, Qian Zhu, Yi Fang, ChengBiao Yang, YouBing Xia

In acupuncture diagnosis and treatment, non-quantitative clinical descriptions have limited the development of standardized treatment methods. This study explores the effectiveness and the reasons for discrepancies in the entity recognition and classification of meridians in acupuncture indication using the Acupuncture Bidirectional Encoder Representations from Transformers (ACUBERT) model. During the research process, we selected 54 593 different entities from 82 acupuncture medical books as the pretraining corpus for medical literature, conducting classification research on Chinese medical literature using the BERT model. Additionally, we employed the support vector machine and Random Forest models as comparative benchmarks and optimized them through parameter tuning, ultimately leading to the development of the ACUBERT model. The results show that the ACUBERT model outperforms other baseline models in classification effectiveness, achieving the best performance at Epoch = 5. The model's "precision," "recall," and F1 scores reached above 0.8. Moreover, our study has a unique feature: it trains the meridian differentiation model based on the eight principles of differentiation and zang-fu differentiation as foundational labels. It establishes an acupuncture-indication knowledge base (ACU-IKD) and ACUBERT model with traditional Chinese medicine characteristics. In summary, the ACUBERT model significantly enhances the classification effectiveness of meridian attribution in the acupuncture indication database and also demonstrates the classification advantages of deep learning methods based on BERT in multi-category, large-scale training sets. Database URL: http://acuai.njucm.edu.cn:8081/#/user/login?tenantUrl=default.

在针灸诊断和治疗中,非量化的临床描述限制了标准化治疗方法的发展。本研究利用针灸双向变换编码器表征(ACUBERT)模型,探讨了针灸指征中经络实体识别和分类的有效性及其差异原因。在研究过程中,我们从 82 本针灸医书中选取了 54 593 个不同的实体作为医学文献的预训练语料库,利用 BERT 模型对中医文献进行分类研究。此外,我们还采用支持向量机和随机森林模型作为比较基准,并通过参数调整对其进行优化,最终开发出 ACUBERT 模型。结果表明,ACUBERT 模型的分类效果优于其他基准模型,在 Epoch = 5 时表现最佳。该模型的 "精确度"、"召回率 "和 F1 分数都达到了 0.8 以上。此外,我们的研究还有一个独特之处:它以八纲辨证和藏府辨证为基础标签,训练经络辨证模型。它建立了具有中医特色的针灸辨证知识库(ACU-IKD)和 ACUBERT 模型。总之,ACUBERT模型显著提高了针灸指征数据库中经络归属的分类效果,同时也证明了基于BERT的深度学习方法在多类别、大规模训练集中的分类优势。数据库网址:http://acuai.njucm.edu.cn:8081/#/user/login?tenantUrl=default。
{"title":"Acupuncture indication knowledge bases: meridian entity recognition and classification based on ACUBERT.","authors":"TianCheng Xu, Jing Wen, Lei Wang, YueYing Huang, ZiJing Zhu, Qian Zhu, Yi Fang, ChengBiao Yang, YouBing Xia","doi":"10.1093/database/baae083","DOIUrl":"10.1093/database/baae083","url":null,"abstract":"<p><p>In acupuncture diagnosis and treatment, non-quantitative clinical descriptions have limited the development of standardized treatment methods. This study explores the effectiveness and the reasons for discrepancies in the entity recognition and classification of meridians in acupuncture indication using the Acupuncture Bidirectional Encoder Representations from Transformers (ACUBERT) model. During the research process, we selected 54 593 different entities from 82 acupuncture medical books as the pretraining corpus for medical literature, conducting classification research on Chinese medical literature using the BERT model. Additionally, we employed the support vector machine and Random Forest models as comparative benchmarks and optimized them through parameter tuning, ultimately leading to the development of the ACUBERT model. The results show that the ACUBERT model outperforms other baseline models in classification effectiveness, achieving the best performance at Epoch = 5. The model's \"precision,\" \"recall,\" and F1 scores reached above 0.8. Moreover, our study has a unique feature: it trains the meridian differentiation model based on the eight principles of differentiation and zang-fu differentiation as foundational labels. It establishes an acupuncture-indication knowledge base (ACU-IKD) and ACUBERT model with traditional Chinese medicine characteristics. In summary, the ACUBERT model significantly enhances the classification effectiveness of meridian attribution in the acupuncture indication database and also demonstrates the classification advantages of deep learning methods based on BERT in multi-category, large-scale training sets. Database URL: http://acuai.njucm.edu.cn:8081/#/user/login?tenantUrl=default.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363959/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autoinhibited Protein Database: a curated database of autoinhibitory domains and their autoinhibition mechanisms. 自动抑制蛋白质数据库:一个关于自动抑制结构域及其自动抑制机制的编辑数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-28 DOI: 10.1093/database/baae085
Daeahn Cho, Hyang-Mi Lee, Ji Ah Kim, Jae Gwang Song, Su-Hee Hwang, Bomi Lee, Jinsil Park, Kha Mong Tran, Jiwon Kim, Phuong Ngoc Lam Vo, Jooeun Bae, Teerapat Pimt, Kangseok Lee, Jörg Gsponer, Hyung Wook Kim, Dokyun Na

Autoinhibition, a crucial allosteric self-regulation mechanism in cell signaling, ensures signal propagation exclusively in the presence of specific molecular inputs. The heightened focus on autoinhibited proteins stems from their implication in human diseases, positioning them as potential causal factors or therapeutic targets. However, the absence of a comprehensive knowledgebase impedes a thorough understanding of their roles and applications in drug discovery. Addressing this gap, we introduce Autoinhibited Protein Database (AiPD), a curated database standardizing information on autoinhibited proteins. AiPD encompasses details on autoinhibitory domains (AIDs), their targets, regulatory mechanisms, experimental validation methods, and implications in diseases, including associated mutations and post-translational modifications. AiPD comprises 698 AIDs from 532 experimentally characterized autoinhibited proteins and 2695 AIDs from their 2096 homologs, which were retrieved from 864 published articles. AiPD also includes 42 520 AIDs of computationally predicted autoinhibited proteins. In addition, AiPD facilitates users in investigating potential AIDs within a query sequence through comparisons with documented autoinhibited proteins. As the inaugural autoinhibited protein repository, AiPD significantly aids researchers studying autoinhibition mechanisms and their alterations in human diseases. It is equally valuable for developing computational models, analyzing allosteric protein regulation, predicting new drug targets, and understanding intervention mechanisms AiPD serves as a valuable resource for diverse researchers, contributing to the understanding and manipulation of autoinhibition in cellular processes. Database URL: http://ssbio.cau.ac.kr/databases/AiPD.

自抑制是细胞信号传导过程中一种重要的异位自我调节机制,它确保信号只在特定分子输入的情况下传播。人们对自动抑制蛋白的高度关注源于它们对人类疾病的影响,并将其定位为潜在的致病因素或治疗靶点。然而,缺乏全面的知识库阻碍了对它们在药物发现中的作用和应用的透彻理解。为了填补这一空白,我们引入了自体抑制蛋白数据库(AiPD),这是一个对自体抑制蛋白的信息进行标准化编辑的数据库。AiPD 包含有关自体抑制结构域(AID)、其靶点、调控机制、实验验证方法和对疾病的影响的详细信息,包括相关突变和翻译后修饰。AiPD 包括从 864 篇已发表文章中检索到的 532 个经实验鉴定的自体抑制蛋白中的 698 个 AID 和它们的 2096 个同源物中的 2695 个 AID。AiPD 还包括 42 520 个经计算预测的自身抑制蛋白的 AID。此外,AiPD 还通过与已发表的自体抑制蛋白进行比较,帮助用户研究查询序列中潜在的自体抑制蛋白。作为首个自身抑制蛋白库,AiPD 极大地帮助了研究自身抑制机制及其在人类疾病中的改变的研究人员。它对于开发计算模型、分析异位蛋白调控、预测新的药物靶点和了解干预机制也同样有价值。 AiPD 是各种研究人员的宝贵资源,有助于了解和操纵细胞过程中的自动抑制。数据库网址:http://ssbio.cau.ac.kr/databases/AiPD。
{"title":"Autoinhibited Protein Database: a curated database of autoinhibitory domains and their autoinhibition mechanisms.","authors":"Daeahn Cho, Hyang-Mi Lee, Ji Ah Kim, Jae Gwang Song, Su-Hee Hwang, Bomi Lee, Jinsil Park, Kha Mong Tran, Jiwon Kim, Phuong Ngoc Lam Vo, Jooeun Bae, Teerapat Pimt, Kangseok Lee, Jörg Gsponer, Hyung Wook Kim, Dokyun Na","doi":"10.1093/database/baae085","DOIUrl":"10.1093/database/baae085","url":null,"abstract":"<p><p>Autoinhibition, a crucial allosteric self-regulation mechanism in cell signaling, ensures signal propagation exclusively in the presence of specific molecular inputs. The heightened focus on autoinhibited proteins stems from their implication in human diseases, positioning them as potential causal factors or therapeutic targets. However, the absence of a comprehensive knowledgebase impedes a thorough understanding of their roles and applications in drug discovery. Addressing this gap, we introduce Autoinhibited Protein Database (AiPD), a curated database standardizing information on autoinhibited proteins. AiPD encompasses details on autoinhibitory domains (AIDs), their targets, regulatory mechanisms, experimental validation methods, and implications in diseases, including associated mutations and post-translational modifications. AiPD comprises 698 AIDs from 532 experimentally characterized autoinhibited proteins and 2695 AIDs from their 2096 homologs, which were retrieved from 864 published articles. AiPD also includes 42 520 AIDs of computationally predicted autoinhibited proteins. In addition, AiPD facilitates users in investigating potential AIDs within a query sequence through comparisons with documented autoinhibited proteins. As the inaugural autoinhibited protein repository, AiPD significantly aids researchers studying autoinhibition mechanisms and their alterations in human diseases. It is equally valuable for developing computational models, analyzing allosteric protein regulation, predicting new drug targets, and understanding intervention mechanisms AiPD serves as a valuable resource for diverse researchers, contributing to the understanding and manipulation of autoinhibition in cellular processes. Database URL: http://ssbio.cau.ac.kr/databases/AiPD.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11349611/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142079544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Australian Biosecurity Genomic Database: a new resource for high-throughput sequencing analysis based on the National Notifiable Disease List of Terrestrial Animals. 澳大利亚生物安全基因组数据库:基于陆生动物国家通报疾病清单的高通量测序分析新资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-28 DOI: 10.1093/database/baae084
Jana Batovska, Natasha D Brohier, Peter T Mee, Fiona E Constable, Brendan C Rodoni, Stacey E Lynch

The Australian Biosecurity Genomic Database (ABGD) is a curated collection of reference viral genome sequences based on the Australian National Notifiable Disease List of Terrestrial Animals. It was created to facilitate the screening of high-throughput sequencing (HTS) data for the potential presence of viruses associated with notifiable disease. The database includes a single verified sequence (the exemplar species sequence, where relevant) for each of the 60 virus species across 21 viral families that are associated with or cause these notifiable diseases, as recognized by the World Organisation for Animal Health. The open-source ABGD on GitHub provides usage guidance documents and is intended to support building a culture in Australian HTS communities that promotes the use of quality-assured, standardized, and verified databases for Australia's national biosecurity interests. Future expansion of the database will include the addition of more strains or subtypes for highly variable viruses, viruses causing diseases of aquatic animals, and genomes of other types of pathogens associated with notifiable diseases, such as bacteria. Database URL: https://github.com/ausbiopathgenDB/AustralianBiosecurityGenomicDatabase.

澳大利亚生物安全基因组数据库(ABGD)是根据《澳大利亚陆生动物国家应报疾病清单》编辑的参考病毒基因组序列库。建立该数据库的目的是为了方便筛选高通量测序 (HTS) 数据,以确定是否存在与应呈报疾病相关的病毒。该数据库包括世界动物卫生组织认定的与这些应呈报疾病相关或导致这些疾病的 21 个病毒科的 60 种病毒中每种病毒的单个验证序列(相关情况下为示范物种序列)。GitHub 上的开源 ABGD 提供了使用指南文档,旨在支持澳大利亚 HTS 社区建立一种文化,促进使用有质量保证、标准化和经过验证的数据库来维护澳大利亚的国家生物安全利益。数据库未来的扩展将包括添加更多的高变异病毒毒株或亚型、导致水生动物疾病的病毒以及与应申报疾病相关的其他类型病原体(如细菌)的基因组。数据库网址:https://github.com/ausbiopathgenDB/AustralianBiosecurityGenomicDatabase。
{"title":"The Australian Biosecurity Genomic Database: a new resource for high-throughput sequencing analysis based on the National Notifiable Disease List of Terrestrial Animals.","authors":"Jana Batovska, Natasha D Brohier, Peter T Mee, Fiona E Constable, Brendan C Rodoni, Stacey E Lynch","doi":"10.1093/database/baae084","DOIUrl":"10.1093/database/baae084","url":null,"abstract":"<p><p>The Australian Biosecurity Genomic Database (ABGD) is a curated collection of reference viral genome sequences based on the Australian National Notifiable Disease List of Terrestrial Animals. It was created to facilitate the screening of high-throughput sequencing (HTS) data for the potential presence of viruses associated with notifiable disease. The database includes a single verified sequence (the exemplar species sequence, where relevant) for each of the 60 virus species across 21 viral families that are associated with or cause these notifiable diseases, as recognized by the World Organisation for Animal Health. The open-source ABGD on GitHub provides usage guidance documents and is intended to support building a culture in Australian HTS communities that promotes the use of quality-assured, standardized, and verified databases for Australia's national biosecurity interests. Future expansion of the database will include the addition of more strains or subtypes for highly variable viruses, viruses causing diseases of aquatic animals, and genomes of other types of pathogens associated with notifiable diseases, such as bacteria. Database URL: https://github.com/ausbiopathgenDB/AustralianBiosecurityGenomicDatabase.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11352597/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142085939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MACSFeD-a database of mosquito acoustic communication and swarming features. MACSFeD--蚊子声学通讯和成群特征数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-28 DOI: 10.1093/database/baae086
YuMin M Loh, Matthew P Su, Kayla G Haruni, Azusa Kamikouchi

Acoustic communication plays an important role during the courtship of many mosquito species. Male mosquitoes show strong attraction to female wing beat frequencies, mediated via spectral matching between female wing beat frequency and male ear mechanical tuning frequency. Such acoustic communication typically occurs within swarms, male-dominated aggregations with species-specific properties. Despite hundreds of relevant publications being available, the lack of a central platform hosting all associated data hinders research efforts and limits cross-species comparisons. Here, we introduce MACSFeD (Mosquito Acoustic Communication and Swarming Features Database), an interactive platform for the exploration of our comprehensive database containing 251 unique reports focusing on different aspects of mosquito acoustic communication, including hearing function, wing beat frequency and phonotaxis, as well as male swarming parameters. MACSFeD serves as an easily accessible, efficient, and robust data visualization tool for mosquito acoustic communication research. We envision that further in-depth studies could arise following the use of this new platform. Database URL: https://minmatt.shinyapps.io/MACSFeD/.

声学交流在许多蚊子物种的求偶过程中发挥着重要作用。雄蚊对雌蚊的拍翅频率有很强的吸引力,雌蚊拍翅频率和雄蚊耳朵机械调谐频率之间的频谱匹配是其媒介。这种声学交流通常发生在蚊群中,即由雄性主导的具有物种特异性的聚集。尽管有数以百计的相关出版物,但由于缺乏一个承载所有相关数据的中央平台,阻碍了研究工作并限制了跨物种比较。在此,我们介绍了MACSFeD(蚊子声学通讯和蜂群特征数据库),这是一个用于探索我们的综合数据库的互动平台,该数据库包含251份独特的报告,侧重于蚊子声学通讯的不同方面,包括听觉功能、拍翅频率和声轴,以及雄性蜂群参数。MACSFeD 可作为蚊子声学通讯研究的一个易于访问、高效和强大的数据可视化工具。我们设想,在使用这一新平台后,可能会有进一步的深入研究。数据库网址:https://minmatt.shinyapps.io/MACSFeD/.
{"title":"MACSFeD-a database of mosquito acoustic communication and swarming features.","authors":"YuMin M Loh, Matthew P Su, Kayla G Haruni, Azusa Kamikouchi","doi":"10.1093/database/baae086","DOIUrl":"10.1093/database/baae086","url":null,"abstract":"<p><p>Acoustic communication plays an important role during the courtship of many mosquito species. Male mosquitoes show strong attraction to female wing beat frequencies, mediated via spectral matching between female wing beat frequency and male ear mechanical tuning frequency. Such acoustic communication typically occurs within swarms, male-dominated aggregations with species-specific properties. Despite hundreds of relevant publications being available, the lack of a central platform hosting all associated data hinders research efforts and limits cross-species comparisons. Here, we introduce MACSFeD (Mosquito Acoustic Communication and Swarming Features Database), an interactive platform for the exploration of our comprehensive database containing 251 unique reports focusing on different aspects of mosquito acoustic communication, including hearing function, wing beat frequency and phonotaxis, as well as male swarming parameters. MACSFeD serves as an easily accessible, efficient, and robust data visualization tool for mosquito acoustic communication research. We envision that further in-depth studies could arise following the use of this new platform. Database URL: https://minmatt.shinyapps.io/MACSFeD/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2024 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11352598/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142085937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1