首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
A longitudinal analysis of function annotations of the human proteome reveals consistently high biases. 对人类蛋白质组功能注释的纵向分析显示出一贯的高偏差。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-07 DOI: 10.1093/database/baaf036
An Phan, Parnal Joshi, Claus Kadelka, Iddo Friedberg

The resources required to study gene function are limited, especially when considering the number of genes in the human genome and the complexity of their function. Therefore, genes are prioritized for experimental studies based on many different considerations, including, but not limited to, perceived biomedical importance, such as disease-associated genes, or the understanding of biological processes, such as cell signalling pathways. At the same time, most genes are not studied or are under-characterized, which hampers our understanding of their function and potential effects on human health and wellness. Understanding function annotation disparity is a necessary first step toward understanding how much functional knowledge is gained from the human genome, and toward guidelines for better targeting future studies of the genes in the human genome effectively. Here, we present a comprehensive longitudinal analysis of the human proteome utilizing data analysis tools from economics and information theory. Specifically, we view the human proteome as a population of proteins within a knowledge economy: we treat the quantified knowledge of the protein's function as the analogue of wealth and examine the distribution of information in a population of proteins in the proteome in the same manner distribution of wealth is studied in societies. Our results show a highly skewed distribution of information about human proteins over the last decade, in which the inequality in the annotations given to the proteins remains high. Additionally, we examine the correlation between the knowledge about protein function as captured in databases and the interest in proteins as reflected by mentions in the scientific literature. We show a large gap between knowledge and interest and dissect the factors leading to this gap. In conclusion, our study shows that research efforts should be redirected to less studied proteins to mitigate the disparity among human proteins both in databases and literature.

研究基因功能所需的资源是有限的,特别是考虑到人类基因组中基因的数量及其功能的复杂性。因此,基于许多不同的考虑因素,包括但不限于感知到的生物医学重要性,如疾病相关基因,或对生物过程的理解,如细胞信号传导途径,对实验研究的基因进行优先排序。与此同时,大多数基因没有被研究或特征不充分,这阻碍了我们对它们的功能和对人类健康的潜在影响的理解。了解功能注释差异是了解人类基因组功能知识的必要第一步,也是更好地针对人类基因组中基因的未来研究的指导方针。在这里,我们利用经济学和信息论的数据分析工具,对人类蛋白质组进行了全面的纵向分析。具体而言,我们将人类蛋白质组视为知识经济中的蛋白质群体:我们将蛋白质功能的量化知识视为财富的类似物,并以研究社会中财富分布的相同方式检查蛋白质组中蛋白质群体中的信息分布。我们的结果表明,在过去十年中,关于人类蛋白质的信息分布高度倾斜,其中给予蛋白质的注释中的不平等仍然很高。此外,我们研究了数据库中捕获的关于蛋白质功能的知识与科学文献中提到的对蛋白质的兴趣之间的相关性。我们展示了知识和兴趣之间的巨大差距,并剖析了导致这种差距的因素。总之,我们的研究表明,研究工作应该转向研究较少的蛋白质,以减轻数据库和文献中人类蛋白质之间的差异。
{"title":"A longitudinal analysis of function annotations of the human proteome reveals consistently high biases.","authors":"An Phan, Parnal Joshi, Claus Kadelka, Iddo Friedberg","doi":"10.1093/database/baaf036","DOIUrl":"10.1093/database/baaf036","url":null,"abstract":"<p><p>The resources required to study gene function are limited, especially when considering the number of genes in the human genome and the complexity of their function. Therefore, genes are prioritized for experimental studies based on many different considerations, including, but not limited to, perceived biomedical importance, such as disease-associated genes, or the understanding of biological processes, such as cell signalling pathways. At the same time, most genes are not studied or are under-characterized, which hampers our understanding of their function and potential effects on human health and wellness. Understanding function annotation disparity is a necessary first step toward understanding how much functional knowledge is gained from the human genome, and toward guidelines for better targeting future studies of the genes in the human genome effectively. Here, we present a comprehensive longitudinal analysis of the human proteome utilizing data analysis tools from economics and information theory. Specifically, we view the human proteome as a population of proteins within a knowledge economy: we treat the quantified knowledge of the protein's function as the analogue of wealth and examine the distribution of information in a population of proteins in the proteome in the same manner distribution of wealth is studied in societies. Our results show a highly skewed distribution of information about human proteins over the last decade, in which the inequality in the annotations given to the proteins remains high. Additionally, we examine the correlation between the knowledge about protein function as captured in databases and the interest in proteins as reflected by mentions in the scientific literature. We show a large gap between knowledge and interest and dissect the factors leading to this gap. In conclusion, our study shows that research efforts should be redirected to less studied proteins to mitigate the disparity among human proteins both in databases and literature.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12060720/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143984205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CancerPPD2: an updated repository of anticancer peptides and proteins. CancerPPD2:抗癌肽和蛋白质的更新库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-07 DOI: 10.1093/database/baaf030
Milind Chauhan, Amisha Gupta, Ritu Tomer, Gajendra P S Raghava

CancerPPD2 (http://webs.iiitd.edu.in/raghava/cancerppd2/) is an updated version of CancerPPD, developed to maintain comprehensive information about anticancer peptides and proteins. It contains 6521 entries, each entry provides detailed information about an anticancer peptide/protein that include origin of the peptide, cancer cell line, type of cancer, peptide sequence, and structure. These anticancer peptides have been tested against 392 types of cancer cell lines and 28 types of cancer-associated tissues. In addition to natural anticancer peptides, CancerPPD2 contains 781 entries for chemically modified and 3018 entries for N-/C- terminus modified anticancer peptides. Few entries are also linked with 47 clinical studies and have provided the cross reference to Uniprot, DrugBank, and ThPDB2. The possible entries also linked with clinical trials. On average, CancerPPD2 contains around 85% more information than its previous version, CancerPPD. The structures of these anticancer peptides and proteins were either obtained from the Protein Data Bank (PDB) or predicted using PEPstrMOD, I-TASSER, and AlphaFold. A wide range of tools have been integrated into CancerPPD2 for data retrieval and similarity searches. Additionally, we integrated a REST API into this repository to facilitate automatic data retrieval via program. Database URL: https://webs.iiitd.edu.in/raghava/cancerppd2/api/rest.html.

CancerPPD2 (http://webs.iiitd.edu.in/raghava/cancerppd2/)是CancerPPD的更新版本,旨在维护有关抗癌肽和蛋白质的全面信息。它包含6521个条目,每个条目提供有关抗癌肽/蛋白质的详细信息,包括肽的来源、癌细胞系、癌症类型、肽序列和结构。这些抗癌肽已经对392种癌细胞系和28种癌症相关组织进行了测试。除天然抗癌肽外,CancerPPD2含有781个化学修饰的片段和3018个N /C末端修饰的抗癌肽片段。少数条目还与47项临床研究相关联,并为Uniprot、DrugBank和ThPDB2提供了交叉参考。可能的条目还与临床试验有关。平均而言,CancerPPD2比之前的版本CancerPPD多包含约85%的信息。这些抗癌肽和蛋白质的结构要么从蛋白质数据库(PDB)中获得,要么使用PEPstrMOD、I-TASSER和AlphaFold预测。CancerPPD2中集成了多种工具,用于数据检索和相似性搜索。此外,我们将一个REST API集成到这个存储库中,以方便通过程序自动检索数据。数据库地址:https://webs.iiitd.edu.in/raghava/cancerppd2/api/rest.html。
{"title":"CancerPPD2: an updated repository of anticancer peptides and proteins.","authors":"Milind Chauhan, Amisha Gupta, Ritu Tomer, Gajendra P S Raghava","doi":"10.1093/database/baaf030","DOIUrl":"10.1093/database/baaf030","url":null,"abstract":"<p><p>CancerPPD2 (http://webs.iiitd.edu.in/raghava/cancerppd2/) is an updated version of CancerPPD, developed to maintain comprehensive information about anticancer peptides and proteins. It contains 6521 entries, each entry provides detailed information about an anticancer peptide/protein that include origin of the peptide, cancer cell line, type of cancer, peptide sequence, and structure. These anticancer peptides have been tested against 392 types of cancer cell lines and 28 types of cancer-associated tissues. In addition to natural anticancer peptides, CancerPPD2 contains 781 entries for chemically modified and 3018 entries for N-/C- terminus modified anticancer peptides. Few entries are also linked with 47 clinical studies and have provided the cross reference to Uniprot, DrugBank, and ThPDB2. The possible entries also linked with clinical trials. On average, CancerPPD2 contains around 85% more information than its previous version, CancerPPD. The structures of these anticancer peptides and proteins were either obtained from the Protein Data Bank (PDB) or predicted using PEPstrMOD, I-TASSER, and AlphaFold. A wide range of tools have been integrated into CancerPPD2 for data retrieval and similarity searches. Additionally, we integrated a REST API into this repository to facilitate automatic data retrieval via program. Database URL: https://webs.iiitd.edu.in/raghava/cancerppd2/api/rest.html.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12060709/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143981885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STCDB4ND: a signal transduction classification database for neurological diseases. STCDB4ND:神经系统疾病信号转导分类数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-02 DOI: 10.1093/database/baaf032
Boyan Gong, Sida Li, Yifan Chen, Liya Liu, Ralf Hofestädt, Ming Chen

Neurological disorders pose significant global health challenges due to their complex etiology and insufficient understanding of underlying mechanisms. Signal transduction pathways are critical in the pathophysiology of these diseases and have been extensively studied to develop therapeutic interventions. However, existing databases for biological signal pathways often overlook the dynamic interactions between entities within these pathways and lack standardized representations of the signaling processes. To address these limitations, we present STCDB4ND, a specialized database focused on signal transduction pathways associated with neurological diseases. Utilizing the ST classification system, STCDB4ND provides a unified framework for pathway representation, emphasizing interactions and pathway characteristics. The database features advanced visualization tools, network analysis capabilities, and a key factor identification module, enabling researchers to comprehensively study these complex networks. Our analysis of neurological disease-related pathways using STCDB4ND revealed key signaling factors and supported existing findings on pathogenic mechanisms STCDB4ND serves as a valuable resource for advancing the understanding of neurological disease pathways and promoting novel therapeutic approaches. And we believe that STCDB will provide greater convenience for researchers in various fields as we expand the STCDB system's database in the future. Database URL: https://bis.zju.edu.cn/STCDB.

神经系统疾病由于其复杂的病因和对其潜在机制的了解不足,构成了重大的全球健康挑战。信号转导通路在这些疾病的病理生理学中至关重要,并已被广泛研究以开发治疗干预措施。然而,现有的生物信号通路数据库往往忽略了这些通路中实体之间的动态相互作用,并且缺乏信号过程的标准化表示。为了解决这些限制,我们提出了STCDB4ND,一个专门的数据库,专注于与神经系统疾病相关的信号转导途径。利用ST分类系统,STCDB4ND提供了一个统一的通路表示框架,强调相互作用和通路特征。该数据库具有先进的可视化工具、网络分析能力和关键因素识别模块,使研究人员能够全面研究这些复杂的网络。我们使用STCDB4ND对神经系统疾病相关通路进行分析,揭示了关键的信号因子,并支持了现有的致病机制发现,STCDB4ND为促进对神经系统疾病通路的理解和促进新的治疗方法提供了宝贵的资源。我们相信,随着未来STCDB系统数据库的扩展,STCDB将为各个领域的研究人员提供更大的便利。数据库地址:https://bis.zju.edu.cn/STCDB。
{"title":"STCDB4ND: a signal transduction classification database for neurological diseases.","authors":"Boyan Gong, Sida Li, Yifan Chen, Liya Liu, Ralf Hofestädt, Ming Chen","doi":"10.1093/database/baaf032","DOIUrl":"10.1093/database/baaf032","url":null,"abstract":"<p><p>Neurological disorders pose significant global health challenges due to their complex etiology and insufficient understanding of underlying mechanisms. Signal transduction pathways are critical in the pathophysiology of these diseases and have been extensively studied to develop therapeutic interventions. However, existing databases for biological signal pathways often overlook the dynamic interactions between entities within these pathways and lack standardized representations of the signaling processes. To address these limitations, we present STCDB4ND, a specialized database focused on signal transduction pathways associated with neurological diseases. Utilizing the ST classification system, STCDB4ND provides a unified framework for pathway representation, emphasizing interactions and pathway characteristics. The database features advanced visualization tools, network analysis capabilities, and a key factor identification module, enabling researchers to comprehensively study these complex networks. Our analysis of neurological disease-related pathways using STCDB4ND revealed key signaling factors and supported existing findings on pathogenic mechanisms STCDB4ND serves as a valuable resource for advancing the understanding of neurological disease pathways and promoting novel therapeutic approaches. And we believe that STCDB will provide greater convenience for researchers in various fields as we expand the STCDB system's database in the future. Database URL: https://bis.zju.edu.cn/STCDB.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12047452/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STCDB4ND: a signal transduction classification database for neurological diseases. STCDB4ND:神经系统疾病信号转导分类数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-02 DOI: 10.1093/database/baaf032
Boyan Gong, Sida Li, Yifan Chen, Liya Liu, Ralf Hofestädt, Ming Chen

Neurological disorders pose significant global health challenges due to their complex etiology and insufficient understanding of underlying mechanisms. Signal transduction pathways are critical in the pathophysiology of these diseases and have been extensively studied to develop therapeutic interventions. However, existing databases for biological signal pathways often overlook the dynamic interactions between entities within these pathways and lack standardized representations of the signaling processes. To address these limitations, we present STCDB4ND, a specialized database focused on signal transduction pathways associated with neurological diseases. Utilizing the ST classification system, STCDB4ND provides a unified framework for pathway representation, emphasizing interactions and pathway characteristics. The database features advanced visualization tools, network analysis capabilities, and a key factor identification module, enabling researchers to comprehensively study these complex networks. Our analysis of neurological disease-related pathways using STCDB4ND revealed key signaling factors and supported existing findings on pathogenic mechanisms STCDB4ND serves as a valuable resource for advancing the understanding of neurological disease pathways and promoting novel therapeutic approaches. And we believe that STCDB will provide greater convenience for researchers in various fields as we expand the STCDB system's database in the future. Database URL: https://bis.zju.edu.cn/STCDB.

神经系统疾病由于其复杂的病因和对其潜在机制的了解不足,构成了重大的全球健康挑战。信号转导通路在这些疾病的病理生理学中至关重要,并已被广泛研究以开发治疗干预措施。然而,现有的生物信号通路数据库往往忽略了这些通路中实体之间的动态相互作用,并且缺乏信号过程的标准化表示。为了解决这些限制,我们提出了STCDB4ND,一个专门的数据库,专注于与神经系统疾病相关的信号转导途径。利用ST分类系统,STCDB4ND提供了一个统一的通路表示框架,强调相互作用和通路特征。该数据库具有先进的可视化工具、网络分析能力和关键因素识别模块,使研究人员能够全面研究这些复杂的网络。我们使用STCDB4ND对神经系统疾病相关通路进行分析,揭示了关键的信号因子,并支持了现有的致病机制发现,STCDB4ND为促进对神经系统疾病通路的理解和促进新的治疗方法提供了宝贵的资源。我们相信,随着未来STCDB系统数据库的扩展,STCDB将为各个领域的研究人员提供更大的便利。数据库地址:https://bis.zju.edu.cn/STCDB。
{"title":"STCDB4ND: a signal transduction classification database for neurological diseases.","authors":"Boyan Gong, Sida Li, Yifan Chen, Liya Liu, Ralf Hofestädt, Ming Chen","doi":"10.1093/database/baaf032","DOIUrl":"https://doi.org/10.1093/database/baaf032","url":null,"abstract":"<p><p>Neurological disorders pose significant global health challenges due to their complex etiology and insufficient understanding of underlying mechanisms. Signal transduction pathways are critical in the pathophysiology of these diseases and have been extensively studied to develop therapeutic interventions. However, existing databases for biological signal pathways often overlook the dynamic interactions between entities within these pathways and lack standardized representations of the signaling processes. To address these limitations, we present STCDB4ND, a specialized database focused on signal transduction pathways associated with neurological diseases. Utilizing the ST classification system, STCDB4ND provides a unified framework for pathway representation, emphasizing interactions and pathway characteristics. The database features advanced visualization tools, network analysis capabilities, and a key factor identification module, enabling researchers to comprehensively study these complex networks. Our analysis of neurological disease-related pathways using STCDB4ND revealed key signaling factors and supported existing findings on pathogenic mechanisms STCDB4ND serves as a valuable resource for advancing the understanding of neurological disease pathways and promoting novel therapeutic approaches. And we believe that STCDB will provide greater convenience for researchers in various fields as we expand the STCDB system's database in the future. Database URL: https://bis.zju.edu.cn/STCDB.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping assays to the key characteristics of carcinogens to support decision-making. 对致癌物的关键特征进行制图分析,以支持决策。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-22 DOI: 10.1093/database/baaf026
Gabrielle Rigutto, Cliona M McHale, Ettayapuram Ramaprasad Azhagiya Singam, Iemaan Rana, Luoping Zhang, Martyn T Smith

The key characteristics (KCs) of carcinogens are the properties common to known human carcinogens that can be used to search for, organize, and evaluate mechanistic data in support of hazard identification. A limiting factor in this approach is that relevant in vitro and in vivo assays, as well as corresponding biomarkers and endpoints, have been only partially documented for each of the 10 KCs (Smith MT, Guyton KZ, Kleinstreuer N et al. The key characteristics of carcinogens: relationship to the hallmarks of cancer, relevant biomarkers, and assays to measure them. Cancer Epidemiol Biomarkers Prev 2020;29:1887-903. https://doi.org/10.1158/1055-9965.EPI-19-1346). To address this limitation, a comprehensive database is described that catalogues these previously described methods and endpoints/biomarkers pertinent to the 10 KCs of carcinogens as well as those referenced as supporting evidence for each KC in the International Agency of Research on Cancer Monograph Volumes 112-131. Our comprehensive mapping of KCs to assays and endpoints can be used to facilitate mechanistic data searches, presents a useful tool for searching for assays and endpoints relevant to the 10 KCs, and can be used to create a roadmap for utilizing data to evaluate the strength of the evidence for each KC. The KC-Assay database is available to the public on the web at https://kcad.cchem.berkeley.edu and acts as a 'living document', with the ability to be updated and refined. Database URL: https://kcad.cchem.berkeley.edu.

致癌物的关键特征(KCs)是已知人类致癌物的共同特性,可用于搜索、组织和评估支持危害识别的机制数据。这种方法的一个限制因素是,相关的体外和体内试验,以及相应的生物标志物和终点,仅部分记录了10种KCs中的每一种(Smith MT, Guyton KZ, Kleinstreuer N等)。致癌物的主要特征:与癌症特征的关系,相关的生物标志物,以及测量它们的方法。癌症流行病学杂志,2020;29:1887-903。https://doi.org/10.1158/1055 - 9965. - epi - 19 - 1346)。为了解决这一限制,本文描述了一个综合数据库,该数据库将这些先前描述的与10种致癌物质相关的方法和终点/生物标志物以及国际癌症研究机构专著第112-131卷中作为每种致癌物质的支持证据的方法和终点/生物标志物进行了分类。我们全面的映射;化验和端点可以用来促进机械的数据搜索,提出了一种有用的工具,寻找相关化验和端点10;,,可以用来创建一个路线图利用数据来评估每个KC的证据的力量。KC-Assay数据库向公众提供在网络上https://kcad.cchem.berkeley.edu和充当“活文件”,能够被更新和改进。数据库地址:https://kcad.cchem.berkeley.edu。
{"title":"Mapping assays to the key characteristics of carcinogens to support decision-making.","authors":"Gabrielle Rigutto, Cliona M McHale, Ettayapuram Ramaprasad Azhagiya Singam, Iemaan Rana, Luoping Zhang, Martyn T Smith","doi":"10.1093/database/baaf026","DOIUrl":"10.1093/database/baaf026","url":null,"abstract":"<p><p>The key characteristics (KCs) of carcinogens are the properties common to known human carcinogens that can be used to search for, organize, and evaluate mechanistic data in support of hazard identification. A limiting factor in this approach is that relevant in vitro and in vivo assays, as well as corresponding biomarkers and endpoints, have been only partially documented for each of the 10 KCs (Smith MT, Guyton KZ, Kleinstreuer N et al. The key characteristics of carcinogens: relationship to the hallmarks of cancer, relevant biomarkers, and assays to measure them. Cancer Epidemiol Biomarkers Prev 2020;29:1887-903. https://doi.org/10.1158/1055-9965.EPI-19-1346). To address this limitation, a comprehensive database is described that catalogues these previously described methods and endpoints/biomarkers pertinent to the 10 KCs of carcinogens as well as those referenced as supporting evidence for each KC in the International Agency of Research on Cancer Monograph Volumes 112-131. Our comprehensive mapping of KCs to assays and endpoints can be used to facilitate mechanistic data searches, presents a useful tool for searching for assays and endpoints relevant to the 10 KCs, and can be used to create a roadmap for utilizing data to evaluate the strength of the evidence for each KC. The KC-Assay database is available to the public on the web at https://kcad.cchem.berkeley.edu and acts as a 'living document', with the ability to be updated and refined. Database URL: https://kcad.cchem.berkeley.edu.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12013474/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping assays to the key characteristics of carcinogens to support decision-making. 对致癌物的关键特征进行制图分析,以支持决策。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-22 DOI: 10.1093/database/baaf026
Gabrielle Rigutto, Cliona M McHale, Ettayapuram Ramaprasad Azhagiya Singam, Iemaan Rana, Luoping Zhang, Martyn T Smith

The key characteristics (KCs) of carcinogens are the properties common to known human carcinogens that can be used to search for, organize, and evaluate mechanistic data in support of hazard identification. A limiting factor in this approach is that relevant in vitro and in vivo assays, as well as corresponding biomarkers and endpoints, have been only partially documented for each of the 10 KCs (Smith MT, Guyton KZ, Kleinstreuer N et al. The key characteristics of carcinogens: relationship to the hallmarks of cancer, relevant biomarkers, and assays to measure them. Cancer Epidemiol Biomarkers Prev 2020;29:1887-903. https://doi.org/10.1158/1055-9965.EPI-19-1346). To address this limitation, a comprehensive database is described that catalogues these previously described methods and endpoints/biomarkers pertinent to the 10 KCs of carcinogens as well as those referenced as supporting evidence for each KC in the International Agency of Research on Cancer Monograph Volumes 112-131. Our comprehensive mapping of KCs to assays and endpoints can be used to facilitate mechanistic data searches, presents a useful tool for searching for assays and endpoints relevant to the 10 KCs, and can be used to create a roadmap for utilizing data to evaluate the strength of the evidence for each KC. The KC-Assay database is available to the public on the web at https://kcad.cchem.berkeley.edu and acts as a 'living document', with the ability to be updated and refined. Database URL: https://kcad.cchem.berkeley.edu.

致癌物的关键特征(KCs)是已知人类致癌物的共同特性,可用于搜索、组织和评估支持危害识别的机制数据。这种方法的一个限制因素是,相关的体外和体内试验,以及相应的生物标志物和终点,仅部分记录了10种KCs中的每一种(Smith MT, Guyton KZ, Kleinstreuer N等)。致癌物的主要特征:与癌症特征的关系,相关的生物标志物,以及测量它们的方法。癌症流行病学杂志,2020;29:1887-903。https://doi.org/10.1158/1055 - 9965. - epi - 19 - 1346)。为了解决这一限制,本文描述了一个综合数据库,该数据库将这些先前描述的与10种致癌物质相关的方法和终点/生物标志物以及国际癌症研究机构专著第112-131卷中作为每种致癌物质的支持证据的方法和终点/生物标志物进行了分类。我们全面的映射;化验和端点可以用来促进机械的数据搜索,提出了一种有用的工具,寻找相关化验和端点10;,,可以用来创建一个路线图利用数据来评估每个KC的证据的力量。KC-Assay数据库向公众提供在网络上https://kcad.cchem.berkeley.edu和充当“活文件”,能够被更新和改进。数据库地址:https://kcad.cchem.berkeley.edu。
{"title":"Mapping assays to the key characteristics of carcinogens to support decision-making.","authors":"Gabrielle Rigutto, Cliona M McHale, Ettayapuram Ramaprasad Azhagiya Singam, Iemaan Rana, Luoping Zhang, Martyn T Smith","doi":"10.1093/database/baaf026","DOIUrl":"https://doi.org/10.1093/database/baaf026","url":null,"abstract":"<p><p>The key characteristics (KCs) of carcinogens are the properties common to known human carcinogens that can be used to search for, organize, and evaluate mechanistic data in support of hazard identification. A limiting factor in this approach is that relevant in vitro and in vivo assays, as well as corresponding biomarkers and endpoints, have been only partially documented for each of the 10 KCs (Smith MT, Guyton KZ, Kleinstreuer N et al. The key characteristics of carcinogens: relationship to the hallmarks of cancer, relevant biomarkers, and assays to measure them. Cancer Epidemiol Biomarkers Prev 2020;29:1887-903. https://doi.org/10.1158/1055-9965.EPI-19-1346). To address this limitation, a comprehensive database is described that catalogues these previously described methods and endpoints/biomarkers pertinent to the 10 KCs of carcinogens as well as those referenced as supporting evidence for each KC in the International Agency of Research on Cancer Monograph Volumes 112-131. Our comprehensive mapping of KCs to assays and endpoints can be used to facilitate mechanistic data searches, presents a useful tool for searching for assays and endpoints relevant to the 10 KCs, and can be used to create a roadmap for utilizing data to evaluate the strength of the evidence for each KC. The KC-Assay database is available to the public on the web at https://kcad.cchem.berkeley.edu and acts as a 'living document', with the ability to be updated and refined. Database URL: https://kcad.cchem.berkeley.edu.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CPDMS: a database system for crop physiological disorder management. CPDMS:作物生理失调管理数据库系统。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-22 DOI: 10.1093/database/baaf031
Jae-Hyeon Oh, Hwang-Weon Jeong, Il Pyung Ahn, Seon-Hwa Bae, Sung Mi Kim, Eunhee Kim, Su Jung Ra, Jinjeong Lee, Hye Yeon Choi, Young-Joo Seol

As the importance of precision agriculture grows, scalable and efficient methods for real-time data collection and analysis have become essential. In this study, we developed a system to collect real-time crop images, focusing on physiological disorders in tomatoes. This system systematically collects crop images and related data, with the potential to evolve into a valuable tool for researchers and agricultural practitioners. A total of 58 479 images were produced under stress conditions, including bacterial wilt (BW), Tomato Yellow Leaf Curl Virus (TYLCV), Tomato Spotted Wilt Virus (TSWV), drought, and salinity, across seven tomato varieties. The images include front views at 0 degrees, 120 degrees, 240 degrees, and top views and petiole images. Of these, 43 894 images were suitable for labeling. Based on this, 24 000 images were used for AI model training, and 13 037 images for model testing. By training a deep learning model, we achieved a mean Average Precision (mAP) of 0.46 and a recall rate of 0.60. Additionally, we discussed data augmentation and hyperparameter tuning strategies to improve AI model performance and explored the potential for generalizing the system across various agricultural environments. The database constructed in this study will serve as a crucial resource for the future development of agricultural AI. Database URL: https://crops.phyzen.com/.

随着精准农业重要性的增长,实时数据收集和分析的可扩展和高效方法变得至关重要。在这项研究中,我们开发了一个系统来收集实时作物图像,专注于番茄的生理失调。该系统系统地收集作物图像和相关数据,有可能发展成为研究人员和农业从业者的宝贵工具。在包括细菌性枯萎病(BW)、番茄黄卷叶病毒(TYLCV)、番茄斑点枯萎病(TSWV)、干旱和盐度在内的胁迫条件下,共生成了58 479张图像,涉及7个番茄品种。这些图像包括0度、120度、240度的前视图,以及俯视图和叶柄图像。其中,43 894幅图像适合标记。在此基础上,人工智能模型训练使用了2.4万张图像,模型测试使用了13 037张图像。通过训练深度学习模型,我们实现了0.46的平均精度(mAP)和0.60的召回率。此外,我们讨论了数据增强和超参数调整策略,以提高人工智能模型的性能,并探索了在各种农业环境中推广系统的潜力。本研究构建的数据库将成为未来农业人工智能发展的重要资源。数据库地址:https://crops.phyzen.com/。
{"title":"CPDMS: a database system for crop physiological disorder management.","authors":"Jae-Hyeon Oh, Hwang-Weon Jeong, Il Pyung Ahn, Seon-Hwa Bae, Sung Mi Kim, Eunhee Kim, Su Jung Ra, Jinjeong Lee, Hye Yeon Choi, Young-Joo Seol","doi":"10.1093/database/baaf031","DOIUrl":"https://doi.org/10.1093/database/baaf031","url":null,"abstract":"<p><p>As the importance of precision agriculture grows, scalable and efficient methods for real-time data collection and analysis have become essential. In this study, we developed a system to collect real-time crop images, focusing on physiological disorders in tomatoes. This system systematically collects crop images and related data, with the potential to evolve into a valuable tool for researchers and agricultural practitioners. A total of 58 479 images were produced under stress conditions, including bacterial wilt (BW), Tomato Yellow Leaf Curl Virus (TYLCV), Tomato Spotted Wilt Virus (TSWV), drought, and salinity, across seven tomato varieties. The images include front views at 0 degrees, 120 degrees, 240 degrees, and top views and petiole images. Of these, 43 894 images were suitable for labeling. Based on this, 24 000 images were used for AI model training, and 13 037 images for model testing. By training a deep learning model, we achieved a mean Average Precision (mAP) of 0.46 and a recall rate of 0.60. Additionally, we discussed data augmentation and hyperparameter tuning strategies to improve AI model performance and explored the potential for generalizing the system across various agricultural environments. The database constructed in this study will serve as a crucial resource for the future development of agricultural AI. Database URL: https://crops.phyzen.com/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CPDMS: a database system for crop physiological disorder management. CPDMS:作物生理失调管理数据库系统。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-22 DOI: 10.1093/database/baaf031
Jae-Hyeon Oh, Hwang-Weon Jeong, Il Pyung Ahn, Seon-Hwa Bae, Sung Mi Kim, Eunhee Kim, Su Jung Ra, Jinjeong Lee, Hye Yeon Choi, Young-Joo Seol

As the importance of precision agriculture grows, scalable and efficient methods for real-time data collection and analysis have become essential. In this study, we developed a system to collect real-time crop images, focusing on physiological disorders in tomatoes. This system systematically collects crop images and related data, with the potential to evolve into a valuable tool for researchers and agricultural practitioners. A total of 58 479 images were produced under stress conditions, including bacterial wilt (BW), Tomato Yellow Leaf Curl Virus (TYLCV), Tomato Spotted Wilt Virus (TSWV), drought, and salinity, across seven tomato varieties. The images include front views at 0 degrees, 120 degrees, 240 degrees, and top views and petiole images. Of these, 43 894 images were suitable for labeling. Based on this, 24 000 images were used for AI model training, and 13 037 images for model testing. By training a deep learning model, we achieved a mean Average Precision (mAP) of 0.46 and a recall rate of 0.60. Additionally, we discussed data augmentation and hyperparameter tuning strategies to improve AI model performance and explored the potential for generalizing the system across various agricultural environments. The database constructed in this study will serve as a crucial resource for the future development of agricultural AI. Database URL: https://crops.phyzen.com/.

随着精准农业重要性的增长,实时数据收集和分析的可扩展和高效方法变得至关重要。在这项研究中,我们开发了一个系统来收集实时作物图像,专注于番茄的生理失调。该系统系统地收集作物图像和相关数据,有可能发展成为研究人员和农业从业者的宝贵工具。在包括细菌性枯萎病(BW)、番茄黄卷叶病毒(TYLCV)、番茄斑点枯萎病(TSWV)、干旱和盐度在内的胁迫条件下,共生成了58 479张图像,涉及7个番茄品种。这些图像包括0度、120度、240度的前视图,以及俯视图和叶柄图像。其中,43 894幅图像适合标记。在此基础上,人工智能模型训练使用了2.4万张图像,模型测试使用了13 037张图像。通过训练深度学习模型,我们实现了0.46的平均精度(mAP)和0.60的召回率。此外,我们讨论了数据增强和超参数调整策略,以提高人工智能模型的性能,并探索了在各种农业环境中推广系统的潜力。本研究构建的数据库将成为未来农业人工智能发展的重要资源。数据库地址:https://crops.phyzen.com/。
{"title":"CPDMS: a database system for crop physiological disorder management.","authors":"Jae-Hyeon Oh, Hwang-Weon Jeong, Il Pyung Ahn, Seon-Hwa Bae, Sung Mi Kim, Eunhee Kim, Su Jung Ra, Jinjeong Lee, Hye Yeon Choi, Young-Joo Seol","doi":"10.1093/database/baaf031","DOIUrl":"10.1093/database/baaf031","url":null,"abstract":"<p><p>As the importance of precision agriculture grows, scalable and efficient methods for real-time data collection and analysis have become essential. In this study, we developed a system to collect real-time crop images, focusing on physiological disorders in tomatoes. This system systematically collects crop images and related data, with the potential to evolve into a valuable tool for researchers and agricultural practitioners. A total of 58 479 images were produced under stress conditions, including bacterial wilt (BW), Tomato Yellow Leaf Curl Virus (TYLCV), Tomato Spotted Wilt Virus (TSWV), drought, and salinity, across seven tomato varieties. The images include front views at 0 degrees, 120 degrees, 240 degrees, and top views and petiole images. Of these, 43 894 images were suitable for labeling. Based on this, 24 000 images were used for AI model training, and 13 037 images for model testing. By training a deep learning model, we achieved a mean Average Precision (mAP) of 0.46 and a recall rate of 0.60. Additionally, we discussed data augmentation and hyperparameter tuning strategies to improve AI model performance and explored the potential for generalizing the system across various agricultural environments. The database constructed in this study will serve as a crucial resource for the future development of agricultural AI. Database URL: https://crops.phyzen.com/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12013473/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database. MIPD:分子,图像和临床表型集成数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf029
Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng

Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.

由于肿瘤的异质性,一部分患者不能从目前的治疗策略中获益。然而,对影像学特征、遗传分子和临床表型的综合分析可以表征肿瘤的异质性,从而开发出更加个性化的治疗方法。尽管具有潜力,但跨模式数据库仍未得到充分开发。为了弥补这一空白,我们建立了一个包含9965个基因、5449个蛋白质、1121个代谢物、283个通路、854个成像特征和73个结直肠癌患者临床因素的综合数据库。该数据库识别与临床表型相关的显著不同的分子和成像特征,并提供基于这些特征的生存分析。此外,它还提供遗传分子注释,肿瘤和正常组织之间的比较表达水平,与遗传分子相关的成像特征,以及用于预测基因表达水平的基于成像的模型。此外,该数据库突出了遗传分子、临床因素和影像学特征之间的相关性。总之,我们提出了MIPD(分子、成像和临床表型相关数据库),这是一个用户友好的、互动的、专门的平台,可在http://corgenerf.com上访问。MIPD通过提供查询、浏览、搜索、可视化和下载功能,促进了跨模式数据的可解释性,从而为推进结直肠癌的精准医学提供了宝贵的资源。数据库地址:http://corgenerf。
{"title":"MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database.","authors":"Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng","doi":"10.1093/database/baaf029","DOIUrl":"https://doi.org/10.1093/database/baaf029","url":null,"abstract":"<p><p>Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Localizatome: a database for stress-dependent subcellular localization changes in proteins. Localizatome:一个蛋白质中应力依赖性亚细胞定位变化的数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf028
Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara

Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.

了解蛋白质亚细胞定位及其动态变化对于阐明细胞功能和疾病机制至关重要,特别是在应激条件下,蛋白质定位变化可以调节细胞反应。目前可用的数据库提供了在稳态条件下蛋白质定位的见解;然而,与应力相关的动态局部化变化仍然知之甚少。在这里,我们介绍了Localizatome,一个捕获活细胞中应力诱导的蛋白质定位动态的综合数据库。利用原始的高通量显微镜系统和机器学习算法,我们分析了氧化应激前后HeLa细胞中10 287种荧光蛋白融合的人蛋白的定位模式。我们的分析显示,1910蛋白表现出氧化应激依赖的定位变化,特别是形成不同的焦点。其中包括胁迫颗粒组装因子和自噬相关蛋白,以及各种信号通路的组分。随后的表征确定了一些特定的氨基酸基序和与应力诱导的蛋白质再分配相关的内在紊乱区域。Localizatome通过基于网络的界面提供对这些数据的开放访问,支持对细胞应激反应和疾病机制的广泛研究。数据库URL https://localizatome.embrys.jp/。
{"title":"Localizatome: a database for stress-dependent subcellular localization changes in proteins.","authors":"Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara","doi":"10.1093/database/baaf028","DOIUrl":"https://doi.org/10.1093/database/baaf028","url":null,"abstract":"<p><p>Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1