{"title":"感知纤毛,数字化采集纤毛数据用于比较基因组学研究。","authors":"Karen R Christie, Judith A Blake","doi":"10.1186/s13630-018-0057-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cilia are specialized, hair-like structures that project from the cell bodies of eukaryotic cells. With increased understanding of the distribution and functions of various types of cilia, interest in these organelles is accelerating. To effectively use this great expansion in knowledge, this information must be made digitally accessible and available for large-scale analytical and computational investigation. Capture and integration of knowledge about cilia into existing knowledge bases, thus providing the ability to improve comparative genomic data analysis, is the objective of this work.</p><p><strong>Methods: </strong>We focused on the capture of information about cilia as studied in the laboratory mouse, a primary model of human biology. The workflow developed establishes a standard for capture of comparative functional data relevant to human biology. We established the 310 closest mouse orthologs of the 302 human genes defined in the SYSCILIA Gold Standard set of ciliary genes. For the mouse genes, we identified biomedical literature for curation and used Gene Ontology (GO) curation paradigms to provide functional annotations from these publications.</p><p><strong>Results: </strong>Employing a methodology for comprehensive capture of experimental data about cilia genes in structured, digital form, we established a workflow for curation of experimental literature detailing molecular function and roles of cilia proteins starting with the mouse orthologs of the human SYSCILIA gene set. We worked closely with the GO Consortium ontology development editors and the SYSCILIA Consortium to improve the representation of ciliary biology within the GO. During the time frame of the ontology improvement project, we have fully curated 134 of these 310 mouse genes, resulting in an increase in the number of ciliary and other experimental annotations.</p><p><strong>Conclusions: </strong>We have improved the GO annotations available for mouse genes orthologous to the human genes in the SYSCILIA Consortium's Gold Standard set. In addition, ciliary terminology in the GO itself was improved in collaboration with GO ontology developers and the SYSCILIA Consortium. These improvements to the GO terms for the functions and roles of ciliary proteins, along with the increase in annotations of the corresponding genes, enhance the representation of ciliary processes and localizations and improve access to these data during large-scale bioinformatic analyses.</p>","PeriodicalId":38134,"journal":{"name":"Cilia","volume":"7 ","pages":"3"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907423/pdf/","citationCount":"0","resultStr":"{\"title\":\"Sensing the cilium, digital capture of ciliary data for comparative genomics investigations.\",\"authors\":\"Karen R Christie, Judith A Blake\",\"doi\":\"10.1186/s13630-018-0057-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Cilia are specialized, hair-like structures that project from the cell bodies of eukaryotic cells. With increased understanding of the distribution and functions of various types of cilia, interest in these organelles is accelerating. To effectively use this great expansion in knowledge, this information must be made digitally accessible and available for large-scale analytical and computational investigation. Capture and integration of knowledge about cilia into existing knowledge bases, thus providing the ability to improve comparative genomic data analysis, is the objective of this work.</p><p><strong>Methods: </strong>We focused on the capture of information about cilia as studied in the laboratory mouse, a primary model of human biology. The workflow developed establishes a standard for capture of comparative functional data relevant to human biology. We established the 310 closest mouse orthologs of the 302 human genes defined in the SYSCILIA Gold Standard set of ciliary genes. For the mouse genes, we identified biomedical literature for curation and used Gene Ontology (GO) curation paradigms to provide functional annotations from these publications.</p><p><strong>Results: </strong>Employing a methodology for comprehensive capture of experimental data about cilia genes in structured, digital form, we established a workflow for curation of experimental literature detailing molecular function and roles of cilia proteins starting with the mouse orthologs of the human SYSCILIA gene set. We worked closely with the GO Consortium ontology development editors and the SYSCILIA Consortium to improve the representation of ciliary biology within the GO. During the time frame of the ontology improvement project, we have fully curated 134 of these 310 mouse genes, resulting in an increase in the number of ciliary and other experimental annotations.</p><p><strong>Conclusions: </strong>We have improved the GO annotations available for mouse genes orthologous to the human genes in the SYSCILIA Consortium's Gold Standard set. In addition, ciliary terminology in the GO itself was improved in collaboration with GO ontology developers and the SYSCILIA Consortium. These improvements to the GO terms for the functions and roles of ciliary proteins, along with the increase in annotations of the corresponding genes, enhance the representation of ciliary processes and localizations and improve access to these data during large-scale bioinformatic analyses.</p>\",\"PeriodicalId\":38134,\"journal\":{\"name\":\"Cilia\",\"volume\":\"7 \",\"pages\":\"3\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907423/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cilia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s13630-018-0057-0\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2018/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cilia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13630-018-0057-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2018/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0
摘要
背景:纤毛是从真核细胞的细胞体中伸出的特化的毛发状结构。随着人们对各种类型纤毛的分布和功能有了更多的了解,对这些细胞器的兴趣正在加速增长。为了有效利用这一巨大的知识增长,必须以数字化方式获取这些信息,并进行大规模的分析和计算研究。捕获纤毛知识并将其整合到现有的知识库中,从而提供改进比较基因组数据分析的能力,是这项工作的目标:我们的研究重点是捕捉实验鼠(人类生物学的主要模型)纤毛的相关信息。所开发的工作流程为获取与人类生物学相关的比较功能数据建立了标准。我们确定了 SYSCILIA 黄金标准纤毛基因集中定义的 302 个人类基因的 310 个最接近的小鼠同源基因。对于小鼠基因,我们确定了需要整理的生物医学文献,并使用基因本体(GO)整理范例提供了这些文献中的功能注释:我们采用了一种以结构化、数字化形式全面采集纤毛基因实验数据的方法,建立了一套实验文献整理工作流程,从人类 SYSCILIA 基因组的小鼠直向同源物入手,详细描述了纤毛蛋白的分子功能和作用。我们与 GO 联盟本体开发编辑和 SYSCILIA 联盟密切合作,以改进纤毛生物学在 GO 中的表述。在本体改进项目期间,我们对这 310 个小鼠基因中的 134 个基因进行了完整的注释,从而增加了睫状肌和其他实验注释的数量:结论:我们改进了 SYSCILIA 联盟黄金标准集中与人类基因同源的小鼠基因的 GO 注释。此外,我们还与 GO 本体开发人员和 SYSCILIA 联盟合作,改进了 GO 本身中的睫状肌术语。这些对睫状肌蛋白功能和作用的 GO 术语的改进,以及相应基因注释的增加,增强了睫状肌过程和定位的代表性,并改善了大规模生物信息学分析中对这些数据的访问。
Sensing the cilium, digital capture of ciliary data for comparative genomics investigations.
Background: Cilia are specialized, hair-like structures that project from the cell bodies of eukaryotic cells. With increased understanding of the distribution and functions of various types of cilia, interest in these organelles is accelerating. To effectively use this great expansion in knowledge, this information must be made digitally accessible and available for large-scale analytical and computational investigation. Capture and integration of knowledge about cilia into existing knowledge bases, thus providing the ability to improve comparative genomic data analysis, is the objective of this work.
Methods: We focused on the capture of information about cilia as studied in the laboratory mouse, a primary model of human biology. The workflow developed establishes a standard for capture of comparative functional data relevant to human biology. We established the 310 closest mouse orthologs of the 302 human genes defined in the SYSCILIA Gold Standard set of ciliary genes. For the mouse genes, we identified biomedical literature for curation and used Gene Ontology (GO) curation paradigms to provide functional annotations from these publications.
Results: Employing a methodology for comprehensive capture of experimental data about cilia genes in structured, digital form, we established a workflow for curation of experimental literature detailing molecular function and roles of cilia proteins starting with the mouse orthologs of the human SYSCILIA gene set. We worked closely with the GO Consortium ontology development editors and the SYSCILIA Consortium to improve the representation of ciliary biology within the GO. During the time frame of the ontology improvement project, we have fully curated 134 of these 310 mouse genes, resulting in an increase in the number of ciliary and other experimental annotations.
Conclusions: We have improved the GO annotations available for mouse genes orthologous to the human genes in the SYSCILIA Consortium's Gold Standard set. In addition, ciliary terminology in the GO itself was improved in collaboration with GO ontology developers and the SYSCILIA Consortium. These improvements to the GO terms for the functions and roles of ciliary proteins, along with the increase in annotations of the corresponding genes, enhance the representation of ciliary processes and localizations and improve access to these data during large-scale bioinformatic analyses.