{"title":"基因表达研究中的聚类概念方法","authors":"M. Bogatyrev, K. Samodurov","doi":"10.17537/ICMBB18.81","DOIUrl":null,"url":null,"abstract":"The work contains an overview of clustering methods used in the study of gene expression and new results of application of Formal Concepts Analysis to clustering data extracted from the public functional genomics data repository GEO. Methods of Formal Concept Analysis allow clustering of multidimensional data under the single condition of partial ordering of such data sets. As a result, clusters are separate sublattices in the concept lattice, where each sublattice contains hierarchically related formal concepts. This solution of the clustering problem allows deep investigating both mutual influences of genes and their influence on other data obtained in experiments on gene expression. The paper describes a new information technology developed for the implementation of the proposed approach. The technology uses modern solutions in the field of Big Data processing, it has functions for communication with external data sources and other information systems. Preliminary results of the application of this technology to three-dimensional gene expression data obtained from the GEO system are presented.","PeriodicalId":168323,"journal":{"name":"Proceedings of the International Conference \"Mathematical Biology and Bioinformatics\"","volume":"293 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Conceptual Approach to Clustering in the Study of Gene Expression\",\"authors\":\"M. Bogatyrev, K. Samodurov\",\"doi\":\"10.17537/ICMBB18.81\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The work contains an overview of clustering methods used in the study of gene expression and new results of application of Formal Concepts Analysis to clustering data extracted from the public functional genomics data repository GEO. Methods of Formal Concept Analysis allow clustering of multidimensional data under the single condition of partial ordering of such data sets. As a result, clusters are separate sublattices in the concept lattice, where each sublattice contains hierarchically related formal concepts. This solution of the clustering problem allows deep investigating both mutual influences of genes and their influence on other data obtained in experiments on gene expression. The paper describes a new information technology developed for the implementation of the proposed approach. The technology uses modern solutions in the field of Big Data processing, it has functions for communication with external data sources and other information systems. Preliminary results of the application of this technology to three-dimensional gene expression data obtained from the GEO system are presented.\",\"PeriodicalId\":168323,\"journal\":{\"name\":\"Proceedings of the International Conference \\\"Mathematical Biology and Bioinformatics\\\"\",\"volume\":\"293 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference \\\"Mathematical Biology and Bioinformatics\\\"\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17537/ICMBB18.81\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference \"Mathematical Biology and Bioinformatics\"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17537/ICMBB18.81","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Conceptual Approach to Clustering in the Study of Gene Expression
The work contains an overview of clustering methods used in the study of gene expression and new results of application of Formal Concepts Analysis to clustering data extracted from the public functional genomics data repository GEO. Methods of Formal Concept Analysis allow clustering of multidimensional data under the single condition of partial ordering of such data sets. As a result, clusters are separate sublattices in the concept lattice, where each sublattice contains hierarchically related formal concepts. This solution of the clustering problem allows deep investigating both mutual influences of genes and their influence on other data obtained in experiments on gene expression. The paper describes a new information technology developed for the implementation of the proposed approach. The technology uses modern solutions in the field of Big Data processing, it has functions for communication with external data sources and other information systems. Preliminary results of the application of this technology to three-dimensional gene expression data obtained from the GEO system are presented.