{"title":"Associating gene functional groups with multiple clinical conditions using Jaccard similarity","authors":"N. A. Yousri, D. Elkaffash","doi":"10.1109/BIBMW.2011.6112381","DOIUrl":null,"url":null,"abstract":"Gene expression arrays provide a rich source of information on the behaviour of thousands of genes for several clinical conditions in a particular tumor/cancer. Such expression sets when integrated with functional classification of genes enrich information provided from both sources. Stemming from the need to score relations between functional groups of genes and multiple clinical types associated with a tumor, this study proposes to use Jaccard similarity. For any set of genes, this measure can be used to measure the association between two sets of gene classes/groups, obtained from two different sources of information. In the proposed study, we particularly consider subsets of overexpressing genes in cancer expression sets. This enables the identification of unique genes and associate their most correlated sample clinical types to their functional groups. Experiments on a breast cancer expression set are done to illustrate the use of the proposed measure.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"8 1","pages":"241-246"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2011.6112381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Gene expression arrays provide a rich source of information on the behaviour of thousands of genes for several clinical conditions in a particular tumor/cancer. Such expression sets when integrated with functional classification of genes enrich information provided from both sources. Stemming from the need to score relations between functional groups of genes and multiple clinical types associated with a tumor, this study proposes to use Jaccard similarity. For any set of genes, this measure can be used to measure the association between two sets of gene classes/groups, obtained from two different sources of information. In the proposed study, we particularly consider subsets of overexpressing genes in cancer expression sets. This enables the identification of unique genes and associate their most correlated sample clinical types to their functional groups. Experiments on a breast cancer expression set are done to illustrate the use of the proposed measure.