{"title":"Multi-relational factorizations for cancer subclassification","authors":"Liviu Badea","doi":"10.1109/ICACTE.2010.5579024","DOIUrl":null,"url":null,"abstract":"We introduce a novel multi-relational learning algorithm based on simultaneous nonnegative matrix factorizations, able to distinguish between “target” and “background” relations, deal with incomplete data and so-called “link” functions. The ability to handle incomplete data allows us to tackle both relation prediction and clustering. Moreover, the nonnegativity constraints are essential for the interpretability of the resulting clusters. We apply our approach to a large breast cancer dataset for which we find 5 subclasses that agree very well with the known subclassification of this disease, while emphasizing the main biological processes and genes involved in the corresponding subtypes.","PeriodicalId":255806,"journal":{"name":"2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACTE.2010.5579024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
We introduce a novel multi-relational learning algorithm based on simultaneous nonnegative matrix factorizations, able to distinguish between “target” and “background” relations, deal with incomplete data and so-called “link” functions. The ability to handle incomplete data allows us to tackle both relation prediction and clustering. Moreover, the nonnegativity constraints are essential for the interpretability of the resulting clusters. We apply our approach to a large breast cancer dataset for which we find 5 subclasses that agree very well with the known subclassification of this disease, while emphasizing the main biological processes and genes involved in the corresponding subtypes.