{"title":"Distance based knowledge retrieval through rule mining for complex biomarker recognition from tri-omics profiles","authors":"Saurav Mallik, Zhongming Zhao","doi":"10.1504/IJCBDD.2019.10021269","DOIUrl":null,"url":null,"abstract":"Biomarker discovery from complex biomedical data has become an important topic to unveil the significant new disease signals for disease diagnosis and treatment during past two decades. The earlier methods were proposed on a single genomic profile, and most of them utilize a single minimum support/confidence/lift cutoff. To overcome these shortcomings, here, we developed a framework for identifying complex markers using shortest distance based rule mining from the tri-omics profiles (gene expression, methylation and protein-protein interaction). We applied our method to a high-grade soft-tissue sarcomas multi-omics dataset. The novel markers were {GRB2-, STAT3-}('-' and '+' denote decreased and increased gene activities, respectively), {STAT3+, TP53-, MAPK3+} and {STAT3+, FYN+, MAPK3+}. We showed the superiority of our method vs. others, as it generates fewer rules and lower mean of the shortest distance than others. Moreover, our method is useful to extract complex markers from tri-omics profiles for the complex disease.","PeriodicalId":13612,"journal":{"name":"Int. J. Comput. Biol. Drug Des.","volume":"12 1","pages":"105-127"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Comput. Biol. Drug Des.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCBDD.2019.10021269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Biomarker discovery from complex biomedical data has become an important topic to unveil the significant new disease signals for disease diagnosis and treatment during past two decades. The earlier methods were proposed on a single genomic profile, and most of them utilize a single minimum support/confidence/lift cutoff. To overcome these shortcomings, here, we developed a framework for identifying complex markers using shortest distance based rule mining from the tri-omics profiles (gene expression, methylation and protein-protein interaction). We applied our method to a high-grade soft-tissue sarcomas multi-omics dataset. The novel markers were {GRB2-, STAT3-}('-' and '+' denote decreased and increased gene activities, respectively), {STAT3+, TP53-, MAPK3+} and {STAT3+, FYN+, MAPK3+}. We showed the superiority of our method vs. others, as it generates fewer rules and lower mean of the shortest distance than others. Moreover, our method is useful to extract complex markers from tri-omics profiles for the complex disease.