{"title":"基于熵谱聚类的胰腺癌数据集生物标志物检测","authors":"Purbanka Pahari, Piyali Basak, Anasua Sarkar","doi":"10.1109/ICRCICN.2017.8234508","DOIUrl":null,"url":null,"abstract":"Pancreatic ductal adenocarcinoma (PDAC) is one of most aggressive malignancy. The identification of Biomarker for PDAC is an ongoing challenge. The high dimensional PDAC gene expression dataset in Gene Expression Omnibus(GEO) database, is analyzed in this work. To select those genes which are relevant as well as with least redundancy among them, we use successive approaches like Filter methods and Normalization phase. In this work, after pre-processing of the data, we have used three types of spectral clustering methods, Unnormalized, Ng-Jordan and proposed entropy based Shi-Malik spectral clustering algorithms to find important genetic and biological information. There we have applied new Shannon's Entropy based distance measure to identify the clusters on Pancreatic dataset. Some Biomarkers are identified through KEGG Pathway analysis. The Biological analysis and functional correlation of genes based on Gene Ontology(GO) terms show that the proposed method is helpful for the selection of Biomarkers.","PeriodicalId":166298,"journal":{"name":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Biomarker detection on Pancreatic cancer dataset using entropy based spectral clustering\",\"authors\":\"Purbanka Pahari, Piyali Basak, Anasua Sarkar\",\"doi\":\"10.1109/ICRCICN.2017.8234508\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pancreatic ductal adenocarcinoma (PDAC) is one of most aggressive malignancy. The identification of Biomarker for PDAC is an ongoing challenge. The high dimensional PDAC gene expression dataset in Gene Expression Omnibus(GEO) database, is analyzed in this work. To select those genes which are relevant as well as with least redundancy among them, we use successive approaches like Filter methods and Normalization phase. In this work, after pre-processing of the data, we have used three types of spectral clustering methods, Unnormalized, Ng-Jordan and proposed entropy based Shi-Malik spectral clustering algorithms to find important genetic and biological information. There we have applied new Shannon's Entropy based distance measure to identify the clusters on Pancreatic dataset. Some Biomarkers are identified through KEGG Pathway analysis. The Biological analysis and functional correlation of genes based on Gene Ontology(GO) terms show that the proposed method is helpful for the selection of Biomarkers.\",\"PeriodicalId\":166298,\"journal\":{\"name\":\"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRCICN.2017.8234508\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2017.8234508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Biomarker detection on Pancreatic cancer dataset using entropy based spectral clustering
Pancreatic ductal adenocarcinoma (PDAC) is one of most aggressive malignancy. The identification of Biomarker for PDAC is an ongoing challenge. The high dimensional PDAC gene expression dataset in Gene Expression Omnibus(GEO) database, is analyzed in this work. To select those genes which are relevant as well as with least redundancy among them, we use successive approaches like Filter methods and Normalization phase. In this work, after pre-processing of the data, we have used three types of spectral clustering methods, Unnormalized, Ng-Jordan and proposed entropy based Shi-Malik spectral clustering algorithms to find important genetic and biological information. There we have applied new Shannon's Entropy based distance measure to identify the clusters on Pancreatic dataset. Some Biomarkers are identified through KEGG Pathway analysis. The Biological analysis and functional correlation of genes based on Gene Ontology(GO) terms show that the proposed method is helpful for the selection of Biomarkers.