{"title":"Combining multiple clustering and network analysis for discoveries in gene expression data","authors":"Sleiman Alhajj, A. Alhajj, S. Özyer","doi":"10.1145/3487351.3490961","DOIUrl":null,"url":null,"abstract":"Clustering is a challenging research task which could benefit a wide range of practical applications, including bioinformatics. It targets success by optimizing a number of objectives, a characteristic mostly ignored by clustering approaches. This paper describes a synthetic clustering algorithm which first applies multi-objective based approach to produce the alternative clustering solutions. Then the best clusters from each solution are selected and combined into a seed for a compact and effective solution which is expected to be better than all the individual solutions because it combines the best of each. This way, the developed algorithm may be classified as a fuzzy clustering approach because each object may belong to more than one cluster in the synthesized solution with a degree of membership in each cluster. Another interesting aspect of the algorithm is that it identifies the outliers. Further, a network is built from the relationships of the objects within the various clusters. The network is analyzed to reveal interesting discoveries not clearly reflected in the clustering outcome. The validity and applicability of the presented methodology has been assessed using synthetic and real data from the cancer.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3487351.3490961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Clustering is a challenging research task which could benefit a wide range of practical applications, including bioinformatics. It targets success by optimizing a number of objectives, a characteristic mostly ignored by clustering approaches. This paper describes a synthetic clustering algorithm which first applies multi-objective based approach to produce the alternative clustering solutions. Then the best clusters from each solution are selected and combined into a seed for a compact and effective solution which is expected to be better than all the individual solutions because it combines the best of each. This way, the developed algorithm may be classified as a fuzzy clustering approach because each object may belong to more than one cluster in the synthesized solution with a degree of membership in each cluster. Another interesting aspect of the algorithm is that it identifies the outliers. Further, a network is built from the relationships of the objects within the various clusters. The network is analyzed to reveal interesting discoveries not clearly reflected in the clustering outcome. The validity and applicability of the presented methodology has been assessed using synthetic and real data from the cancer.