{"title":"A Feature Vector Based Approach for Software Component Clustering and Reuse Using K-means","authors":"C. Srinivas, C. V. Rao","doi":"10.1145/2832987.2833080","DOIUrl":null,"url":null,"abstract":"Software component clustering is an unsupervised learning approach which is used to cluster the software components. These clusters may then be used to study, analyze, understand behavior of the software components. In this paper, we use the k-means clustering algorithm to cluster the software components. The main difference lies in the use of distance measure which is designed to find the similarity between the software components. We use the distance measure [12], to find the pair wise project distance matrix and apply the k-means algorithm on this distance matrix. The main idea is to use more than one distance measure, to explore consensus based technique, so as to cluster software components, instead of using only one measure to cluster the components. This approach may also be applied for software architecture recovery problem by using our distance measure.","PeriodicalId":416001,"journal":{"name":"Proceedings of the The International Conference on Engineering & MIS 2015","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the The International Conference on Engineering & MIS 2015","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2832987.2833080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
Software component clustering is an unsupervised learning approach which is used to cluster the software components. These clusters may then be used to study, analyze, understand behavior of the software components. In this paper, we use the k-means clustering algorithm to cluster the software components. The main difference lies in the use of distance measure which is designed to find the similarity between the software components. We use the distance measure [12], to find the pair wise project distance matrix and apply the k-means algorithm on this distance matrix. The main idea is to use more than one distance measure, to explore consensus based technique, so as to cluster software components, instead of using only one measure to cluster the components. This approach may also be applied for software architecture recovery problem by using our distance measure.