Clustering software systems to identify subsystem structures using knowledgebase

2011 Malaysian Conference in Software Engineering Pub Date : 2011-12-01 DOI:10.1109/MYSEC.2011.6140714

Md. Nasim Adnan, Md. Rashedul Islam, S. Hossain

{"title":"Clustering software systems to identify subsystem structures using knowledgebase","authors":"Md. Nasim Adnan, Md. Rashedul Islam, S. Hossain","doi":"10.1109/MYSEC.2011.6140714","DOIUrl":null,"url":null,"abstract":"The structure of a software system deteriorates as a result of continuous maintenance activity. For the purpose of software reengineering or reverse engineering, software engineers often get the original source code as the most updated source of information due to lack of current documentation and limited or nonexistent availability of the original designers. The application of clustering techniques to software systems aiming to discover feature-oriented and meaningful subsystems can help software engineers involved in software reengineering or reverse engineering to understand high-level features provided by those subsystems. Continuous research is going on in the recent years — addressing different issues in the software clustering problem. Our software clustering approach introduces the use of Knowledgebase, which leads to considerable improvement than the existing approaches. Similarity measurement is the key to perform successful clustering. Similarity measurement in the existing approaches has a common drawback that they do not incorporate the diversity of software systems. Our approach uses Knowledgebase which acts as a repository of information about the internal structure of the generic types of the software systems to provide guidelines on similarity measurement criteria and their respective weightages. The final clustering is done by populating automatically generated subsystems along with the known subsystems (provided by Knowledgebase). In our research, we have developed a tool named “ULAB Cluster 1.0” which implements our new clustering approach. This clustering tool has been evaluated by using a benchmark named “MoJo distance” for different well-known software systems. The experimental results show that our approach generates more appropriate subsystems than the other existing clustering approaches and outperforms others in different dimensions of software clustering quality.","PeriodicalId":137714,"journal":{"name":"2011 Malaysian Conference in Software Engineering","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Malaysian Conference in Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MYSEC.2011.6140714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

The structure of a software system deteriorates as a result of continuous maintenance activity. For the purpose of software reengineering or reverse engineering, software engineers often get the original source code as the most updated source of information due to lack of current documentation and limited or nonexistent availability of the original designers. The application of clustering techniques to software systems aiming to discover feature-oriented and meaningful subsystems can help software engineers involved in software reengineering or reverse engineering to understand high-level features provided by those subsystems. Continuous research is going on in the recent years — addressing different issues in the software clustering problem. Our software clustering approach introduces the use of Knowledgebase, which leads to considerable improvement than the existing approaches. Similarity measurement is the key to perform successful clustering. Similarity measurement in the existing approaches has a common drawback that they do not incorporate the diversity of software systems. Our approach uses Knowledgebase which acts as a repository of information about the internal structure of the generic types of the software systems to provide guidelines on similarity measurement criteria and their respective weightages. The final clustering is done by populating automatically generated subsystems along with the known subsystems (provided by Knowledgebase). In our research, we have developed a tool named “ULAB Cluster 1.0” which implements our new clustering approach. This clustering tool has been evaluated by using a benchmark named “MoJo distance” for different well-known software systems. The experimental results show that our approach generates more appropriate subsystems than the other existing clustering approaches and outperforms others in different dimensions of software clustering quality.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

聚类软件系统，利用知识库识别子系统结构

由于持续的维护活动，软件系统的结构会恶化。为了软件再工程或逆向工程的目的，由于缺乏当前文档和原始设计人员有限或不存在的可用性，软件工程师通常将原始源代码作为最新的信息来源。将聚类技术应用于软件系统，旨在发现面向特征和有意义的子系统，可以帮助参与软件再工程或逆向工程的软件工程师理解这些子系统提供的高级特征。近年来，人们对软件聚类问题进行了不断的研究，解决了软件聚类问题中的不同问题。我们的软件聚类方法引入了知识库的使用，比现有的方法有了很大的改进。相似度度量是成功进行聚类的关键。现有方法中的相似性度量有一个共同的缺点，即它们没有考虑到软件系统的多样性。我们的方法使用知识库，知识库作为软件系统的一般类型的内部结构的信息库，为相似性测量标准及其各自的权重提供指导。最终的集群是通过将自动生成的子系统与已知的子系统(由Knowledgebase提供)一起填充来完成的。在我们的研究中，我们开发了一个名为“ULAB集群1.0”的工具来实现我们的新集群方法。这个聚类工具已经通过使用一个名为“MoJo距离”的基准对不同的知名软件系统进行了评估。实验结果表明，该方法比现有的聚类方法生成了更多合适的子系统，并且在软件聚类质量的不同维度上都优于其他方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2011 Malaysian Conference in Software Engineering

自引率

0.00%

发文量

期刊最新文献

Model checking and verification method of engine control units Assessment of knowledge ability towards decision-making in information systems from managers perspective A comparison of software reuse in software development communities Software engineering approach to enterprise-strength ontology engineering Software Engineering Curriculum: A systematic mapping study