发现软件构件之间的关系

Job M. Champagne, D. Carver
{"title":"发现软件构件之间的关系","authors":"Job M. Champagne, D. Carver","doi":"10.1109/AERO47225.2020.9172288","DOIUrl":null,"url":null,"abstract":"Software systems have become ubiquitous in today's world. Most software will evolve after initial deployment. Software changes that are a part of that evolution often are documented in a requirements change document. One of the challenges when changing software is understanding the portions of the existing requirements and the existing code that could be affected by the change in order to avoid or minimize unexpected side effects from the changes. Researchers have addressed the problem of minimizing the effect of changes by using different methods, including text mining and clustering. Some approaches to determine change impact are based on information retrieval (IR) techniques using both term frequency-inverse document frequency (TF—IDF) and latent semantic indexing (LSI) methods. Other approaches are based on visualization techniques using degree and betweenness centrality measures. In this research, we approach the problem by applying IR techniques along with data mining. We apply TF-IDF and LSI to investigate which changes have a high potential of modifying existing requirements. We also analyze similarities between changes that do not map to existing requirements. In both cases, our threshold for identifying similarity is 80%. We designed our approach to identify, for a given change, one or more requirements that have a high potential of being associated with the change as well as identifying intra-document requirements or changes that have a high potential for consolidation. We were able to identify requirements that had a similarity of at least 80% to a change request using TF-IDF and LSI. We were also able to isolate changes that did not show a high similarity to any requirement, thus indicating that the change request was likely a request for a new requirement. The results are encouraging for assessing the impact of software change requests on requirements of an existing system.","PeriodicalId":114560,"journal":{"name":"2020 IEEE Aerospace Conference","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Discovering Relationships Among Software Artifacts\",\"authors\":\"Job M. Champagne, D. Carver\",\"doi\":\"10.1109/AERO47225.2020.9172288\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Software systems have become ubiquitous in today's world. Most software will evolve after initial deployment. Software changes that are a part of that evolution often are documented in a requirements change document. One of the challenges when changing software is understanding the portions of the existing requirements and the existing code that could be affected by the change in order to avoid or minimize unexpected side effects from the changes. Researchers have addressed the problem of minimizing the effect of changes by using different methods, including text mining and clustering. Some approaches to determine change impact are based on information retrieval (IR) techniques using both term frequency-inverse document frequency (TF—IDF) and latent semantic indexing (LSI) methods. Other approaches are based on visualization techniques using degree and betweenness centrality measures. In this research, we approach the problem by applying IR techniques along with data mining. We apply TF-IDF and LSI to investigate which changes have a high potential of modifying existing requirements. We also analyze similarities between changes that do not map to existing requirements. In both cases, our threshold for identifying similarity is 80%. We designed our approach to identify, for a given change, one or more requirements that have a high potential of being associated with the change as well as identifying intra-document requirements or changes that have a high potential for consolidation. We were able to identify requirements that had a similarity of at least 80% to a change request using TF-IDF and LSI. We were also able to isolate changes that did not show a high similarity to any requirement, thus indicating that the change request was likely a request for a new requirement. The results are encouraging for assessing the impact of software change requests on requirements of an existing system.\",\"PeriodicalId\":114560,\"journal\":{\"name\":\"2020 IEEE Aerospace Conference\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Aerospace Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AERO47225.2020.9172288\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Aerospace Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AERO47225.2020.9172288","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

软件系统在当今世界已经无处不在。大多数软件在初始部署之后会不断发展。作为演进的一部分的软件变更通常记录在需求变更文档中。更改软件时的挑战之一是理解可能受更改影响的现有需求和现有代码的部分,以避免或最小化更改带来的意外副作用。研究人员已经通过使用不同的方法解决了最小化变化影响的问题,包括文本挖掘和聚类。一些确定变更影响的方法是基于信息检索(IR)技术,使用术语频率-逆文档频率(TF-IDF)和潜在语义索引(LSI)方法。其他方法基于使用度和中间度中心性度量的可视化技术。在本研究中,我们通过应用IR技术和数据挖掘来解决这个问题。我们应用TF-IDF和LSI来研究哪些变化有很大的潜力来修改现有的要求。我们还分析没有映射到现有需求的变更之间的相似性。在这两种情况下,我们识别相似性的阈值都是80%。我们设计了我们的方法来识别,对于一个给定的变更,一个或多个具有与变更相关联的高潜力的需求,以及识别文档内部需求或具有高整合潜力的变更。使用TF-IDF和LSI,我们能够识别出与变更请求至少有80%相似的需求。我们还能够隔离那些没有显示出与任何需求高度相似的变更,从而表明变更请求很可能是对新需求的请求。评估软件变更请求对现有系统需求的影响的结果是令人鼓舞的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Discovering Relationships Among Software Artifacts
Software systems have become ubiquitous in today's world. Most software will evolve after initial deployment. Software changes that are a part of that evolution often are documented in a requirements change document. One of the challenges when changing software is understanding the portions of the existing requirements and the existing code that could be affected by the change in order to avoid or minimize unexpected side effects from the changes. Researchers have addressed the problem of minimizing the effect of changes by using different methods, including text mining and clustering. Some approaches to determine change impact are based on information retrieval (IR) techniques using both term frequency-inverse document frequency (TF—IDF) and latent semantic indexing (LSI) methods. Other approaches are based on visualization techniques using degree and betweenness centrality measures. In this research, we approach the problem by applying IR techniques along with data mining. We apply TF-IDF and LSI to investigate which changes have a high potential of modifying existing requirements. We also analyze similarities between changes that do not map to existing requirements. In both cases, our threshold for identifying similarity is 80%. We designed our approach to identify, for a given change, one or more requirements that have a high potential of being associated with the change as well as identifying intra-document requirements or changes that have a high potential for consolidation. We were able to identify requirements that had a similarity of at least 80% to a change request using TF-IDF and LSI. We were also able to isolate changes that did not show a high similarity to any requirement, thus indicating that the change request was likely a request for a new requirement. The results are encouraging for assessing the impact of software change requests on requirements of an existing system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Integrated Innovative 3D Radiation Protection Fabric for Advanced Spacesuits and Systems Model-based Tools designed for the FACE™ Technical Standard, Editions 3.0 & 2.1 Can Adaptive Response and Evolution Make Survival of Extremophile Bacteria Possible on Mars? Initial Orbit Determination Using Simplex Fusion Headline-based visualization to prioritize events
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1