Comparison of Information Retrieval Techniques for Traceability Link Recovery

Danissa V. Rodriguez, D. Carver
{"title":"Comparison of Information Retrieval Techniques for Traceability Link Recovery","authors":"Danissa V. Rodriguez, D. Carver","doi":"10.1109/INFOCT.2019.8710919","DOIUrl":null,"url":null,"abstract":"Requirements traceability supports many software engineering activities such as change impact analysis and requirements validation, providing benefits to the overall quality of software systems. Factors such as lack of communication, time pressure problems, and unsuccessfully implemented traceability practices result in developers losing track of requirements. Requirements traceability is a primary means to address completeness and accuracy of requirements. It is an active research topic for software engineers. Textual analysis and information retrieval techniques have been applied to the requirements traceability recovery problem for many years, due to the textual components of requirements and source code. Information retrieval techniques are semiautomatic techniques for recovering traceability links and on occasion, they have become the baseline for automatic methods applied to requirements traceability recovery. We evaluate the performance of IR techniques applied to the requirement traceability recovery process. The most popular information retrieval techniques applied to the requirements traceability recovery problem are the IR Probabilistic, Vector Space Model, and Latent Semantic Index approach. All three approaches rank documents by using one of the documents for extracting queries and the other as the documents being search using those extracted queries; however, they apply different internal logics for establishing similarities. We compared IR Probabilistic, Vector Space Model, and Latent Semantic Index approaches to evaluate their performance for requirement traceability recovery using the metrics of precision and recall. Experimental results indicate a low precision and recall for the LSI technique and high precision and low recall for both the IR probabilistic and the VSM techniques.","PeriodicalId":369231,"journal":{"name":"2019 IEEE 2nd International Conference on Information and Computer Technologies (ICICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 2nd International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCT.2019.8710919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Requirements traceability supports many software engineering activities such as change impact analysis and requirements validation, providing benefits to the overall quality of software systems. Factors such as lack of communication, time pressure problems, and unsuccessfully implemented traceability practices result in developers losing track of requirements. Requirements traceability is a primary means to address completeness and accuracy of requirements. It is an active research topic for software engineers. Textual analysis and information retrieval techniques have been applied to the requirements traceability recovery problem for many years, due to the textual components of requirements and source code. Information retrieval techniques are semiautomatic techniques for recovering traceability links and on occasion, they have become the baseline for automatic methods applied to requirements traceability recovery. We evaluate the performance of IR techniques applied to the requirement traceability recovery process. The most popular information retrieval techniques applied to the requirements traceability recovery problem are the IR Probabilistic, Vector Space Model, and Latent Semantic Index approach. All three approaches rank documents by using one of the documents for extracting queries and the other as the documents being search using those extracted queries; however, they apply different internal logics for establishing similarities. We compared IR Probabilistic, Vector Space Model, and Latent Semantic Index approaches to evaluate their performance for requirement traceability recovery using the metrics of precision and recall. Experimental results indicate a low precision and recall for the LSI technique and high precision and low recall for both the IR probabilistic and the VSM techniques.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
溯源链路恢复信息检索技术比较
需求可追溯性支持许多软件工程活动,例如变更影响分析和需求验证,为软件系统的整体质量提供了好处。诸如缺乏沟通、时间压力问题和未成功实现的可追溯性实践等因素导致开发人员失去对需求的跟踪。需求可追溯性是处理需求完整性和准确性的主要手段。对于软件工程师来说,这是一个活跃的研究课题。由于需求和源代码的文本组件,文本分析和信息检索技术已经应用于需求可追溯性恢复问题多年。信息检索技术是用于恢复可追溯性链接的半自动技术,有时,它们已经成为应用于需求可追溯性恢复的自动方法的基线。我们评估了应用于需求可追溯性恢复过程的IR技术的性能。应用于需求可追溯性恢复问题的最流行的信息检索技术是IR概率、向量空间模型和潜在语义索引方法。这三种方法对文档进行排序,方法是使用一个文档提取查询,另一个文档作为使用提取的查询进行搜索的文档;然而,它们采用不同的内部逻辑来建立相似性。我们比较了IR概率、向量空间模型和潜在语义索引方法,以使用精度和召回率的度量来评估它们在需求可追溯性恢复方面的性能。实验结果表明,大规模集成电路技术具有较低的准确率和召回率,而红外概率和VSM技术具有较高的准确率和较低的召回率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Utilization of Data Mining for Generalizable, All-Admission Prediction of Inpatient Mortality Development of Navigation Monitoring & Assistance Service Data Model ITIKI Plus: A Mobile Based Application for Integrating Indigenous Knowledge and Scientific Agro-Climate Decision Support for Africa’s Small-Scale Farmers TFDroid: Android Malware Detection by Topics and Sensitive Data Flows Using Machine Learning Techniques Weighted DV-Hop Localization Algorithm for Wireless Sensor Network based on Differential Evolution Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1