TEFSE '11最新文献

英文中文

Creating operational profiles of software systems by transforming their log files to directed cyclic graphs 通过将日志文件转换为有向循环图来创建软件系统的操作概要文件

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987869

M. Nagappan, Brian P. Robinson

Most log files are of one format - a flat file with the events of execution recorded one after the other. Each line in the file contains at least a timestamp, a combination of one or more event identifiers, and the actual log message with information of which event was executed and what the values for the dynamic parameters of that event are. Since log files have this trace information, we can use it for many purposes, such as operational profiling and anomalous execution path detection. However the current flat file format of a log file is very unintuitive to detect the existence of a repeating pattern. In this paper we propose a transformation of the current serial order format of a log file to a directed cyclic graph (such as a non-finite state machine) format and how the operational profile of a system can be built from this representation of the log file. We built a tool (in C++), that transforms a log file with a set of log events in a serial order to an adjacency matrix for the resulting graphical representation. We can then easily apply existing graph theory based algorithms on the adjacency matrix to analyze the log file of the system. The directed cyclic graph and the analysis of it can be visualized by rendering the adjacency matrix with graph visualization tools, like Graphviz.

大多数日志文件都是一种格式——一个平面文件，一个接一个地记录执行事件。文件中的每一行至少包含一个时间戳、一个或多个事件标识符的组合，以及实际的日志消息，其中包含执行了哪个事件的信息以及该事件的动态参数的值。由于日志文件具有此跟踪信息，因此我们可以将其用于许多目的，例如操作分析和异常执行路径检测。然而，当前日志文件的平面文件格式对于检测重复模式的存在非常不直观。在本文中，我们提出了将当前日志文件的串行顺序格式转换为有向循环图(如非有限状态机)格式，以及如何从日志文件的这种表示构建系统的操作概况。我们构建了一个工具(在c++中)，它将包含一系列日志事件的日志文件按顺序转换为邻接矩阵，以得到图形表示。然后，我们可以很容易地在邻接矩阵上应用现有的基于图论的算法来分析系统的日志文件。有向循环图及其分析可以通过图形可视化工具(如Graphviz)绘制邻接矩阵来可视化。

{"title":"Creating operational profiles of software systems by transforming their log files to directed cyclic graphs","authors":"M. Nagappan, Brian P. Robinson","doi":"10.1145/1987856.1987869","DOIUrl":"https://doi.org/10.1145/1987856.1987869","url":null,"abstract":"Most log files are of one format - a flat file with the events of execution recorded one after the other. Each line in the file contains at least a timestamp, a combination of one or more event identifiers, and the actual log message with information of which event was executed and what the values for the dynamic parameters of that event are. Since log files have this trace information, we can use it for many purposes, such as operational profiling and anomalous execution path detection. However the current flat file format of a log file is very unintuitive to detect the existence of a repeating pattern. In this paper we propose a transformation of the current serial order format of a log file to a directed cyclic graph (such as a non-finite state machine) format and how the operational profile of a system can be built from this representation of the log file. We built a tool (in C++), that transforms a log file with a set of log events in a serial order to an adjacency matrix for the resulting graphical representation. We can then easily apply existing graph theory based algorithms on the adjacency matrix to analyze the log file of the system. The directed cyclic graph and the analysis of it can be visualized by rendering the adjacency matrix with graph visualization tools, like Graphviz.","PeriodicalId":116816,"journal":{"name":"TEFSE '11","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120959988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Improving traceability link recovery methods through software artifact summarization 通过软件工件总结改进可追溯性链接恢复方法

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987867

Jairo Aponte, Andrian Marcus

Analyzing candidate traceability links is a difficult, time consuming and error prone task, as it usually requires a detailed study of a long list of software artifacts of various kinds. One option to alleviate this problem is to select the most important features of the software artifacts that the developers would investigate. We discuss in this position paper how text summarization techniques could be used to address this problem. The potential gains in using summaries are both in terms of time and correctness of the traceability link recovery process.

分析候选的可跟踪性链接是一项困难、耗时且容易出错的任务，因为它通常需要对各种软件工件的长列表进行详细的研究。缓解这个问题的一个选择是选择开发人员将要研究的软件工件的最重要的特性。我们将在本文中讨论如何使用文本摘要技术来解决这个问题。使用摘要的潜在好处在于时间和可追溯性链接恢复过程的正确性。

引用次数: 16

Formalizing traceability relations for product lines 形式化产品线的可追溯性关系

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987866

L. Lamb, Waraporn Jirapanthong, A. Zisman

Traceability is considered an important activity during the development of software systems. Despite the various classifications that have been proposed for different types of traceability relations, there is still a lack of standard semantic definitions for traceability relations. In this paper, we present an ontology-based formalism for semantic representation of various types of traceability relations for product line systems and associations between these various types of traceability relations.

可追溯性被认为是软件系统开发过程中的一项重要活动。尽管针对不同类型的可追溯关系已经提出了各种分类，但是仍然缺乏可追溯关系的标准语义定义。在本文中，我们提出了一种基于本体的形式，用于产品线系统的各种类型的可追溯关系的语义表示以及这些不同类型的可追溯关系之间的关联。

引用次数: 19

Grand challenges, benchmarks, and TraceLab: developing infrastructure for the software traceability research community 重大挑战、基准和TraceLab:为软件可追溯性研究社区开发基础设施

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987861

J. Cleland-Huang, Adam Czauderna, Alex Dekhtyar, O. Gotel, J. Hayes, E. Keenan, Greg Leach, Jonathan I. Maletic, D. Poshyvanyk, Yonghee Shin, A. Zisman, G. Antoniol, B. Berenbach, Alexander Egyed, Patrick Mäder

The challenges of implementing successful and cost-effective traceability have created a compelling research agenda that has addressed a broad range of traceability related issues, ranging from qualitative studies of traceability users in industry to very technical and quantitative studies. Unfortunately, advances are hampered by the significant time and effort needed to establish a traceability research environment and to perform comparative evaluations of new results against existing baselines. In this panel we discuss ongoing efforts by members of the Center of Excellence for Software Traceability (CoEST) to define the Grand Challenges of Traceability, develop benchmarks, and to construct TraceLab, an extensible and scalable visual environment for designing and executing a broad range of traceability experiments.

实现成功且具有成本效益的可追溯性的挑战已经创建了一个引人注目的研究议程，该议程已经解决了广泛的可追溯性相关问题，范围从工业中可追溯性用户的定性研究到非常技术性和定量的研究。不幸的是，建立可追溯性研究环境和对新结果与现有基线进行比较评估所需的大量时间和精力阻碍了进展。在这个小组中，我们讨论了卓越软件可追溯性中心(CoEST)成员正在进行的工作，以定义可追溯性的重大挑战，开发基准，并构建TraceLab，这是一个可扩展和可伸缩的可视化环境，用于设计和执行广泛的可追溯性实验。

引用次数: 52

Towards a model of analyst effort for traceability research 朝着可追溯性研究的分析师工作模型迈进

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987870

Alex Dekhtyar, J. Hayes, Matty Smith

This paper posits that a theoretical model of analyst effort in tracing tasks is necessary to assist with study of the analyst. Specifically, it is clear from prior work by numerous research groups that the important factors in such a model are: the amount of time it takes for an analyst to vet a given candidate link and the amount of time it takes an analyst to find a missing link. This paper introduces a theoretical model of analyst effort as well as a simplified model. A number of simulations were undertaken in order to build effort curves to assist in evaluating numerous tracing scenarios, such as determining at what point in time an analyst should switch from vetting candidate links to manually searching for links not in the candidate list.

本文认为有必要建立分析人员追踪任务的理论模型，以协助分析人员的研究。具体地说，从许多研究小组先前的工作中可以清楚地看出，这种模型中的重要因素是:分析师审查给定候选链接所需的时间量以及分析师找到缺失链接所需的时间量。本文介绍了分析师努力的理论模型和简化模型。为了构建工作曲线以帮助评估大量的跟踪场景，进行了大量的模拟，例如确定在什么时间点分析人员应该从审查候选链接切换到手动搜索不在候选列表中的链接。

引用次数: 6

Source code indexing for automated tracing 用于自动跟踪的源代码索引

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987859

Anas Mahmoud, Nan Niu

Requirements-to-source-code traceability employs information retrieval (IR) methods to automatically link requirements to the source code that implements them. A crucial step in this process is indexing, where partial and important information from the software artifacts is converted into a representation that is compatible with the underlying IR model. Source code demands special attention in the indexing process. In this paper, we investigate source code indexing for supporting automatic traceability. We introduce a feature diagram that captures the key components and their relationships in the domain of source code indexing. We then present an experiment to examine the features of the diagram and their dependencies. Results show that utilizing comments has a significant effect on traceability link generation, and stemming is required when comments are considered.

需求到源代码的可跟踪性使用信息检索(IR)方法来自动地将需求链接到实现它们的源代码。这个过程中的一个关键步骤是索引，其中将来自软件工件的部分和重要信息转换为与底层IR模型兼容的表示。源代码在索引过程中需要特别注意。在本文中，我们研究了支持自动跟踪的源代码索引。我们引入了一个特征图来捕捉源代码索引领域的关键组件和它们之间的关系。然后，我们提出一个实验来检查图的特征及其依赖关系。结果表明，利用注释对可跟踪链接的生成有显著的影响，并且在考虑注释时需要词干。

引用次数: 29

Traceability challenge 2011: using TraceLab to evaluate the impact of local versus global IDF on trace retrieval 追踪挑战2011:使用TraceLab评估本地与全球IDF对追踪检索的影响

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987874

Adam Czauderna, M. Gibiec, Greg Leach, Yubin Li, Yonghee Shin, E. Keenan, J. Cleland-Huang

Numerous trace retrieval algorithms incorporate the standard tf-idf (term frequency, inverse document frequency) technique to weight various terms. In this paper we address Grand Challenge C-GC1 by comparing the effectiveness of computing idf based only on the local terms in the query, versus computing it based on general term usage as documented in the American National Corpus. We also address Grand Challenges L-GC1 and L-GC2 by setting ourselves the additional task of designing and conducting the experiments using the alpha-release of TraceLab. TraceLab is an experimental workbench which allows researchers to graphically model and execute a traceability experiment as a workflow of components. Results of the experiment show that the local idf approach exceeds or matches the global approach in all of the cases studied.

许多跟踪检索算法都采用标准的tf-idf(术语频率，逆文档频率)技术来对各种术语进行加权。在本文中，我们通过比较仅基于查询中的局部术语计算idf与基于美国国家语料库中记录的通用术语使用计算idf的有效性来解决大挑战C-GC1。我们还通过使用TraceLab alpha版本设计和执行实验来解决大挑战L-GC1和L-GC2。TraceLab是一个实验工作台，它允许研究人员以图形化的方式建模和执行可追溯性实验，作为组件的工作流。实验结果表明，在所有研究的情况下，局部idf方法都优于或匹配全局方法。

引用次数: 13

A rich traceability model for social interactions 用于社会交互的丰富的可追溯性模型

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987871

Maurício Serrano, Julio Cesar Sampaio do Prado Leite

In 1993, Goguen published a research note addressing the social issues in Requirements Engineering. He identified in the requirements process three major social groups: the client organization; the requirements team; and the development team. However, nowadays there is a lack of technological support that traces requirements to social issues on the requirements team or development team. From early published traceability metamodels to current requirements traceability literature, the client organization and the stakeholders are first-class citizens, but the software engineers and the interactions between these groups are not. In this paper we present a partially formalized RichPicture traceability model to fill this gap. ITrace is a flexible model to weave together the social network graph, the information sources graph, the social interactions graph, and the Requirements Engineering artifacts evolution graph. We empirically developed our traceability model tracking a Transparency catalogue evolution. We also compare our model structure to Contribution Structures.

1993年，Goguen发表了一篇关于需求工程中的社会问题的研究报告。他在需求过程中确定了三个主要的社会群体:客户组织;需求团队;还有开发团队。然而，现在缺乏技术支持来追踪需求团队或开发团队的社会问题。从早期发布的可追溯性元模型到当前的需求可追溯性文献，客户组织和涉众都是一流的公民，但是软件工程师和这些群体之间的交互却不是。在本文中，我们提出了一个部分形式化的RichPicture可追溯性模型来填补这个空白。ITrace是一个灵活的模型，可以将社会网络图、信息源图、社会交互图和需求工程工件演化图编织在一起。我们根据经验开发了跟踪透明度目录演变的可追溯性模型。我们还将模型结构与贡献结构进行了比较。

引用次数: 7

Recovering traceability links between source code and fixed bugs via patch analysis 通过补丁分析恢复源代码和修复错误之间的可追溯性链接

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987863

Christopher S. Corley, Nicholas A. Kraft, L. Etzkorn, Stacy K. Lukins

Traceability links can be recovered using data mined from a revision control system, such as CVS, and an issue tracking system, such as Bugzilla. Existing approaches to recover links between a bug and the methods changed to fix the bug rely on the presence of the bug's identifier in a CVS log message. In this paper we present an approach that relies instead on the presence of a patch in the issue report for the bug. That is, rather than analyzing deltas retrieved from CVS to recover links, our approach analyzes patches retrieved from Bugzilla. We use BugTrace, the tool implementing our approach, to conduct a case study in which we compare the links recovered by our approach to links recovered by manual inspection. The results of the case study support the efficacy of our approach. After describing the limitations of our case study, we conclude by reviewing closely related work and suggesting possible future work.

可追溯性链接可以使用从修订控制系统(如CVS)和问题跟踪系统(如Bugzilla)中挖掘的数据来恢复。恢复错误和为修复错误而更改的方法之间的链接的现有方法依赖于CVS日志消息中存在的错误标识符。在本文中，我们提出了一种方法，该方法依赖于bug问题报告中补丁的存在。也就是说，我们的方法不是分析从CVS检索到的增量来恢复链接，而是分析从Bugzilla检索到的补丁。我们使用BugTrace(实现我们的方法的工具)来进行案例研究，将我们的方法恢复的链接与手工检查恢复的链接进行比较。案例研究的结果支持了我们方法的有效性。在描述了我们的案例研究的局限性之后，我们通过回顾密切相关的工作并建议可能的未来工作来结束。

引用次数: 28

On the use of eye tracking in software traceability 眼动追踪在软件溯源中的应用

TEFSE '11

Pub Date : 2011-05-23 DOI: 10.1145/1987856.1987872

Bonita Sharif, Huzefa H. Kagdi

The paper advocates for the induction of eye tracking technology in software traceability and takes a position that the use of eye tracking metrics can contribute to several software traceability tasks. The authors posit that the role of eye tracking is not simply restricted to an instrument for empirical studies, but also could extend to providing a foundation of a new software traceability methodology. Several scenarios where eye-tracking metrics could be meaningful are presented. The specific research directions include conducting empirical studies with eye-tracking metrics and replicating previously reported empirical studies, eye-tracking enabled traceability link recovery and management methodology, and visualization support.

本文主张在软件溯源中引入眼动追踪技术，并认为眼动追踪指标的使用有助于完成多项软件溯源任务。作者认为，眼动追踪的作用不仅仅局限于实证研究的工具，还可以扩展到为新的软件可追溯方法提供基础。介绍了眼动追踪指标可能有意义的几个场景。具体研究方向包括使用眼动追踪指标进行实证研究并复制先前报道的实证研究，眼动追踪可追溯性链接恢复和管理方法，以及可视化支持。

引用次数: 37

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

TEFSE '11

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀