Creating operational profiles of software systems by transforming their log files to directed cyclic graphs

TEFSE '11 Pub Date : 2011-05-23 DOI:10.1145/1987856.1987869
M. Nagappan, Brian P. Robinson
{"title":"Creating operational profiles of software systems by transforming their log files to directed cyclic graphs","authors":"M. Nagappan, Brian P. Robinson","doi":"10.1145/1987856.1987869","DOIUrl":null,"url":null,"abstract":"Most log files are of one format - a flat file with the events of execution recorded one after the other. Each line in the file contains at least a timestamp, a combination of one or more event identifiers, and the actual log message with information of which event was executed and what the values for the dynamic parameters of that event are. Since log files have this trace information, we can use it for many purposes, such as operational profiling and anomalous execution path detection. However the current flat file format of a log file is very unintuitive to detect the existence of a repeating pattern. In this paper we propose a transformation of the current serial order format of a log file to a directed cyclic graph (such as a non-finite state machine) format and how the operational profile of a system can be built from this representation of the log file. We built a tool (in C++), that transforms a log file with a set of log events in a serial order to an adjacency matrix for the resulting graphical representation. We can then easily apply existing graph theory based algorithms on the adjacency matrix to analyze the log file of the system. The directed cyclic graph and the analysis of it can be visualized by rendering the adjacency matrix with graph visualization tools, like Graphviz.","PeriodicalId":116816,"journal":{"name":"TEFSE '11","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TEFSE '11","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1987856.1987869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Most log files are of one format - a flat file with the events of execution recorded one after the other. Each line in the file contains at least a timestamp, a combination of one or more event identifiers, and the actual log message with information of which event was executed and what the values for the dynamic parameters of that event are. Since log files have this trace information, we can use it for many purposes, such as operational profiling and anomalous execution path detection. However the current flat file format of a log file is very unintuitive to detect the existence of a repeating pattern. In this paper we propose a transformation of the current serial order format of a log file to a directed cyclic graph (such as a non-finite state machine) format and how the operational profile of a system can be built from this representation of the log file. We built a tool (in C++), that transforms a log file with a set of log events in a serial order to an adjacency matrix for the resulting graphical representation. We can then easily apply existing graph theory based algorithms on the adjacency matrix to analyze the log file of the system. The directed cyclic graph and the analysis of it can be visualized by rendering the adjacency matrix with graph visualization tools, like Graphviz.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过将日志文件转换为有向循环图来创建软件系统的操作概要文件
大多数日志文件都是一种格式——一个平面文件,一个接一个地记录执行事件。文件中的每一行至少包含一个时间戳、一个或多个事件标识符的组合,以及实际的日志消息,其中包含执行了哪个事件的信息以及该事件的动态参数的值。由于日志文件具有此跟踪信息,因此我们可以将其用于许多目的,例如操作分析和异常执行路径检测。然而,当前日志文件的平面文件格式对于检测重复模式的存在非常不直观。在本文中,我们提出了将当前日志文件的串行顺序格式转换为有向循环图(如非有限状态机)格式,以及如何从日志文件的这种表示构建系统的操作概况。我们构建了一个工具(在c++中),它将包含一系列日志事件的日志文件按顺序转换为邻接矩阵,以得到图形表示。然后,我们可以很容易地在邻接矩阵上应用现有的基于图论的算法来分析系统的日志文件。有向循环图及其分析可以通过图形可视化工具(如Graphviz)绘制邻接矩阵来可视化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Traceability challenge 2011: using TraceLab to evaluate the impact of local versus global IDF on trace retrieval Towards a model of analyst effort for traceability research Traceclipse: an eclipse plug-in for traceability link recovery and management Traceability research: taking the next steps Recovering traceability links between source code and fixed bugs via patch analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1