Leveraging many simple statistical models to adaptively monitor software systems

M. A. Munawar, Paul A. S. Ward
{"title":"Leveraging many simple statistical models to adaptively monitor software systems","authors":"M. A. Munawar, Paul A. S. Ward","doi":"10.1504/IJHPCN.2011.038708","DOIUrl":null,"url":null,"abstract":"Self-managing systems require continuous monitoring to ensure correct operation. Detailed monitoring is often too costly to use in production. An alternative is adaptive monitoring, whereby monitoring is kept to a minimal level while the system behaves as expected, and the monitoring level is increased if a problem is suspected. To enable such an approach, we must model the system, both at a minimal level to ensure correct operation, and at a detailed level, to diagnose faulty components. To avoid the complexity of developing an explicit model based on the system structure, we employ simple statistical techniques to identify relationships in the monitored data. These relationships are used to characterize normal operation and identify problematic areas. \n \nWe develop and evaluate a prototype for the adaptive monitoring of J2EE applications. We experiment with 29 different fault scenarios of three general types, and show that we are able to detect the presence of faults in 80% of cases, where all but one instance of non-detection is attributable to a single fault type. We are able to shortlist the faulty component in 65% of cases where anomalies are observed.","PeriodicalId":384857,"journal":{"name":"International Journal of High Performance Computing and Networking","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of High Performance Computing and Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJHPCN.2011.038708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 32

Abstract

Self-managing systems require continuous monitoring to ensure correct operation. Detailed monitoring is often too costly to use in production. An alternative is adaptive monitoring, whereby monitoring is kept to a minimal level while the system behaves as expected, and the monitoring level is increased if a problem is suspected. To enable such an approach, we must model the system, both at a minimal level to ensure correct operation, and at a detailed level, to diagnose faulty components. To avoid the complexity of developing an explicit model based on the system structure, we employ simple statistical techniques to identify relationships in the monitored data. These relationships are used to characterize normal operation and identify problematic areas. We develop and evaluate a prototype for the adaptive monitoring of J2EE applications. We experiment with 29 different fault scenarios of three general types, and show that we are able to detect the presence of faults in 80% of cases, where all but one instance of non-detection is attributable to a single fault type. We are able to shortlist the faulty component in 65% of cases where anomalies are observed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用许多简单的统计模型自适应地监视软件系统
自我管理系统需要持续监测以确保正确运行。在生产中使用详细的监控通常成本太高。另一种选择是自适应监视,即在系统按预期运行时将监视保持在最低级别,如果怀疑存在问题则提高监视级别。要启用这样的方法,我们必须对系统进行建模,既要在最小的级别上确保正确的操作,又要在详细的级别上诊断有故障的组件。为了避免开发基于系统结构的显式模型的复杂性,我们采用简单的统计技术来识别被监视数据中的关系。这些关系用于描述正常操作并识别有问题的区域。我们开发并评估了一个用于自适应监视J2EE应用程序的原型。我们对三种一般类型的29种不同故障场景进行了实验,并表明我们能够在80%的情况下检测到故障的存在,其中所有未检测到的实例都可归因于单一故障类型。在观察到异常的情况下,我们能够在65%的情况下列出故障部件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Semi-supervised dimensionality reduction based on local estimation error Leveraging many simple statistical models to adaptively monitor software systems Queuing algorithm for speculative Network Processors MPI scalability of a large memory LES code Adaptive time/space sharing with SCOJO
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1