首页 > 最新文献

2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems最新文献

英文 中文
Analysis of a Simple Approach to Modeling Performance for Streaming Data Applications 流数据应用建模性能的一种简单方法分析
J. Beard, R. Chamberlain
Current state of the art systems contain various types of multicore processors, General Purpose Graphics Processing Units (GPGPUs) and occasionally Digital Signal Processors (DSPs) or Field-Programmable Gate Arrays (FPGAs). With heterogeneity comes multiple abstraction layers that hide underlying complexity. While necessary to ease programmability of these systems, this hidden complexity makes quantitative performance modeling a difficult task. This paper outlines a computationally simple approach to modeling the overall throughput and buffering needs of a streaming application deployed on heterogeneous hardware.
当前最先进的系统包含各种类型的多核处理器,通用图形处理单元(gpgpu)和偶尔的数字信号处理器(dsp)或现场可编程门阵列(fpga)。异质性带来了隐藏底层复杂性的多个抽象层。虽然有必要简化这些系统的可编程性,但这种隐藏的复杂性使得定量性能建模成为一项困难的任务。本文概述了一种计算简单的方法来对部署在异构硬件上的流应用程序的总体吞吐量和缓冲需求进行建模。
{"title":"Analysis of a Simple Approach to Modeling Performance for Streaming Data Applications","authors":"J. Beard, R. Chamberlain","doi":"10.1109/MASCOTS.2013.49","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.49","url":null,"abstract":"Current state of the art systems contain various types of multicore processors, General Purpose Graphics Processing Units (GPGPUs) and occasionally Digital Signal Processors (DSPs) or Field-Programmable Gate Arrays (FPGAs). With heterogeneity comes multiple abstraction layers that hide underlying complexity. While necessary to ease programmability of these systems, this hidden complexity makes quantitative performance modeling a difficult task. This paper outlines a computationally simple approach to modeling the overall throughput and buffering needs of a streaming application deployed on heterogeneous hardware.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114984031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Overcoming Limitations of Off-the-Shelf Priority Schedulers in Dynamic Environments 克服现有优先级调度器在动态环境中的局限性
Feng Yan, S. Hughes, Alma Riska, E. Smirni
It is common nowadays to architect and design scaled-out systems with off-the-shelf computing components operated and managed by off-the-shelf open-source tools. While web services represent the critical set of services offered at scale, big data analytics is emerging as a preferred service to be colocated with cloud web services at a lower priority raising the need for off-the-shelf priority scheduling. In this paper we report on the perils of Linux priority scheduling tools when used to differentiate between such complex services. We demonstrate that simple priority scheduling utilities such as nice and ionice can result in dramatically erratic behavior. We provide a remedy by proposing an autonomic priority scheduling algorithm that adjusts its execution parameters based on on-line measurements of the current resource usage of critical applications. Detailed experimentation with a user-space prototype of the algorithm on a Linux system using popular benchmarks such as SPEC and TPC-W illustrate the robustness and versatility of the proposed technique, as it provides consistency to the expected performance of a high-priority application when running simultaneously with multiple low priority jobs.
如今,使用现成的计算组件构建和设计可扩展的系统是很常见的,这些组件由现成的开源工具操作和管理。虽然web服务代表了大规模提供的关键服务集,但大数据分析正在成为与云web服务搭配的首选服务,优先级较低,这增加了对现成优先级调度的需求。在本文中,我们报告了Linux优先级调度工具在用于区分这些复杂服务时的危险。我们演示了简单的优先级调度实用程序(如nice和ionice)可能导致非常不稳定的行为。我们提供了一种补救方法,提出了一种自主优先级调度算法,该算法根据关键应用程序当前资源使用的在线测量来调整其执行参数。在Linux系统上使用SPEC和TPC-W等流行基准测试对算法的用户空间原型进行了详细的实验,说明了所建议技术的健壮性和多功能性,因为它在同时运行多个低优先级作业时,为高优先级应用程序的预期性能提供了一致性。
{"title":"Overcoming Limitations of Off-the-Shelf Priority Schedulers in Dynamic Environments","authors":"Feng Yan, S. Hughes, Alma Riska, E. Smirni","doi":"10.1109/MASCOTS.2013.72","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.72","url":null,"abstract":"It is common nowadays to architect and design scaled-out systems with off-the-shelf computing components operated and managed by off-the-shelf open-source tools. While web services represent the critical set of services offered at scale, big data analytics is emerging as a preferred service to be colocated with cloud web services at a lower priority raising the need for off-the-shelf priority scheduling. In this paper we report on the perils of Linux priority scheduling tools when used to differentiate between such complex services. We demonstrate that simple priority scheduling utilities such as nice and ionice can result in dramatically erratic behavior. We provide a remedy by proposing an autonomic priority scheduling algorithm that adjusts its execution parameters based on on-line measurements of the current resource usage of critical applications. Detailed experimentation with a user-space prototype of the algorithm on a Linux system using popular benchmarks such as SPEC and TPC-W illustrate the robustness and versatility of the proposed technique, as it provides consistency to the expected performance of a high-priority application when running simultaneously with multiple low priority jobs.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130651136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Automatic Meter-Reading Simulation through Power Line Communication 电力线通信自动抄表仿真
J. Matanza, S. Alexandres, C. Rodríguez-Morcillo
This paper proposes a simulation's architecture that allows for the analysis of the performance when using the Power Line Communication's technology. In concrete, it studies the viability of PRIME' standard, to send Automatic Meter Reading (AMR) messages through a low voltage network. In contrast with other studies, physical phenomena-such as background and impulsive noise sources, channel attenuation and multipath effect-are taken into account by Mat lab simulations. Additionally, OMNeT++ network simulator is used to model the telematic effects that occur in the communication process. As an example of the kind of output that can be obtained by the proposed architecture, the paper analyses the end-to-end's performance at application layer in terms of round-trip latency. Several simulations are performed in a European low-voltage network topology to compute the number of meters that can be polled within 15 minutes. Additionally, one experiment tries to determine the optimal position of one of the key nodes in PRIME's networks: the SWITCH node.
本文提出了一种仿真体系结构,可以对电力线通信技术的性能进行分析。具体来说,研究了PRIME标准在低压网络中发送自动抄表(AMR)信息的可行性。与其他研究相比,Mat实验室模拟考虑了物理现象,如背景和脉冲噪声源、信道衰减和多径效应。此外,还使用omnet++网络模拟器对通信过程中的远程信息处理效果进行建模。作为该架构可获得的输出类型的一个示例,本文从往返延迟的角度分析了应用层端到端性能。在欧洲低压网络拓扑中进行了几次模拟,以计算15分钟内可以轮询的仪表数量。此外,一个实验试图确定PRIME网络中一个关键节点的最佳位置:SWITCH节点。
{"title":"Automatic Meter-Reading Simulation through Power Line Communication","authors":"J. Matanza, S. Alexandres, C. Rodríguez-Morcillo","doi":"10.1109/MASCOTS.2013.36","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.36","url":null,"abstract":"This paper proposes a simulation's architecture that allows for the analysis of the performance when using the Power Line Communication's technology. In concrete, it studies the viability of PRIME' standard, to send Automatic Meter Reading (AMR) messages through a low voltage network. In contrast with other studies, physical phenomena-such as background and impulsive noise sources, channel attenuation and multipath effect-are taken into account by Mat lab simulations. Additionally, OMNeT++ network simulator is used to model the telematic effects that occur in the communication process. As an example of the kind of output that can be obtained by the proposed architecture, the paper analyses the end-to-end's performance at application layer in terms of round-trip latency. Several simulations are performed in a European low-voltage network topology to compute the number of meters that can be polled within 15 minutes. Additionally, one experiment tries to determine the optimal position of one of the key nodes in PRIME's networks: the SWITCH node.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134559750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
HTTP Adaptive Streaming in Mobile Networks: Characteristics and Caching Opportunities 移动网络中的HTTP自适应流:特征和缓存机会
Ali Gouta, D. Hong, Anne-Marie Kermarrec, Yannick Le Louédec
Cellular networks have witnessed the emergence of the HTTP Adaptive Streaming (HAS) as a new video delivery method. In HAS, several qualities of the same videos are made available in the network so that clients can choose the best quality that fits their bandwidth capacity. This has particular implications on caching strategies with respect to the viewing patterns and the switching behavior between video qualities. In this paper we present analysis of a real HAS dataset collected in France and provided by the country's largest mobile phone operator. Firstly, we analyse the viewing patterns of HAS contents and the distribution of the encoding bit rates requested by mobile clients. Secondly, we give an in-depth analysis of the switching pattern between video bit rates during a video session and assess the implication on the caching efficiency. We also model this switching based on empirical observations. Finally, we propose WA-LRU a new caching algorithm tailored for HAS contents and compare it to the standard LRU. Our evaluations demonstrate that WA-LRU performs better and achieves its goals.
蜂窝网络见证了HTTP自适应流(HAS)作为一种新的视频传输方法的出现。在HAS中,同一视频的几种质量在网络中可用,以便客户可以选择适合其带宽容量的最佳质量。这对与观看模式和视频质量之间的切换行为有关的缓存策略具有特殊的含义。在这篇论文中,我们对法国最大的移动电话运营商提供的真实HAS数据集进行了分析。首先,我们分析了HAS内容的观看模式和移动客户端所要求的编码码率分布。其次,我们深入分析了视频会话中视频比特率之间的切换模式,并评估了对缓存效率的影响。我们还根据经验观察建立了这种转换的模型。最后,我们提出了一种新的针对HAS内容的缓存算法WA-LRU,并将其与标准LRU进行了比较。我们的评估表明,WA-LRU表现更好,达到了目标。
{"title":"HTTP Adaptive Streaming in Mobile Networks: Characteristics and Caching Opportunities","authors":"Ali Gouta, D. Hong, Anne-Marie Kermarrec, Yannick Le Louédec","doi":"10.1109/MASCOTS.2013.17","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.17","url":null,"abstract":"Cellular networks have witnessed the emergence of the HTTP Adaptive Streaming (HAS) as a new video delivery method. In HAS, several qualities of the same videos are made available in the network so that clients can choose the best quality that fits their bandwidth capacity. This has particular implications on caching strategies with respect to the viewing patterns and the switching behavior between video qualities. In this paper we present analysis of a real HAS dataset collected in France and provided by the country's largest mobile phone operator. Firstly, we analyse the viewing patterns of HAS contents and the distribution of the encoding bit rates requested by mobile clients. Secondly, we give an in-depth analysis of the switching pattern between video bit rates during a video session and assess the implication on the caching efficiency. We also model this switching based on empirical observations. Finally, we propose WA-LRU a new caching algorithm tailored for HAS contents and compare it to the standard LRU. Our evaluations demonstrate that WA-LRU performs better and achieves its goals.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131568931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Analytic Models of Applications in Multi-core Computers 分析模型在多核计算机中的应用
Shouvik Bardhan, D. Menascé
Multi-core computers are becoming increasingly ubiquitous. Understanding and being able to predict the performance of applications that run on such machines is paramount. This paper first shows experimentally that memory contention resulting from multiple cores accessing shared memory can become a significant component of an application's execution time. Then, the paper develops an approximate single-class analytic performance model that captures the effect of memory contention. The model is validated through measurements taken on a micro-benchmark and on well known Unix memory benchmark programs on machines with 4, 12, and 16 cores. The paper also shows that there is a significant difference in the predictions when memory contention is not considered.
多核计算机正变得越来越普遍。理解并能够预测在这些机器上运行的应用程序的性能是至关重要的。本文首先通过实验证明,多核访问共享内存导致的内存争用可能成为应用程序执行时间的重要组成部分。然后,本文开发了一个近似的单类分析性能模型,以捕获内存争用的影响。该模型通过在4核、12核和16核机器上的微基准测试和著名的Unix内存基准测试程序进行的测量得到验证。本文还表明,在不考虑内存争用的情况下,预测结果有显著差异。
{"title":"Analytic Models of Applications in Multi-core Computers","authors":"Shouvik Bardhan, D. Menascé","doi":"10.1109/MASCOTS.2013.43","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.43","url":null,"abstract":"Multi-core computers are becoming increasingly ubiquitous. Understanding and being able to predict the performance of applications that run on such machines is paramount. This paper first shows experimentally that memory contention resulting from multiple cores accessing shared memory can become a significant component of an application's execution time. Then, the paper develops an approximate single-class analytic performance model that captures the effect of memory contention. The model is validated through measurements taken on a micro-benchmark and on well known Unix memory benchmark programs on machines with 4, 12, and 16 cores. The paper also shows that there is a significant difference in the predictions when memory contention is not considered.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128047102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards Instruction Level Record and Replay of Sensor Network Applications 传感器网络应用的指令级记录与重放研究
Lipeng Wan, Qing Cao
Debugging wireless sensor network (WSN) applications has been complicated for multiple reasons, among which the lack of visibility is one of the most challenging. To address this issue, in this paper, we present a systematic approach to record and replay WSN applications at the granularity of instructions. This approach differs from previous ones in that it is purely software based, therefore, no additional hardware component is needed. Our key idea is to combine the static, structural information of the assembly-level code with their dynamic, run-time traces as measured by timestamps and basic block counters, so that we can faithfully infer and replay the actual execution paths of applications at instruction level in a post-mortem manner. The evaluation results show that this approach is feasible despite of the resource constraints of sensor nodes. We also provide two case studies to demonstrate that our instruction level record-and-replay approach can be used to: (1) discover randomness of EEPROM writing time, (2) localize stack smashing bugs in sensor network applications.
由于多种原因,调试无线传感器网络(WSN)应用变得非常复杂,其中缺乏可见性是最具挑战性的原因之一。为了解决这个问题,在本文中,我们提出了一种在指令粒度上记录和重播WSN应用程序的系统方法。这种方法不同于以前的方法,因为它完全基于软件,因此不需要额外的硬件组件。我们的关键思想是将汇编级代码的静态、结构化信息与它们的动态、运行时跟踪(通过时间戳和基本块计数器测量)结合起来,这样我们就可以以事后分析的方式忠实地推断和重播指令级应用程序的实际执行路径。评估结果表明,该方法在传感器节点资源约束下是可行的。我们还提供了两个案例研究来证明我们的指令级记录和重播方法可以用于:(1)发现EEPROM写入时间的随机性,(2)定位传感器网络应用中的堆栈破坏错误。
{"title":"Towards Instruction Level Record and Replay of Sensor Network Applications","authors":"Lipeng Wan, Qing Cao","doi":"10.1109/MASCOTS.2013.69","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.69","url":null,"abstract":"Debugging wireless sensor network (WSN) applications has been complicated for multiple reasons, among which the lack of visibility is one of the most challenging. To address this issue, in this paper, we present a systematic approach to record and replay WSN applications at the granularity of instructions. This approach differs from previous ones in that it is purely software based, therefore, no additional hardware component is needed. Our key idea is to combine the static, structural information of the assembly-level code with their dynamic, run-time traces as measured by timestamps and basic block counters, so that we can faithfully infer and replay the actual execution paths of applications at instruction level in a post-mortem manner. The evaluation results show that this approach is feasible despite of the resource constraints of sensor nodes. We also provide two case studies to demonstrate that our instruction level record-and-replay approach can be used to: (1) discover randomness of EEPROM writing time, (2) localize stack smashing bugs in sensor network applications.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125360173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
I/O Performance Modeling of Virtualized Storage Systems 虚拟化存储系统I/O性能建模
Qais Noorshams, Kiana Rostami, Samuel Kounev, P. Tůma, Ralf H. Reussner
Server virtualization is a key technology to share physical resources efficiently and flexibly. With the increasing popularity of I/O-intensive applications, however, the virtualized storage used in shared environments can easily become a bottleneck and cause performance and scalability issues. Performance modeling and evaluation techniques applied prior to system deployment help to avoid such issues. In current practice, however, virtualized storage and its effects on the overall system performance are often neglected or treated as a black-box. In this paper, we present a systematic I/O performance modeling approach for virtualized storage systems based on queueing theory. We first propose a general performance model building methodology. Then, we demonstrate our methodology creating I/O queueing models of a real-world representative environment based on IBM System z and IBM DS8700 server hardware. Finally, we present an in-depth evaluation of our models considering both interpolation and extrapolation scenarios as well as scenarios with multiple virtual machines. Overall, we effectively create performance models with less than 11% mean prediction error in the worst case and less than 5% prediction error on average.
服务器虚拟化是实现物理资源高效、灵活共享的关键技术。然而,随着I/ o密集型应用程序的日益普及,共享环境中使用的虚拟化存储很容易成为瓶颈,并导致性能和可伸缩性问题。在系统部署之前应用的性能建模和评估技术有助于避免此类问题。然而,在当前的实践中,虚拟化存储及其对系统整体性能的影响往往被忽略或视为一个黑箱。本文提出了一种基于排队理论的虚拟化存储系统I/O性能建模方法。我们首先提出了一种通用的性能模型构建方法。然后,我们演示了基于IBM System z和IBM DS8700服务器硬件创建现实世界代表性环境的I/O队列模型的方法。最后,我们对我们的模型进行了深入的评估,考虑了插值和外推场景以及多个虚拟机的场景。总的来说,我们有效地创建了在最坏情况下平均预测误差小于11%,平均预测误差小于5%的性能模型。
{"title":"I/O Performance Modeling of Virtualized Storage Systems","authors":"Qais Noorshams, Kiana Rostami, Samuel Kounev, P. Tůma, Ralf H. Reussner","doi":"10.1109/MASCOTS.2013.20","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.20","url":null,"abstract":"Server virtualization is a key technology to share physical resources efficiently and flexibly. With the increasing popularity of I/O-intensive applications, however, the virtualized storage used in shared environments can easily become a bottleneck and cause performance and scalability issues. Performance modeling and evaluation techniques applied prior to system deployment help to avoid such issues. In current practice, however, virtualized storage and its effects on the overall system performance are often neglected or treated as a black-box. In this paper, we present a systematic I/O performance modeling approach for virtualized storage systems based on queueing theory. We first propose a general performance model building methodology. Then, we demonstrate our methodology creating I/O queueing models of a real-world representative environment based on IBM System z and IBM DS8700 server hardware. Finally, we present an in-depth evaluation of our models considering both interpolation and extrapolation scenarios as well as scenarios with multiple virtual machines. Overall, we effectively create performance models with less than 11% mean prediction error in the worst case and less than 5% prediction error on average.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134173010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Effect of Latent Errors on the Reliability of Data Storage Systems 潜在错误对数据存储系统可靠性的影响
V. Venkatesan, I. Iliadis
The reliability of data storage systems is adversely affected by the presence of latent sector errors. As the number of occurrences of such errors increases with the storage capacity, latent sector errors have become more prevalent in today's high capacity storage devices. Such errors are typically not detected until an attempt is made to read the affected sectors. When a latent sector error is detected, the redundant data corresponding to the affected sector is used to recover its data. However, if no such redundant data is available, then the data of the affected sector is irrecoverably lost from the storage system. Therefore, the reliability of data storage systems is affected by both the complete failure of storage nodes and the latent sector errors within them. In this article, closed-form expressions for the mean time to data loss (MTTDL) of erasure coded storage systems in the presence of latent errors are derived. The effect of latent errors on systems with various types of redundancy, data placement, and sector error probabilities is studied. For small latent sector error probabilities, it is shown that the MTTDL is reduced by a factor that is independent of the number of parities in the data redundancy scheme as well as the number of nodes in the system. However, for large latent sector error probabilities, the MTTDL is similar to that of a system using a data redundancy scheme with one parity less. The reduction of the MTTDL in the latter case is more pronounced than in the former one.
潜在扇区错误的存在,会严重影响数据存储系统的可靠性。由于此类错误的发生次数随着存储容量的增加而增加,在当今的高容量存储设备中,潜在扇区错误变得更加普遍。直到尝试读取受影响的扇区时,通常不会检测到此类错误。当检测到潜在扇区错误时,将使用受影响扇区对应的冗余数据来恢复其数据。但是,如果没有这样的冗余数据,则受影响扇区的数据将不可恢复地从存储系统中丢失。因此,数据存储系统的可靠性既受到存储节点完全失效的影响,也受到存储节点扇区潜在错误的影响。本文推导了存在潜在错误的擦除编码存储系统的平均数据丢失时间(MTTDL)的封闭表达式。研究了潜在错误对具有不同类型冗余、数据放置和扇区错误概率的系统的影响。对于较小的潜在扇区错误概率,表明MTTDL被一个与数据冗余方案中的对等体数量和系统中的节点数量无关的因素所减少。然而,对于较大的潜在扇区错误概率,MTTDL类似于使用少一个奇偶校验的数据冗余方案的系统。在后一种情况下,MTTDL的减少比前一种情况更为明显。
{"title":"Effect of Latent Errors on the Reliability of Data Storage Systems","authors":"V. Venkatesan, I. Iliadis","doi":"10.1109/MASCOTS.2013.38","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.38","url":null,"abstract":"The reliability of data storage systems is adversely affected by the presence of latent sector errors. As the number of occurrences of such errors increases with the storage capacity, latent sector errors have become more prevalent in today's high capacity storage devices. Such errors are typically not detected until an attempt is made to read the affected sectors. When a latent sector error is detected, the redundant data corresponding to the affected sector is used to recover its data. However, if no such redundant data is available, then the data of the affected sector is irrecoverably lost from the storage system. Therefore, the reliability of data storage systems is affected by both the complete failure of storage nodes and the latent sector errors within them. In this article, closed-form expressions for the mean time to data loss (MTTDL) of erasure coded storage systems in the presence of latent errors are derived. The effect of latent errors on systems with various types of redundancy, data placement, and sector error probabilities is studied. For small latent sector error probabilities, it is shown that the MTTDL is reduced by a factor that is independent of the number of parities in the data redundancy scheme as well as the number of nodes in the system. However, for large latent sector error probabilities, the MTTDL is similar to that of a system using a data redundancy scheme with one parity less. The reduction of the MTTDL in the latter case is more pronounced than in the former one.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"143 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120886526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
"The Tail Wags the Dog": A Study of Anomaly Detection in Commercial Application Performance “尾巴摇狗”:商业应用性能异常检测研究
Richard Gow, S. Venugopal, P. Ray
The IT industry needs systems management models that leverage available application information to detect quality of service, scalability and health of service. Ideally this technique would be common for varying application types with different n-tier architectures under normal production conditions of varying load, user session traffic, transaction type, transaction mix, and hosting environment. This paper shows that a whole of service measurement paradigm utilizing a black box M/M/1 queuing model and auto regression curve fitting of the associated CDF are an accurate model to characterize system performance signatures. This modeling method is used to detect application slow down events. The method did not rely on customizations specific to the n-tier architecture of the systems being analyzed and so the performance anomaly detection technique was shown to be platform and configuration agnostic.
IT行业需要利用可用应用程序信息来检测服务质量、可伸缩性和服务运行状况的系统管理模型。理想情况下,在负载、用户会话流量、事务类型、事务组合和托管环境变化的正常生产条件下,对于具有不同n层体系结构的不同应用程序类型,这种技术是常见的。本文表明,利用黑盒M/M/1排队模型和相关CDF的自动回归曲线拟合的整体服务度量范式是表征系统性能特征的准确模型。此建模方法用于检测应用程序变慢事件。该方法不依赖于特定于被分析系统的n层体系结构的定制,因此性能异常检测技术与平台和配置无关。
{"title":"\"The Tail Wags the Dog\": A Study of Anomaly Detection in Commercial Application Performance","authors":"Richard Gow, S. Venugopal, P. Ray","doi":"10.1109/MASCOTS.2013.51","DOIUrl":"https://doi.org/10.1109/MASCOTS.2013.51","url":null,"abstract":"The IT industry needs systems management models that leverage available application information to detect quality of service, scalability and health of service. Ideally this technique would be common for varying application types with different n-tier architectures under normal production conditions of varying load, user session traffic, transaction type, transaction mix, and hosting environment. This paper shows that a whole of service measurement paradigm utilizing a black box M/M/1 queuing model and auto regression curve fitting of the associated CDF are an accurate model to characterize system performance signatures. This modeling method is used to detect application slow down events. The method did not rely on customizations specific to the n-tier architecture of the systems being analyzed and so the performance anomaly detection technique was shown to be platform and configuration agnostic.","PeriodicalId":385538,"journal":{"name":"2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128868886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1