首页 > 最新文献

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)最新文献

英文 中文
Fault injection acceleration by architectural importance sampling 通过架构重要性采样加速故障注入
Mojtaba Ebrahimi, Nour Sayed, Maryam Rashvand, M. Tahoori
Radiation-induced soft errors are major reliability concerns in advanced technology nodes. The de facto approach for evaluation of the soft error vulnerability is to perform a costly fault injection campaign. Due to the long residency of some errors in system states, the error has to be traced for even millions of cycles. However, only a very small portion of injected errors leads to the failure. This means that many simulation cycles are wasted as they contribute to no failure due to various masking effects. In this paper, we present an importance sampling technique based on Architecturally Correct Execution (ACE) analysis to identify the non-vulnerable time intervals in memory arrays and avoid unnecessary fault injections to speedup the soft error vulnerability evaluation process without sacrificing the accuracy. Our analysis reveals that this approach significantly expedites our architecture-level fault injection technique (on average by 13X).
辐射引起的软误差是先进技术节点的主要可靠性问题。评估软错误漏洞的实际方法是执行代价高昂的错误注入活动。由于某些误差在系统状态中长时间驻留,因此必须跟踪误差甚至数百万个周期。然而,只有很小一部分注入错误会导致失败。这意味着许多模拟周期被浪费了,因为它们不会由于各种掩蔽效应而导致失败。本文提出了一种基于架构正确执行(ACE)分析的重要采样技术,用于识别存储阵列中的非漏洞时间间隔,避免不必要的错误注入,从而在不牺牲准确性的前提下加快软错误漏洞评估过程。我们的分析表明,这种方法显著加快了架构级故障注入技术的速度(平均提高了13倍)。
{"title":"Fault injection acceleration by architectural importance sampling","authors":"Mojtaba Ebrahimi, Nour Sayed, Maryam Rashvand, M. Tahoori","doi":"10.1109/CODESISSS.2015.7331384","DOIUrl":"https://doi.org/10.1109/CODESISSS.2015.7331384","url":null,"abstract":"Radiation-induced soft errors are major reliability concerns in advanced technology nodes. The de facto approach for evaluation of the soft error vulnerability is to perform a costly fault injection campaign. Due to the long residency of some errors in system states, the error has to be traced for even millions of cycles. However, only a very small portion of injected errors leads to the failure. This means that many simulation cycles are wasted as they contribute to no failure due to various masking effects. In this paper, we present an importance sampling technique based on Architecturally Correct Execution (ACE) analysis to identify the non-vulnerable time intervals in memory arrays and avoid unnecessary fault injections to speedup the soft error vulnerability evaluation process without sacrificing the accuracy. Our analysis reveals that this approach significantly expedites our architecture-level fault injection technique (on average by 13X).","PeriodicalId":281383,"journal":{"name":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115262964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Completeness bounds and sequentialization for model checking of interacting firmware and hardware 固件和硬件交互模型检验的完备性边界和顺序化
Sunha Ahn, S. Malik, Aarti Gupta
An emerging trend in system design is to implement complex system management functions in firmware (FW). This changing design scenario requires support for verifying FW in the context of its hardware (HW) environment. As shown in previous work, there is value in a unified HW-FW model for driving the verification tasks. This model can help identify specific commonly-occurring interaction patterns between the HW and FW. These patterns enable pruning the verification search space as demonstrated in previous work in automating FW test generation using concolic testing. In this work, we introduce a bounded model checking (BMC)-based methodology for FW verification. Although BMC is effective for finding bugs by unrolling the underlying transition system up to some bound, it requires a completeness threshold on the bound to ensure complete verification. We show how commonly occurring FW code patterns can be exploited, using inexpensive static analysis techniques, to determine this completeness bound. Further, we show how this bound analysis, combined with the interaction patterns in the unified HW-FW model, is used to sequentialize the concurrent FW and HW code, i.e., to derive a sequential program that represents the parallel interaction of the FW and HW. This enables the direct application of standard software model checkers such as CBMC on this sequentialized program. We have automated this process by implementing: (i) a static completeness bound analyzer on top of the tool Frama-C, and (ii) a sequentializer to generate code for verification by the CBMC model checker. We evaluate the resulting tool using three real FW benchmarks, each consisting of a Linux device driver and its interacting QEMU-emulated HW code with multiple correctness properties. We successfully computed the BMC completeness bounds for 41 out of 46 properties and completed model checking for 12 out of 16 FW transactions.
在固件(FW)中实现复杂的系统管理功能是系统设计的一个新趋势。这种不断变化的设计场景需要支持在硬件环境中对FW进行验证。如前所述,统一的HW-FW模型对于驱动验证任务是有价值的。该模型可以帮助识别硬件和FW之间常见的特定交互模式。这些模式可以修剪验证搜索空间,正如前面使用集合测试自动化FW测试生成的工作中所演示的那样。在这项工作中,我们引入了一种基于有界模型检查(BMC)的FW验证方法。尽管BMC通过将底层转换系统展开到某个边界,可以有效地发现bug,但它需要在该边界上设置一个完整性阈值,以确保完成验证。我们展示了如何利用常见的FW代码模式,使用廉价的静态分析技术来确定这个完整性界限。此外,我们还展示了如何结合统一的HW-FW模型中的交互模式,使用这种界分析对并发的FW和HW代码进行顺序化,即推导出一个表示FW和HW并行交互的顺序程序。这使得标准软件模型检查器(如CBMC)可以直接应用于这个顺序化的程序。我们通过实现:(i)在工具Frama-C之上的静态完整性绑定分析器,以及(ii)一个序列化器来生成代码,以供CBMC模型检查器进行验证。我们使用三个真实的FW基准测试来评估生成的工具,每个基准测试都包含一个Linux设备驱动程序及其交互的qemu模拟的具有多个正确性属性的硬件代码。我们成功地计算了46个属性中的41个属性的BMC完整性边界,并完成了16个FW事务中的12个事务的模型检查。
{"title":"Completeness bounds and sequentialization for model checking of interacting firmware and hardware","authors":"Sunha Ahn, S. Malik, Aarti Gupta","doi":"10.1109/CODESISSS.2015.7331383","DOIUrl":"https://doi.org/10.1109/CODESISSS.2015.7331383","url":null,"abstract":"An emerging trend in system design is to implement complex system management functions in firmware (FW). This changing design scenario requires support for verifying FW in the context of its hardware (HW) environment. As shown in previous work, there is value in a unified HW-FW model for driving the verification tasks. This model can help identify specific commonly-occurring interaction patterns between the HW and FW. These patterns enable pruning the verification search space as demonstrated in previous work in automating FW test generation using concolic testing. In this work, we introduce a bounded model checking (BMC)-based methodology for FW verification. Although BMC is effective for finding bugs by unrolling the underlying transition system up to some bound, it requires a completeness threshold on the bound to ensure complete verification. We show how commonly occurring FW code patterns can be exploited, using inexpensive static analysis techniques, to determine this completeness bound. Further, we show how this bound analysis, combined with the interaction patterns in the unified HW-FW model, is used to sequentialize the concurrent FW and HW code, i.e., to derive a sequential program that represents the parallel interaction of the FW and HW. This enables the direct application of standard software model checkers such as CBMC on this sequentialized program. We have automated this process by implementing: (i) a static completeness bound analyzer on top of the tool Frama-C, and (ii) a sequentializer to generate code for verification by the CBMC model checker. We evaluate the resulting tool using three real FW benchmarks, each consisting of a Linux device driver and its interacting QEMU-emulated HW code with multiple correctness properties. We successfully computed the BMC completeness bounds for 41 out of 46 properties and completed model checking for 12 out of 16 FW transactions.","PeriodicalId":281383,"journal":{"name":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133573454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
How to improve the space utilization of dedup-based PCM storage devices? 如何提高基于深度挖掘的PCM存储设备的空间利用率?
Chun-Ta Lin, Yuan-Hao Chang, Tei-Wei Kuo, Hung-Sheng Chang, Hsiang-Pang Li
There is a growing demand to introduce more and more intelligence to storage devices in recent years, especially with the rapid increasing of hardware computing power. This paper targets on essential design issues in space utilization for dedup-based non-volatile phase-change memory (PCM). We explore the adoption of data duplication techniques to reduce potential data duplicates over PCM storage devices to provide more storage space than the physical storage space does. Among various data deduplication techniques, variable-sized chunking is considered in less cost-effective PCM-based storage devices because variable-sized chunking has better data deduplication capability than fixed-sized chunking. However, in a typical system architecture, data are written or updated in the fixed management units (e.g., LBAs). Thus, to ultimately improve the space utilization of PCM-based storage device, the technical problem falls on (1) how to map fixed-sized LBAs to variable-sized chunks and (2) how to efficiently manage (i.e., allocated and deallocate) free PCM storage space for variable-sized chunks. In this work, we propose a free space manager, called container-based space manager, to resolve the above two issues by exploiting the fact that (1) a storage system initially has more free space to relax the complexity on space management and (2) the space optimization of a storage system can grow with the time when it contains more and more data. The proposed design is evaluated over popular benchmarks, for which we have very encouraging results.
近年来,特别是随着硬件计算能力的迅速提高,人们对存储设备的智能化要求越来越高。本文研究了基于深度挖掘的非易失性相变存储器(PCM)空间利用的基本设计问题。我们探索采用数据复制技术来减少PCM存储设备上潜在的数据重复,从而提供比物理存储空间更多的存储空间。在各种重复数据删除技术中,可变大小的分块被认为是成本较低的基于pcm的存储设备,因为可变大小的分块比固定大小的分块具有更好的重复数据删除能力。然而,在典型的系统架构中,数据是在固定的管理单元(例如,lba)中写入或更新的。因此,为了最终提高基于PCM的存储设备的空间利用率,技术问题在于(1)如何将固定大小的lba映射到可变大小的块,(2)如何有效地管理(即分配和释放)可变大小块的空闲PCM存储空间。为了解决以上两个问题,本文提出了一种基于容器的空间管理器,即基于容器的空间管理器,利用存储系统在初始阶段拥有更多的自由空间来缓解空间管理的复杂性,以及当存储系统的数据量越来越大时,存储系统的空间优化可以随着时间的推移而增长。建议的设计在流行的基准上进行了评估,我们得到了非常令人鼓舞的结果。
{"title":"How to improve the space utilization of dedup-based PCM storage devices?","authors":"Chun-Ta Lin, Yuan-Hao Chang, Tei-Wei Kuo, Hung-Sheng Chang, Hsiang-Pang Li","doi":"10.1109/CODESISSS.2015.7331363","DOIUrl":"https://doi.org/10.1109/CODESISSS.2015.7331363","url":null,"abstract":"There is a growing demand to introduce more and more intelligence to storage devices in recent years, especially with the rapid increasing of hardware computing power. This paper targets on essential design issues in space utilization for dedup-based non-volatile phase-change memory (PCM). We explore the adoption of data duplication techniques to reduce potential data duplicates over PCM storage devices to provide more storage space than the physical storage space does. Among various data deduplication techniques, variable-sized chunking is considered in less cost-effective PCM-based storage devices because variable-sized chunking has better data deduplication capability than fixed-sized chunking. However, in a typical system architecture, data are written or updated in the fixed management units (e.g., LBAs). Thus, to ultimately improve the space utilization of PCM-based storage device, the technical problem falls on (1) how to map fixed-sized LBAs to variable-sized chunks and (2) how to efficiently manage (i.e., allocated and deallocate) free PCM storage space for variable-sized chunks. In this work, we propose a free space manager, called container-based space manager, to resolve the above two issues by exploiting the fact that (1) a storage system initially has more free space to relax the complexity on space management and (2) the space optimization of a storage system can grow with the time when it contains more and more data. The proposed design is evaluated over popular benchmarks, for which we have very encouraging results.","PeriodicalId":281383,"journal":{"name":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134178192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Improved hard real-time scheduling of CSDF-modeled streaming applications 改进了csdf模型流应用程序的硬实时调度
J. Spasić, Di Liu, E. Cannella, T. Stefanov
Recently, it has been shown that hard real-time scheduling theory can be applied to streaming applications modeled as acyclic Cyclo-Static Dataflow (CSDF) graphs. However, that approach is not efficient in terms of throughput and processor utilization. Therefore, in this paper, we propose an improved hard real-time scheduling approach to schedule streaming applications modeled as acyclic CSDF graphs on a Multi-Processor System-on-Chip (MPSoC) platform. The proposed approach converts each actor in a CSDF graph to a set of real-time periodic tasks. The conversion enables application of many hard real-time scheduling algorithms which offer fast calculation of the required number of processors for scheduling the tasks. We evaluate the performance and time complexity of our approach in comparison to several existing scheduling approaches. Experiments on a set of real-life streaming applications demonstrate that our approach: 1) results in systems with higher throughput and better processor utilization in comparison to the existing hard real-time scheduling approach for CSDF graphs while requiring comparable time for the system derivation; 2) gives the same throughput as the existing periodic scheduling approach for CSDF graphs but requires much shorter time to derive the task schedule and tasks' parameters (periods, start times, etc.); and 3) gives the throughput that is equal or very close to the maximum achievable throughput of an application obtained via self-timed scheduling, but requires much shorter time to derive the schedule. The total time needed for the proposed conversion approach and the calculation of the minimum number of processors needed to schedule the tasks and the calculation of the size of communication buffers between tasks is in the range of seconds.
最近有研究表明,硬实时调度理论可以应用于以无循环循环静态数据流(CSDF)图为模型的流应用。然而,就吞吐量和处理器利用率而言,这种方法并不高效。因此,在本文中,我们提出了一种改进的硬实时调度方法,以调度多处理器片上系统(MPSoC)平台上的非循环CSDF图建模的流应用程序。该方法将CSDF图中的每个参与者转换为一组实时周期性任务。这种转换使许多硬实时调度算法的应用成为可能,这些算法可以快速计算调度任务所需的处理器数量。我们将该方法的性能和时间复杂度与几种现有的调度方法进行了比较。在一组现实生活中的流应用程序上的实验表明,我们的方法:1)与现有的CSDF图硬实时调度方法相比,系统具有更高的吞吐量和更好的处理器利用率,同时需要相当的系统推导时间;2)对CSDF图给出了与现有周期调度方法相同的吞吐量,但需要更短的时间来导出任务调度和任务参数(周期,开始时间等);3)给出的吞吐量等于或非常接近通过自定时调度获得的应用程序的最大可实现吞吐量,但需要更短的时间来推导调度。所提议的转换方法所需的总时间、调度任务所需的最小处理器数量的计算以及任务之间通信缓冲区大小的计算都在秒的范围内。
{"title":"Improved hard real-time scheduling of CSDF-modeled streaming applications","authors":"J. Spasić, Di Liu, E. Cannella, T. Stefanov","doi":"10.1109/CODESISSS.2015.7331369","DOIUrl":"https://doi.org/10.1109/CODESISSS.2015.7331369","url":null,"abstract":"Recently, it has been shown that hard real-time scheduling theory can be applied to streaming applications modeled as acyclic Cyclo-Static Dataflow (CSDF) graphs. However, that approach is not efficient in terms of throughput and processor utilization. Therefore, in this paper, we propose an improved hard real-time scheduling approach to schedule streaming applications modeled as acyclic CSDF graphs on a Multi-Processor System-on-Chip (MPSoC) platform. The proposed approach converts each actor in a CSDF graph to a set of real-time periodic tasks. The conversion enables application of many hard real-time scheduling algorithms which offer fast calculation of the required number of processors for scheduling the tasks. We evaluate the performance and time complexity of our approach in comparison to several existing scheduling approaches. Experiments on a set of real-life streaming applications demonstrate that our approach: 1) results in systems with higher throughput and better processor utilization in comparison to the existing hard real-time scheduling approach for CSDF graphs while requiring comparable time for the system derivation; 2) gives the same throughput as the existing periodic scheduling approach for CSDF graphs but requires much shorter time to derive the task schedule and tasks' parameters (periods, start times, etc.); and 3) gives the throughput that is equal or very close to the maximum achievable throughput of an application obtained via self-timed scheduling, but requires much shorter time to derive the schedule. The total time needed for the proposed conversion approach and the calculation of the minimum number of processors needed to schedule the tasks and the calculation of the size of communication buffers between tasks is in the range of seconds.","PeriodicalId":281383,"journal":{"name":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115744533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1