首页 > 最新文献

IPSJ Transactions on System LSI Design Methodology最新文献

英文 中文
An Area Efficient Regular Expression Matching Engine Using Partial Reconfiguration for Quick Pattern Updating 基于局部重构的区域高效正则表达式匹配引擎
Q4 Engineering Pub Date : 2014-01-01 DOI: 10.2197/ipsjtsldm.7.110
Yoichi Wakaba, S. Wakabayashi, Shinobu Nagayama, Masato Inagi
This paper proposes a method using partial reconfiguration to realize a compact regular expression matching engine, which can update a pattern quickly. In the proposed method, a set of partial circuits, each of which handles a different class of regular expressions, are provided in advance. When a regular expression pattern is given, a compact matching engine dedicated to the pattern is implemented on FPGA by combining the partial circuits according to the given pattern using partial reconfiguration. The method can update a pattern quickly, since it does not need re-design of a circuit. Experimental results show that the proposed method reduces 60% circuit size compared with the previous method without increasing the pattern updating time significantly.
本文提出了一种利用部分重构实现精简正则表达式匹配引擎的方法,该引擎可以快速更新模式。在该方法中,预先提供了一组局部电路,每个电路处理不同类型的正则表达式。当给定正则表达式模式时,通过局部重构将给定模式的部分电路组合在一起,在FPGA上实现了一个专用于该模式的紧凑匹配引擎。该方法不需要重新设计电路,可以快速更新模式。实验结果表明,该方法在不显著增加模式更新时间的情况下,电路尺寸比原方法减小了60%。
{"title":"An Area Efficient Regular Expression Matching Engine Using Partial Reconfiguration for Quick Pattern Updating","authors":"Yoichi Wakaba, S. Wakabayashi, Shinobu Nagayama, Masato Inagi","doi":"10.2197/ipsjtsldm.7.110","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.7.110","url":null,"abstract":"This paper proposes a method using partial reconfiguration to realize a compact regular expression matching engine, which can update a pattern quickly. In the proposed method, a set of partial circuits, each of which handles a different class of regular expressions, are provided in advance. When a regular expression pattern is given, a compact matching engine dedicated to the pattern is implemented on FPGA by combining the partial circuits according to the given pattern using partial reconfiguration. The method can update a pattern quickly, since it does not need re-design of a circuit. Experimental results show that the proposed method reduces 60% circuit size compared with the previous method without increasing the pattern updating time significantly.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86109074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Efficient Algorithm for 3D NoC Architecture Optimization 一种高效的三维NoC结构优化算法
Q4 Engineering Pub Date : 2013-02-01 DOI: 10.2197/ipsjtsldm.6.34
Xin Jiang, Ran Zhang, Takahiro Watanabe
{"title":"An Efficient Algorithm for 3D NoC Architecture Optimization","authors":"Xin Jiang, Ran Zhang, Takahiro Watanabe","doi":"10.2197/ipsjtsldm.6.34","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.34","url":null,"abstract":"","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87300161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
VLSI Architecture Design for H.264/AVC Intra-frame Video Encoding H.264/AVC帧内视频编码的VLSI架构设计
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.76
Huang-Chih Kuo, Y. Lin
Intra-frame encoding is useful for many video applications such as security surveillance, digital cinema, and video conferencing because it supports random access to every video frame for easy editing and has low computational complexity that results in low hardware cost. H.264/AVC, which is the most popular video coding standard today, also defines novel intra-coding tools to achieve high compression performance at the expense of significantly increased computational complexity. We present a VLSI design for H.264/AVC intra-frame encoder. The paper summaries several novel approaches to alleviate the performance bottleneck caused by the long data dependency loop among 4 × 4 luma blocks, integrate a high-performance hardwired CABAC entropy encoder, and apply a clock-gating technique to reduce power consumption. Synthesized with a TSMC 130 nm CMOS cell library, our design requires 194.1 K gates at 108 MHz and consumes 19.8 mW to encode 1080p (1920 × 1088) video sequences at 30 frames per second (fps). It also delivers the same video quality as the H.264/AVC reference software. We suggest a figure of merit called Design Efficiency for fair comparison of different works. Experimental results show that the proposed design is more efficient than prior arts.
帧内编码对于许多视频应用程序(如安全监控、数字影院和视频会议)非常有用,因为它支持随机访问每个视频帧以方便编辑,并且具有较低的计算复杂度,从而降低硬件成本。H.264/AVC是当今最流行的视频编码标准,它也定义了新的内部编码工具,以显著增加计算复杂性为代价实现高压缩性能。提出了一种用于H.264/AVC帧内编码器的VLSI设计方案。本文总结了几种新方法来缓解4 × 4亮度块之间的长数据依赖环路造成的性能瓶颈,集成了高性能硬连线CABAC熵编码器,并应用时钟门控技术来降低功耗。我们的设计采用台积电130纳米CMOS单元库合成,需要194.1 K栅极,频率为108 MHz,功耗为19.8 mW,以每秒30帧(fps)的速度编码1080p (1920 × 1088)视频序列。它还提供与H.264/AVC参考软件相同的视频质量。为了公平地比较不同的作品,我们建议使用一个叫做“设计效率”的价值指标。实验结果表明,该设计比现有技术效率更高。
{"title":"VLSI Architecture Design for H.264/AVC Intra-frame Video Encoding","authors":"Huang-Chih Kuo, Y. Lin","doi":"10.2197/ipsjtsldm.6.76","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.76","url":null,"abstract":"Intra-frame encoding is useful for many video applications such as security surveillance, digital cinema, and video conferencing because it supports random access to every video frame for easy editing and has low computational complexity that results in low hardware cost. H.264/AVC, which is the most popular video coding standard today, also defines novel intra-coding tools to achieve high compression performance at the expense of significantly increased computational complexity. We present a VLSI design for H.264/AVC intra-frame encoder. The paper summaries several novel approaches to alleviate the performance bottleneck caused by the long data dependency loop among 4 × 4 luma blocks, integrate a high-performance hardwired CABAC entropy encoder, and apply a clock-gating technique to reduce power consumption. Synthesized with a TSMC 130 nm CMOS cell library, our design requires 194.1 K gates at 108 MHz and consumes 19.8 mW to encode 1080p (1920 × 1088) video sequences at 30 frames per second (fps). It also delivers the same video quality as the H.264/AVC reference software. We suggest a figure of merit called Design Efficiency for fair comparison of different works. Experimental results show that the proposed design is more efficient than prior arts.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78802015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A New Formal Verification Approach for Hardware-dependent Embedded System Software 基于硬件的嵌入式系统软件形式化验证新方法
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.135
Bernard Schmidt, Carlos Villarraga, Thomas Fehmel, J. Bormann, Markus Wedler, Minh D. Nguyen, D. Stoffel, W. Kunz
: This paper describes a method to generate a computational model for formal verification of hardware- dependent software in embedded systems. The computational model of the combined HW / SW system is a program netlist (PN) consisting of instruction cells connected in a directed acyclic graph that compactly represents all execution paths of the software. The model can be easily integrated into SAT-based verification environments such as those based on Bounded Model Checking (BMC). The proposed construction of the model allows for an e ffi cient reasoning of the SAT solver over entire execution paths. Program netlists are compositional. The paper presents how they can be com- bined to model interrupt-driven systems. We demonstrate the e ffi ciency of our approach by presenting experimental results from the formal verification of an industrial LIN (Local Interconnect Network) bus node, implemented as a software driver on a 32-bit RISC machine.
本文描述了一种生成嵌入式系统中硬件相关软件形式化验证计算模型的方法。硬件/软件组合系统的计算模型是一个由指令单元组成的程序网表(PN),这些指令单元连接在一个有向无环图中,该图紧凑地表示软件的所有执行路径。该模型可以很容易地集成到基于sat的验证环境中,例如基于有界模型检查(BMC)的验证环境。提出的模型结构允许在整个执行路径上对SAT求解器进行有效的推理。程序网表是组合的。本文介绍了如何将它们结合起来对中断驱动系统进行建模。我们通过在32位RISC机器上作为软件驱动程序实现的工业LIN(本地互连网络)总线节点的形式化验证的实验结果来证明我们方法的有效性。
{"title":"A New Formal Verification Approach for Hardware-dependent Embedded System Software","authors":"Bernard Schmidt, Carlos Villarraga, Thomas Fehmel, J. Bormann, Markus Wedler, Minh D. Nguyen, D. Stoffel, W. Kunz","doi":"10.2197/ipsjtsldm.6.135","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.135","url":null,"abstract":": This paper describes a method to generate a computational model for formal verification of hardware- dependent software in embedded systems. The computational model of the combined HW / SW system is a program netlist (PN) consisting of instruction cells connected in a directed acyclic graph that compactly represents all execution paths of the software. The model can be easily integrated into SAT-based verification environments such as those based on Bounded Model Checking (BMC). The proposed construction of the model allows for an e ffi cient reasoning of the SAT solver over entire execution paths. Program netlists are compositional. The paper presents how they can be com- bined to model interrupt-driven systems. We demonstrate the e ffi ciency of our approach by presenting experimental results from the formal verification of an industrial LIN (Local Interconnect Network) bus node, implemented as a software driver on a 32-bit RISC machine.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87111110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Power Converter-aware Design of Electronics Systems 电子系统的电源变换器感知设计
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.2
Sangyoung Park, Younghyun Kim, Jaehyun Park, N. Chang
Semiconductor scaling makes the individual part can no longer share the same supply voltage, and some chips even require multiple different supply voltage levels. Different input and output voltage standard specification of each device make use of multiple supply voltage levels. Various devices such as display, RF, USB, SD card, etc. increase the number of supply voltage levels. Moreover, analog devices often do not allow sharing power supply due to coupling noise. However, those components are commonly powered by a single power source such as a battery. Consequently, power converters such as onand off-chip switching-mode DC–DC converters, low-dropout linear regulators and charge pumps are largely populated even on a single circuit board. Efficiency of the power converters is known to be high enough and often ignored during power management policy development. However, their actual conversion efficiency varies significantly according to device activity and power mode, which sometimes results in substantially lower efficiency than the value provided in datasheets. Moreover, hardware designers generally optimize the power converters for the maximum power supply current of the device and even perform over-design while the actual device power consumption during runtime could be largely offset from the energy-optimal operating point. This tutorial paper covers a wide range of topics on power converter-aware design and introduces several design practices; i) power converter basics and the conversion efficiency, ii) power converter voltage transition overhead, iii) power converter-aware design of embedded systems, and iv) maximum energy transfer of energy harvesting devices.
半导体的缩放使得单个部件不能再共享相同的供电电压,有些芯片甚至需要多个不同的供电电压水平。每个设备的输入输出电压标准规格不同,使用多个电源电压等级。各种设备,如显示器,RF, USB, SD卡等,增加了供电电压等级的数量。此外,由于耦合噪声,模拟设备通常不允许共享电源。然而,这些组件通常由单一电源供电,如电池。因此,电源转换器,如片上和片外开关模式DC-DC转换器,低差线性稳压器和电荷泵,即使在单个电路板上也大量填充。众所周知,电源转换器的效率足够高,但在电源管理政策制定过程中往往被忽视。然而,它们的实际转换效率根据设备活动和电源模式有很大差异,这有时会导致效率大大低于数据表中提供的值。此外,硬件设计人员通常会根据器件的最大供电电流对功率转换器进行优化,甚至进行过度设计,而器件在运行时的实际功耗可能在很大程度上与能量最优工作点相抵消。本教程涵盖了电源转换器感知设计的广泛主题,并介绍了几个设计实践;I)功率转换器的基础知识和转换效率,ii)功率转换器电压转换开销,iii)嵌入式系统的功率转换器感知设计,iv)能量收集设备的最大能量传输。
{"title":"Power Converter-aware Design of Electronics Systems","authors":"Sangyoung Park, Younghyun Kim, Jaehyun Park, N. Chang","doi":"10.2197/ipsjtsldm.6.2","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.2","url":null,"abstract":"Semiconductor scaling makes the individual part can no longer share the same supply voltage, and some chips even require multiple different supply voltage levels. Different input and output voltage standard specification of each device make use of multiple supply voltage levels. Various devices such as display, RF, USB, SD card, etc. increase the number of supply voltage levels. Moreover, analog devices often do not allow sharing power supply due to coupling noise. However, those components are commonly powered by a single power source such as a battery. Consequently, power converters such as onand off-chip switching-mode DC–DC converters, low-dropout linear regulators and charge pumps are largely populated even on a single circuit board. Efficiency of the power converters is known to be high enough and often ignored during power management policy development. However, their actual conversion efficiency varies significantly according to device activity and power mode, which sometimes results in substantially lower efficiency than the value provided in datasheets. Moreover, hardware designers generally optimize the power converters for the maximum power supply current of the device and even perform over-design while the actual device power consumption during runtime could be largely offset from the energy-optimal operating point. This tutorial paper covers a wide range of topics on power converter-aware design and introduces several design practices; i) power converter basics and the conversion efficiency, ii) power converter voltage transition overhead, iii) power converter-aware design of embedded systems, and iv) maximum energy transfer of energy harvesting devices.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88357464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantitative Evaluation of Resource Sharing in High-level Synthesis Using Realistic Benchmarks 基于现实基准的高水平综合资源共享定量评价
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.122
Yuko Hara-Azumi, Toshinobu Matsuba, H. Tomiyama, S. Honda, H. Takada
For FPGA-based designs generated through high-level synthesis (HLS), effects of resource sharing/unsharing on clock frequency, execution time, and area are quantitatively evaluated for several practically large benchmarks on multiple FPGA devices. Through experiments, we observed five important findings about resource sharing/unsharing, which are contrary to conventional wisdom or have not been sufficiently handled. These five findings will be useful for the further development and advance of the practical HLS technology.
对于通过高级综合(HLS)生成的基于FPGA的设计,资源共享/取消共享对时钟频率、执行时间和面积的影响在多个FPGA设备上进行了几个实际大型基准测试,并进行了定量评估。通过实验,我们观察到与传统观点相反或尚未得到充分处理的关于资源共享/不共享的五个重要发现。这五项研究成果将有助于进一步发展和推进实用HLS技术。
{"title":"Quantitative Evaluation of Resource Sharing in High-level Synthesis Using Realistic Benchmarks","authors":"Yuko Hara-Azumi, Toshinobu Matsuba, H. Tomiyama, S. Honda, H. Takada","doi":"10.2197/ipsjtsldm.6.122","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.122","url":null,"abstract":"For FPGA-based designs generated through high-level synthesis (HLS), effects of resource sharing/unsharing on clock frequency, execution time, and area are quantitatively evaluated for several practically large benchmarks on multiple FPGA devices. Through experiments, we observed five important findings about resource sharing/unsharing, which are contrary to conventional wisdom or have not been sufficiently handled. These five findings will be useful for the further development and advance of the practical HLS technology.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90253966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A New Variation of Adaptive Simulated Annealing for 2D/3D Packing Optimization 一种新的自适应模拟退火算法用于二维/三维包装优化
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.94
Yiqiang Sheng, A. Takahashi
: 2D / 3D packing optimization is facing big challenges to get better solution with less runtime. In this paper, we propose a new variation of adaptive simulated annealing (ASA) to solve packing problem. In the traditional ASA, the parameters that control temperature scheduling and random step selection are adjusted according to search progress. In the proposed ASA, a guide with adaptive probabilities is used to automatically select moving methods, including crossover to improve its e ffi ciency. The interesting point is the traditional SA with crossover is ine ffi cient, while the proposed ASA with crossover is e ffi cient due to the adaptive guide. Based on the experiment using MCNC, ami49 X and ami98 3D benchmarks, the computational performance is considerably improved. In the case of area minimization, the results gotten by the proposed ASA are normally better than the published data of 2D packing. In the case of volume minimization for 3D packing, the results gotten by the proposed ASA are better than the data of traditional ASA and SA.
2D / 3D包装优化面临着以更短的运行时间获得更好的解决方案的巨大挑战。本文提出了一种新的自适应模拟退火(ASA)方法来解决包装问题。在传统的ASA中,控制温度调度和随机步长选择的参数是根据搜索进度进行调整的。在该算法中,采用自适应概率指南来自动选择包括交叉在内的移动方法,以提高其效率。有趣的是,传统的带交叉的ASA效率是线性的,而本文提出的带交叉的ASA由于自适应导引而效率为零。在MCNC、ami49x和ami98 3D基准测试中,计算性能得到了显著提高。在面积最小化的情况下,所提出的ASA得到的结果通常优于已发表的二维填充数据。在体积最小化的情况下,所提出的ASA得到的结果优于传统ASA和SA的数据。
{"title":"A New Variation of Adaptive Simulated Annealing for 2D/3D Packing Optimization","authors":"Yiqiang Sheng, A. Takahashi","doi":"10.2197/ipsjtsldm.6.94","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.94","url":null,"abstract":": 2D / 3D packing optimization is facing big challenges to get better solution with less runtime. In this paper, we propose a new variation of adaptive simulated annealing (ASA) to solve packing problem. In the traditional ASA, the parameters that control temperature scheduling and random step selection are adjusted according to search progress. In the proposed ASA, a guide with adaptive probabilities is used to automatically select moving methods, including crossover to improve its e ffi ciency. The interesting point is the traditional SA with crossover is ine ffi cient, while the proposed ASA with crossover is e ffi cient due to the adaptive guide. Based on the experiment using MCNC, ami49 X and ami98 3D benchmarks, the computational performance is considerably improved. In the case of area minimization, the results gotten by the proposed ASA are normally better than the published data of 2D packing. In the case of volume minimization for 3D packing, the results gotten by the proposed ASA are better than the data of traditional ASA and SA.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78724809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient Fault Simulation Algorithms for Analyzing Soft Error Propagation in Sequential Circuits 时序电路软误差传播分析的有效故障仿真算法
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.127
Taiga Takata, Masayoshi Yoshimura, Y. Matsunaga
This paper presents two acceleration techniques of fault simulation for analyzing soft error propagation in sequential circuits. One is an exact technique and the other is a heuristic technique. Since these techniques are independent on how the logic functions of circuits are evaluated, they can be combined with other techniques which accelerate evaluations of the logic functions of circuits, such as event-driven simulation, single pattern parallel fault propagation (SPPFP). Experimental results show that applying the exact technique makes a fault simulator with event-driven simulation and SPPFP 30–143 times faster. A fault simulator with the exact technique finished for several large-scale circuits in 4.6 hours or less, while a fault simulator without the exact technique could not finish for such circuits in 72 hours. Furthermore, applying the heuristic technique makes a fault simulator with the exact technique about 7–17 times faster with only 0.5–2.2% estimation error.
本文提出了两种加速故障仿真技术,用于分析顺序电路中的软误差传播。一种是精确技术,另一种是启发式技术。由于这些技术与如何评估电路的逻辑功能无关,因此它们可以与其他加速电路逻辑功能评估的技术相结合,例如事件驱动仿真,单模式并行故障传播(SPPFP)。实验结果表明,采用精确的技术,具有事件驱动仿真和SPPFP的故障模拟器的速度提高了30-143倍。具有精确技术的故障模拟器在4.6小时内完成了几个大规模电路的测试,而没有精确技术的故障模拟器在72小时内无法完成这些电路的测试。此外,应用启发式技术可以使采用精确技术的故障模拟器的估计速度提高7-17倍,估计误差仅为0.5-2.2%。
{"title":"Efficient Fault Simulation Algorithms for Analyzing Soft Error Propagation in Sequential Circuits","authors":"Taiga Takata, Masayoshi Yoshimura, Y. Matsunaga","doi":"10.2197/ipsjtsldm.6.127","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.127","url":null,"abstract":"This paper presents two acceleration techniques of fault simulation for analyzing soft error propagation in sequential circuits. One is an exact technique and the other is a heuristic technique. Since these techniques are independent on how the logic functions of circuits are evaluated, they can be combined with other techniques which accelerate evaluations of the logic functions of circuits, such as event-driven simulation, single pattern parallel fault propagation (SPPFP). Experimental results show that applying the exact technique makes a fault simulator with event-driven simulation and SPPFP 30–143 times faster. A fault simulator with the exact technique finished for several large-scale circuits in 4.6 hours or less, while a fault simulator without the exact technique could not finish for such circuits in 72 hours. Furthermore, applying the heuristic technique makes a fault simulator with the exact technique about 7–17 times faster with only 0.5–2.2% estimation error.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78392951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design and Implementation of IP-based iSCSI Offload Engine on an FPGA 基于ip的iSCSI分流引擎的FPGA设计与实现
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.112
Amila Akagic, H. Amano
The IP-based storage systems often require bandwidth intensive access to storage devices, thus they exhibit high CPU utilization and low throughput when executed in a principally software implementation. This is especially evident for multi-Gbps networks where the impact of computational overhead is so pronounced that the current state of the art processors cannot take advantage of the capacity of the network. In this paper we propose new iSCSI Offload Engine architecture for high data rate storage networking. Based on our analysis of open source Open-iSCSI initiator, we offload the most computationally intensive and the most executed functions in a common case scenario, while other functions are implemented in a modified Open-iSCSI initiator on a general purpose processor. Our architecture overcomes the performance limitations imposed by a single processor which runs on 15x higher operating frequency than our accelerator. It exhibits very low CPU utilization of approximately 3% on the host CPU, which is 10–15x reduction compared with software implementation. The maximum transmission throughput is 7.81 Gbps, while reception throughput is 7.34 Gbps, which is 2 times speedup over software. The new architecture also shows comparable performance with Chelsio T110 ASIC-based HBA, and has more flexibility.
基于ip的存储系统通常需要对存储设备进行带宽密集型访问,因此,当主要在软件实现中执行时,它们表现出高CPU利用率和低吞吐量。这对于多gbps网络尤其明显,因为计算开销的影响非常明显,以至于当前的先进处理器无法利用网络的容量。在本文中,我们提出了新的iSCSI卸载引擎架构,用于高数据速率存储网络。基于我们对开源open - iscsi启动器的分析,我们在一个常见的情况下卸载了计算最密集和执行最多的功能,而其他功能则在一个通用处理器上修改的open - iscsi启动器中实现。我们的架构克服了单个处理器的性能限制,它的运行频率比我们的加速器高15倍。它在主机CPU上的CPU利用率非常低,大约为3%,与软件实现相比降低了10 - 15倍。最大传输吞吐量为7.81 Gbps,接收吞吐量为7.34 Gbps,比软件提速2倍。新架构也显示出与切尔西T110基于asic的HBA相当的性能,并且具有更大的灵活性。
{"title":"Design and Implementation of IP-based iSCSI Offload Engine on an FPGA","authors":"Amila Akagic, H. Amano","doi":"10.2197/ipsjtsldm.6.112","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.112","url":null,"abstract":"The IP-based storage systems often require bandwidth intensive access to storage devices, thus they exhibit high CPU utilization and low throughput when executed in a principally software implementation. This is especially evident for multi-Gbps networks where the impact of computational overhead is so pronounced that the current state of the art processors cannot take advantage of the capacity of the network. In this paper we propose new iSCSI Offload Engine architecture for high data rate storage networking. Based on our analysis of open source Open-iSCSI initiator, we offload the most computationally intensive and the most executed functions in a common case scenario, while other functions are implemented in a modified Open-iSCSI initiator on a general purpose processor. Our architecture overcomes the performance limitations imposed by a single processor which runs on 15x higher operating frequency than our accelerator. It exhibits very low CPU utilization of approximately 3% on the host CPU, which is 10–15x reduction compared with software implementation. The maximum transmission throughput is 7.81 Gbps, while reception throughput is 7.34 Gbps, which is 2 times speedup over software. The new architecture also shows comparable performance with Chelsio T110 ASIC-based HBA, and has more flexibility.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85014173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Secure and Testable Scan Design Utilizing Shift Register Quasi-equivalents 利用移位寄存器准等效物的安全可测试扫描设计
Q4 Engineering Pub Date : 2013-01-01 DOI: 10.2197/ipsjtsldm.6.27
Katsuya Fujiwara, H. Fujiwara, H. Tamamoto
Scan design makes digital circuits easily testable, however, it can also be exploited to be used for hacking the chip. We have reported a secure and testable scan design approach by using extended shift registers called "SR- equivalents" that are functionally equivalent but not structurally equivalent to shift registers(14), (15), (16), (17), (18). In this paper, to further extend the class of SR-equivalents we introduce a wider class of circuits called "SR-quasi- equivalents" which still satisfy the testability and security similar to SR-equivalents. To estimate the security level, we clarify the cardinality of each equivalent class in SR-quasi-equivalents for several linear structural circuits, and also present the actual number of SR-quasi-equivalents obtained by the enhanced program SREEP.
扫描设计使数字电路易于测试,然而,它也可以被利用来攻击芯片。我们已经报道了一种安全的、可测试的扫描设计方法,通过使用被称为“SR等效”的扩展移位寄存器,它在功能上等效,但在结构上不等同于移位寄存器(14)、(15)、(16)、(17)、(18)。在本文中,为了进一步扩展sr -等效电路的范畴,我们引入了一类更广泛的电路,称为“sr -准等效电路”,它仍然满足与sr -等效电路相似的可测试性和安全性。为了估计安全水平,我们澄清了几个线性结构电路的sr -准等价中每个等价类的基数,并给出了通过增强程序SREEP获得的sr -准等价的实际数量。
{"title":"Secure and Testable Scan Design Utilizing Shift Register Quasi-equivalents","authors":"Katsuya Fujiwara, H. Fujiwara, H. Tamamoto","doi":"10.2197/ipsjtsldm.6.27","DOIUrl":"https://doi.org/10.2197/ipsjtsldm.6.27","url":null,"abstract":"Scan design makes digital circuits easily testable, however, it can also be exploited to be used for hacking the chip. We have reported a secure and testable scan design approach by using extended shift registers called \"SR- equivalents\" that are functionally equivalent but not structurally equivalent to shift registers(14), (15), (16), (17), (18). In this paper, to further extend the class of SR-equivalents we introduce a wider class of circuits called \"SR-quasi- equivalents\" which still satisfy the testability and security similar to SR-equivalents. To estimate the security level, we clarify the cardinality of each equivalent class in SR-quasi-equivalents for several linear structural circuits, and also present the actual number of SR-quasi-equivalents obtained by the enhanced program SREEP.","PeriodicalId":38964,"journal":{"name":"IPSJ Transactions on System LSI Design Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89295142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
IPSJ Transactions on System LSI Design Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1