首页 > 最新文献

2013 8th IEEE Design and Test Symposium最新文献

英文 中文
Systolic architecture for hardware implementation of two-dimensional non-separable filter-bank 二维不可分滤波器组硬件实现的收缩结构
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727130
B. K. Mohanty, S. Al-Maadeed, A. Amira
In this paper, we present an efficient poly-phase decomposition scheme for implementation of 2-D non-separable filter bank. Poly-phase decomposition scheme offers multiplexing of filter bank computations or/and reduce the data clocking without affecting the overall throughput rate. Both these features can be used conveniently depending on resources availability or processor-technology. Time-multiplexing could be the choice for resource-constrained applications. Slower clocking rate could be chosen if processor-technology is the constraint. In that case, the design could be realized with cheaper and slower processor-technology. Time-multiplexed design needs proper data scheduling to perform filter bank computation interleavingly without data overlapping. Keeping this in mind, we have derived a systolic architecture for hardware realization of time-multiplexed filter bank where we have used novel data buffering scheme for the filter coefficients of the filter bank. Comparison result show that, the proposed structure involves almost J times less hardware resource than the non poly-phase filter bank structure and it provides the same throughput rate as the other, where J is the filter bank size. The hardware saving is significant for large size filter banks like Gabor. The proposed structure could be a good candidate for efficient hardware implementation of non-separable filter bank used in various image processing applications such as biometrics systems.
在本文中,我们提出了一种有效的多相分解方案来实现二维不可分离滤波器组。多相分解方案提供了滤波器组计算的多路复用或/并在不影响总体吞吐率的情况下减少数据时钟。这两个特性都可以根据资源可用性或处理器技术方便地使用。时间复用可能是资源受限应用程序的选择。如果处理器技术有限制,可以选择较慢的时钟速率。在这种情况下,设计可以用更便宜和更慢的处理器技术来实现。时间复用设计需要合理的数据调度,以实现滤波器组的交错计算,避免数据重叠。考虑到这一点,我们推导了一个时间复用滤波器组硬件实现的收缩架构,其中我们对滤波器组的滤波器系数使用了新的数据缓冲方案。对比结果表明,该结构比非多相滤波器组结构少占用近J倍的硬件资源,并提供与非多相滤波器组结构相同的吞吐率,其中J为滤波器组大小。对于像Gabor这样的大型滤波器组来说,硬件节省意义重大。所提出的结构可以为各种图像处理应用(如生物识别系统)中使用的不可分离滤波器组的有效硬件实现提供良好的候选者。
{"title":"Systolic architecture for hardware implementation of two-dimensional non-separable filter-bank","authors":"B. K. Mohanty, S. Al-Maadeed, A. Amira","doi":"10.1109/IDT.2013.6727130","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727130","url":null,"abstract":"In this paper, we present an efficient poly-phase decomposition scheme for implementation of 2-D non-separable filter bank. Poly-phase decomposition scheme offers multiplexing of filter bank computations or/and reduce the data clocking without affecting the overall throughput rate. Both these features can be used conveniently depending on resources availability or processor-technology. Time-multiplexing could be the choice for resource-constrained applications. Slower clocking rate could be chosen if processor-technology is the constraint. In that case, the design could be realized with cheaper and slower processor-technology. Time-multiplexed design needs proper data scheduling to perform filter bank computation interleavingly without data overlapping. Keeping this in mind, we have derived a systolic architecture for hardware realization of time-multiplexed filter bank where we have used novel data buffering scheme for the filter coefficients of the filter bank. Comparison result show that, the proposed structure involves almost J times less hardware resource than the non poly-phase filter bank structure and it provides the same throughput rate as the other, where J is the filter bank size. The hardware saving is significant for large size filter banks like Gabor. The proposed structure could be a good candidate for efficient hardware implementation of non-separable filter bank used in various image processing applications such as biometrics systems.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114689648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Design space exploration and synthesis for digital signal processing algorithms from Simulink models 设计空间探索和综合数字信号处理算法从Simulink模型
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727109
S. Butt, L. Lavagno
Design teams are increasingly looking for design flows that can rapidly lead to high performance and low power implementation of DSP algorithms. Model-based design can satisfy this requirement, but it must be (1) coupled with efficient high-level synthesis support in order to provide good Quality of Results, and (2) controlled to derive the desired area/performance/throughput trade-off. We present a semi-automatic design flow for rapid high level synthesis-based hardware design space exploration starting from Simulink digital signal processing models. We illustrate our flow with a realistically complex signal processing algorithm for estimating the direction of arrival of a sound source. We show how one can start from a functionally validated fixed point model in Simulink and then go through a relatively simple design flow for hardware synthesis and automatic design space exploration, obtaining a very efficient hardware implementation that is competitive with the RTL implementation generated by another commercial model-based design tool.
设计团队越来越多地寻找能够快速实现高性能和低功耗DSP算法的设计流程。基于模型的设计可以满足这一需求,但它必须(1)与高效的高级综合支持相结合,以提供良好的结果质量,并且(2)控制以获得所需的面积/性能/吞吐量权衡。从Simulink数字信号处理模型出发,提出了一种基于快速高级合成的硬件设计空间探索的半自动设计流程。我们用一个实际复杂的信号处理算法来说明我们的流程,该算法用于估计声源的到达方向。我们展示了如何在Simulink中从功能验证的不动点模型开始,然后通过相对简单的设计流程进行硬件合成和自动设计空间探索,从而获得与另一个基于商业模型的设计工具生成的RTL实现相竞争的非常有效的硬件实现。
{"title":"Design space exploration and synthesis for digital signal processing algorithms from Simulink models","authors":"S. Butt, L. Lavagno","doi":"10.1109/IDT.2013.6727109","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727109","url":null,"abstract":"Design teams are increasingly looking for design flows that can rapidly lead to high performance and low power implementation of DSP algorithms. Model-based design can satisfy this requirement, but it must be (1) coupled with efficient high-level synthesis support in order to provide good Quality of Results, and (2) controlled to derive the desired area/performance/throughput trade-off. We present a semi-automatic design flow for rapid high level synthesis-based hardware design space exploration starting from Simulink digital signal processing models. We illustrate our flow with a realistically complex signal processing algorithm for estimating the direction of arrival of a sound source. We show how one can start from a functionally validated fixed point model in Simulink and then go through a relatively simple design flow for hardware synthesis and automatic design space exploration, obtaining a very efficient hardware implementation that is competitive with the RTL implementation generated by another commercial model-based design tool.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132664291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Enabling difference-based dynamic partial self reconfiguration for large differences 为较大差异启用基于差异的动态部分自重构
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727108
Sezer Gören, Ozgur Ozkurt, Yusuf Turk, Abdullah Yildiz, H. F. Ugurdag
This paper presents a new Dynamic Partial Self Reconfiguration (DPSR) flow for Xilinx FPGAs. Leveraging the Xilinx FPGA Editor and PlanAhead tools, we provide two implementation approaches that enable partial reconfiguration for large configuration changes without Xilinx's paid tool. The flow is difference-based but still allows a modular design, which is made up of Partial Reconfiguration (PR) modules and a static design. It works regardless of the amount of difference between PR modules. We call this flow DPSR-LD, where LD stands for Large Differences. DPSR-LD is an enabler especially for Spartan-6 FPGA family., as Xilinx currently supports PR on Spartan-6 only through the difference-based flow and only for small differences. DPSR-LD also includes an ICAP controller that makes DPSR possible and offers bitstream compression.
提出了一种新的Xilinx fpga动态部分自重构(DPSR)流程。利用Xilinx FPGA Editor和PlanAhead工具,我们提供了两种实现方法,可以在没有Xilinx付费工具的情况下对大型配置更改进行部分重新配置。该流程是基于差异的,但仍然允许模块化设计,它由部分重新配置(PR)模块和静态设计组成。不管PR模块之间的差异有多大,它都可以工作。我们称这个流程为DPSR-LD,其中LD代表大差异。DPSR-LD是Spartan-6 FPGA家族的推动者。,因为Xilinx目前仅通过基于差异的流程支持Spartan-6上的PR,并且仅支持较小的差异。DPSR- ld还包括一个ICAP控制器,使DPSR成为可能,并提供比特流压缩。
{"title":"Enabling difference-based dynamic partial self reconfiguration for large differences","authors":"Sezer Gören, Ozgur Ozkurt, Yusuf Turk, Abdullah Yildiz, H. F. Ugurdag","doi":"10.1109/IDT.2013.6727108","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727108","url":null,"abstract":"This paper presents a new Dynamic Partial Self Reconfiguration (DPSR) flow for Xilinx FPGAs. Leveraging the Xilinx FPGA Editor and PlanAhead tools, we provide two implementation approaches that enable partial reconfiguration for large configuration changes without Xilinx's paid tool. The flow is difference-based but still allows a modular design, which is made up of Partial Reconfiguration (PR) modules and a static design. It works regardless of the amount of difference between PR modules. We call this flow DPSR-LD, where LD stands for Large Differences. DPSR-LD is an enabler especially for Spartan-6 FPGA family., as Xilinx currently supports PR on Spartan-6 only through the difference-based flow and only for small differences. DPSR-LD also includes an ICAP controller that makes DPSR possible and offers bitstream compression.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133251201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reducing random-dopant fluctuation impact on core-speed and power variability in many-core platforms 在多核平台中减少随机掺杂对核速度和功率可变性的影响
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727103
S. Majzoub, Z. Al-Ars, S. Hamdioui
In this paper, we propose a novel technique that uses multi-Vt design to reduce the impact of random process variation on delay and power in a many-core platform. Random variation is mostly attributed to the random-dopant fluctuation. The proposed technique reduces this fluctuation by lowering the dopant density and then compensating the threshold voltage using a footer transistor. The results show a reduction of the total standard deviation from 25% down to 17% using the proposed method.
在本文中,我们提出了一种新的技术,利用多vt设计来减少随机过程变化对多核平台延迟和功耗的影响。随机变化主要归因于随机掺杂的波动。所提出的技术通过降低掺杂剂密度,然后使用脚晶体管补偿阈值电压来减小这种波动。结果表明,使用该方法可以将总标准差从25%降低到17%。
{"title":"Reducing random-dopant fluctuation impact on core-speed and power variability in many-core platforms","authors":"S. Majzoub, Z. Al-Ars, S. Hamdioui","doi":"10.1109/IDT.2013.6727103","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727103","url":null,"abstract":"In this paper, we propose a novel technique that uses multi-Vt design to reduce the impact of random process variation on delay and power in a many-core platform. Random variation is mostly attributed to the random-dopant fluctuation. The proposed technique reduces this fluctuation by lowering the dopant density and then compensating the threshold voltage using a footer transistor. The results show a reduction of the total standard deviation from 25% down to 17% using the proposed method.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134218858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Accurate and efficient identification of worst-case execution time for multicore processors: A survey 多核处理器最坏情况执行时间的准确和有效识别:综述
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727080
Hamid Mushtaq, Z. Al-Ars, K. Bertels
Parallel systems were for a long time confined to high-performance computing. However, with the increasing popularity of multicore processors, parallelization has also become important for other computing domains, such as desktops and embedded systems. Mission-critical embedded software, like that used in avionics and automotive industry, also needs to guarantee real time behavior. For that purpose, tools are needed to calculate the worst-case execution time (WCET) of tasks running on a processor, so that the real time system can make sure that real time guarantees are met. However, due to the shared resources present in a multicore system, this task is made much more difficult as compared to finding WCET for a single core processor. In this paper, we will discuss how recent research has tried to solve this problem and what the open research problems are.
并行系统在很长一段时间内都局限于高性能计算。然而,随着多核处理器的日益普及,并行化对于其他计算领域(如桌面和嵌入式系统)也变得非常重要。关键任务的嵌入式软件,如用于航空电子和汽车工业的软件,也需要保证实时行为。为此,需要工具来计算在处理器上运行的任务的最坏情况执行时间(WCET),以便实时系统可以确保满足实时保证。然而,由于多核系统中存在共享资源,与为单核处理器查找WCET相比,此任务变得更加困难。在本文中,我们将讨论最近的研究如何试图解决这个问题,以及开放的研究问题是什么。
{"title":"Accurate and efficient identification of worst-case execution time for multicore processors: A survey","authors":"Hamid Mushtaq, Z. Al-Ars, K. Bertels","doi":"10.1109/IDT.2013.6727080","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727080","url":null,"abstract":"Parallel systems were for a long time confined to high-performance computing. However, with the increasing popularity of multicore processors, parallelization has also become important for other computing domains, such as desktops and embedded systems. Mission-critical embedded software, like that used in avionics and automotive industry, also needs to guarantee real time behavior. For that purpose, tools are needed to calculate the worst-case execution time (WCET) of tasks running on a processor, so that the real time system can make sure that real time guarantees are met. However, due to the shared resources present in a multicore system, this task is made much more difficult as compared to finding WCET for a single core processor. In this paper, we will discuss how recent research has tried to solve this problem and what the open research problems are.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134251870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Design tradeoffs for voltage controlled crystal oscillators with built-in calibration mechanisms 具有内置校准机构的电压控制晶体振荡器的设计权衡
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727100
Jose Pedro Cardoso, J. M. D. Silva
Timing is a critical issue in communication systems, especially for synchronous communications. These show a high dependence on the clock signal purity due to errors that can be introduced into the decision process. This paper addresses the design, on a 130nm CMOS process, of a Radiation Tolerant Voltage Controlled Quartz Crystal Oscillator (VCXO), including techniques to reduce the influence of radiation and noise on its performance. The VCXO is included on a PLL designed to work within High Energy Physics (HEP) experiments.
时序是通信系统中的一个关键问题,尤其是同步通信。由于可能引入决策过程的误差,这些显示了对时钟信号纯度的高度依赖。本文介绍了一种基于130nm CMOS工艺的容辐射压控石英晶体振荡器(VCXO)的设计,包括降低辐射和噪声对其性能影响的技术。VCXO包含在设计用于高能物理(HEP)实验的锁相环上。
{"title":"Design tradeoffs for voltage controlled crystal oscillators with built-in calibration mechanisms","authors":"Jose Pedro Cardoso, J. M. D. Silva","doi":"10.1109/IDT.2013.6727100","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727100","url":null,"abstract":"Timing is a critical issue in communication systems, especially for synchronous communications. These show a high dependence on the clock signal purity due to errors that can be introduced into the decision process. This paper addresses the design, on a 130nm CMOS process, of a Radiation Tolerant Voltage Controlled Quartz Crystal Oscillator (VCXO), including techniques to reduce the influence of radiation and noise on its performance. The VCXO is included on a PLL designed to work within High Energy Physics (HEP) experiments.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115520292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Verifying generic IEC 61508 CPU self-tests with fault injection 验证通用IEC 61508 CPU自检与故障注入
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727145
C. Preschern, N. Kajtazovic, Andrea Höller, C. Steger, Christian Kreiner
In this paper we present generic CPU self-test programs and we check if the test programs conform to the IEC 61508 safety standard. We use processor architecture independent test programs to indirectly test the CPU components. We present a fault injection framework which we use to verify the fault detection ratio of the self-tests through simulation on a Plasma/MIPS and on a LEON3 processor.
本文提出了通用的CPU自检程序,并对其是否符合IEC 61508安全标准进行了检验。我们使用独立于处理器体系结构的测试程序来间接测试CPU组件。我们提出了一个故障注入框架,并通过在Plasma/MIPS和LEON3处理器上的仿真验证了自检的故障检测率。
{"title":"Verifying generic IEC 61508 CPU self-tests with fault injection","authors":"C. Preschern, N. Kajtazovic, Andrea Höller, C. Steger, Christian Kreiner","doi":"10.1109/IDT.2013.6727145","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727145","url":null,"abstract":"In this paper we present generic CPU self-test programs and we check if the test programs conform to the IEC 61508 safety standard. We use processor architecture independent test programs to indirectly test the CPU components. We present a fault injection framework which we use to verify the fault detection ratio of the self-tests through simulation on a Plasma/MIPS and on a LEON3 processor.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121178173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Silicon CMOS interdigitated-MSM photodetector and self-mixer for low-cost crash-avoidance Ladar system 用于低成本防撞雷达系统的硅CMOS数字化- msm光电探测器和自混频器
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727126
E. Awad, Theodora Rezk, A. Abou-Auf
We present theoretical model of interdigitated MSM photo-detector and RF electro-optic self-mixer based on standard CMOS technology. The model allows for simulation and analysis of photodetection performance and RF self-mixing capabilities of MSM. A performance comparison is performed between Si and GaAs materials in case of steady-state and transient operation.
提出了基于标准CMOS技术的交叉数字化MSM光电探测器和射频电光自混频器的理论模型。该模型允许模拟和分析MSM的光探测性能和射频自混合能力。比较了硅和砷化镓材料在稳态和瞬态工作下的性能。
{"title":"Silicon CMOS interdigitated-MSM photodetector and self-mixer for low-cost crash-avoidance Ladar system","authors":"E. Awad, Theodora Rezk, A. Abou-Auf","doi":"10.1109/IDT.2013.6727126","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727126","url":null,"abstract":"We present theoretical model of interdigitated MSM photo-detector and RF electro-optic self-mixer based on standard CMOS technology. The model allows for simulation and analysis of photodetection performance and RF self-mixing capabilities of MSM. A performance comparison is performed between Si and GaAs materials in case of steady-state and transient operation.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117075952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A functional test algorithm for the register forwarding and pipeline interlocking unit in pipelined microprocessors 流水线微处理器中寄存器转发和流水线联锁单元的功能测试算法
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727120
P. Bernardi, D. Boyang, Lyl M. Ciganda Brasca, E. Sánchez, M. Reorda, M. Grosso, O. Ballan
When the result of a previous instruction is needed in the pipeline before it is available, a “data hazard” occurs. Register Forwarding and Pipeline Interlock (RF&PI) are mechanisms suitable to avoid data corruption and to limit the performance penalty caused by data hazards in pipelined microprocessors. Data hazards handling is part of the microprocessor control logic; its test can hardly be achieved with a functional approach, unless a specific test algorithm is adopted. In this paper we analyze the causes for the low functional testability of the RF&PI logic and propose some techniques able to effectively perform its test. In particular, we describe a strategy to perform Software-Based Self-Test (SBST) on the RF&PI unit. The general structure of the unit is analyzed, a suitable test algorithm is proposed and the strategy to observe the test responses is explained. The method can be exploited for test both at the end of manufacturing and in the operational phase. Feasibility and effectiveness of the proposed approach are demonstrated on both an academic MIPS-like processor and an industrial System-on-Chip based on the Power ArchitectureTM.
当管道中需要之前的指令的结果时,就会发生“数据危害”。寄存器转发和管道互锁(RF&PI)是一种适用于避免数据损坏和限制在流水线微处理器中由于数据危害而造成的性能损失的机制。数据危害处理是微处理器控制逻辑的一部分;除非采用特定的测试算法,否则它的测试很难用函数方法来实现。本文分析了RF&PI逻辑功能可测试性低的原因,并提出了一些能够有效进行RF&PI逻辑测试的技术。特别地,我们描述了在RF&PI单元上执行基于软件的自测(SBST)的策略。分析了该装置的一般结构,提出了一种合适的测试算法,并说明了观察测试响应的策略。该方法既可用于制造末期的测试,也可用于运行阶段的测试。在类似mips的学术处理器和基于功率架构的工业片上系统上验证了该方法的可行性和有效性。
{"title":"A functional test algorithm for the register forwarding and pipeline interlocking unit in pipelined microprocessors","authors":"P. Bernardi, D. Boyang, Lyl M. Ciganda Brasca, E. Sánchez, M. Reorda, M. Grosso, O. Ballan","doi":"10.1109/IDT.2013.6727120","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727120","url":null,"abstract":"When the result of a previous instruction is needed in the pipeline before it is available, a “data hazard” occurs. Register Forwarding and Pipeline Interlock (RF&PI) are mechanisms suitable to avoid data corruption and to limit the performance penalty caused by data hazards in pipelined microprocessors. Data hazards handling is part of the microprocessor control logic; its test can hardly be achieved with a functional approach, unless a specific test algorithm is adopted. In this paper we analyze the causes for the low functional testability of the RF&PI logic and propose some techniques able to effectively perform its test. In particular, we describe a strategy to perform Software-Based Self-Test (SBST) on the RF&PI unit. The general structure of the unit is analyzed, a suitable test algorithm is proposed and the strategy to observe the test responses is explained. The method can be exploited for test both at the end of manufacturing and in the operational phase. Feasibility and effectiveness of the proposed approach are demonstrated on both an academic MIPS-like processor and an industrial System-on-Chip based on the Power ArchitectureTM.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126530355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Functional verification of complete sequential behaviors: A formal treatment of discrepancies between system-level and RTL descriptions 完整顺序行为的功能验证:对系统级和RTL描述之间差异的正式处理
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727074
Carlos Ivan Castro Marquez, M. Strum, J. Wang
Formal techniques allow exhaustive verification on circuit design (at least in theory), but due to actual computational limitations, workarounds must always be adopted to check only a portion of the design at a time. Sequential equivalence checking is an effective approach, but it can only be applied between circuit descriptions where a one-to-one correspondence for states, as well as for memory elements, is expected. This paper presents a formal methodology to verify RTL descriptions through direct comparison with high-level reference models. By doing so, there is no need to specify or analyze formal properties, as the complete behavior is already contained in the reference model. We also consider the natural discrepancies between system level and RTL code, including non-matching interface and memory elements, and state mapping. In this manner, we are able to prove the functional coherence for the overall sequential behavior of the design under verification.
正式技术允许对电路设计进行详尽的验证(至少在理论上),但由于实际的计算限制,必须始终采用变通方法,一次只能检查设计的一部分。顺序等效性检查是一种有效的方法,但它只能应用于电路描述之间,其中状态和存储元素的一对一对应是预期的。本文提出了一种通过与高级参考模型直接比较来验证RTL描述的形式化方法。通过这样做,就不需要指定或分析形式属性,因为完整的行为已经包含在参考模型中。我们还考虑了系统级和RTL代码之间的自然差异,包括不匹配的接口和内存元素,以及状态映射。通过这种方式,我们能够证明在验证下设计的整体顺序行为的功能一致性。
{"title":"Functional verification of complete sequential behaviors: A formal treatment of discrepancies between system-level and RTL descriptions","authors":"Carlos Ivan Castro Marquez, M. Strum, J. Wang","doi":"10.1109/IDT.2013.6727074","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727074","url":null,"abstract":"Formal techniques allow exhaustive verification on circuit design (at least in theory), but due to actual computational limitations, workarounds must always be adopted to check only a portion of the design at a time. Sequential equivalence checking is an effective approach, but it can only be applied between circuit descriptions where a one-to-one correspondence for states, as well as for memory elements, is expected. This paper presents a formal methodology to verify RTL descriptions through direct comparison with high-level reference models. By doing so, there is no need to specify or analyze formal properties, as the complete behavior is already contained in the reference model. We also consider the natural discrepancies between system level and RTL code, including non-matching interface and memory elements, and state mapping. In this manner, we are able to prove the functional coherence for the overall sequential behavior of the design under verification.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126539079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2013 8th IEEE Design and Test Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1