首页 > 最新文献

2009 IEEE Computer Society Annual Symposium on VLSI最新文献

英文 中文
Synthesis Oriented Scheduling of Multiparty Rendezvous in Transaction Level Models 事务级模型中面向综合的多方交会调度
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.8
Vyas Venkataraman, Di Wang, A. Mahram, W. Qin, Mrinal Bose, J. Bhadra
Due to the large semantic gap between transaction level models and actual implementations, hardware synthesis based on system level models has been a great challenge. Aiming to close the semantic gap, we studied an approach that uses rendezvous to model communication. By allowing both conjunctive and disjunctive composition of rendezvous, the approach supports flexible communication patterns involving multiple processes. However, a practical issue of the model is the complexity of scheduling of multiparty rendezvous, which is NP hard in general. This paper proposes an efficient scheduling algorithm. It begins by encapsulating state transition information of processes into a relation graph. It then creates a tree that relates edge combinations. The tree is used to guide the scheduler at run time to search for schedulable sets. Experimental results prove that this algorithm improves our scheduler significantly. The algorithm lays the ground for the synthesis of the communication and synchronization circuitry for the system.
由于事务级模型与实际实现之间存在很大的语义差距,基于系统级模型的硬件综合一直是一个巨大的挑战。为了缩小语义差距,我们研究了一种使用集合来建模通信的方法。通过允许交会的合取和析取组合,该方法支持涉及多个进程的灵活通信模式。然而,该模型的一个实际问题是多方集合调度的复杂性,通常是NP困难的。本文提出了一种高效的调度算法。它首先将流程的状态转换信息封装到关系图中。然后,它创建一个关联边组合的树。该树用于指导调度程序在运行时搜索可调度集。实验结果表明,该算法显著改善了我们的调度程序。该算法为系统通信和同步电路的综合奠定了基础。
{"title":"Synthesis Oriented Scheduling of Multiparty Rendezvous in Transaction Level Models","authors":"Vyas Venkataraman, Di Wang, A. Mahram, W. Qin, Mrinal Bose, J. Bhadra","doi":"10.1109/ISVLSI.2009.8","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.8","url":null,"abstract":"Due to the large semantic gap between transaction level models and actual implementations, hardware synthesis based on system level models has been a great challenge. Aiming to close the semantic gap, we studied an approach that uses rendezvous to model communication. By allowing both conjunctive and disjunctive composition of rendezvous, the approach supports flexible communication patterns involving multiple processes. However, a practical issue of the model is the complexity of scheduling of multiparty rendezvous, which is NP hard in general. This paper proposes an efficient scheduling algorithm. It begins by encapsulating state transition information of processes into a relation graph. It then creates a tree that relates edge combinations. The tree is used to guide the scheduler at run time to search for schedulable sets. Experimental results prove that this algorithm improves our scheduler significantly. The algorithm lays the ground for the synthesis of the communication and synchronization circuitry for the system.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134196295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The Ternary Quantum-dot Cellular Automata Memorizing Cell 三元量子点元胞自动机记忆单元
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.32
P. Pecar, M. Janež, N. Zimic, M. Mraz, I. L. Bajec
Quantum-dot Cellular Automata (QCA) were demonstrated to be a possible candidate for the implementation of a future multi-valued processing platform. Recent papers show that the introduction of adiabatic switching and the elegant application of the adiabatic pipelining concept in the QCA logic design can be used to efficiently solve the issues of the elementary ternary QCA logic primitives. The architectures of the resulting ternary QCAs become similar to their binary counterparts and thus the design rules for large circuit design remain similar to those developed for the binary QCA domain. In spite of this the design of the binary QCA SR memorizing cell cannot be directly transferred to the ternary domain, mostly because the control logic cannot properly handle the third value. We here propose a ternary QCA memorizing cell that efficiently exploits the pipelining mechanism at a wire level. It is centered on the circulating memory model (i.e. the memory in motion concept), which proved to be an efficient concept in memorizing cell design in the binary QCA domain. The proposed memorizing cell is capable of serving as one trit (ternary digit) of memory and represents a step forward to the ternary register, one of the basic building blocks of a ternary processor.
量子点元胞自动机(QCA)被证明是实现未来多值处理平台的可能候选。近年来的研究表明,在QCA逻辑设计中引入绝热开关和巧妙地应用绝热流水线概念可以有效地解决初等三元QCA逻辑原语的问题。由此产生的三元QCA的体系结构变得类似于它们的二进制对应物,因此大型电路设计的设计规则仍然类似于为二进制QCA领域开发的规则。尽管如此,二进制QCA SR记忆单元的设计不能直接转移到三元域,主要是因为控制逻辑不能正确处理第三值。我们在此提出了一种三元QCA记忆单元,它有效地利用了线级的流水线机制。以循环记忆模型(即运动记忆概念)为中心,在二进制QCA领域中,循环记忆模型被证明是一种有效的记忆单元设计概念。所提出的存储单元能够作为存储器的一个三位数(三元数字),并且代表了向三元寄存器迈出的一步,三元处理器的基本构建块之一。
{"title":"The Ternary Quantum-dot Cellular Automata Memorizing Cell","authors":"P. Pecar, M. Janež, N. Zimic, M. Mraz, I. L. Bajec","doi":"10.1109/ISVLSI.2009.32","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.32","url":null,"abstract":"Quantum-dot Cellular Automata (QCA) were demonstrated to be a possible candidate for the implementation of a future multi-valued processing platform. Recent papers show that the introduction of adiabatic switching and the elegant application of the adiabatic pipelining concept in the QCA logic design can be used to efficiently solve the issues of the elementary ternary QCA logic primitives. The architectures of the resulting ternary QCAs become similar to their binary counterparts and thus the design rules for large circuit design remain similar to those developed for the binary QCA domain. In spite of this the design of the binary QCA SR memorizing cell cannot be directly transferred to the ternary domain, mostly because the control logic cannot properly handle the third value. We here propose a ternary QCA memorizing cell that efficiently exploits the pipelining mechanism at a wire level. It is centered on the circulating memory model (i.e. the memory in motion concept), which proved to be an efficient concept in memorizing cell design in the binary QCA domain. The proposed memorizing cell is capable of serving as one trit (ternary digit) of memory and represents a step forward to the ternary register, one of the basic building blocks of a ternary processor.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116707743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Low Phase-Noise and Wide Tuning-Range CMOS Differential VCO for Frequency ?S Modulator 低频调制器的低相位噪声和宽调谐范围CMOS差分压控振荡器
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.36
T. Cao, D. Wisland, T. Lande, F. Moradi
In this paper, a novel low phase-noise and wide tuning-range CMOS differential Voltage-Controlled Oscillator (VCO) for a frequency ΔΣ modulator (FDSM) is presented. The VCO which converts an analog input voltage to phase information is based on a differential ring oscillator with modified symmetric load and a positive feedback in the differential delay cells, combined with a new bias circuit. The proposed VCO with two stages operating at a low power supply voltage of 0.6V can achieve low power consumption of 212uW, and wide tuning-range by increasing the operating frequency range by about 22%. The phase noise is -132dBc/Hz at 600KHz offset from the centre frequency of 480MHz. The new VCO has a good linearity reducing harmonic distortion in the ΔΣ modulator. The circuits are designed using a 65nm CMOS process.
本文提出了一种用于频率ΔΣ调制器(FDSM)的新型低相位噪声、宽调谐范围CMOS差分压控振荡器(VCO)。将模拟输入电压转换为相位信息的压控振荡器是基于一个改进对称负载的差分环振荡器和差分延迟单元中的正反馈,并结合一个新的偏置电路。所提出的两级压控振荡器工作在0.6V的低电源电压下,工作频率范围提高约22%,可实现212uW的低功耗和宽调谐范围。相位噪声为-132dBc/Hz,偏离480MHz的中心频率为600KHz。新型压控振荡器具有良好的线性度,降低了ΔΣ调制器中的谐波失真。电路采用65nm CMOS工艺设计。
{"title":"Low Phase-Noise and Wide Tuning-Range CMOS Differential VCO for Frequency ?S Modulator","authors":"T. Cao, D. Wisland, T. Lande, F. Moradi","doi":"10.1109/ISVLSI.2009.36","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.36","url":null,"abstract":"In this paper, a novel low phase-noise and wide tuning-range CMOS differential Voltage-Controlled Oscillator (VCO) for a frequency ΔΣ modulator (FDSM) is presented. The VCO which converts an analog input voltage to phase information is based on a differential ring oscillator with modified symmetric load and a positive feedback in the differential delay cells, combined with a new bias circuit. The proposed VCO with two stages operating at a low power supply voltage of 0.6V can achieve low power consumption of 212uW, and wide tuning-range by increasing the operating frequency range by about 22%. The phase noise is -132dBc/Hz at 600KHz offset from the centre frequency of 480MHz. The new VCO has a good linearity reducing harmonic distortion in the ΔΣ modulator. The circuits are designed using a 65nm CMOS process.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123663826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Non-Uniform Grid Based Ground Plane Model for High Performance Nodes: The Impact of Heterogeneous Cores on Ground Voltage Gradient 基于非均匀网格的高性能节点地平面模型:异质磁芯对地电压梯度的影响
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.52
N. Venkateswaran, Ravindhiran Mukundrajan, Mrigank Sharma, B. Ravi
With shift towards heterogeneous core architectures imminent, the uniform grid based ground plane model that is currently employed for chip-multiprocessors will no longer suf¿ce. It is practically impossible to achieve absolute zero potential at all grid nodes of the uniform ground plane model with advent of heterogeneous cores. Differential injection of current into the ground plane by different heterogeneous core partitions results in voltage gradients across the ground plane, which is detrimental to the operation of the processor. The extremely stochastic spiking activity of different cores further accentuates the problem. To overcome the problem of varying voltage distribution across the ground plane, we propose a ¿rst-ever ground plane model structured as a non-uniform RLC interconnect grid. A simulated annealing optimization is employed with parameter of ‘temperature’ as each node in the grid and impedance as the cost function ’δe’ to arrive at the non-uniform grid structure.
随着向异构核心架构的转变迫在眉睫,目前用于芯片多处理器的基于统一网格的地平面模型将不再适用。随着非均匀地心的出现,在均匀地平面模型的所有网格节点上实现绝对零电位实际上是不可能的。不同的异质磁芯分区向地平面注入电流的差异会导致整个地平面上的电压梯度,这对处理器的运行是不利的。不同地核的极端随机尖峰活动进一步加剧了这个问题。为了克服整个接地面电压分布变化的问题,我们首次提出了一个非均匀RLC互连网格结构的接地面模型。采用模拟退火优化,以温度参数作为网格中的每个节点,以阻抗为代价函数δe,得到非均匀网格结构。
{"title":"A Non-Uniform Grid Based Ground Plane Model for High Performance Nodes: The Impact of Heterogeneous Cores on Ground Voltage Gradient","authors":"N. Venkateswaran, Ravindhiran Mukundrajan, Mrigank Sharma, B. Ravi","doi":"10.1109/ISVLSI.2009.52","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.52","url":null,"abstract":"With shift towards heterogeneous core architectures imminent, the uniform grid based ground plane model that is currently employed for chip-multiprocessors will no longer suf¿ce. It is practically impossible to achieve absolute zero potential at all grid nodes of the uniform ground plane model with advent of heterogeneous cores. Differential injection of current into the ground plane by different heterogeneous core partitions results in voltage gradients across the ground plane, which is detrimental to the operation of the processor. The extremely stochastic spiking activity of different cores further accentuates the problem. To overcome the problem of varying voltage distribution across the ground plane, we propose a ¿rst-ever ground plane model structured as a non-uniform RLC interconnect grid. A simulated annealing optimization is employed with parameter of ‘temperature’ as each node in the grid and impedance as the cost function ’δe’ to arrive at the non-uniform grid structure.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"51 Pt 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126237819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of Efficient Reversible Binary Subtractors Based on a New Reversible Gate 基于一种新型可逆门的高效可逆二进制减法器设计
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.49
H. Thapliyal, N. Ranganathan
Reversible logic has extensive applications in quantum computing, low power VLSI design, quantum dot cellular automata and optical computing. While several researchers have investigated the design of reversible logic elements, there is not much work reported on reversible binary subtractors. In this paper, we propose the design of a new reversible gate called TR gate. Further, we investigate the design of reversible binary subtractors based on the proposed TR gate. The proposed TR gate is better for designing reversible binary subtractor compared to such gates discussed in literature in terms of quantum cost, garbage outputs and complexity of gates.
可逆逻辑在量子计算、低功耗VLSI设计、量子点元胞自动机和光计算等领域有着广泛的应用。虽然有一些研究者对可逆逻辑元件的设计进行了研究,但关于可逆二进制减法器的研究还不多。在本文中,我们提出了一种新的可逆栅极的设计称为TR栅极。进一步,我们研究了基于所提出的TR门的可逆二进制减法器的设计。从量子成本、垃圾输出和门的复杂度等方面来看,本文提出的TR门比文献中讨论的可逆二进制减法器门更适合设计。
{"title":"Design of Efficient Reversible Binary Subtractors Based on a New Reversible Gate","authors":"H. Thapliyal, N. Ranganathan","doi":"10.1109/ISVLSI.2009.49","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.49","url":null,"abstract":"Reversible logic has extensive applications in quantum computing, low power VLSI design, quantum dot cellular automata and optical computing. While several researchers have investigated the design of reversible logic elements, there is not much work reported on reversible binary subtractors. In this paper, we propose the design of a new reversible gate called TR gate. Further, we investigate the design of reversible binary subtractors based on the proposed TR gate. The proposed TR gate is better for designing reversible binary subtractor compared to such gates discussed in literature in terms of quantum cost, garbage outputs and complexity of gates.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125797203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 167
TEPS: Transient Error Protection Utilizing Sub-word Parallelism 利用子字并行的瞬态错误保护
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.21
Seokin Hong, Soontae Kim
Future microprocessors are expected to observe higher transient error rates in combinational logic due to technology scaling and dense integration. We propose a simple transient error protection mechanism for embedded systems exploiting frequent small operand values of instructions and frequently used shift operations. The conditions for applicable instructions for the proposed mechanism are explored, which account for 84% of total instructions executed on average. The operands of these instructions are replicated in ALU directly and other instructions are protected using time-redundant double execution. Our experimental results show that the proposed mechanism incurs 12% performance hit and 4% energy hit, on average, with a low impact on the chip area (7% of the execution unit area).
由于技术的规模化和密集集成,未来的微处理器有望在组合逻辑中观察到更高的瞬态错误率。我们提出了一种简单的瞬态错误保护机制,用于嵌入式系统利用指令的频繁小操作数值和频繁使用的移位操作。探讨了所提出的机制的适用指令的条件,这些条件占平均执行指令总数的84%。这些指令的操作数直接在ALU中复制,其他指令使用时间冗余双执行来保护。我们的实验结果表明,该机制平均造成12%的性能损失和4%的能量损失,对芯片面积的影响很小(占执行单元面积的7%)。
{"title":"TEPS: Transient Error Protection Utilizing Sub-word Parallelism","authors":"Seokin Hong, Soontae Kim","doi":"10.1109/ISVLSI.2009.21","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.21","url":null,"abstract":"Future microprocessors are expected to observe higher transient error rates in combinational logic due to technology scaling and dense integration. We propose a simple transient error protection mechanism for embedded systems exploiting frequent small operand values of instructions and frequently used shift operations. The conditions for applicable instructions for the proposed mechanism are explored, which account for 84% of total instructions executed on average. The operands of these instructions are replicated in ALU directly and other instructions are protected using time-redundant double execution. Our experimental results show that the proposed mechanism incurs 12% performance hit and 4% energy hit, on average, with a low impact on the chip area (7% of the execution unit area).","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129581210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Maximally Redundant High-Radix Signed-Digit Adder: New Algorithm and Implementation 最大冗余高基数符号加法器:新算法与实现
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.30
S. Timarchi, K. Navi, O. Kavehei
Redundant Number Systems have been widely used in fast arithmetic circuits design. Signed-Digit (SD) or generally High-Radix SD (HRSD) number system is one of the most important redundant number systems. HRSD additions are used in many arithmetic functions as basic operations. Hence, improving the additions characteristics will improve the performance of almost all arithmetic modules. Several HRSD adders have been introduced in literatures. In this paper a new maximally redundant HRSD adder is proposed. This adder is compared to some most efficient HRSD adders previously published. The proposed adder is fabricated using a standard TSMC 65nm CMOS technology at 1volt supply voltage. The adder consumes 2.5% less power than the best previous published HRSD design. These implementations are also synthesized with FPGA flow on Xilinx Virtex2. The experimental result shows 5% and 6% decreases in the area and delay, respectively.
冗余数系统在快速算术电路设计中得到了广泛的应用。符号数字(SD)或高基数SD (HRSD)数字系统是最重要的冗余数字系统之一。HRSD加法作为基本运算在许多算术函数中使用。因此,改进加法特性将提高几乎所有算术模块的性能。文献中介绍了几种HRSD加法器。本文提出了一种新的最大冗余HRSD加法器。将此加法器与以前发布的一些最有效的HRSD加法器进行比较。所提出的加法器采用标准台积电65nm CMOS技术在1伏电源电压下制造。该加法器的功耗比之前发布的最佳HRSD设计低2.5%。这些实现也在Xilinx Virtex2上用FPGA流程进行了合成。实验结果表明,该算法的面积和时延分别降低了5%和6%。
{"title":"Maximally Redundant High-Radix Signed-Digit Adder: New Algorithm and Implementation","authors":"S. Timarchi, K. Navi, O. Kavehei","doi":"10.1109/ISVLSI.2009.30","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.30","url":null,"abstract":"Redundant Number Systems have been widely used in fast arithmetic circuits design. Signed-Digit (SD) or generally High-Radix SD (HRSD) number system is one of the most important redundant number systems. HRSD additions are used in many arithmetic functions as basic operations. Hence, improving the additions characteristics will improve the performance of almost all arithmetic modules. Several HRSD adders have been introduced in literatures. In this paper a new maximally redundant HRSD adder is proposed. This adder is compared to some most efficient HRSD adders previously published. The proposed adder is fabricated using a standard TSMC 65nm CMOS technology at 1volt supply voltage. The adder consumes 2.5% less power than the best previous published HRSD design. These implementations are also synthesized with FPGA flow on Xilinx Virtex2. The experimental result shows 5% and 6% decreases in the area and delay, respectively.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116550178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Transition Inversion Based Low Power Data Coding Scheme for Synchronous Serial Communication 基于转换反转的同步串行通信低功耗数据编码方案
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.43
A. Ramachandran, Bharghava Rajaram, M. Srinivas
Reducing off-chip bus power consumption has become one of the key issues for low power system design. Although methods have been proposed to reduce the power dissipated in parallel buses, these techniques do not apply to serial communication since they work on consecutive data words. The data line in synchronous serial communication is a major source of power dissipation, apart from the clock line. The clock line cannot be modified due to the requirements of data recovery. This work outlines a novel transition inversion based data coding protocol by which these transitions on the data line can be reduced for synchronous serial buses like JTAG, SPI, I2C etc. Simulation results show up to 31.9% reduction in transitions, with negligible performance loss. Analysis on the utility of the proposed technique for error detection shows that the technique can be used instead of the parity bit technique since both are found to have the same average error detection capability.
降低片外总线功耗已成为低功耗系统设计的关键问题之一。虽然已经提出了减少并行总线功耗的方法,但这些技术不适用于串行通信,因为它们处理连续的数据字。在同步串行通信中,数据线是除时钟线外的主要功耗来源。由于数据恢复的需要,不能修改时钟线。这项工作概述了一种新的基于转换反转的数据编码协议,通过该协议可以减少数据线上的这些转换,如JTAG, SPI, I2C等同步串行总线。仿真结果表明,转换减少了31.9%,而性能损失可以忽略不计。对所提出的错误检测技术的效用分析表明,由于发现两者具有相同的平均错误检测能力,因此可以使用该技术代替奇偶校验位技术。
{"title":"Transition Inversion Based Low Power Data Coding Scheme for Synchronous Serial Communication","authors":"A. Ramachandran, Bharghava Rajaram, M. Srinivas","doi":"10.1109/ISVLSI.2009.43","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.43","url":null,"abstract":"Reducing off-chip bus power consumption has become one of the key issues for low power system design. Although methods have been proposed to reduce the power dissipated in parallel buses, these techniques do not apply to serial communication since they work on consecutive data words. The data line in synchronous serial communication is a major source of power dissipation, apart from the clock line. The clock line cannot be modified due to the requirements of data recovery. This work outlines a novel transition inversion based data coding protocol by which these transitions on the data line can be reduced for synchronous serial buses like JTAG, SPI, I2C etc. Simulation results show up to 31.9% reduction in transitions, with negligible performance loss. Analysis on the utility of the proposed technique for error detection shows that the technique can be used instead of the parity bit technique since both are found to have the same average error detection capability.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131565101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
A High Performance Unified BCD and Binary Adder/Subtractor 一种高性能统一BCD和二进制加减法器
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.40
Anshul Singh, Aman Gupta, S. Veeramachaneni, M. Srinivas
Decimal data processing applications have grown exponentially in recent years thereby increasing the need to have hardware support for decimal arithmetic. In this paper, an improved architecture for efficient Binary Coded Decimal (BCD) addition/subtraction is presented that performs binary addition/subtraction without any extra hardware. The architecture works for both signed and unsigned numbers. The design is runtime reconfigurable and maximum utilization of the hardware is a feature of the architecture. Simulation results show that the proposed architecture is at least 32% better in terms of power-delay product than the existing designs.
近年来,十进制数据处理应用程序呈指数级增长,从而增加了对十进制算术的硬件支持的需求。本文提出了一种改进的高效二进制编码十进制(BCD)加减法体系结构,该体系结构无需额外的硬件即可执行二进制加减法。该体系结构适用于有符号数和无符号数。该设计是运行时可重构的,最大限度地利用硬件是该体系结构的一个特点。仿真结果表明,该架构在功率延迟积方面比现有设计至少提高32%。
{"title":"A High Performance Unified BCD and Binary Adder/Subtractor","authors":"Anshul Singh, Aman Gupta, S. Veeramachaneni, M. Srinivas","doi":"10.1109/ISVLSI.2009.40","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.40","url":null,"abstract":"Decimal data processing applications have grown exponentially in recent years thereby increasing the need to have hardware support for decimal arithmetic. In this paper, an improved architecture for efficient Binary Coded Decimal (BCD) addition/subtraction is presented that performs binary addition/subtraction without any extra hardware. The architecture works for both signed and unsigned numbers. The design is runtime reconfigurable and maximum utilization of the hardware is a feature of the architecture. Simulation results show that the proposed architecture is at least 32% better in terms of power-delay product than the existing designs.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132310324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Comparative Analysis of Timing Yield Improvement under Process Variations of Flip-Flops Circuits 触发器电路工艺变化下定时良率提高的比较分析
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.23
H. Mostafa, M. Anis, M. Elmasry
In synchronous systems, any violation of the timing constraints of the flip-flops can cause the overall system to malfunction. Moreover, the process variations create a large variability in the flip-flop delay in scaled technologies impacting the timing yield. Overtime, many gate sizing algorithms have been introduced to improve the timing yield. This paper presents an analysis of timing yield improvement of four commonly used flip-flops under process variations. These flip-flops are designed using STMicroelectronics 65-nm CMOS technology. The analyzed flip-flops are compared for power and power-delay product (PDP) overheads to achieve this timing yield improvement. The analysis shows that the sense amplifier based flip flop (SA-FF) has a power overhead and PDP overhead of 1.7X and 2.8X, respectively, much higher than that of the transmission-gate master-slave flip flop(TG-MSFF) . The TG-MSFF exhibits the lowest relative power and PDP overheads of 30.87% and 9% ,respectively.
在同步系统中,任何对触发器时序约束的违反都可能导致整个系统故障。此外,在影响时序良率的规模化技术中,工艺变化会造成触发器延迟的巨大变化。随着时间的推移,引入了许多门尺寸算法来提高时序良率。本文分析了四种常用触发器在工艺变化下的定时良率提高。这些触发器采用意法半导体65nm CMOS技术设计。对所分析的触发器进行了功率和功率延迟产品(PDP)开销的比较,以实现时序良率的提高。分析表明,基于感测放大器的触发器(SA-FF)的功率开销和PDP开销分别为1.7倍和2.8倍,远高于传输门主从触发器(TG-MSFF)。TG-MSFF的相对功率和PDP开销最低,分别为30.87%和9%。
{"title":"Comparative Analysis of Timing Yield Improvement under Process Variations of Flip-Flops Circuits","authors":"H. Mostafa, M. Anis, M. Elmasry","doi":"10.1109/ISVLSI.2009.23","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.23","url":null,"abstract":"In synchronous systems, any violation of the timing constraints of the flip-flops can cause the overall system to malfunction. Moreover, the process variations create a large variability in the flip-flop delay in scaled technologies impacting the timing yield. Overtime, many gate sizing algorithms have been introduced to improve the timing yield. This paper presents an analysis of timing yield improvement of four commonly used flip-flops under process variations. These flip-flops are designed using STMicroelectronics 65-nm CMOS technology. The analyzed flip-flops are compared for power and power-delay product (PDP) overheads to achieve this timing yield improvement. The analysis shows that the sense amplifier based flip flop (SA-FF) has a power overhead and PDP overhead of 1.7X and 2.8X, respectively, much higher than that of the transmission-gate master-slave flip flop(TG-MSFF) . The TG-MSFF exhibits the lowest relative power and PDP overheads of 30.87% and 9% ,respectively.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117270956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
期刊
2009 IEEE Computer Society Annual Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1