首页 > 最新文献

Proceedings Eleventh International Conference on VLSI Design最新文献

英文 中文
Coefficient transformations for area-efficient implementation of multiplier-less FIR filters 系数变换的面积效率实现的乘法器无FIR滤波器
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646587
M. Mehendale, S. B. Roy, S. Sherlekar, G. Venkatesh
Techniques based on common sub-computation extraction can be used to minimize number of additions in the multiplier-less implementations of Finite Impulse Response (FIR) filters. We present two types of coefficient transforms which used in conjunction with these techniques enable area-efficient realization of multiplier-less FIR filters. (i) Number theoretic transforms-that use redundant binary representations such as Canonical Sign Digit (CSD) (ii) Signal Flow Graph transformations that modify the coefficient values while retaining the output functionality. We demonstrate this with results of 6 different coefficient transforms for 14 low pass FIR filters with number of taps ranging from 16 to 128.
基于公共子计算提取的技术可用于在有限脉冲响应(FIR)滤波器的无乘法器实现中最小化加法数量。我们提出了两种类型的系数变换,它们与这些技术结合使用,可以实现无乘法器FIR滤波器的面积效率。(i)数论变换——使用冗余二进制表示,如规范符号数字(CSD); (ii)信号流图变换,在保留输出功能的同时修改系数值。我们用14个低通FIR滤波器的6种不同系数变换的结果证明了这一点,抽头数量从16到128不等。
{"title":"Coefficient transformations for area-efficient implementation of multiplier-less FIR filters","authors":"M. Mehendale, S. B. Roy, S. Sherlekar, G. Venkatesh","doi":"10.1109/ICVD.1998.646587","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646587","url":null,"abstract":"Techniques based on common sub-computation extraction can be used to minimize number of additions in the multiplier-less implementations of Finite Impulse Response (FIR) filters. We present two types of coefficient transforms which used in conjunction with these techniques enable area-efficient realization of multiplier-less FIR filters. (i) Number theoretic transforms-that use redundant binary representations such as Canonical Sign Digit (CSD) (ii) Signal Flow Graph transformations that modify the coefficient values while retaining the output functionality. We demonstrate this with results of 6 different coefficient transforms for 14 low pass FIR filters with number of taps ranging from 16 to 128.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"292 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132035300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A technique to improve capture range of a PLL in PRML read channel 一种提高PRML读通道锁相环捕获范围的技术
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646593
C. Srinivasan
A new technique to improve the capture range of a Phase Locked Loop (PLL) in the context of partial response signalling is presented. A known preamble is transmitted at the beginning to aid phase and frequency locking. Previous timing recovery techniques have a false locking problem for large initial frequency errors. The new technique eliminates this problem by using information available in the sampled preamble sequence. The improvement obtained is demonstrated using computer simulations.
提出了一种在部分响应信号环境下提高锁相环捕获范围的新技术。在开始时发送一个已知的前导,以帮助相位和频率锁定。以往的定时恢复技术在初始频率误差较大时存在误锁定问题。新技术通过利用采样前序中可用的信息消除了这个问题。通过计算机仿真验证了所得到的改进。
{"title":"A technique to improve capture range of a PLL in PRML read channel","authors":"C. Srinivasan","doi":"10.1109/ICVD.1998.646593","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646593","url":null,"abstract":"A new technique to improve the capture range of a Phase Locked Loop (PLL) in the context of partial response signalling is presented. A known preamble is transmitted at the beginning to aid phase and frequency locking. Previous timing recovery techniques have a false locking problem for large initial frequency errors. The new technique eliminates this problem by using information available in the sampled preamble sequence. The improvement obtained is demonstrated using computer simulations.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"516 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133035951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testability preserving and enhancing transformations for robust delay fault testability 鲁棒延迟故障可测性的保持和增强变换
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646635
Amey Karkare, M. Singla, Ajai Jain
Multilevel logic optimization transformations for DFT (design for testability) used in existing logic systems, are characterized with respect to their testability preserving and testability enhancing properties. In this paper, we propose three new transformations which preserve or improve path delay testability with reduction in circuitry. The paper also includes a theorem showing the condition under which a testability preserving transformation (TPT) will be a testability enhancing transformations (TET).
在现有的逻辑系统中,DFT(可测试性设计)的多级逻辑优化变换具有保持可测试性和增强可测试性的特点。在本文中,我们提出了三种新的变换,它们在电路减少的同时保持或提高了路径延迟的可测试性。本文还给出了一个定理,证明了保持可测试性变换(TPT)是增强可测试性变换(TET)的条件。
{"title":"Testability preserving and enhancing transformations for robust delay fault testability","authors":"Amey Karkare, M. Singla, Ajai Jain","doi":"10.1109/ICVD.1998.646635","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646635","url":null,"abstract":"Multilevel logic optimization transformations for DFT (design for testability) used in existing logic systems, are characterized with respect to their testability preserving and testability enhancing properties. In this paper, we propose three new transformations which preserve or improve path delay testability with reduction in circuitry. The paper also includes a theorem showing the condition under which a testability preserving transformation (TPT) will be a testability enhancing transformations (TET).","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133701071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Top-down approach to technology migration for full-custom mask layouts 自定义掩码布局的自顶向下技术迁移方法
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646577
Z. Apanovich, A. Marchuk
The approach to technology migration presented in this paper is based on a compaction and rerouting strategy. It takes as input the full-chip mask layout hierarchical description (CIF format) and produces as output the mask layout in the target design rules. The applicability of the compaction and rerouting facilities, and the flexibility of the routing layers redistribution between different levels of mask layout hierarchy, are provided by a procedure for mask layout decomposition. The decomposition procedure takes as input any node of the mask layout hierarchical description and extracts the fragments which should be transformed by means of compaction. The size of the extracted fragments is controlled by decomposition parameters. Each extracted fragment is processed by a symbolisation procedure which provides resizing and regeneration of elementary objects such as transistors, contacts and wires. The target mask layout for each fragment is generated by a compaction procedure which is controlled by the constraints extracted during the symbolisation step. The resulting chip mask layout is generated by a routing procedure which is controlled by the data structures (netlist and floorplan) extracted during the decomposition step.
本文提出的技术迁移方法是基于压缩和重路由策略。它将全芯片掩码布局分层描述(CIF格式)作为输入,并产生目标设计规则中的掩码布局作为输出。通过掩码布局分解程序,提供了压缩和重路由功能的适用性和路由层在不同掩码布局层次之间重新分配的灵活性。分解过程以掩模布局分层描述的任意节点为输入,通过压缩提取需要变换的碎片。提取的碎片大小由分解参数控制。每个提取的片段都通过一个符号化程序进行处理,该程序提供了晶体管、触点和电线等基本对象的大小调整和再生。每个片段的目标掩码布局由一个压缩过程生成,该压缩过程由符号化步骤中提取的约束控制。由此产生的芯片掩码布局由路由过程生成,路由过程由分解步骤中提取的数据结构(网表和平面图)控制。
{"title":"Top-down approach to technology migration for full-custom mask layouts","authors":"Z. Apanovich, A. Marchuk","doi":"10.1109/ICVD.1998.646577","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646577","url":null,"abstract":"The approach to technology migration presented in this paper is based on a compaction and rerouting strategy. It takes as input the full-chip mask layout hierarchical description (CIF format) and produces as output the mask layout in the target design rules. The applicability of the compaction and rerouting facilities, and the flexibility of the routing layers redistribution between different levels of mask layout hierarchy, are provided by a procedure for mask layout decomposition. The decomposition procedure takes as input any node of the mask layout hierarchical description and extracts the fragments which should be transformed by means of compaction. The size of the extracted fragments is controlled by decomposition parameters. Each extracted fragment is processed by a symbolisation procedure which provides resizing and regeneration of elementary objects such as transistors, contacts and wires. The target mask layout for each fragment is generated by a compaction procedure which is controlled by the constraints extracted during the symbolisation step. The resulting chip mask layout is generated by a routing procedure which is controlled by the data structures (netlist and floorplan) extracted during the decomposition step.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"173 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114445574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Arbitrary precision arithmetic-SIMD style 任意精度算术- simd风格
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646590
S. Balakrishnan, S. Nandy
Current day general purpose processors have been enhanced with what is called "media instruction set" to achieve performance gains in applications that are media processing intensive. The instruction set that has been added exploits the fact that media applications have small native datatypes and have widths much less than that supported by commercial processors and the plethora of data-parallelism in such applications. Current processors enhanced with the "media instruction set" support arithmetic on sub-datatypes of only 8-bit, 16-bit, 32-bit and 64-bit precision. In this paper we motivate the need for arbitrary precision packed arithmetic wherein the width of the sub-datatypes are programmable by the user and propose an implementation for arithmetic on such packed datatypes. The proposed scheme has marginal hardware overhead over conventional implementations of arithmetic on processors incorporating a multimedia extended instruction set.
当前的通用处理器已经通过所谓的“媒体指令集”进行了增强,以在媒体处理密集型应用程序中实现性能提升。添加的指令集利用了媒体应用程序具有较小的本机数据类型和宽度远小于商业处理器所支持的数据类型和宽度这一事实,以及此类应用程序中大量的数据并行性。当前使用“媒体指令集”增强的处理器只支持8位、16位、32位和64位精度的子数据类型运算。在本文中,我们激发了对任意精度封装算法的需求,其中子数据类型的宽度是由用户可编程的,并提出了在这种封装数据类型上的算法实现。与传统的包含多媒体扩展指令集的处理器上的算法实现相比,所提出的方案具有边际的硬件开销。
{"title":"Arbitrary precision arithmetic-SIMD style","authors":"S. Balakrishnan, S. Nandy","doi":"10.1109/ICVD.1998.646590","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646590","url":null,"abstract":"Current day general purpose processors have been enhanced with what is called \"media instruction set\" to achieve performance gains in applications that are media processing intensive. The instruction set that has been added exploits the fact that media applications have small native datatypes and have widths much less than that supported by commercial processors and the plethora of data-parallelism in such applications. Current processors enhanced with the \"media instruction set\" support arithmetic on sub-datatypes of only 8-bit, 16-bit, 32-bit and 64-bit precision. In this paper we motivate the need for arbitrary precision packed arithmetic wherein the width of the sub-datatypes are programmable by the user and propose an implementation for arithmetic on such packed datatypes. The proposed scheme has marginal hardware overhead over conventional implementations of arithmetic on processors incorporating a multimedia extended instruction set.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114261869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
New net models for spectral netlist partitioning 谱网表划分的新网络模型
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646642
P. Rao, C.S. Jayathirtha, C.S. Raghavendraprasad
Spectral approaches for partitioning netlists that use the eigenvectors of a matrix derived from a weighted graph model of the netlist (hypergraph) have been attracting considerable attention. There are several known ways in which a weighted graph could be derived from the netlist. However, the effectiveness of these alternate net models for netlist partitioning has remained unexplored. In this paper we first evaluate the relative performance of these approaches and establish that the quality of the partition is sensitive to the choice of the model. We also propose and investigate a number of new approaches for deriving a weighted graph model for a netlist. We show through test results on benchmark partitioning problems that one of the new models proposed here, performs consistently better than all the other models.
划分网表的谱方法使用从网表(超图)的加权图模型派生的矩阵的特征向量,已经引起了相当大的关注。有几种已知的方法可以从网表导出加权图。然而,这些用于网表划分的替代网络模型的有效性仍未得到探索。在本文中,我们首先评价了这些方法的相对性能,并建立了分区的质量对模型的选择是敏感的。我们还提出并研究了一些新的方法来推导网表的加权图模型。我们通过对基准划分问题的测试结果表明,这里提出的一个新模型的性能始终优于所有其他模型。
{"title":"New net models for spectral netlist partitioning","authors":"P. Rao, C.S. Jayathirtha, C.S. Raghavendraprasad","doi":"10.1109/ICVD.1998.646642","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646642","url":null,"abstract":"Spectral approaches for partitioning netlists that use the eigenvectors of a matrix derived from a weighted graph model of the netlist (hypergraph) have been attracting considerable attention. There are several known ways in which a weighted graph could be derived from the netlist. However, the effectiveness of these alternate net models for netlist partitioning has remained unexplored. In this paper we first evaluate the relative performance of these approaches and establish that the quality of the partition is sensitive to the choice of the model. We also propose and investigate a number of new approaches for deriving a weighted graph model for a netlist. We show through test results on benchmark partitioning problems that one of the new models proposed here, performs consistently better than all the other models.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129617812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
False path detection at transistor level 在晶体管级误路检测
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646607
Abhijit Das, Samrat Sen, Mohan Rangan, Rupesh Nayak, G. Nandakumar
The prevalent concern over static timing analysis is that it might produce a very pessimistic result in presence of false paths in the circuit. It is therefore essential to detect and avoid the false paths during timing analysis, to estimate the timing characteristics of the design better. In this paper a framework is presented to automatically detect false paths at the transistor level. The false path detection involves logic extraction from a transistor level netlist and then detecting false paths, with more accurate estimation of the path delays.
对静态时序分析的普遍关注是,它可能在电路中存在假路径时产生非常悲观的结果。因此,在时序分析中检测和避免假路径,以更好地估计设计的时序特性是至关重要的。本文提出了一个在晶体管级自动检测误路的框架。假路径检测包括从晶体管级网表中提取逻辑,然后检测假路径,更准确地估计路径延迟。
{"title":"False path detection at transistor level","authors":"Abhijit Das, Samrat Sen, Mohan Rangan, Rupesh Nayak, G. Nandakumar","doi":"10.1109/ICVD.1998.646607","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646607","url":null,"abstract":"The prevalent concern over static timing analysis is that it might produce a very pessimistic result in presence of false paths in the circuit. It is therefore essential to detect and avoid the false paths during timing analysis, to estimate the timing characteristics of the design better. In this paper a framework is presented to automatically detect false paths at the transistor level. The false path detection involves logic extraction from a transistor level netlist and then detecting false paths, with more accurate estimation of the path delays.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129961116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Circuit design using resonant tunneling diodes 使用谐振隧道二极管的电路设计
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646656
P. Mazumder, S. Kulkarni, M. Bhattacharya, Alejandro F. González
Picosecond switching speeds and folded current voltage characteristics have made quantum tunneling devices promising alternatives for high-speed and compact VLSI circuit design. This paper describes new bistable digital logic circuit topologies that use resonant tunneling diodes (RTDs) in conjunction with heterojunction bipolar transistors (HBTs) and modulation-doped field effect transistors (MODFETs). The designed circuits include a single-gate, self-latching MAJORITY function besides basic NAND, NOR and inverter gates. The application of these circuits in the design of high-performance adders and parallel correlators is discussed. We also review multiple-valued logic (MVL) applications of RTDs that achieve significant compaction in terms of device count over comparable binary logic implementations in conventional technologies. These include a four-valued 4:1 multiplexer using 13 resonant tunneling bipolar transistors (RTBTs) and HBTs, a mask programmable four-valued, single-input gate using 4 RTDs and 14 HBTs, and a four-step countdown circuit using 1 RTD and 3 HBTs.
皮秒开关速度和折叠电流电压特性使量子隧道器件成为高速和紧凑VLSI电路设计的有希望的替代方案。本文描述了一种新的双稳态数字逻辑电路拓扑结构,该电路使用谐振隧道二极管(rtd)与异质结双极晶体管(HBTs)和调制掺杂场效应晶体管(modfet)相结合。设计的电路除了基本的NAND门、NOR门和逆变门外,还包括一个单门、自锁存的MAJORITY功能。讨论了这些电路在高性能加法器和并行相关器设计中的应用。我们还回顾了rtd的多值逻辑(MVL)应用,这些应用在器件数量方面比传统技术中可比较的二进制逻辑实现实现显著压缩。其中包括使用13个谐振隧道双极晶体管(rtbt)和hbt的四值4:1多路复用器,使用4个RTD和14个hbt的掩模可编程四值单输入门,以及使用1个RTD和3个hbt的四步倒计时电路。
{"title":"Circuit design using resonant tunneling diodes","authors":"P. Mazumder, S. Kulkarni, M. Bhattacharya, Alejandro F. González","doi":"10.1109/ICVD.1998.646656","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646656","url":null,"abstract":"Picosecond switching speeds and folded current voltage characteristics have made quantum tunneling devices promising alternatives for high-speed and compact VLSI circuit design. This paper describes new bistable digital logic circuit topologies that use resonant tunneling diodes (RTDs) in conjunction with heterojunction bipolar transistors (HBTs) and modulation-doped field effect transistors (MODFETs). The designed circuits include a single-gate, self-latching MAJORITY function besides basic NAND, NOR and inverter gates. The application of these circuits in the design of high-performance adders and parallel correlators is discussed. We also review multiple-valued logic (MVL) applications of RTDs that achieve significant compaction in terms of device count over comparable binary logic implementations in conventional technologies. These include a four-valued 4:1 multiplexer using 13 resonant tunneling bipolar transistors (RTBTs) and HBTs, a mask programmable four-valued, single-input gate using 4 RTDs and 14 HBTs, and a four-step countdown circuit using 1 RTD and 3 HBTs.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114715623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Speeding up program execution using reconfigurable hardware and a hardware function library 使用可重构硬件和硬件函数库加速程序执行
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646641
S. Jain, M. Balakrishnan, Anshul Kumar, Shashi Kumar
This paper describes a co-design environment which follows a new approach for speeding up compute intensive applications. The environment consists of three major components. First, a target architecture consisting of a uniprocessor host and a board with dynamically reconfigurable FPGAs and memory modules; second, a library of functions pre-synthesized for hardware or software implementation; and third, a tool which takes as input an application described in C and partitions it into hardware and software parts at functional granularity using information obtained by profiling the application. An important feature of the partitioning tool is a new efficient heuristic specifically suited for the architecture with reconfigurable hardware.
本文描述了一种协同设计环境,它遵循了一种加速计算密集型应用程序的新方法。环境由三个主要部分组成。首先,目标架构由单处理器主机和具有动态可重构fpga和存储模块的板组成;第二,预合成用于硬件或软件实现的函数库;第三,该工具以C语言描述的应用程序为输入,并根据应用程序分析获得的信息按功能粒度将其划分为硬件和软件部分。分区工具的一个重要特性是一种新的有效的启发式方法,特别适用于具有可重构硬件的体系结构。
{"title":"Speeding up program execution using reconfigurable hardware and a hardware function library","authors":"S. Jain, M. Balakrishnan, Anshul Kumar, Shashi Kumar","doi":"10.1109/ICVD.1998.646641","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646641","url":null,"abstract":"This paper describes a co-design environment which follows a new approach for speeding up compute intensive applications. The environment consists of three major components. First, a target architecture consisting of a uniprocessor host and a board with dynamically reconfigurable FPGAs and memory modules; second, a library of functions pre-synthesized for hardware or software implementation; and third, a tool which takes as input an application described in C and partitions it into hardware and software parts at functional granularity using information obtained by profiling the application. An important feature of the partitioning tool is a new efficient heuristic specifically suited for the architecture with reconfigurable hardware.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115011555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A pipelined parallel processor to implement MD4 message digest algorithm on Xilinx FPGA 在Xilinx FPGA上实现MD4消息摘要算法的流水线并行处理器
Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646640
M. B. Sherigar, A. Mahadevan, K. S. Kumar, D. S. Sumam
The paper presents a pipelined parallel processor architecture design to implement the MD4 message digest algorithm which computes the message digest or the fingerprint of 128 bit fixed length, for any arbitrary length of input message. The processor implements the arithmetic, logic and circular shift operations by a pipelined parallel process. The architecture is designed to suit the design flexibility of the Xilinx Field Programmable Gate Arrays (FPGAs) The processor reads the message from an external RAM, 16-bit at a time and internal operations are performed with 32-bit data. The major advantage of the design is increased speed of computation and minimum hardware. The processor computes the digest with a speed approximately three times faster than the software version implemented in DSP processors.
本文提出了一种实现MD4消息摘要算法的流水线并行处理器架构设计,该算法可以对任意长度的输入消息计算128位固定长度的消息摘要或指纹。处理器通过流水线并行处理实现算术、逻辑和循环移位操作。该架构旨在适应Xilinx现场可编程门阵列(fpga)的设计灵活性。处理器从外部RAM读取消息,每次16位,内部操作使用32位数据执行。该设计的主要优点是提高了计算速度和最小的硬件。该处理器计算摘要的速度比在DSP处理器中实现的软件版本快大约三倍。
{"title":"A pipelined parallel processor to implement MD4 message digest algorithm on Xilinx FPGA","authors":"M. B. Sherigar, A. Mahadevan, K. S. Kumar, D. S. Sumam","doi":"10.1109/ICVD.1998.646640","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646640","url":null,"abstract":"The paper presents a pipelined parallel processor architecture design to implement the MD4 message digest algorithm which computes the message digest or the fingerprint of 128 bit fixed length, for any arbitrary length of input message. The processor implements the arithmetic, logic and circular shift operations by a pipelined parallel process. The architecture is designed to suit the design flexibility of the Xilinx Field Programmable Gate Arrays (FPGAs) The processor reads the message from an external RAM, 16-bit at a time and internal operations are performed with 32-bit data. The major advantage of the design is increased speed of computation and minimum hardware. The processor computes the digest with a speed approximately three times faster than the software version implemented in DSP processors.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123705402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings Eleventh International Conference on VLSI Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1