首页 > 最新文献

Proceedings Design, Automation and Test in Europe Conference and Exhibition最新文献

英文 中文
Design of very deep pipelined multipliers for FPGAs fpga的深管道乘法器设计
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269200
A. Panato, S. V. Silva, F. Wagner, M. Johann, R. Reis, S. Bampi
This work investigates the use of very deep pipelines for implementing circuits in FPGAs, where each pipeline stage is limited to a single FPGA logic element (LE). The architecture and VHDL design of a parameterized integer array multiplier is presented and also an IEEE 754 compliant 32-bit floating-point multiplier. We show how to write VHDL cells that implement such approach, and how the array multiplier architecture was adapted. Synthesis and simulation were performed for Altera Apex20KE devices, although the VHDL code should be portable to other devices. For this family, a 16 bit integer multiplier achieves a frequency of 266 MHz, while the floating point unit reaches 235 MHz, performing 235 MFLOPS in an FPGA. Additional cells are inserted to synchronize data, what imposes significant area penalties. This and other considerations to apply the technique in real designs are also addressed.
这项工作研究了在FPGA中使用非常深的管道来实现电路,其中每个管道阶段仅限于单个FPGA逻辑元件(LE)。给出了参数化整数阵列乘法器的结构和VHDL设计,并给出了一个符合IEEE 754标准的32位浮点乘法器。我们展示了如何编写实现这种方法的VHDL单元,以及如何适应阵列乘法器架构。对Altera Apex20KE设备进行了综合和仿真,尽管VHDL代码应该可移植到其他设备。对于该系列,16位整数乘法器的频率达到266 MHz,而浮点单元的频率达到235 MHz,在FPGA中执行235 MFLOPS。插入额外的单元格来同步数据,这会造成很大的面积损失。本文还讨论了将该技术应用于实际设计中的其他考虑因素。
{"title":"Design of very deep pipelined multipliers for FPGAs","authors":"A. Panato, S. V. Silva, F. Wagner, M. Johann, R. Reis, S. Bampi","doi":"10.1109/DATE.2004.1269200","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269200","url":null,"abstract":"This work investigates the use of very deep pipelines for implementing circuits in FPGAs, where each pipeline stage is limited to a single FPGA logic element (LE). The architecture and VHDL design of a parameterized integer array multiplier is presented and also an IEEE 754 compliant 32-bit floating-point multiplier. We show how to write VHDL cells that implement such approach, and how the array multiplier architecture was adapted. Synthesis and simulation were performed for Altera Apex20KE devices, although the VHDL code should be portable to other devices. For this family, a 16 bit integer multiplier achieves a frequency of 266 MHz, while the floating point unit reaches 235 MHz, performing 235 MFLOPS in an FPGA. Additional cells are inserted to synchronize data, what imposes significant area penalties. This and other considerations to apply the technique in real designs are also addressed.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115596506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Efficient implementations of mobile video computations on domain-specific reconfigurable arrays 针对特定领域可重构阵列的移动视频计算的高效实现
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269064
S. Khawam, S. Baloch, A. Pai, Imran Ahmed, N. Aydin, T. Arslan, F. Westall
Mobile video processing as defined in standards like MPEG-4 and H.263 contains a number of timeconsuming computations that cannot be efficiently executed on current hardware architectures. The authors recently introduced a reconfigurable SoC platform that permits a low-power, high-throughput and flexible implementation of the motion estimation and DCT algorithms. The computations are done using domainspecific reconfigurable arrays that have demonstrated up to 75% reduction in power consumption when compared to generic FPGA architecture, which makes them suitable for portable devices. This paper presents and compares different configurations of the arrays to efficiently implementing DCT and motion estimation algorithms. A number of algorithms are mapped into the various reconfigurable fabrics demonstrating the flexibility of the new reconfigurable SoC architecture and its ability to support a number of implementations having different performance characteristics.
在MPEG-4和H.263等标准中定义的移动视频处理包含大量耗时的计算,在当前的硬件架构上无法有效执行。作者最近推出了一种可重构的SoC平台,该平台允许低功耗,高吞吐量和灵活地实现运动估计和DCT算法。计算是使用特定领域的可重构阵列完成的,与通用FPGA架构相比,该阵列的功耗降低了75%,这使得它们适用于便携式设备。本文提出并比较了阵列的不同配置,以有效地实现DCT和运动估计算法。许多算法被映射到各种可重构结构中,展示了新的可重构SoC架构的灵活性,以及它支持具有不同性能特征的许多实现的能力。
{"title":"Efficient implementations of mobile video computations on domain-specific reconfigurable arrays","authors":"S. Khawam, S. Baloch, A. Pai, Imran Ahmed, N. Aydin, T. Arslan, F. Westall","doi":"10.1109/DATE.2004.1269064","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269064","url":null,"abstract":"Mobile video processing as defined in standards like MPEG-4 and H.263 contains a number of timeconsuming computations that cannot be efficiently executed on current hardware architectures. The authors recently introduced a reconfigurable SoC platform that permits a low-power, high-throughput and flexible implementation of the motion estimation and DCT algorithms. The computations are done using domainspecific reconfigurable arrays that have demonstrated up to 75% reduction in power consumption when compared to generic FPGA architecture, which makes them suitable for portable devices. This paper presents and compares different configurations of the arrays to efficiently implementing DCT and motion estimation algorithms. A number of algorithms are mapped into the various reconfigurable fabrics demonstrating the flexibility of the new reconfigurable SoC architecture and its ability to support a number of implementations having different performance characteristics.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114711740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Bandwidth-constrained mapping of cores onto NoC architectures 带宽受限的内核映射到NoC架构
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269002
S. Murali, G. Micheli
We address the design of complex monolithic systems, where processing cores generate and consume a varying and large amount of data, thus bringing the communication links to the edge of congestion. Typical applications are in the area of multi-media processing. We consider a mesh-based networks on chip (NoC) architecture, and we explore the assignment of cores to mesh cross-points so that the traffic on links satisfies bandwidth constraints. A single-path deterministic routing between the cores places high bandwidth demands on the links. The bandwidth requirements can be significantly reduced by splitting the traffic between the cores across multiple paths. In this paper, we present NMAP, a fast algorithm that maps the cores onto a mesh NoC architecture under bandwidth constraints, minimizing the average communication delay. The NMAP algorithm is presented for both single minimum-path routing and split-traffic routing. The algorithm is applied to a benchmark DSP design and the resulting NoC is built and simulated at cycle accurate level in SystemC using macros from the /spl times/pipes library. Also, experiments with six video processing applications show significant savings in bandwidth and communication cost for NMAP algorithm when compared to existing algorithms.
我们解决了复杂单片系统的设计,其中处理核心生成和消耗大量变化的数据,从而将通信链路带到拥塞的边缘。典型的应用是在多媒体处理领域。我们考虑了一种基于网格的片上网络(NoC)架构,并探索了将核心分配到网格交叉点,从而使链路上的流量满足带宽限制。核心之间的单路径确定性路由对链路提出了高带宽要求。通过在多个路径上分割核心之间的流量,可以显著降低带宽需求。在本文中,我们提出了NMAP,一种在带宽限制下将核心映射到网状NoC架构的快速算法,最大限度地减少了平均通信延迟。提出了单最小路径路由和分流路由的NMAP算法。该算法应用于基准DSP设计,并在SystemC中使用/spl times/pipes库中的宏构建和模拟了周期精确级别的NoC。此外,在6个视频处理应用中进行的实验表明,与现有算法相比,NMAP算法显著节省了带宽和通信成本。
{"title":"Bandwidth-constrained mapping of cores onto NoC architectures","authors":"S. Murali, G. Micheli","doi":"10.1109/DATE.2004.1269002","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269002","url":null,"abstract":"We address the design of complex monolithic systems, where processing cores generate and consume a varying and large amount of data, thus bringing the communication links to the edge of congestion. Typical applications are in the area of multi-media processing. We consider a mesh-based networks on chip (NoC) architecture, and we explore the assignment of cores to mesh cross-points so that the traffic on links satisfies bandwidth constraints. A single-path deterministic routing between the cores places high bandwidth demands on the links. The bandwidth requirements can be significantly reduced by splitting the traffic between the cores across multiple paths. In this paper, we present NMAP, a fast algorithm that maps the cores onto a mesh NoC architecture under bandwidth constraints, minimizing the average communication delay. The NMAP algorithm is presented for both single minimum-path routing and split-traffic routing. The algorithm is applied to a benchmark DSP design and the resulting NoC is built and simulated at cycle accurate level in SystemC using macros from the /spl times/pipes library. Also, experiments with six video processing applications show significant savings in bandwidth and communication cost for NMAP algorithm when compared to existing algorithms.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"241 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114466275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 737
Efficient modular testing of SoCs using dual-speed TAM architectures 采用双速TAM架构的soc高效模块化测试
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268883
Anuja Sehgal, K. Chakrabarty
The increasing complexity of system-on-chip (SOC) integrated circuits has spurred the development of versatile automatic test equipment (ATE) that can simultaneously drive different channels at different data rates. Examples of such ATEs include the Agilent 93000 series tester based on port scalability and the test processor-per-pin architecture, and the Tiger system from Teradyne. The number of tester channels with high data rates may be constrained in practice however due to ATE resource limitations, the power rating of the SOC, and scan frequency limits for the embedded cores. Therefore, we formulate the following optimization problem: given two available data rates for the tester channels, an SOC-level test access mechanism (TAM) width W, V (V < W) channels that can transport test data at the higher data rate, determine an SOC TAM architecture that minimizes the testing time. We present an efficient heuristic algorithm for TAM optimization that exploits port scalability of ATEs to reduce SOC testing time and test cost. We present experimental results on dual-speed TAM optimization for the ITC'2002 SOC test benchmarks.
片上系统(SOC)集成电路的复杂性日益增加,促使了多功能自动测试设备(ATE)的发展,这些设备可以同时以不同的数据速率驱动不同的通道。此类ATEs的示例包括基于端口可伸缩性和测试处理器每引脚架构的Agilent 93000系列测试仪,以及Teradyne的Tiger系统。然而,由于ATE资源限制、SOC的额定功率以及嵌入式内核的扫描频率限制,具有高数据速率的测试通道的数量可能在实践中受到限制。因此,我们制定了以下优化问题:给定两种可用的测试通道数据速率,一个SOC级测试访问机制(TAM)宽度为W, V (V < W)的通道可以以更高的数据速率传输测试数据,确定一个SOC TAM架构,使测试时间最小化。我们提出了一种有效的启发式算法,该算法利用ATEs的端口可扩展性来减少SOC测试时间和测试成本。我们在ITC'2002 SOC测试基准上给出了双速度TAM优化的实验结果。
{"title":"Efficient modular testing of SoCs using dual-speed TAM architectures","authors":"Anuja Sehgal, K. Chakrabarty","doi":"10.1109/DATE.2004.1268883","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268883","url":null,"abstract":"The increasing complexity of system-on-chip (SOC) integrated circuits has spurred the development of versatile automatic test equipment (ATE) that can simultaneously drive different channels at different data rates. Examples of such ATEs include the Agilent 93000 series tester based on port scalability and the test processor-per-pin architecture, and the Tiger system from Teradyne. The number of tester channels with high data rates may be constrained in practice however due to ATE resource limitations, the power rating of the SOC, and scan frequency limits for the embedded cores. Therefore, we formulate the following optimization problem: given two available data rates for the tester channels, an SOC-level test access mechanism (TAM) width W, V (V < W) channels that can transport test data at the higher data rate, determine an SOC TAM architecture that minimizes the testing time. We present an efficient heuristic algorithm for TAM optimization that exploits port scalability of ATEs to reduce SOC testing time and test cost. We present experimental results on dual-speed TAM optimization for the ITC'2002 SOC test benchmarks.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"6 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116742643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Managing don't cares in Boolean satisfiability 管理并不关心布尔的可满足性
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268858
Sean Safarpour, A. Veneris, R. Drechsler, Joanne Lee
Advances in Boolean satisfiability solvers have popularized their use in many of today's CAD VLSI challenges. Existing satisfiability solvers operate on a circuit representation that does not capture all of the structural circuit characteristics and properties. This work proposes algorithms that take into account the circuit don't care conditions thus enhancing the performance of these tools. Don't care sets are addressed in this work both statically and dynamically to reduce the search space and guide the decision making process. Experiments demonstrate performance gains.
布尔可满足性解算器的进步已经普及了它们在当今许多CAD VLSI挑战中的应用。现有的可满足性求解器在电路表示上运行,不能捕获所有结构电路的特征和属性。这项工作提出了考虑电路不关心条件的算法,从而提高了这些工具的性能。在这项工作中,不关心集被静态和动态地处理,以减少搜索空间并指导决策过程。实验证明了性能的提高。
{"title":"Managing don't cares in Boolean satisfiability","authors":"Sean Safarpour, A. Veneris, R. Drechsler, Joanne Lee","doi":"10.1109/DATE.2004.1268858","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268858","url":null,"abstract":"Advances in Boolean satisfiability solvers have popularized their use in many of today's CAD VLSI challenges. Existing satisfiability solvers operate on a circuit representation that does not capture all of the structural circuit characteristics and properties. This work proposes algorithms that take into account the circuit don't care conditions thus enhancing the performance of these tools. Don't care sets are addressed in this work both statically and dynamically to reduce the search space and guide the decision making process. Experiments demonstrate performance gains.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116177854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Building the hierarchy from a flat netlist for a fast and accurate post-layout simulation with parasitic components 从平面网表构建层次结构,用于快速准确的寄生组件布局后仿真
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269268
P. Daglio, D. Iezzi, Danilo Rimondi, C. Roma, Salvatore Santapa
Main concerns related to post-layout simulation, today, are about the format of the netlist coming out from the parasitic extractor. In fact, such a netlist is usually flat so that readability, whether compared to the pre-layout hierarchical one, is very poor due to device and net names which often change and to the difficulty to compare pre-layout and post-layout output signals. Furthermore, simulating such large flat netlists is frequently time consuming because it is not possible to exploit algorithms like hierarchical array reduction (HAR) and isomorphic matching (IM), strength points of state-of-the-art full chip simulators. In this paper, we present a new approach that, starting from a flat netlist with parasitic components and a pre-layout hierarchical one, allows to create a fully hierarchical post-layout netlist containing device parameters and parasitic components directly extracted from the layout. In this way, a fast and accurate post-layout simulation is made possible by the use of look-up table simulators, taking advantages from the HAR and IM algorithms as mentioned before. This methodology has been integrated in a complete design flow to guarantee first silicon success, cut down time-to-design, improve time-to-market and streamline design quality.
今天,与布局后仿真相关的主要问题是寄生提取器产生的网表格式。事实上,这样的网络列表通常是扁平的,因此,无论与布局前的分层列表相比,由于设备和网络名称经常变化,并且难以比较布局前和布局后的输出信号,因此可读性非常差。此外,模拟如此大的平面网络列表通常非常耗时,因为不可能利用分层阵列缩减(HAR)和同构匹配(IM)等算法,这些算法是最先进的全芯片模拟器的优势点。在本文中,我们提出了一种新的方法,从具有寄生组件的平面网络列表和布局前分层网络列表开始,允许创建一个完全分层的布局后网络列表,其中包含直接从布局中提取的设备参数和寄生组件。这样,通过使用查找表模拟器,利用前面提到的HAR和IM算法的优势,可以实现快速准确的布局后仿真。该方法已集成在一个完整的设计流程中,以保证第一个硅的成功,缩短设计时间,提高上市时间并简化设计质量。
{"title":"Building the hierarchy from a flat netlist for a fast and accurate post-layout simulation with parasitic components","authors":"P. Daglio, D. Iezzi, Danilo Rimondi, C. Roma, Salvatore Santapa","doi":"10.1109/DATE.2004.1269268","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269268","url":null,"abstract":"Main concerns related to post-layout simulation, today, are about the format of the netlist coming out from the parasitic extractor. In fact, such a netlist is usually flat so that readability, whether compared to the pre-layout hierarchical one, is very poor due to device and net names which often change and to the difficulty to compare pre-layout and post-layout output signals. Furthermore, simulating such large flat netlists is frequently time consuming because it is not possible to exploit algorithms like hierarchical array reduction (HAR) and isomorphic matching (IM), strength points of state-of-the-art full chip simulators. In this paper, we present a new approach that, starting from a flat netlist with parasitic components and a pre-layout hierarchical one, allows to create a fully hierarchical post-layout netlist containing device parameters and parasitic components directly extracted from the layout. In this way, a fast and accurate post-layout simulation is made possible by the use of look-up table simulators, taking advantages from the HAR and IM algorithms as mentioned before. This methodology has been integrated in a complete design flow to guarantee first silicon success, cut down time-to-design, improve time-to-market and streamline design quality.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115133464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
SoftContract: an assertion-based software development process that enables design-by-contract SoftContract:基于断言的软件开发过程,支持契约式设计
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268873
J. Brunel, M. Natale, A. Ferrari, P. Giusto, L. Lavagno
This paper discusses a model-based design flow for requirements in distributed embedded software development. Such requirements are specified using a language similar to linear temporal logic which allows one to reason about time and sequencing. They consist of assertions which must hold for a design, given some assumptions on its environment. They can be checked both during simulation and, at least for a subset, even on the target. The key contribution of the paper is the extension to the embedded software domain of assertion-based verification, and the automated generation of property-checking code in multiple target languages, from simulation, to prototyping, to final production.
本文讨论了分布式嵌入式软件开发中基于模型的需求设计流程。这样的需求是用一种类似于线性时间逻辑的语言来指定的,这种语言允许人们对时间和顺序进行推理。它们由断言组成,这些断言必须在给定其环境的某些假设的情况下支持设计。它们可以在模拟期间进行检查,至少对于一个子集,甚至可以在目标上进行检查。本文的主要贡献是扩展了基于断言的验证的嵌入式软件领域,以及从仿真到原型设计再到最终产品的多种目标语言的属性检查代码的自动生成。
{"title":"SoftContract: an assertion-based software development process that enables design-by-contract","authors":"J. Brunel, M. Natale, A. Ferrari, P. Giusto, L. Lavagno","doi":"10.1109/DATE.2004.1268873","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268873","url":null,"abstract":"This paper discusses a model-based design flow for requirements in distributed embedded software development. Such requirements are specified using a language similar to linear temporal logic which allows one to reason about time and sequencing. They consist of assertions which must hold for a design, given some assumptions on its environment. They can be checked both during simulation and, at least for a subset, even on the target. The key contribution of the paper is the extension to the embedded software domain of assertion-based verification, and the automated generation of property-checking code in multiple target languages, from simulation, to prototyping, to final production.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124676588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
An asynchronous synthesis toolset using Verilog 使用Verilog的异步合成工具集
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268948
F. Burns, D. Shang, A. Koelmans, A. Yakovlev
We present a new CAD tool set for generating asynchronous circuits from high-level Verilog level-sensitive specifications. Initially, high-level Verilog descriptions are compiled and converted into a novel intermediate Petri net format. The intermediate format is subsequently passed to optimization tools and mapping tools where it is directly mapped into asynchronous datapath and control circuits using David cells (DCs). Finally, logic optimization tools are applied to generate speed-independent (SI) circuits. The speed independent circuits generated perform well compared to circuits generated by existing asynchronous tools.
我们提出了一种新的CAD工具集,用于从高级Verilog电平敏感规范生成异步电路。最初,编译高级Verilog描述并将其转换为一种新的中间Petri网格式。中间格式随后被传递给优化工具和映射工具,在那里它被直接映射到使用戴维单元(dc)的异步数据路径和控制电路中。最后,应用逻辑优化工具生成速度无关(SI)电路。与现有异步工具生成的电路相比,所生成的速度无关电路性能良好。
{"title":"An asynchronous synthesis toolset using Verilog","authors":"F. Burns, D. Shang, A. Koelmans, A. Yakovlev","doi":"10.1109/DATE.2004.1268948","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268948","url":null,"abstract":"We present a new CAD tool set for generating asynchronous circuits from high-level Verilog level-sensitive specifications. Initially, high-level Verilog descriptions are compiled and converted into a novel intermediate Petri net format. The intermediate format is subsequently passed to optimization tools and mapping tools where it is directly mapped into asynchronous datapath and control circuits using David cells (DCs). Finally, logic optimization tools are applied to generate speed-independent (SI) circuits. The speed independent circuits generated perform well compared to circuits generated by existing asynchronous tools.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124937861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An application of parallel discrete event simulation algorithms to mixed domain system simulation 并行离散事件仿真算法在混合域系统仿真中的应用
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269085
D. K. Reed, S. Levitan, J. Boles, J. A. Martínez, D. Chiarulli
We present our system-level co-simulation environment for mixed domain microsystems. The environment provides synchronization and co-simulation between the Chatoyant MOEMS (micro-electro mechanical systems) simulator and ModelTech ModelSim. By using shared memory IPC (inter-process communication) and PDES (parallel discrete event simulation) techniques, we achieve two orders of magnitude speedup over standard pipe/socket communication.
提出了混合域微系统的系统级协同仿真环境。该环境提供了Chatoyant MOEMS(微机电系统)模拟器和ModelTech ModelSim之间的同步和联合仿真。通过使用共享内存IPC(进程间通信)和PDES(并行离散事件模拟)技术,我们实现了比标准管道/套接字通信两个数量级的加速。
{"title":"An application of parallel discrete event simulation algorithms to mixed domain system simulation","authors":"D. K. Reed, S. Levitan, J. Boles, J. A. Martínez, D. Chiarulli","doi":"10.1109/DATE.2004.1269085","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269085","url":null,"abstract":"We present our system-level co-simulation environment for mixed domain microsystems. The environment provides synchronization and co-simulation between the Chatoyant MOEMS (micro-electro mechanical systems) simulator and ModelTech ModelSim. By using shared memory IPC (inter-process communication) and PDES (parallel discrete event simulation) techniques, we achieve two orders of magnitude speedup over standard pipe/socket communication.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121549218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An efficient on-chip network interface offering guaranteed services, shared-memory abstraction, and flexible network configuration 高效的片上网络接口,提供有保证的服务、共享内存抽象和灵活的网络配置
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268998
A. Radulescu, J. Dielissen, K. Goossens, E. Rijpkema, P. Wielage
In this paper we present a network interface for an on-chip network. Our network interface decouples computation from communication by offering a shared-memory abstraction, which is independent of the network implementation. We use a transaction-based protocol to achieve backward compatibility with existing bus protocols such as AXI, OCP and DTL. Our network interface has a modular architecture, which allows flexible instantiation. It provides both guaranteed and best-effort services via connections. These are configured via network interface ports using the network itself, instead of a separate control interconnect. An example instance of this network interface with 4 ports has an area of 0.143 mm/sup 2/ in a 0.13 /spl mu/m technology, and runs at 500 MHz.
本文提出了一种用于片上网络的网络接口。我们的网络接口通过提供一个独立于网络实现的共享内存抽象,将计算与通信解耦。我们使用基于事务的协议来实现与现有总线协议(如AXI、OCP和DTL)的向后兼容。我们的网络接口具有模块化架构,允许灵活的实例化。它通过连接提供有保证的和最努力的服务。这些都是通过使用网络本身的网络接口端口配置的,而不是单独的控制互连。在0.13 /spl mu/m技术下,该网络接口具有4个端口的示例实例的面积为0.143 mm/sup 2/,运行频率为500 MHz。
{"title":"An efficient on-chip network interface offering guaranteed services, shared-memory abstraction, and flexible network configuration","authors":"A. Radulescu, J. Dielissen, K. Goossens, E. Rijpkema, P. Wielage","doi":"10.1109/DATE.2004.1268998","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268998","url":null,"abstract":"In this paper we present a network interface for an on-chip network. Our network interface decouples computation from communication by offering a shared-memory abstraction, which is independent of the network implementation. We use a transaction-based protocol to achieve backward compatibility with existing bus protocols such as AXI, OCP and DTL. Our network interface has a modular architecture, which allows flexible instantiation. It provides both guaranteed and best-effort services via connections. These are configured via network interface ports using the network itself, instead of a separate control interconnect. An example instance of this network interface with 4 ports has an area of 0.143 mm/sup 2/ in a 0.13 /spl mu/m technology, and runs at 500 MHz.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122782746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 148
期刊
Proceedings Design, Automation and Test in Europe Conference and Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1