首页 > 最新文献

Proceedings Design, Automation and Test in Europe Conference and Exhibition最新文献

英文 中文
A configurable logic architecture for dynamic hardware/software partitioning 用于动态硬件/软件分区的可配置逻辑体系结构
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268892
Roman L. Lysecky, F. Vahid
In previous work, we showed the benefits and feasibility of having a processor dynamically partition its executing software such that critical software kernels are transparently partitioned to execute as a hardware coprocessor on configurable logic - an approach we call warp processing. The configurable logic place and route step is the most computationally intensive part of such hardware/software partitioning, normally running for many minutes or hours on powerful desktop processors. In contrast, dynamic partitioning requires place and route to execute in just seconds and on a lean embedded processor. We have therefore designed a configurable logic architecture specifically for dynamic hardware/software partitioning. Through experiments with popular benchmarks, we show that by specifically focusing on the goal of software kernel speedup when designing the FPGA architecture, rather than on the more general goal of ASIC prototyping, we can perform place and route for our architecture 50 times faster, using 10,000 times less data memory, and 1,000 times less code memory, than popular commercial tools mapping to commercial configurable logic. Yet, we show that we obtain speedups (2x on average, and as much as 4x) and energy savings (33% on average, and up to 74%) when partitioning even just one loop, which are comparable to commercial tools and fabrics. Thus, our configurable logic architecture represents a good candidate for platforms that will support dynamic hardware/software partitioning, and enables ultra-fast desktop tools for hardware/software partitioning, and even for fast configurable logic design in general.
在之前的工作中,我们展示了让处理器动态分区其正在执行的软件的好处和可行性,这样关键的软件内核就可以透明地分区,作为硬件协处理器在可配置逻辑上执行——我们将这种方法称为warp处理。可配置逻辑位置和路由步骤是此类硬件/软件分区中计算最密集的部分,通常在功能强大的桌面处理器上运行数分钟或数小时。相比之下,动态分区需要在几秒钟内在一个精简的嵌入式处理器上执行位置和路由。因此,我们设计了一个专门用于动态硬件/软件分区的可配置逻辑体系结构。通过流行的基准测试实验,我们表明,通过在设计FPGA架构时特别关注软件内核加速的目标,而不是更一般的ASIC原型目标,我们可以比流行的商业工具映射到商业可配置逻辑快50倍,使用少10000倍的数据内存和少1000倍的代码内存。然而,我们表明,即使只划分一个循环,我们也可以获得速度提升(平均2倍,最多4倍)和能源节约(平均33%,最高74%),这与商业工具和结构相当。因此,我们的可配置逻辑架构代表了支持动态硬件/软件分区的平台的一个很好的候选平台,并且支持用于硬件/软件分区的超快速桌面工具,甚至支持一般的快速可配置逻辑设计。
{"title":"A configurable logic architecture for dynamic hardware/software partitioning","authors":"Roman L. Lysecky, F. Vahid","doi":"10.1109/DATE.2004.1268892","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268892","url":null,"abstract":"In previous work, we showed the benefits and feasibility of having a processor dynamically partition its executing software such that critical software kernels are transparently partitioned to execute as a hardware coprocessor on configurable logic - an approach we call warp processing. The configurable logic place and route step is the most computationally intensive part of such hardware/software partitioning, normally running for many minutes or hours on powerful desktop processors. In contrast, dynamic partitioning requires place and route to execute in just seconds and on a lean embedded processor. We have therefore designed a configurable logic architecture specifically for dynamic hardware/software partitioning. Through experiments with popular benchmarks, we show that by specifically focusing on the goal of software kernel speedup when designing the FPGA architecture, rather than on the more general goal of ASIC prototyping, we can perform place and route for our architecture 50 times faster, using 10,000 times less data memory, and 1,000 times less code memory, than popular commercial tools mapping to commercial configurable logic. Yet, we show that we obtain speedups (2x on average, and as much as 4x) and energy savings (33% on average, and up to 74%) when partitioning even just one loop, which are comparable to commercial tools and fabrics. Thus, our configurable logic architecture represents a good candidate for platforms that will support dynamic hardware/software partitioning, and enables ultra-fast desktop tools for hardware/software partitioning, and even for fast configurable logic design in general.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134479535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 73
A scalable ODC-based algorithm for RTL insertion of gated clocks 一种可扩展的基于odc的门控时钟RTL插入算法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268895
P. Babighian, L. Benini, E. Macii
This paper describes a new automatic clock-gating extraction working at the RT-level. The key features of our approach are: (i) seamless merging with existing industrial design flows and commercial tools; (ii) high scalability to deal with large circuits; (iii) improved quality of results with respect to available commercial tools; (iv) smaller and well-controlled overhead in speed and area. Experimental results, on a set of industrial RTL designs, demonstrate the viability and practical impact of our approach.
本文介绍了一种工作在rt级的自动时钟门控提取方法。我们方法的主要特点是:(i)与现有的工业设计流程和商业工具无缝融合;(ii)处理大型电路的高可扩展性;(iii)与现有商业工具相比,提高了结果的质量;(四)在速度和面积上的开销更小,控制得更好。在一组工业RTL设计上的实验结果证明了我们方法的可行性和实际影响。
{"title":"A scalable ODC-based algorithm for RTL insertion of gated clocks","authors":"P. Babighian, L. Benini, E. Macii","doi":"10.1109/DATE.2004.1268895","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268895","url":null,"abstract":"This paper describes a new automatic clock-gating extraction working at the RT-level. The key features of our approach are: (i) seamless merging with existing industrial design flows and commercial tools; (ii) high scalability to deal with large circuits; (iii) improved quality of results with respect to available commercial tools; (iv) smaller and well-controlled overhead in speed and area. Experimental results, on a set of industrial RTL designs, demonstrate the viability and practical impact of our approach.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131909033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
SCORE: SPICE compatible reluctance extraction SCORE: SPICE兼容磁阻萃取
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269014
Rong Jiang, C. C. Chen
Presently, a necessary modification to mainstream analysis tools prevents the direct application of reluctance k. In this paper, we propose a reluctance realization algorithm (RRA) by directly converting reluctances to circuit elements compatible with general simulation engines, such as SPICE. Reluctance realization is applicable to arbitrary circuit topology and no accuracy penalty is involved in the realization process. Since the stability of the converted circuit largely depends on the stability of the reluctance matrix, we present an efficient improved recursive bisection cutting algorithm (IRBCA) to obtain stability-guaranteed reluctance matrices, and integrate IRBCA and RRA into a SPICE compatible reluctance extraction tool, SCORE.
目前,主流分析工具的必要修改阻止了磁阻k的直接应用。在本文中,我们提出了一种磁阻实现算法(RRA),该算法将磁阻直接转换为与通用仿真引擎(如SPICE)兼容的电路元件。磁阻实现适用于任意电路拓扑结构,且在实现过程中不存在精度损失。由于转换电路的稳定性很大程度上取决于磁阻矩阵的稳定性,因此我们提出了一种高效的改进递推对分切割算法(IRBCA)来获得稳定保证的磁阻矩阵,并将IRBCA和RRA集成到SPICE兼容的磁阻提取工具SCORE中。
{"title":"SCORE: SPICE compatible reluctance extraction","authors":"Rong Jiang, C. C. Chen","doi":"10.1109/DATE.2004.1269014","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269014","url":null,"abstract":"Presently, a necessary modification to mainstream analysis tools prevents the direct application of reluctance k. In this paper, we propose a reluctance realization algorithm (RRA) by directly converting reluctances to circuit elements compatible with general simulation engines, such as SPICE. Reluctance realization is applicable to arbitrary circuit topology and no accuracy penalty is involved in the realization process. Since the stability of the converted circuit largely depends on the stability of the reluctance matrix, we present an efficient improved recursive bisection cutting algorithm (IRBCA) to obtain stability-guaranteed reluctance matrices, and integrate IRBCA and RRA into a SPICE compatible reluctance extraction tool, SCORE.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133774247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Value-conscious cache: simple technique for reducing cache access power 值感知缓存:降低缓存访问能力的简单技术
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268821
Yen-Jen Chang, Chia-Lin Yang, F. Lai
Most microprocessors employ the on-chip caches to bridge the performance gap between the processor and main memory. However, the cache accesses usually contribute significantly to the total power consumption of the chip. Based on the observation that an overwhelming majority of the cache access bits are '0', in this paper we propose a value-conscious (VC) cache to reduce the average cache power consumption during an access. Unlike the conventional cache with differential-bitline implementation, the VC cache is a single-bitline design. Depending on the access bit value, the VC cache can dynamically prevent the bitline from being discharged such that the power dissipated in accessing '0' is much less than the power dissipated in accessing '1'. The implementation of the VC cache is a circuit-level technique, which is software independent and orthogonal to other low power techniques at architecture-level. The experimental results based on the SPEC2000 and MediaBench traces show that without compromise of both performance and stability, by exploiting the prevalence of '0' bits in access data the VC cache can reduce the average cache read and write power by about 18%/spl sim/22% and 36%/spl sim/40%, respectively.
大多数微处理器采用片上缓存来弥补处理器和主存之间的性能差距。然而,缓存访问通常对芯片的总功耗有很大的贡献。基于对绝大多数缓存访问位为“0”的观察,在本文中,我们提出了一种值感知(VC)缓存,以减少访问期间的平均缓存功耗。与差分位线实现的传统缓存不同,VC缓存是单位线设计。根据访问位值的不同,VC缓存可以动态地防止位线被释放,使得访问“0”所消耗的功率远远小于访问“1”所消耗的功率。VC缓存的实现是一种电路级技术,它与软件无关,并且在体系结构级与其他低功耗技术正交。基于SPEC2000和mediabbench跟踪的实验结果表明,在不影响性能和稳定性的情况下,通过利用访问数据中普遍存在的“0”位,VC缓存可以将平均缓存读写功率分别降低约18%/spl sim/22%和36%/spl sim/40%。
{"title":"Value-conscious cache: simple technique for reducing cache access power","authors":"Yen-Jen Chang, Chia-Lin Yang, F. Lai","doi":"10.1109/DATE.2004.1268821","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268821","url":null,"abstract":"Most microprocessors employ the on-chip caches to bridge the performance gap between the processor and main memory. However, the cache accesses usually contribute significantly to the total power consumption of the chip. Based on the observation that an overwhelming majority of the cache access bits are '0', in this paper we propose a value-conscious (VC) cache to reduce the average cache power consumption during an access. Unlike the conventional cache with differential-bitline implementation, the VC cache is a single-bitline design. Depending on the access bit value, the VC cache can dynamically prevent the bitline from being discharged such that the power dissipated in accessing '0' is much less than the power dissipated in accessing '1'. The implementation of the VC cache is a circuit-level technique, which is software independent and orthogonal to other low power techniques at architecture-level. The experimental results based on the SPEC2000 and MediaBench traces show that without compromise of both performance and stability, by exploiting the prevalence of '0' bits in access data the VC cache can reduce the average cache read and write power by about 18%/spl sim/22% and 36%/spl sim/40%, respectively.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124166686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Synthesis of embedded systemC design: a case study of digital neural networks 嵌入式系统综合设计:以数字神经网络为例
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269239
D. Lettnin, A. Braun, M. Bogdan, J. Gerlach, W. Rosenstiel
This work presents the whole system-on-silicon design flow using systemC system specification language. In this study, systemC is used to design a multilayer perceptron neural network, which is applied to an electrocardiogram pattern recognition system. The objective of this work is to exemplify the synthesis of RTL-and behavioral integrated systems. To achieve this, a preprocessing methodology was used to optimize the three main constraints of hardware neural network (HNN) design: accuracy, space and processing speed. This allows a complex HNN to be implemented on a single field programmable gate array (FPGA). The high level systemC synthesis allows the straightforward translation of system level into hardware level, avoiding the error prone and the time consuming translation into another hardware description language.
本文用systemC系统规范语言描述了整个单片系统的设计流程。本研究利用systemC设计多层感知器神经网络,并将其应用于心电图模式识别系统。这项工作的目的是举例说明rtl和行为集成系统的综合。为了实现这一目标,采用预处理方法优化硬件神经网络(HNN)设计的三个主要约束:精度、空间和处理速度。这允许在单个现场可编程门阵列(FPGA)上实现复杂的HNN。高级systemC综合允许直接将系统级转换为硬件级,避免容易出错和费时的转换为另一种硬件描述语言。
{"title":"Synthesis of embedded systemC design: a case study of digital neural networks","authors":"D. Lettnin, A. Braun, M. Bogdan, J. Gerlach, W. Rosenstiel","doi":"10.1109/DATE.2004.1269239","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269239","url":null,"abstract":"This work presents the whole system-on-silicon design flow using systemC system specification language. In this study, systemC is used to design a multilayer perceptron neural network, which is applied to an electrocardiogram pattern recognition system. The objective of this work is to exemplify the synthesis of RTL-and behavioral integrated systems. To achieve this, a preprocessing methodology was used to optimize the three main constraints of hardware neural network (HNN) design: accuracy, space and processing speed. This allows a complex HNN to be implemented on a single field programmable gate array (FPGA). The high level systemC synthesis allows the straightforward translation of system level into hardware level, avoiding the error prone and the time consuming translation into another hardware description language.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124558769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Evaluation of an object-oriented hardware design methodology for automotive applications 评价面向对象的汽车硬件设计方法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269247
N. Bannow, Karsten Haug
In this paper we present results in using the new object-oriented design approach OSSS (ODETTE system synthesis subset). The methodology and tools of the ODETTE (object-oriented co-design and functional test techniques) project have been developed within the context of the IST programme of the European Commission. Main focus of OSSS lies in the field of hardware design and in synthesis capability. The strategy is based on an extension of the synthesizable subset of standard systemC. The approach supports real object-oriented and synthesizable design features like classes, inheritance, templates, polymorphism and global object access. Therefore OSSS promises high efficiency in sense of capability to handle complex designs, faster development time, improved code quality and faster time to market. In contrast, standard systemC is also based on C++ constructs, but no object-oriented constructs are available yet for a synthesizable system description. We have evaluated OSSS on an automotive design example. It was chosen for the implementation of a component that is part of all video projects: A camera's exposure control unit (ExpoCU). The first main goal that was achieved is a synthesizable design by the automatic generation of an FPGA netlist from an OSSS description. Furthermore we have also proved the methodology to fulfill industrial requirements such as usability for complex system development, integration of existing IP, improved code quality and decreased development effort. Comparison will be done against existing VHDL based design flow. We especially focus on the implementation and testability by comparing the new object-oriented synthesis approach with a standard VHDL flow by laying emphasis on synthesizability. OSSS and equivalent kinds of methodology show a large potential to handle new generations of complex HW-SW systems. Moreover the gap between increasing design complexity and available methodologies already now gets bigger and bigger and thus needs to be closed by new solutions such as OSSS.
在本文中,我们介绍了使用新的面向对象设计方法OSSS (ODETTE系统合成子集)的结果。ODETTE(面向对象的共同设计和功能测试技术)项目的方法和工具是在欧洲委员会的技术创新方案范围内开发的。OSSS主要关注硬件设计和综合能力。该策略基于标准systemC的可合成子集的扩展。该方法支持真正的面向对象和可合成的设计特性,如类、继承、模板、多态性和全局对象访问。因此,OSSS承诺在处理复杂设计的能力、更快的开发时间、改进的代码质量和更快的上市时间方面具有高效率。相比之下,标准systemC也是基于c++构造的,但是还没有面向对象的构造可用于可合成的系统描述。我们以汽车设计为例对OSSS进行了评估。它被选中用于实现所有视频项目的组成部分:相机的曝光控制单元(ExpoCU)。实现的第一个主要目标是通过从OSSS描述自动生成FPGA网络列表来实现可合成设计。此外,我们还证明了该方法可以满足工业需求,如复杂系统开发的可用性、现有IP的集成、改进的代码质量和减少的开发工作量。将与现有的基于VHDL的设计流程进行比较。通过将新的面向对象的合成方法与标准的VHDL流程进行比较,强调了可合成性,重点关注了实现和可测试性。OSSS和类似的方法显示出处理新一代复杂HW-SW系统的巨大潜力。此外,不断增加的设计复杂性和可用方法之间的差距现在已经变得越来越大,因此需要通过新的解决方案(如OSSS)来缩小。
{"title":"Evaluation of an object-oriented hardware design methodology for automotive applications","authors":"N. Bannow, Karsten Haug","doi":"10.1109/DATE.2004.1269247","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269247","url":null,"abstract":"In this paper we present results in using the new object-oriented design approach OSSS (ODETTE system synthesis subset). The methodology and tools of the ODETTE (object-oriented co-design and functional test techniques) project have been developed within the context of the IST programme of the European Commission. Main focus of OSSS lies in the field of hardware design and in synthesis capability. The strategy is based on an extension of the synthesizable subset of standard systemC. The approach supports real object-oriented and synthesizable design features like classes, inheritance, templates, polymorphism and global object access. Therefore OSSS promises high efficiency in sense of capability to handle complex designs, faster development time, improved code quality and faster time to market. In contrast, standard systemC is also based on C++ constructs, but no object-oriented constructs are available yet for a synthesizable system description. We have evaluated OSSS on an automotive design example. It was chosen for the implementation of a component that is part of all video projects: A camera's exposure control unit (ExpoCU). The first main goal that was achieved is a synthesizable design by the automatic generation of an FPGA netlist from an OSSS description. Furthermore we have also proved the methodology to fulfill industrial requirements such as usability for complex system development, integration of existing IP, improved code quality and decreased development effort. Comparison will be done against existing VHDL based design flow. We especially focus on the implementation and testability by comparing the new object-oriented synthesis approach with a standard VHDL flow by laying emphasis on synthesizability. OSSS and equivalent kinds of methodology show a large potential to handle new generations of complex HW-SW systems. Moreover the gap between increasing design complexity and available methodologies already now gets bigger and bigger and thus needs to be closed by new solutions such as OSSS.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114728860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Formal verification coverage: are the RTL-properties covering the design's architectural intent? 正式验证覆盖范围:rtl属性是否覆盖了设计的架构意图?
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268922
P. Basu, Sayantan Das, P. Dasgupta, P. Chakrabarti, C. Mohan, L. Fix
It is essential to formally ascertain whether the RTL validation effort effectively guarantees the correctness with respect to the design's architectural intent. The design's architectural intent can be expressed in formal properties. However, due to the capacity limitation of formal verification, these architectural-properties cannot be directly verified on the RTL. As a result, a set of lower level RTL-properties are developed and verified against the RTL. In this paper we present: (1) a method for checking whether the RTL-properties are covering the architectural-properties, that is, whether verifying the RTL-properties guarantee the correctness of the design's architectural intent; and (2) a method to identify the coverage holes in terms of the architectural properties (or their sub-properties) that are not covered.
必须正式确定RTL验证工作是否有效地保证了设计的体系结构意图的正确性。设计的建筑意图可以用形式属性来表达。然而,由于形式验证的能力限制,这些架构属性不能直接在RTL上进行验证。因此,开发了一组较低级的RTL属性,并针对RTL进行了验证。本文提出:(1)一种检查rtl属性是否覆盖了体系结构属性的方法,即验证rtl属性是否保证了设计的体系结构意图的正确性;(2)根据未被覆盖的体系结构属性(或其子属性)识别覆盖漏洞的方法。
{"title":"Formal verification coverage: are the RTL-properties covering the design's architectural intent?","authors":"P. Basu, Sayantan Das, P. Dasgupta, P. Chakrabarti, C. Mohan, L. Fix","doi":"10.1109/DATE.2004.1268922","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268922","url":null,"abstract":"It is essential to formally ascertain whether the RTL validation effort effectively guarantees the correctness with respect to the design's architectural intent. The design's architectural intent can be expressed in formal properties. However, due to the capacity limitation of formal verification, these architectural-properties cannot be directly verified on the RTL. As a result, a set of lower level RTL-properties are developed and verified against the RTL. In this paper we present: (1) a method for checking whether the RTL-properties are covering the architectural-properties, that is, whether verifying the RTL-properties guarantee the correctness of the design's architectural intent; and (2) a method to identify the coverage holes in terms of the architectural properties (or their sub-properties) that are not covered.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114938629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Sensitivity-based modeling and methodology for full-chip substrate noise analysis 基于灵敏度的全芯片衬底噪声分析建模和方法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268912
R. Murgai, S. Reddy, T. Miyoshi, T. Horie, M. Tahoori
Substrate noise (SN) is an important problem in mixed-signal designs. With increasing design complexity, it is not possible to simulate for SN with a detailed SPICE model that uses an accurate model for each transistor. In this paper, we propose a sensitivity analysis- and static timing analysis-based methodology to derive a reduced model that computes the worst case substrate noise in the design. The reduced model contains only passive components, which are very few, and is very quick to simulate. The main feature of our methodology is that, unlike previous approaches, it is independent of input patterns and does not need to simulate for millions of clock cycles. This lets us apply it to a full-chip design in reasonable CPU time. We validate our reduced model on several benchmark circuits against a detailed and highly accurate reference model. On average, the reduced model is within 16.4% of the reference model and is up to 38 times faster. Finally, we apply our methodology to a mixed-signal switch chip design consisting of 8 million gates and show that it finishes in 17 minutes.
基片噪声是混合信号设计中的一个重要问题。随着设计复杂性的增加,不可能使用详细的SPICE模型对每个晶体管使用精确的模型来模拟SN。在本文中,我们提出了一种基于灵敏度分析和静态时序分析的方法,以推导出计算设计中最坏情况下衬底噪声的简化模型。简化后的模型只包含很少的无源元件,而且仿真速度很快。我们的方法的主要特点是,与以前的方法不同,它独立于输入模式,不需要模拟数百万个时钟周期。这使我们能够在合理的CPU时间内将其应用于全芯片设计。我们在几个基准电路上对一个详细和高度精确的参考模型验证了我们的简化模型。平均而言,简化后的模型与参考模型的误差在16.4%以内,速度提高了38倍。最后,我们将我们的方法应用于由800万个门组成的混合信号开关芯片设计,并表明它在17分钟内完成。
{"title":"Sensitivity-based modeling and methodology for full-chip substrate noise analysis","authors":"R. Murgai, S. Reddy, T. Miyoshi, T. Horie, M. Tahoori","doi":"10.1109/DATE.2004.1268912","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268912","url":null,"abstract":"Substrate noise (SN) is an important problem in mixed-signal designs. With increasing design complexity, it is not possible to simulate for SN with a detailed SPICE model that uses an accurate model for each transistor. In this paper, we propose a sensitivity analysis- and static timing analysis-based methodology to derive a reduced model that computes the worst case substrate noise in the design. The reduced model contains only passive components, which are very few, and is very quick to simulate. The main feature of our methodology is that, unlike previous approaches, it is independent of input patterns and does not need to simulate for millions of clock cycles. This lets us apply it to a full-chip design in reasonable CPU time. We validate our reduced model on several benchmark circuits against a detailed and highly accurate reference model. On average, the reduced model is within 16.4% of the reference model and is up to 38 times faster. Finally, we apply our methodology to a mixed-signal switch chip design consisting of 8 million gates and show that it finishes in 17 minutes.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123639239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Platform based on open-source cores for industrial applications 基于开源核心的工业应用平台
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269026
M. Bolado, H. Posadas, J. Castillo, P. Huerta, P. Sánchez, C. Sánchez, H. Fouren, Francisco Blasco
The latest version of the international technology roadmap for semiconductors predicts that design reuse will be essential in the near future to face the constantly increasing design complexity. The concept comes from software engineering in which reuse is a fundamental technology. In order to provide libraries and applications to reuse in software development, some open-source initiatives (e.g. Linux, gcc, X, mysql) have appeared during the last decades. The basic idea is to distribute the library or application source code (normally free-of-charge) and allow any developer to use, modify, debug and improve it. Several initiatives have tried to port this idea to hardware development. The main goal of this paper is to develop a synthesizable platform described in SystemC from an open architecture. The platform includes a CPU (OpenRISC) and some basic peripherals. A set of software development tools (compiler, assembler, debugger) and RTOS (eCos) has also been developed. This work enables the evaluation of the advantages and disadvantages of the open-source model in electronic system design.
最新版国际半导体技术路线图预测,在不久的将来,面对不断增加的设计复杂性,设计重用将是必不可少的。这个概念来自软件工程,其中重用是一项基本技术。为了提供可在软件开发中重用的库和应用程序,在过去几十年中出现了一些开源计划(例如Linux、gcc、X、mysql)。其基本思想是分发库或应用程序源代码(通常是免费的),并允许任何开发人员使用、修改、调试和改进它。有几个项目试图将这个想法移植到硬件开发中。本文的主要目标是在一个开放的体系结构上开发一个用SystemC描述的可合成平台。该平台包括一个CPU (OpenRISC)和一些基本外设。开发了一套软件开发工具(编译器、汇编器、调试器)和实时操作系统(eCos)。这项工作使得评估开放源代码模型在电子系统设计中的优缺点成为可能。
{"title":"Platform based on open-source cores for industrial applications","authors":"M. Bolado, H. Posadas, J. Castillo, P. Huerta, P. Sánchez, C. Sánchez, H. Fouren, Francisco Blasco","doi":"10.1109/DATE.2004.1269026","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269026","url":null,"abstract":"The latest version of the international technology roadmap for semiconductors predicts that design reuse will be essential in the near future to face the constantly increasing design complexity. The concept comes from software engineering in which reuse is a fundamental technology. In order to provide libraries and applications to reuse in software development, some open-source initiatives (e.g. Linux, gcc, X, mysql) have appeared during the last decades. The basic idea is to distribute the library or application source code (normally free-of-charge) and allow any developer to use, modify, debug and improve it. Several initiatives have tried to port this idea to hardware development. The main goal of this paper is to develop a synthesizable platform described in SystemC from an open architecture. The platform includes a CPU (OpenRISC) and some basic peripherals. A set of software development tools (compiler, assembler, debugger) and RTOS (eCos) has also been developed. This work enables the evaluation of the advantages and disadvantages of the open-source model in electronic system design.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122033963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
An interconnect channel design methodology for high performance integrated circuits 高性能集成电路互连通道设计方法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269045
V. Chandra, A. Xu, H. Schmit, L. Pileggi
On-chip communication is becoming a bottleneck for high performance designs. Conventional interconnect design methodology does not account for architectures and/or communication schemes that require storage buffers (first-in-first-out queues or FIFOs) in the interconnect channel. For example, FIFOs and flow-control are needed for Network-on-Chip, high performance ASICs and multiple clock domain designs. These IC implementation architectures require an efficient methodology to determine the size of the FIFOs in the channel since the FIFO sizes affect system performance. In this work we devised a methodology to size the FIFOs in an interconnect channel containing one or more FIFOs connected in series. We show that the sizing of the FIFOs in the channel is a function of system parameters such as data production rate and consumption rate, data burstiness, number of channel stages etc. and we also quantify their effect on performance. For a single clock design, we have developed an efficient algorithm which reduces the search space for the optimal sizing of the FIFOs in the channel.
片上通信正成为高性能设计的瓶颈。传统的互连设计方法没有考虑到互连通道中需要存储缓冲区(先进先出队列或fifo)的架构和/或通信方案。例如,fifo和流量控制是片上网络、高性能asic和多时钟域设计所需要的。这些IC实现架构需要一种有效的方法来确定通道中FIFO的大小,因为FIFO的大小会影响系统性能。在这项工作中,我们设计了一种方法来确定包含一个或多个串行连接的fifo的互连通道中的fifo的大小。我们表明,通道中fifo的大小是系统参数的函数,如数据产生速率和消耗率,数据突发性,通道级数等,我们还量化了它们对性能的影响。对于单时钟设计,我们开发了一种有效的算法,该算法减少了通道中fifo的最佳大小的搜索空间。
{"title":"An interconnect channel design methodology for high performance integrated circuits","authors":"V. Chandra, A. Xu, H. Schmit, L. Pileggi","doi":"10.1109/DATE.2004.1269045","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269045","url":null,"abstract":"On-chip communication is becoming a bottleneck for high performance designs. Conventional interconnect design methodology does not account for architectures and/or communication schemes that require storage buffers (first-in-first-out queues or FIFOs) in the interconnect channel. For example, FIFOs and flow-control are needed for Network-on-Chip, high performance ASICs and multiple clock domain designs. These IC implementation architectures require an efficient methodology to determine the size of the FIFOs in the channel since the FIFO sizes affect system performance. In this work we devised a methodology to size the FIFOs in an interconnect channel containing one or more FIFOs connected in series. We show that the sizing of the FIFOs in the channel is a function of system parameters such as data production rate and consumption rate, data burstiness, number of channel stages etc. and we also quantify their effect on performance. For a single clock design, we have developed an efficient algorithm which reduces the search space for the optimal sizing of the FIFOs in the channel.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124724905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
Proceedings Design, Automation and Test in Europe Conference and Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1