首页 > 最新文献

Proceedings Design, Automation and Test in Europe Conference and Exhibition最新文献

英文 中文
Using BDDs and ZBDDs for efficient identification of testable path delay faults 利用bdd和zbdd对可测试路径延迟故障进行有效识别
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268826
Saravanan Padmanaban, S. Tragoudas
We present a novel framework to identify all the robustly testable and untestable path delay faults in a circuit. The method uses a combination of decision diagrams for manipulating path delay faults and Boolean functions. The approach benefits from processing partial paths or fanout free segments in the circuit rather than the entire path. The effectiveness of the proposed framework is demonstrated experimentally. It is observed that the methodology identifies 350% more testable faults in the ISCAS'85 benchmark C6288 than any existing technique by utilizing only a fraction of the time compared to earlier work.
我们提出了一种新的框架来识别电路中所有鲁棒可测试和不可测试的路径延迟故障。该方法使用决策图和布尔函数的组合来处理路径延迟故障。该方法的优点是处理电路中的部分路径或无扇出段,而不是处理整个路径。实验证明了该框架的有效性。可以观察到,该方法在ISCAS'85基准C6288中识别的可测试故障比任何现有技术多350%,与早期工作相比,只利用了一小部分时间。
{"title":"Using BDDs and ZBDDs for efficient identification of testable path delay faults","authors":"Saravanan Padmanaban, S. Tragoudas","doi":"10.1109/DATE.2004.1268826","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268826","url":null,"abstract":"We present a novel framework to identify all the robustly testable and untestable path delay faults in a circuit. The method uses a combination of decision diagrams for manipulating path delay faults and Boolean functions. The approach benefits from processing partial paths or fanout free segments in the circuit rather than the entire path. The effectiveness of the proposed framework is demonstrated experimentally. It is observed that the methodology identifies 350% more testable faults in the ISCAS'85 benchmark C6288 than any existing technique by utilizing only a fraction of the time compared to earlier work.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124576231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A low power strategy for future mobile terminals 未来移动终端的低功耗策略
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268938
Mladen Nikitovic, M. Brorsson
In this paper, we have investigated the efficiency of two power-saving strategies that reduces both static and dynamic power consumption when applied to a chip-multiprocessor (CMP). They are evaluated under two workload scenarios and compared against a conventional uni-processor architecture and a CMP without any power-aware scheduling. The results show that energy due to static and dynamic power consumption can be reduced by up to 78% and that further 8% energy can be saved at the expense of response-time of non-critical applications. Furthermore, a small study on the potential impact of system-level events showed that system calls can contribute significantly to the total energy consumed.
在本文中,我们研究了两种节能策略在应用于芯片多处理器(CMP)时降低静态和动态功耗的效率。在两种工作负载场景下对它们进行评估,并与传统的单处理器架构和没有任何功耗感知调度的CMP进行比较。结果表明,由于静态和动态功耗的能量可以减少高达78%,并且可以在牺牲非关键应用的响应时间的情况下进一步节省8%的能量。此外,一项关于系统级事件潜在影响的小型研究表明,系统调用对消耗的总能量有很大贡献。
{"title":"A low power strategy for future mobile terminals","authors":"Mladen Nikitovic, M. Brorsson","doi":"10.1109/DATE.2004.1268938","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268938","url":null,"abstract":"In this paper, we have investigated the efficiency of two power-saving strategies that reduces both static and dynamic power consumption when applied to a chip-multiprocessor (CMP). They are evaluated under two workload scenarios and compared against a conventional uni-processor architecture and a CMP without any power-aware scheduling. The results show that energy due to static and dynamic power consumption can be reduced by up to 78% and that further 8% energy can be saved at the expense of response-time of non-critical applications. Furthermore, a small study on the potential impact of system-level events showed that system calls can contribute significantly to the total energy consumed.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132294319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Tuning in-sensor data filtering to reduce energy consumption in wireless sensor networks 调整传感器内数据滤波以降低无线传感器网络的能耗
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268992
I. Kadayif, M. Kandemir
In recent years, research on wireless sensor networks has been undergoing a revolution, promising to have significant impact on a broad range of applications from military to health care to food safety. An important problem in many sensor network applications is to decide the amount of computation (or filtering) that needs to be done in the sensor nodes before the data are shifted to a central base station. Right amount of data filtering in the sensor nodes can lead to large savings in network-wide energy consumption. The main goal of this paper is to develop an automated strategy for data filtering in wireless sensor nodes. Assuming that one needs to reduce the overall energy consumption (as opposed to reducing just computation energy or communication energy), the proposed strategy attempts to strike a balance between computation energy consumption and communication energy consumption. Our experimental results clearly indicate that the proposed data filtering strategy generates substantial energy savings in practice.
近年来,无线传感器网络的研究正在经历一场革命,有望对从军事到医疗保健到食品安全的广泛应用产生重大影响。在许多传感器网络应用中,一个重要的问题是确定在数据转移到中心基站之前需要在传感器节点中进行的计算量(或滤波)。在传感器节点中进行适量的数据过滤可以大大节省整个网络的能耗。本文的主要目标是开发一种无线传感器节点数据过滤的自动化策略。假设需要减少总体能耗(而不是仅仅减少计算能耗或通信能耗),所提出的策略试图在计算能耗和通信能耗之间取得平衡。我们的实验结果清楚地表明,所提出的数据过滤策略在实践中产生了大量的能源节约。
{"title":"Tuning in-sensor data filtering to reduce energy consumption in wireless sensor networks","authors":"I. Kadayif, M. Kandemir","doi":"10.1109/DATE.2004.1268992","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268992","url":null,"abstract":"In recent years, research on wireless sensor networks has been undergoing a revolution, promising to have significant impact on a broad range of applications from military to health care to food safety. An important problem in many sensor network applications is to decide the amount of computation (or filtering) that needs to be done in the sensor nodes before the data are shifted to a central base station. Right amount of data filtering in the sensor nodes can lead to large savings in network-wide energy consumption. The main goal of this paper is to develop an automated strategy for data filtering in wireless sensor nodes. Assuming that one needs to reduce the overall energy consumption (as opposed to reducing just computation energy or communication energy), the proposed strategy attempts to strike a balance between computation energy consumption and communication energy consumption. Our experimental results clearly indicate that the proposed data filtering strategy generates substantial energy savings in practice.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"200 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133380751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Scan power minimization through stimulus and response transformations 扫描功率最小化通过刺激和响应转换
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268880
O. Sinanoglu, A. Orailoglu
Scan-based cores impose considerable test power challenges due to excessive switching activity during shift cycles. The consequent test power constraints force SOC designers to sacrifice parallelism among core tests, as exceeding power thresholds may damage the chip being tested. Reduction of test power for SOC cores can thus increase the number of cores that can be tested in parallel, improving significantly SOC test application time. In this paper, we propose a scan chain modification technique that inserts logic gates on the scan path. The consequent beneficial test data transformations are utilized to reduce the scan chain transitions during shift cycles and hence test power. We introduce a matrix band algebra that models the impact of logic gate insertion between scan cells on the test stimulus and response transformations realized. As we have successfully modeled the response transformations as well, the methodology we propose is capable of truly minimizing the overall test power. The test vectors and responses are analyzed in an intertwined manner, identifying the best possible scan chain modification, which is realized at minimal area cost. Experimental results justify the efficacy of the proposed methodology as well.
由于在移位周期中过度的开关活动,基于扫描的核心对测试功率提出了相当大的挑战。随后的测试功率限制迫使SOC设计人员牺牲核心测试之间的并行性,因为超过功率阈值可能会损坏正在测试的芯片。因此,降低SOC核心的测试功率可以增加可以并行测试的核心数量,从而显着提高SOC测试应用时间。在本文中,我们提出了一种在扫描路径上插入逻辑门的扫描链修改技术。利用由此产生的有益的测试数据转换来减少移位周期期间的扫描链转换,从而减少测试功率。我们引入了一个矩阵带代数来模拟扫描单元间逻辑门插入对测试刺激和响应转换的影响。由于我们已经成功地对响应转换进行了建模,因此我们提出的方法能够真正地最小化总体测试功率。测试向量和响应以一种交织的方式进行分析,确定以最小面积成本实现的最佳扫描链修改。实验结果也证明了该方法的有效性。
{"title":"Scan power minimization through stimulus and response transformations","authors":"O. Sinanoglu, A. Orailoglu","doi":"10.1109/DATE.2004.1268880","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268880","url":null,"abstract":"Scan-based cores impose considerable test power challenges due to excessive switching activity during shift cycles. The consequent test power constraints force SOC designers to sacrifice parallelism among core tests, as exceeding power thresholds may damage the chip being tested. Reduction of test power for SOC cores can thus increase the number of cores that can be tested in parallel, improving significantly SOC test application time. In this paper, we propose a scan chain modification technique that inserts logic gates on the scan path. The consequent beneficial test data transformations are utilized to reduce the scan chain transitions during shift cycles and hence test power. We introduce a matrix band algebra that models the impact of logic gate insertion between scan cells on the test stimulus and response transformations realized. As we have successfully modeled the response transformations as well, the methodology we propose is capable of truly minimizing the overall test power. The test vectors and responses are analyzed in an intertwined manner, identifying the best possible scan chain modification, which is realized at minimal area cost. Experimental results justify the efficacy of the proposed methodology as well.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133606543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A new approach to timing analysis using event propagation and temporal logic 一种利用事件传播和时间逻辑进行时间分析的新方法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269055
Arijit Mondal, P. Chakrabarti, C. Mandal
Present day designers require deep reasoning methods to analyze circuit timing. This includes analysis of effects of dynamic behavior (like glitches) on critical paths, simultaneous switching and identification of specific patterns and their timings. This paper proposes a novel approach that uses a combination of symbolic event propagation and temporal reasoning to extract timing properties of gate-level circuits. The formulation captures complex situations like triggering of traditional false paths and simultaneous switching in a unified symbolic representation in addition to identifying false paths, critical paths as well as conditions for such situations. This information is then represented as an event-time graph. A simple temporal logic on events is proposed that can be used to formulate a wide class of useful queries for various input scenarios. These include maximum/minimum delays, transition times, duration of patterns, etc. An algorithm is developed that retrieves answers to such queries from the event-time graph. A complete BDD based implementation of this system has been made. Results on the ISCAS85 benchmarks indicate very interesting properties of these circuits.
现在的设计人员需要深入的推理方法来分析电路时序。这包括分析关键路径上动态行为(如故障)的影响,同时切换和识别特定模式及其时间。本文提出了一种结合符号事件传播和时间推理提取门级电路时序特性的新方法。该公式在统一的符号表示中捕获了传统假路径触发和同时切换等复杂情况,并识别了假路径、关键路径以及这些情况的条件。然后将此信息表示为事件时间图。提出了一种简单的事件时态逻辑,可用于为各种输入场景制定各种有用的查询。这些包括最大/最小延迟、转换时间、模式持续时间等。开发了一种算法,从事件时间图中检索此类查询的答案。基于BDD的系统实现已经完成。ISCAS85基准测试的结果显示了这些电路非常有趣的特性。
{"title":"A new approach to timing analysis using event propagation and temporal logic","authors":"Arijit Mondal, P. Chakrabarti, C. Mandal","doi":"10.1109/DATE.2004.1269055","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269055","url":null,"abstract":"Present day designers require deep reasoning methods to analyze circuit timing. This includes analysis of effects of dynamic behavior (like glitches) on critical paths, simultaneous switching and identification of specific patterns and their timings. This paper proposes a novel approach that uses a combination of symbolic event propagation and temporal reasoning to extract timing properties of gate-level circuits. The formulation captures complex situations like triggering of traditional false paths and simultaneous switching in a unified symbolic representation in addition to identifying false paths, critical paths as well as conditions for such situations. This information is then represented as an event-time graph. A simple temporal logic on events is proposed that can be used to formulate a wide class of useful queries for various input scenarios. These include maximum/minimum delays, transition times, duration of patterns, etc. An algorithm is developed that retrieves answers to such queries from the event-time graph. A complete BDD based implementation of this system has been made. Results on the ISCAS85 benchmarks indicate very interesting properties of these circuits.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133773454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
/spl times/pipesCompiler: a tool for instantiating application specific networks on chip /spl times/pipesCompiler:一个在芯片上实例化应用特定网络的工具
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268999
A. Jalabert, S. Murali, L. Benini, G. Micheli
Future systems on chips (SoCs) will integrate a large number of processor and storage cores onto a single chip and require networks on chip (NoC) to support the heavy communication demands of the system. The individual components of the SoCs will be heterogeneous in nature with widely varying functionality and communication requirements. The communication infrastructure should optimally match communication patterns among these components accounting for the individual component needs. In this paper we present /spl times/pipesCompiler, a tool for automatically instantiating an application-specific NoC for heterogeneous multi-processor SoCs. The /spl times/pipesCompiler instantiates a network of building blocks from a library of composable soft macros (switches, network interfaces and links) described in SystemC at the cycle-accurate level. The network components are optimized for that particular network and support reliable, latency-insensitive operation. Example systems with application-specific NoCs built using the /spl times/pipesCompiler show large savings in area (factor of 6.5), power (factor of 2.4) and latency (factor of 1.42) when compared to a general-purpose mesh-based NoC architecture.
未来的片上系统(soc)将把大量的处理器和存储核心集成到单个芯片上,并且需要片上网络(NoC)来支持系统的繁重通信需求。soc的各个组件本质上是异构的,具有广泛不同的功能和通信需求。通信基础设施应该最优地匹配这些组件之间的通信模式,以满足各个组件的需求。在本文中,我们介绍了/spl times/pipesCompiler,这是一个为异构多处理器soc自动实例化特定于应用程序的NoC的工具。/spl times/pipesCompiler从SystemC中描述的可组合软宏(开关、网络接口和链接)库中实例化一个构建块网络,在周期精确级别上进行描述。网络组件针对特定的网络进行了优化,并支持可靠的、对延迟不敏感的操作。与基于网格的通用NoC架构相比,使用/spl times/pipesCompiler构建的具有特定应用程序NoC的示例系统在面积(6.5倍)、功耗(2.4倍)和延迟(1.42倍)方面节省了很多。
{"title":"/spl times/pipesCompiler: a tool for instantiating application specific networks on chip","authors":"A. Jalabert, S. Murali, L. Benini, G. Micheli","doi":"10.1109/DATE.2004.1268999","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268999","url":null,"abstract":"Future systems on chips (SoCs) will integrate a large number of processor and storage cores onto a single chip and require networks on chip (NoC) to support the heavy communication demands of the system. The individual components of the SoCs will be heterogeneous in nature with widely varying functionality and communication requirements. The communication infrastructure should optimally match communication patterns among these components accounting for the individual component needs. In this paper we present /spl times/pipesCompiler, a tool for automatically instantiating an application-specific NoC for heterogeneous multi-processor SoCs. The /spl times/pipesCompiler instantiates a network of building blocks from a library of composable soft macros (switches, network interfaces and links) described in SystemC at the cycle-accurate level. The network components are optimized for that particular network and support reliable, latency-insensitive operation. Example systems with application-specific NoCs built using the /spl times/pipesCompiler show large savings in area (factor of 6.5), power (factor of 2.4) and latency (factor of 1.42) when compared to a general-purpose mesh-based NoC architecture.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116620063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 243
System-level performance analysis in SystemC SystemC中的系统级性能分析
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268876
H. Posadas, F. Herrera, P. Sánchez, E. Villar, Francisco Blasco
As both the ITRS and the Medea+ DA Roadmaps have highlighted, early performance estimation is an essential step in any SoC design methodology based on International Technology Roadmap for Semiconductors (2001) and The MEDEA+ Design Automation Roadmap (2002). This paper presents a C++ library for timing estimation at system level. The library is based on a general and systematic methodology that takes as input the original SystemC source code without any modification and provides the estimation parameters by simply including the library within a usual simulation. As a consequence, the same models of computation used during system design are preserved and all simulation conditions are maintained. The method exploits the advantages of dynamic analysis, that is, easy management of unpredictable data-dependent conditions and computational efficiency compared with other alternatives (ISS or RT simulation, without the need for SW generation and compilation and HW synthesis). Results obtained on several examples show the accuracy of the method. In addition to the fundamental parameters needed for system-level design exploration, the proposed methodology allows the designer to include capture points at any place in the code. The user can process the corresponding captured events for unrestricted timing constraint verification.
正如ITRS和Medea+ DA路线图所强调的那样,基于国际半导体技术路线图(2001年)和Medea+设计自动化路线图(2002年),早期性能评估是任何SoC设计方法的重要步骤。本文提出了一个用于系统级时序估计的c++库。该库基于一种通用的系统方法,该方法将原始SystemC源代码作为输入,而不进行任何修改,并通过简单地将库包含在通常的模拟中来提供估计参数。因此,保留了系统设计期间使用的相同计算模型,并保持了所有仿真条件。该方法利用了动态分析的优点,即与其他替代方法(ISS或RT模拟,不需要生成和编译软件以及硬件合成)相比,易于管理不可预测的数据依赖条件和计算效率。算例表明了该方法的准确性。除了系统级设计探索所需的基本参数之外,所建议的方法还允许设计人员在代码中的任何位置包含捕获点。用户可以处理相应的捕获事件,以进行不受限制的时间约束验证。
{"title":"System-level performance analysis in SystemC","authors":"H. Posadas, F. Herrera, P. Sánchez, E. Villar, Francisco Blasco","doi":"10.1109/DATE.2004.1268876","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268876","url":null,"abstract":"As both the ITRS and the Medea+ DA Roadmaps have highlighted, early performance estimation is an essential step in any SoC design methodology based on International Technology Roadmap for Semiconductors (2001) and The MEDEA+ Design Automation Roadmap (2002). This paper presents a C++ library for timing estimation at system level. The library is based on a general and systematic methodology that takes as input the original SystemC source code without any modification and provides the estimation parameters by simply including the library within a usual simulation. As a consequence, the same models of computation used during system design are preserved and all simulation conditions are maintained. The method exploits the advantages of dynamic analysis, that is, easy management of unpredictable data-dependent conditions and computational efficiency compared with other alternatives (ISS or RT simulation, without the need for SW generation and compilation and HW synthesis). Results obtained on several examples show the accuracy of the method. In addition to the fundamental parameters needed for system-level design exploration, the proposed methodology allows the designer to include capture points at any place in the code. The user can process the corresponding captured events for unrestricted timing constraint verification.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117005244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Efficient implementations of mobile video computations on domain-specific reconfigurable arrays 针对特定领域可重构阵列的移动视频计算的高效实现
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269064
S. Khawam, S. Baloch, A. Pai, Imran Ahmed, N. Aydin, T. Arslan, F. Westall
Mobile video processing as defined in standards like MPEG-4 and H.263 contains a number of timeconsuming computations that cannot be efficiently executed on current hardware architectures. The authors recently introduced a reconfigurable SoC platform that permits a low-power, high-throughput and flexible implementation of the motion estimation and DCT algorithms. The computations are done using domainspecific reconfigurable arrays that have demonstrated up to 75% reduction in power consumption when compared to generic FPGA architecture, which makes them suitable for portable devices. This paper presents and compares different configurations of the arrays to efficiently implementing DCT and motion estimation algorithms. A number of algorithms are mapped into the various reconfigurable fabrics demonstrating the flexibility of the new reconfigurable SoC architecture and its ability to support a number of implementations having different performance characteristics.
在MPEG-4和H.263等标准中定义的移动视频处理包含大量耗时的计算,在当前的硬件架构上无法有效执行。作者最近推出了一种可重构的SoC平台,该平台允许低功耗,高吞吐量和灵活地实现运动估计和DCT算法。计算是使用特定领域的可重构阵列完成的,与通用FPGA架构相比,该阵列的功耗降低了75%,这使得它们适用于便携式设备。本文提出并比较了阵列的不同配置,以有效地实现DCT和运动估计算法。许多算法被映射到各种可重构结构中,展示了新的可重构SoC架构的灵活性,以及它支持具有不同性能特征的许多实现的能力。
{"title":"Efficient implementations of mobile video computations on domain-specific reconfigurable arrays","authors":"S. Khawam, S. Baloch, A. Pai, Imran Ahmed, N. Aydin, T. Arslan, F. Westall","doi":"10.1109/DATE.2004.1269064","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269064","url":null,"abstract":"Mobile video processing as defined in standards like MPEG-4 and H.263 contains a number of timeconsuming computations that cannot be efficiently executed on current hardware architectures. The authors recently introduced a reconfigurable SoC platform that permits a low-power, high-throughput and flexible implementation of the motion estimation and DCT algorithms. The computations are done using domainspecific reconfigurable arrays that have demonstrated up to 75% reduction in power consumption when compared to generic FPGA architecture, which makes them suitable for portable devices. This paper presents and compares different configurations of the arrays to efficiently implementing DCT and motion estimation algorithms. A number of algorithms are mapped into the various reconfigurable fabrics demonstrating the flexibility of the new reconfigurable SoC architecture and its ability to support a number of implementations having different performance characteristics.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114711740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A new optimized implementation of the SystemC engine using acyclic scheduling 使用无循环调度的SystemC引擎的新优化实现
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268903
D. G. Pérez, Gilles Mouchard, O. Temam
SystemC is rapidly gaining wide acceptance as a simulation framework for SoC and embedded processors. While its main assets are modularity and the very fact it is becoming a de facto standard, the evolution of the SystemC framework (from version 0.9 to version 2.0.1) suggests the environment is particularly geared toward increasing the framework functionalities rather than improving simulation speed. For cycle-level simulation, speed is a critical factor as simulation can be extremely slow, affecting the extent of design space exploration. In this article, we present a fast SystemC engine that, in our experience, can speed up simulations by a factor of 1.93 to 3.56 over SystemC 2.0.1. This SystemC engine is designed for cycle-level simulators and for the moment, it only supports the subset of the SystemC syntax (signals, methods) that is most often used for such simulators. We achieved greater speed (1) by completely rewriting the SystemC engine and improving the implementation software engineering, and (2) by proposing a new scheduling technique, intermediate between SystemC dynamic scheduling technique and existing static scheduling schemes. Unlike SystemC dynamic scheduling, our technique removes many if not all useless process wake-ups, while using a simpler scheduling algorithm than in existing static scheduling techniques.
作为SoC和嵌入式处理器的仿真框架,SystemC正迅速获得广泛的接受。虽然它的主要优点是模块化,而且它正在成为事实上的标准,但SystemC框架的演变(从0.9版本到2.0.1版本)表明,环境特别倾向于增加框架功能,而不是提高模拟速度。对于循环级仿真,速度是一个关键因素,因为仿真可能非常缓慢,影响设计空间探索的程度。在本文中,我们介绍了一个快速的SystemC引擎,根据我们的经验,它可以将模拟速度提高到SystemC 2.0.1的1.93到3.56倍。这个SystemC引擎是为循环级模拟器设计的,目前,它只支持最常用于此类模拟器的SystemC语法子集(信号、方法)。我们实现了更高的速度(1)通过完全重写SystemC引擎和改进实现软件工程;(2)通过提出一种新的调度技术,介于SystemC动态调度技术和现有静态调度方案之间。与SystemC动态调度不同,我们的技术删除了许多(如果不是全部的话)无用的进程唤醒,同时使用比现有静态调度技术更简单的调度算法。
{"title":"A new optimized implementation of the SystemC engine using acyclic scheduling","authors":"D. G. Pérez, Gilles Mouchard, O. Temam","doi":"10.1109/DATE.2004.1268903","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268903","url":null,"abstract":"SystemC is rapidly gaining wide acceptance as a simulation framework for SoC and embedded processors. While its main assets are modularity and the very fact it is becoming a de facto standard, the evolution of the SystemC framework (from version 0.9 to version 2.0.1) suggests the environment is particularly geared toward increasing the framework functionalities rather than improving simulation speed. For cycle-level simulation, speed is a critical factor as simulation can be extremely slow, affecting the extent of design space exploration. In this article, we present a fast SystemC engine that, in our experience, can speed up simulations by a factor of 1.93 to 3.56 over SystemC 2.0.1. This SystemC engine is designed for cycle-level simulators and for the moment, it only supports the subset of the SystemC syntax (signals, methods) that is most often used for such simulators. We achieved greater speed (1) by completely rewriting the SystemC engine and improving the implementation software engineering, and (2) by proposing a new scheduling technique, intermediate between SystemC dynamic scheduling technique and existing static scheduling schemes. Unlike SystemC dynamic scheduling, our technique removes many if not all useless process wake-ups, while using a simpler scheduling algorithm than in existing static scheduling techniques.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115074123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Building the hierarchy from a flat netlist for a fast and accurate post-layout simulation with parasitic components 从平面网表构建层次结构,用于快速准确的寄生组件布局后仿真
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269268
P. Daglio, D. Iezzi, Danilo Rimondi, C. Roma, Salvatore Santapa
Main concerns related to post-layout simulation, today, are about the format of the netlist coming out from the parasitic extractor. In fact, such a netlist is usually flat so that readability, whether compared to the pre-layout hierarchical one, is very poor due to device and net names which often change and to the difficulty to compare pre-layout and post-layout output signals. Furthermore, simulating such large flat netlists is frequently time consuming because it is not possible to exploit algorithms like hierarchical array reduction (HAR) and isomorphic matching (IM), strength points of state-of-the-art full chip simulators. In this paper, we present a new approach that, starting from a flat netlist with parasitic components and a pre-layout hierarchical one, allows to create a fully hierarchical post-layout netlist containing device parameters and parasitic components directly extracted from the layout. In this way, a fast and accurate post-layout simulation is made possible by the use of look-up table simulators, taking advantages from the HAR and IM algorithms as mentioned before. This methodology has been integrated in a complete design flow to guarantee first silicon success, cut down time-to-design, improve time-to-market and streamline design quality.
今天,与布局后仿真相关的主要问题是寄生提取器产生的网表格式。事实上,这样的网络列表通常是扁平的,因此,无论与布局前的分层列表相比,由于设备和网络名称经常变化,并且难以比较布局前和布局后的输出信号,因此可读性非常差。此外,模拟如此大的平面网络列表通常非常耗时,因为不可能利用分层阵列缩减(HAR)和同构匹配(IM)等算法,这些算法是最先进的全芯片模拟器的优势点。在本文中,我们提出了一种新的方法,从具有寄生组件的平面网络列表和布局前分层网络列表开始,允许创建一个完全分层的布局后网络列表,其中包含直接从布局中提取的设备参数和寄生组件。这样,通过使用查找表模拟器,利用前面提到的HAR和IM算法的优势,可以实现快速准确的布局后仿真。该方法已集成在一个完整的设计流程中,以保证第一个硅的成功,缩短设计时间,提高上市时间并简化设计质量。
{"title":"Building the hierarchy from a flat netlist for a fast and accurate post-layout simulation with parasitic components","authors":"P. Daglio, D. Iezzi, Danilo Rimondi, C. Roma, Salvatore Santapa","doi":"10.1109/DATE.2004.1269268","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269268","url":null,"abstract":"Main concerns related to post-layout simulation, today, are about the format of the netlist coming out from the parasitic extractor. In fact, such a netlist is usually flat so that readability, whether compared to the pre-layout hierarchical one, is very poor due to device and net names which often change and to the difficulty to compare pre-layout and post-layout output signals. Furthermore, simulating such large flat netlists is frequently time consuming because it is not possible to exploit algorithms like hierarchical array reduction (HAR) and isomorphic matching (IM), strength points of state-of-the-art full chip simulators. In this paper, we present a new approach that, starting from a flat netlist with parasitic components and a pre-layout hierarchical one, allows to create a fully hierarchical post-layout netlist containing device parameters and parasitic components directly extracted from the layout. In this way, a fast and accurate post-layout simulation is made possible by the use of look-up table simulators, taking advantages from the HAR and IM algorithms as mentioned before. This methodology has been integrated in a complete design flow to guarantee first silicon success, cut down time-to-design, improve time-to-market and streamline design quality.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115133464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Proceedings Design, Automation and Test in Europe Conference and Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1