首页 > 最新文献

2007 Design, Automation & Test in Europe Conference & Exhibition最新文献

英文 中文
An Efficient Hardware Architecture for H.264 Intra Prediction Algorithm H.264帧内预测算法的高效硬件架构
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364588
E. Sahin, Ilker Hamzaoglu
In this paper, we present an efficient hardware architecture for real-time implementation of intra prediction algorithm used in H.264/MPEG4 part 10 video coding standard. The hardware design is based on a novel organization of the intra prediction equations. This hardware is designed to be used as part of a complete H.264 video coding system for portable applications. The proposed architecture is implemented in Verilog HDL. The Verilog RTL code is verified to work at 90 MHz in a Xilinx Virtex II FPGA. The FPGA implementation can process 27 VGA frames (640times480) per second
本文提出了H.264/MPEG4第10部分视频编码标准中实时实现帧内预测算法的高效硬件架构。硬件设计是基于一种新的内部预测方程组织。该硬件被设计为用于便携式应用程序的完整H.264视频编码系统的一部分。提出的体系结构在Verilog HDL中实现。Verilog RTL代码在Xilinx Virtex II FPGA中工作在90 MHz。FPGA实现可以每秒处理27个VGA帧(640times480)
{"title":"An Efficient Hardware Architecture for H.264 Intra Prediction Algorithm","authors":"E. Sahin, Ilker Hamzaoglu","doi":"10.1109/DATE.2007.364588","DOIUrl":"https://doi.org/10.1109/DATE.2007.364588","url":null,"abstract":"In this paper, we present an efficient hardware architecture for real-time implementation of intra prediction algorithm used in H.264/MPEG4 part 10 video coding standard. The hardware design is based on a novel organization of the intra prediction equations. This hardware is designed to be used as part of a complete H.264 video coding system for portable applications. The proposed architecture is implemented in Verilog HDL. The Verilog RTL code is verified to work at 90 MHz in a Xilinx Virtex II FPGA. The FPGA implementation can process 27 VGA frames (640times480) per second","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125027054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Executable system-level specification models containing UML-based behavioral patterns 包含基于uml的行为模式的可执行系统级规范模型
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364608
L. Indrusiak, A. Thuy, M. Glesner
Behavioral patterns are useful abstractions to simplify the design of the communication-centric systems. Such patterns are traditionally described using UML diagrams, but the lack of execution semantics in UML prevents the co-validation of the patterns together with simulation models and executable specifications which are the mainstream in today's system level design flows. This paper proposes a method to validate UML-based behavioral patterns within executable system models. The method is based on actor orientation and was implemented as an extension of the Ptolemy II framework. A case study is presented and potential applications and extensions of the proposed method are discussed
行为模式是简化以通信为中心的系统设计的有用抽象。这种模式传统上是使用UML图来描述的,但是UML中缺乏执行语义,这阻碍了模式与仿真模型和可执行规范的共同验证,而这些是当今系统级设计流的主流。本文提出了一种在可执行系统模型中验证基于uml的行为模式的方法。该方法基于actor取向,并作为托勒密II框架的扩展而实现。最后给出了一个实例,并讨论了该方法的潜在应用和扩展
{"title":"Executable system-level specification models containing UML-based behavioral patterns","authors":"L. Indrusiak, A. Thuy, M. Glesner","doi":"10.1109/DATE.2007.364608","DOIUrl":"https://doi.org/10.1109/DATE.2007.364608","url":null,"abstract":"Behavioral patterns are useful abstractions to simplify the design of the communication-centric systems. Such patterns are traditionally described using UML diagrams, but the lack of execution semantics in UML prevents the co-validation of the patterns together with simulation models and executable specifications which are the mainstream in today's system level design flows. This paper proposes a method to validate UML-based behavioral patterns within executable system models. The method is based on actor orientation and was implemented as an extension of the Ptolemy II framework. A case study is presented and potential applications and extensions of the proposed method are discussed","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125074719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Working with Process Variation Aware Caches 使用过程变化感知缓存
Pub Date : 2007-04-16 DOI: 10.1145/1266366.1266615
M. Mutyam, N. Vijaykrishnan
Deep-submicron designs have to take care of process variation effects as variations in critical process parameters result in large variations in access latencies of hardware components. This is severe in the case of memory components as minimum sized transistors are used in their design. In this work, by considering on-chip data caches, we study the effect of access latency variations on performance. We discuss performance losses due to the worst-case design, wherein the entire cache operates with the worst-case process variation delay, followed by process variation aware cache designs which work at set-level granularity. We then propose a technique called block rearrangement to minimize performance loss incurred by a process variation aware cache which works at set-level granularity. Using block rearrangement technique, we rearrange the physical locations of cache blocks such that a cache set can have its "n" blocks (assuming a n-way set-associative cache) in multiple rows instead of a single row as in the case of a cache with conventional addressing scheme. By distributing blocks of a cache set over multiple sets, we minimize the number of sets being affected by process variation. We evaluate our technique using SPEC2000 CPU benchmarks and show that our technique achieves significant performance benefits over caches with conventional addressing scheme
深亚微米设计必须考虑到工艺变化的影响,因为关键工艺参数的变化会导致硬件组件访问延迟的巨大变化。这在存储器元件的情况下是严重的,因为它们的设计中使用了最小尺寸的晶体管。在这项工作中,通过考虑片上数据缓存,我们研究了访问延迟变化对性能的影响。我们讨论了由于最坏情况设计造成的性能损失,其中整个缓存以最坏情况进程变化延迟运行,然后是在设置级粒度下工作的进程变化感知缓存设计。然后,我们提出了一种称为块重排的技术,以最大限度地减少在设置级粒度下工作的进程变化感知缓存所带来的性能损失。使用块重排技术,我们重新排列缓存块的物理位置,这样一个缓存集可以在多行中拥有它的“n”块(假设有一个n路集关联缓存),而不是像传统寻址方案的缓存那样在单行中。通过将一个缓存集的块分布到多个集上,我们可以最小化受进程变化影响的集的数量。我们使用SPEC2000 CPU基准测试来评估我们的技术,并表明我们的技术比使用传统寻址方案的缓存实现了显着的性能优势
{"title":"Working with Process Variation Aware Caches","authors":"M. Mutyam, N. Vijaykrishnan","doi":"10.1145/1266366.1266615","DOIUrl":"https://doi.org/10.1145/1266366.1266615","url":null,"abstract":"Deep-submicron designs have to take care of process variation effects as variations in critical process parameters result in large variations in access latencies of hardware components. This is severe in the case of memory components as minimum sized transistors are used in their design. In this work, by considering on-chip data caches, we study the effect of access latency variations on performance. We discuss performance losses due to the worst-case design, wherein the entire cache operates with the worst-case process variation delay, followed by process variation aware cache designs which work at set-level granularity. We then propose a technique called block rearrangement to minimize performance loss incurred by a process variation aware cache which works at set-level granularity. Using block rearrangement technique, we rearrange the physical locations of cache blocks such that a cache set can have its \"n\" blocks (assuming a n-way set-associative cache) in multiple rows instead of a single row as in the case of a cache with conventional addressing scheme. By distributing blocks of a cache set over multiple sets, we minimize the number of sets being affected by process variation. We evaluate our technique using SPEC2000 CPU benchmarks and show that our technique achieves significant performance benefits over caches with conventional addressing scheme","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131899147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Temperature-aware NBTI modeling and the impact of input vector control on performance degradation 温度感知的NBTI建模及输入矢量控制对性能退化的影响
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364650
Yu Wang, Hong Luo, Ku He, Rong Luo, Huazhong Yang, Yuan Xie
As technology scales, negative bias temperature instability (NBTI), which causes temporal performance degradation in digital circuits by affecting PMOS threshold voltage, is emerging as one of the major circuit reliability concerns. In this paper, the authors first investigate the impact of NBTI on PMOS devices and propose a novel temporal performance degradation model for digital circuits considering the temperature difference between active and standby mode. For the first time, the impact of input vector control (to minimize standby leakage) on the NBTI is investigated. Minimum leakage vectors, which lead to minimum circuit performance degradation and remains maximum leakage reduction rate, are selected and used during the standby mode. Furthermore, the potential to save the circuit performance degradation by internal node control techniques during circuit standby mode is discussed. Our simulation results show that: 1) the active and standby time ratio and the standby mode temperature have considerable impact on the circuit performance degradation; 2) the NBTI-aware IVC technique leads to an average 3% savings of the total circuit degradation; while the potential of internal node control may lead to 10% savings of the total circuit degradation
随着技术的发展,负偏置温度不稳定性(NBTI)通过影响PMOS阈值电压而导致数字电路的时间性能下降,成为电路可靠性的主要问题之一。在本文中,作者首先研究了NBTI对PMOS器件的影响,并提出了一种考虑主机和待机模式温差的数字电路时间性能退化模型。首次研究了输入矢量控制(最小化待机泄漏)对NBTI的影响。在待机模式中,选择导致电路性能下降最小并保持最大泄漏减少率的最小泄漏向量。此外,还讨论了在电路待机模式下通过内部节点控制技术来避免电路性能下降的可能性。仿真结果表明:1)主、待机时间比和待机温度对电路性能下降有较大影响;2) nbti感知IVC技术导致总电路退化平均节省3%;而内部节点控制的潜力可能导致10%的总电路退化节省
{"title":"Temperature-aware NBTI modeling and the impact of input vector control on performance degradation","authors":"Yu Wang, Hong Luo, Ku He, Rong Luo, Huazhong Yang, Yuan Xie","doi":"10.1109/DATE.2007.364650","DOIUrl":"https://doi.org/10.1109/DATE.2007.364650","url":null,"abstract":"As technology scales, negative bias temperature instability (NBTI), which causes temporal performance degradation in digital circuits by affecting PMOS threshold voltage, is emerging as one of the major circuit reliability concerns. In this paper, the authors first investigate the impact of NBTI on PMOS devices and propose a novel temporal performance degradation model for digital circuits considering the temperature difference between active and standby mode. For the first time, the impact of input vector control (to minimize standby leakage) on the NBTI is investigated. Minimum leakage vectors, which lead to minimum circuit performance degradation and remains maximum leakage reduction rate, are selected and used during the standby mode. Furthermore, the potential to save the circuit performance degradation by internal node control techniques during circuit standby mode is discussed. Our simulation results show that: 1) the active and standby time ratio and the standby mode temperature have considerable impact on the circuit performance degradation; 2) the NBTI-aware IVC technique leads to an average 3% savings of the total circuit degradation; while the potential of internal node control may lead to 10% savings of the total circuit degradation","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130571687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 90
Pulse propagation for the detection of small delay defects 小延迟缺陷的脉冲传播检测
Pub Date : 2007-04-16 DOI: 10.1145/1266366.1266649
M. Favalli, C. Metra
This paper addresses the problems related to resistive opens and bridging faults which cannot be detected using delay fault testing because they lie out of the most critical paths. Even if the induced defect is not large enough to result in timing violations, these faults may give rise to reliability problems. To detect them, the paper proposes a testing method that is based on the propagation of pulses within the faulty circuit and that exploits the degraded capability of faulty paths to propagate pulses. The effectiveness of the proposed method is analyzed at the electrical level and compared with the use of reduced clock period which can detect the same class of faults. Results show similar performance in the case of resistive opens and better performance in the case of bridgings. Moreover, the proposed approach is not affected by problems on the clock distribution network
本文解决了电阻性断路和桥接故障的相关问题,这些故障由于位于最关键的路径之外而无法通过延迟故障测试检测到。即使诱导缺陷的大小不足以导致时序违规,这些故障也可能导致可靠性问题。为了检测它们,本文提出了一种基于故障电路中脉冲传播的测试方法,利用故障路径的退化能力来传播脉冲。在电气水平上分析了该方法的有效性,并与缩短时钟周期检测同类故障的方法进行了比较。结果表明,在电阻打开情况下性能相似,而在桥接情况下性能更好。此外,该方法不受时钟分配网络问题的影响
{"title":"Pulse propagation for the detection of small delay defects","authors":"M. Favalli, C. Metra","doi":"10.1145/1266366.1266649","DOIUrl":"https://doi.org/10.1145/1266366.1266649","url":null,"abstract":"This paper addresses the problems related to resistive opens and bridging faults which cannot be detected using delay fault testing because they lie out of the most critical paths. Even if the induced defect is not large enough to result in timing violations, these faults may give rise to reliability problems. To detect them, the paper proposes a testing method that is based on the propagation of pulses within the faulty circuit and that exploits the degraded capability of faulty paths to propagate pulses. The effectiveness of the proposed method is analyzed at the electrical level and compared with the use of reduced clock period which can detect the same class of faults. Results show similar performance in the case of resistive opens and better performance in the case of bridgings. Moreover, the proposed approach is not affected by problems on the clock distribution network","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124696116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Register Pointer Architecture for Efficient Embedded Processors 高效嵌入式处理器的寄存器指针体系结构
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364659
Jongsoo Park, Sung-Boem Park, J. Balfour, D. Black-Schaffer, C. Kozyrakis, W. Dally
Conventional register file architectures cannot optimally exploit temporal locality in data references due to their limited capacity and static encoding of register addresses in instructions. In conventional embedded architectures, the register file capacity cannot be increased without resorting to longer instruction words. Similarly, loop unrolling is often required to exploit locality in the register file accesses across iterations because naming registers statically is inflexible. Both optimizations lead to significant code size increases, which is undesirable in embedded systems. In this paper, the authors introduce the register pointer architecture (RPA), which allows registers to be accessed indirectly through register pointers. Indirection allows a larger register file to be used without increasing the length of instruction words. Additional register file capacity allows many loads and stores, such as those introduced by spill code, to be eliminated, which improves performance and reduces energy consumption. Moreover, indirection affords additional flexibility in naming registers, which reduces the need to apply loop unrolling in order to maximize reuse of register allocated variables
传统的寄存器文件体系结构由于其有限的容量和指令中寄存器地址的静态编码而不能最佳地利用数据引用中的时间局部性。在传统的嵌入式体系结构中,如果不使用更长的指令字,寄存器文件的容量就无法增加。类似地,循环展开通常需要利用跨迭代的寄存器文件访问中的局部性,因为静态命名寄存器是不灵活的。这两种优化都会导致代码大小的显著增加,这在嵌入式系统中是不希望出现的。在本文中,作者介绍了寄存器指针体系结构(RPA),它允许通过寄存器指针间接访问寄存器。间接允许在不增加指令字长度的情况下使用更大的寄存器文件。额外的寄存器文件容量允许消除许多负载和存储,例如由溢出代码引入的负载和存储,从而提高性能并降低能耗。此外,间接在命名寄存器方面提供了额外的灵活性,这减少了为了最大限度地重用寄存器分配的变量而应用循环展开的需要
{"title":"Register Pointer Architecture for Efficient Embedded Processors","authors":"Jongsoo Park, Sung-Boem Park, J. Balfour, D. Black-Schaffer, C. Kozyrakis, W. Dally","doi":"10.1109/DATE.2007.364659","DOIUrl":"https://doi.org/10.1109/DATE.2007.364659","url":null,"abstract":"Conventional register file architectures cannot optimally exploit temporal locality in data references due to their limited capacity and static encoding of register addresses in instructions. In conventional embedded architectures, the register file capacity cannot be increased without resorting to longer instruction words. Similarly, loop unrolling is often required to exploit locality in the register file accesses across iterations because naming registers statically is inflexible. Both optimizations lead to significant code size increases, which is undesirable in embedded systems. In this paper, the authors introduce the register pointer architecture (RPA), which allows registers to be accessed indirectly through register pointers. Indirection allows a larger register file to be used without increasing the length of instruction words. Additional register file capacity allows many loads and stores, such as those introduced by spill code, to be eliminated, which improves performance and reduces energy consumption. Moreover, indirection affords additional flexibility in naming registers, which reduces the need to apply loop unrolling in order to maximize reuse of register allocated variables","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"500 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120933205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Incremental ABV for Functional Validation of TL-to-RTL Design Refinement 增量ABV用于从tl到rtl设计细化的功能验证
Pub Date : 2007-04-16 DOI: 10.5555/1266366.1266557
N. Bombieri, F. Fummi, G. Pravadelli
Transaction-level modeling (TLM) has been proposed as the leading strategy to address the always increasing complexity of digital systems. However, its introduction arouses a new challenge for designers and verification engineers, since there are no mature tools to automatically synthesize an RTL implementation from a transaction-level (TL) design, thus manual refinements are mandatory. In this context, the paper presents an incremental assertion-based verification (ABV) methodology to check the correctness of the TL-to-RTL refinement. The methodology relies on reusing assertions and already checked code, and it is guided by an assertion coverage metrics
事务级建模(TLM)已被提出作为解决数字系统日益增加的复杂性的主要策略。然而,它的引入给设计人员和验证工程师带来了新的挑战,因为没有成熟的工具可以从事务级(TL)设计中自动合成RTL实现,因此必须手工改进。在这种情况下,本文提出了一种基于断言的增量验证(ABV)方法来检查从tl到rtl改进的正确性。该方法依赖于重用断言和已经检查过的代码,并由断言覆盖率指标指导
{"title":"Incremental ABV for Functional Validation of TL-to-RTL Design Refinement","authors":"N. Bombieri, F. Fummi, G. Pravadelli","doi":"10.5555/1266366.1266557","DOIUrl":"https://doi.org/10.5555/1266366.1266557","url":null,"abstract":"Transaction-level modeling (TLM) has been proposed as the leading strategy to address the always increasing complexity of digital systems. However, its introduction arouses a new challenge for designers and verification engineers, since there are no mature tools to automatically synthesize an RTL implementation from a transaction-level (TL) design, thus manual refinements are mandatory. In this context, the paper presents an incremental assertion-based verification (ABV) methodology to check the correctness of the TL-to-RTL refinement. The methodology relies on reusing assertions and already checked code, and it is guided by an assertion coverage metrics","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123133732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Experimental Validation of a Tuning Algorithm for High-Speed Filters 一种高速滤波器调谐算法的实验验证
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364628
G. Matarrese, C. Marzocca, F. Corsi, S. D’Amico, A. Baschirotto
We report here the results of some laboratory experiments performed to validate the effectiveness of a technique for the self tuning of integrated continuous-time, high-speed active filters. The tuning algorithm is based on the application of a pseudo-random input sequence of rectangular pulses to the device to be tuned and on the evaluation of a few samples of the input-output cross-correlation function which constitute the filter signature. The key advantages of this technique are the ease of the input test pattern generation and the simplicity of the output circuitry which consists of a digital cross-correlator. The technique allows achieving a tuning error mainly dominated by the value of the elementary capacitors employed in the tuning circuitry. The time required to perform the tuning is kept within a few microseconds. This appears particularly interesting for applications to telecommunication multi-standard terminals. The experiments regarding the application of the proposed tuning algorithm to a baseband multi-standard filter confirm most of the simulation results and show the robustness of the technique against practical operating conditions and noise
我们在这里报告了一些实验室实验的结果,以验证集成连续时间高速有源滤波器自调谐技术的有效性。该调谐算法是基于将矩形脉冲的伪随机输入序列应用于待调谐器件,并对构成滤波器特征的输入-输出互相关函数的几个样本进行评估。该技术的主要优点是易于生成输入测试模式和由数字互关器组成的简单输出电路。该技术允许实现主要由调谐电路中所采用的基本电容器的值所控制的调谐误差。执行调优所需的时间保持在几微秒内。这对于电信多标准终端的应用显得特别有趣。将所提出的调谐算法应用于基带多标准滤波器的实验验证了大部分仿真结果,并显示了该技术对实际工作条件和噪声的鲁棒性
{"title":"Experimental Validation of a Tuning Algorithm for High-Speed Filters","authors":"G. Matarrese, C. Marzocca, F. Corsi, S. D’Amico, A. Baschirotto","doi":"10.1109/DATE.2007.364628","DOIUrl":"https://doi.org/10.1109/DATE.2007.364628","url":null,"abstract":"We report here the results of some laboratory experiments performed to validate the effectiveness of a technique for the self tuning of integrated continuous-time, high-speed active filters. The tuning algorithm is based on the application of a pseudo-random input sequence of rectangular pulses to the device to be tuned and on the evaluation of a few samples of the input-output cross-correlation function which constitute the filter signature. The key advantages of this technique are the ease of the input test pattern generation and the simplicity of the output circuitry which consists of a digital cross-correlator. The technique allows achieving a tuning error mainly dominated by the value of the elementary capacitors employed in the tuning circuitry. The time required to perform the tuning is kept within a few microseconds. This appears particularly interesting for applications to telecommunication multi-standard terminals. The experiments regarding the application of the proposed tuning algorithm to a baseband multi-standard filter confirm most of the simulation results and show the robustness of the technique against practical operating conditions and noise","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131224085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A One-Shot Configurable-Cache Tuner for Improved Energy and Performance 一次性配置缓存调谐器,提高能源和性能
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364686
A. Gordon-Ross, Pablo Viana, F. Vahid, W. Najjar, E. Barros
We introduce a new non-intrusive on-chip cache-tuning hardware module capable of accurately predicting the best configuration of a configurable cache for an executing application. Previous dynamic cache tuning approaches change the cache configuration several times as part of the tuning search process, executing the application using inferior configurations and temporarily causing energy and performance overhead. The introduced tuner uses a different approach, which non-intrusively collects data on addresses issued by the microprocessor, analyzes that data to predict the best cache configuration, and then updates the cache to the new best configuration in "one-shot", without ever having to examine inferior configurations. The result is less energy and less performance overhead, meaning that cache tuning can be applied more frequently. We show through experiments that the one-shot cache tuner can reduce memory-access related energy for instructions by 35% and comes within 4% of a previous intrusive approach, and results in 4.6 times less energy overhead and a 7.7 times speedup in tuning time compared to a previous intrusive approach, at the main expense of 12% larger size
我们引入了一种新的非侵入式片上缓存调优硬件模块,能够准确预测执行应用程序的可配置缓存的最佳配置。以前的动态缓存调优方法在调优搜索过程中多次更改缓存配置,使用较差的配置执行应用程序,并暂时造成能量和性能开销。引入的调谐器使用了一种不同的方法,它非侵入性地收集微处理器发出的地址数据,分析该数据以预测最佳缓存配置,然后“一次性”将缓存更新为新的最佳配置,而不必检查较差的配置。其结果是更少的能量和更少的性能开销,这意味着可以更频繁地应用缓存调优。我们通过实验证明,一次性缓存调谐器可以将指令的内存访问相关能量减少35%,比以前的侵入式方法减少4%,并且与以前的侵入式方法相比,能量开销减少4.6倍,调优时间加快7.7倍,主要代价是尺寸增加12%
{"title":"A One-Shot Configurable-Cache Tuner for Improved Energy and Performance","authors":"A. Gordon-Ross, Pablo Viana, F. Vahid, W. Najjar, E. Barros","doi":"10.1109/DATE.2007.364686","DOIUrl":"https://doi.org/10.1109/DATE.2007.364686","url":null,"abstract":"We introduce a new non-intrusive on-chip cache-tuning hardware module capable of accurately predicting the best configuration of a configurable cache for an executing application. Previous dynamic cache tuning approaches change the cache configuration several times as part of the tuning search process, executing the application using inferior configurations and temporarily causing energy and performance overhead. The introduced tuner uses a different approach, which non-intrusively collects data on addresses issued by the microprocessor, analyzes that data to predict the best cache configuration, and then updates the cache to the new best configuration in \"one-shot\", without ever having to examine inferior configurations. The result is less energy and less performance overhead, meaning that cache tuning can be applied more frequently. We show through experiments that the one-shot cache tuner can reduce memory-access related energy for instructions by 35% and comes within 4% of a previous intrusive approach, and results in 4.6 times less energy overhead and a 7.7 times speedup in tuning time compared to a previous intrusive approach, at the main expense of 12% larger size","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"208 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132816656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Low Power Design on Algorithmic and Architectural Level: A Case Study of an HSDPA Baseband Digital Signal Processing System 基于算法和体系结构的低功耗设计——以HSDPA基带数字信号处理系统为例
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364495
Marcus Schämann, S. Hessel, U. Langmann, Martin Bücker
The optimization of power consumption plays a key role in the design of a cellular system: Increasing data rates together with high mobility represent a constantly growing design challenge because advanced algorithms are required with a higher complexity, more chip area and increased power consumption which contrast with limited power supply. In this contribution, digital baseband components for a high speed downlink packet access (HSDPA) system are optimized on algorithmic and architectural level. Three promising algorithms for the equalization of the propagation channel are compared regarding performance, complexity and power consumption using fixed-point SystemC models. On architectural level an adaptive control unit is introduced together with an output interference analyzer. The presented strategy reduces the arithmetic operations for convenient propagation conditions up to 70 % which relates to an estimated power reduction of up to 40 % while the overall performance is not affected
功耗的优化在蜂窝系统的设计中起着关键作用:增加数据速率和高移动性代表着不断增长的设计挑战,因为与有限的电源相比,需要更高复杂性的先进算法,更大的芯片面积和更高的功耗。在这篇贡献中,高速下行分组接入(HSDPA)系统的数字基带组件在算法和体系结构层面进行了优化。利用定点系统模型,比较了三种有前途的传播信道均衡算法的性能、复杂度和功耗。在结构上引入了自适应控制单元和输出干扰分析仪。该策略在不影响整体性能的情况下,将方便传播条件下的算术运算减少了70%,这意味着估计功耗降低了40%
{"title":"Low Power Design on Algorithmic and Architectural Level: A Case Study of an HSDPA Baseband Digital Signal Processing System","authors":"Marcus Schämann, S. Hessel, U. Langmann, Martin Bücker","doi":"10.1109/DATE.2007.364495","DOIUrl":"https://doi.org/10.1109/DATE.2007.364495","url":null,"abstract":"The optimization of power consumption plays a key role in the design of a cellular system: Increasing data rates together with high mobility represent a constantly growing design challenge because advanced algorithms are required with a higher complexity, more chip area and increased power consumption which contrast with limited power supply. In this contribution, digital baseband components for a high speed downlink packet access (HSDPA) system are optimized on algorithmic and architectural level. Three promising algorithms for the equalization of the propagation channel are compared regarding performance, complexity and power consumption using fixed-point SystemC models. On architectural level an adaptive control unit is introduced together with an output interference analyzer. The presented strategy reduces the arithmetic operations for convenient propagation conditions up to 70 % which relates to an estimated power reduction of up to 40 % while the overall performance is not affected","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132849508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2007 Design, Automation & Test in Europe Conference & Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1