首页 > 最新文献

The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.最新文献

英文 中文
An evolutionary approach to configuring an embedded system based on power consumption 一种基于功耗配置嵌入式系统的进化方法
J. Northern, M. Shanblatt
Power consumption and portability issues are increasingly significant in system-on-a-chip applications. As a result, it is important that power and performance tradeoffs are made more visible to chip architects and circuit designers. Next generation tools are being developed to achieve high accuracy by estimating power consumption earlier in the design process. These tools also allow the designer to explore different configurations in a given design space. As the design space continues to expand, more efficient search methods are needed. This paper presents a framework for an evolutionary approach to configuring an ideal embedded processor based on power consumption.
功耗和可移植性问题在片上系统应用中越来越重要。因此,对于芯片架构师和电路设计师来说,功率和性能权衡变得更加明显是很重要的。下一代工具正在开发中,通过在设计过程的早期估计功耗来实现高精度。这些工具还允许设计师在给定的设计空间中探索不同的配置。随着设计空间的不断扩大,需要更有效的搜索方法。本文提出了一种基于功耗的理想嵌入式处理器配置进化方法框架。
{"title":"An evolutionary approach to configuring an embedded system based on power consumption","authors":"J. Northern, M. Shanblatt","doi":"10.1109/IWSOC.2003.1213035","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213035","url":null,"abstract":"Power consumption and portability issues are increasingly significant in system-on-a-chip applications. As a result, it is important that power and performance tradeoffs are made more visible to chip architects and circuit designers. Next generation tools are being developed to achieve high accuracy by estimating power consumption earlier in the design process. These tools also allow the designer to explore different configurations in a given design space. As the design space continues to expand, more efficient search methods are needed. This paper presents a framework for an evolutionary approach to configuring an ideal embedded processor based on power consumption.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125397148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analysis of coupling noise in dynamic circuit 动态电路中的耦合噪声分析
M. Chowdhury, Y. Ismail
Noise has become an important metric of deep submicron digital integrated circuit performance, and is becoming even more prominent due to the increasing usage of noise sensitive dynamic circuits for speed and area requirements. This paper presents closed form analytical solutions for noise as well as noise tolerance metrics for dynamic circuits to analyze the effects of coupling, which is considered as the dominant source of noise. These solutions are within 5% of dynamic simulations. It is shown that not all the scaling trends are negative for noise, and that the scaling down of supply voltage and increasing frequency help improve certain aspects of the noise immunity of dynamic circuit. Most of the work treated noise immunity and the noise content separately. This paper introduces a positive analysis of noise scalability by looking at the noise immunity and the noise content simultaneously.
噪声已经成为深亚微米数字集成电路性能的重要指标,并且由于越来越多地使用噪声敏感动态电路来满足速度和面积要求,噪声变得更加突出。本文提出了噪声的封闭解析解和动态电路的噪声容限度量,以分析耦合的影响,而耦合被认为是主要的噪声源。这些解决方案在动态模拟的5%以内。结果表明,并非所有的标度趋势对噪声都是负的,电源电压的标度降低和频率的增加有助于提高动态电路的抗噪性。大多数工作将噪声抗扰度和噪声含量分开处理。本文从噪声抗扰度和噪声含量两方面对噪声可扩展性进行了实证分析。
{"title":"Analysis of coupling noise in dynamic circuit","authors":"M. Chowdhury, Y. Ismail","doi":"10.1109/IWSOC.2003.1213056","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213056","url":null,"abstract":"Noise has become an important metric of deep submicron digital integrated circuit performance, and is becoming even more prominent due to the increasing usage of noise sensitive dynamic circuits for speed and area requirements. This paper presents closed form analytical solutions for noise as well as noise tolerance metrics for dynamic circuits to analyze the effects of coupling, which is considered as the dominant source of noise. These solutions are within 5% of dynamic simulations. It is shown that not all the scaling trends are negative for noise, and that the scaling down of supply voltage and increasing frequency help improve certain aspects of the noise immunity of dynamic circuit. Most of the work treated noise immunity and the noise content separately. This paper introduces a positive analysis of noise scalability by looking at the noise immunity and the noise content simultaneously.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129067412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Optimized datapath design by evolutionary computation 采用进化计算优化数据路径设计
Sérgio G. Araújo, A. C. M. Filho, A. Pedroza
High-level design entry tools offer a nice framework to deal with today's complex systems while shortening the design cycle. Nevertheless, such tools provide poor quality results both in area usage and timing performance issues. This paper presents a methodology to design optimized datapaths based on evolutionary techniques and HLS tools. VHDL descriptions of the system are automatically generated by Genetic Programming. To improve the design of the structural quality of such descriptions, a two-stage multi-objective optimization algorithm is used to ensure both desired functionality and area constraints.
高级设计入门工具提供了一个很好的框架来处理当今复杂的系统,同时缩短了设计周期。然而,这些工具在区域使用和计时性能问题上提供的结果质量很差。本文提出了一种基于进化技术和HLS工具的优化数据路径设计方法。系统的VHDL描述由遗传编程自动生成。为了提高此类描述的结构质量设计,采用了两阶段多目标优化算法,以确保所需的功能和面积约束。
{"title":"Optimized datapath design by evolutionary computation","authors":"Sérgio G. Araújo, A. C. M. Filho, A. Pedroza","doi":"10.1109/IWSOC.2003.1212996","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1212996","url":null,"abstract":"High-level design entry tools offer a nice framework to deal with today's complex systems while shortening the design cycle. Nevertheless, such tools provide poor quality results both in area usage and timing performance issues. This paper presents a methodology to design optimized datapaths based on evolutionary techniques and HLS tools. VHDL descriptions of the system are automatically generated by Genetic Programming. To improve the design of the structural quality of such descriptions, a two-stage multi-objective optimization algorithm is used to ensure both desired functionality and area constraints.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134444460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A robust handshake for asynchronous system 异步系统的健壮握手
Kuo-Hsing Cheng, Wei-chun Chang, C. Tu
In this paper, a new handshake methodology to enhance the performance of the asynchronous systems is proposed. The proposed handshake methodology has more flexibility to design an asymmetric asynchronous system. The proposed handshake methodology also has some advantages, like latch free, robust, high throughput, very short pre-charge time, less transistors, and more flexibility in asymmetry data path. A technique that combines a single-rail dynamic circuit with a dual-rail dynamic circuit was proposed and used to design in the data path. In the critical delay data paths, the dual-rail dynamic circuits were used to improve the operating speed. In others the single-rail dynamic circuits were used. It brings some advantages that reduce power consumption and die area while maintaining the calculation speed. Finally, an asynchronous array multiplier was designed and implemented by the new robust handshake methodology. Based on the TSMC 0.35 /spl mu/m CMOS technology, the simulation result of the maximum throughput is about 2.5 ns.
本文提出了一种新的握手方法来提高异步系统的性能。提出的握手方法在设计非对称异步系统时具有更大的灵活性。该方法还具有无锁存器、鲁棒性好、吞吐量高、预充电时间短、晶体管少、非对称数据路径更灵活等优点。提出了一种将单轨动态电路与双轨动态电路相结合的技术,并将其用于数据路径的设计。在关键时延数据路径上,采用双轨动态电路提高运算速度。在其他地方使用单轨动态电路。在保持计算速度的同时,降低了功耗和模具面积。最后,利用新的鲁棒握手方法设计并实现了异步阵列乘法器。基于TSMC 0.35 /spl mu/m CMOS技术,仿真结果显示最大吞吐量约为2.5 ns。
{"title":"A robust handshake for asynchronous system","authors":"Kuo-Hsing Cheng, Wei-chun Chang, C. Tu","doi":"10.1109/IWSOC.2003.1212998","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1212998","url":null,"abstract":"In this paper, a new handshake methodology to enhance the performance of the asynchronous systems is proposed. The proposed handshake methodology has more flexibility to design an asymmetric asynchronous system. The proposed handshake methodology also has some advantages, like latch free, robust, high throughput, very short pre-charge time, less transistors, and more flexibility in asymmetry data path. A technique that combines a single-rail dynamic circuit with a dual-rail dynamic circuit was proposed and used to design in the data path. In the critical delay data paths, the dual-rail dynamic circuits were used to improve the operating speed. In others the single-rail dynamic circuits were used. It brings some advantages that reduce power consumption and die area while maintaining the calculation speed. Finally, an asynchronous array multiplier was designed and implemented by the new robust handshake methodology. Based on the TSMC 0.35 /spl mu/m CMOS technology, the simulation result of the maximum throughput is about 2.5 ns.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"2 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132870585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symbolic simulation as a simplifying strategy for SoC verification 符号仿真作为SoC验证的简化策略
E. Dumitrescu, D. Borrione
The successful application of model-checking to industrial designs requires methods for reducing the complexity of the model. This paper presents an original strategy, for a well identified class of circuit behaviors; by running an appropriate symbolic simulation pattern before the actual proof of a temporal formula, an important FSM model simplification can be obtained. The actual model reduction step is formalized and illustrated. This method has been implemented within the CMU version of the SMV model checking tool and validated on a large industrial design.
模型检验在工业设计中的成功应用需要降低模型复杂性的方法。本文提出了一种新颖的策略,用于识别一类电路行为;通过在时间公式的实际证明之前运行适当的符号模拟模式,可以得到一个重要的FSM模型简化。对实际的模型简化步骤进行了形式化描述和说明。该方法已在SMV模型检查工具的CMU版本中实现,并在大型工业设计中得到验证。
{"title":"Symbolic simulation as a simplifying strategy for SoC verification","authors":"E. Dumitrescu, D. Borrione","doi":"10.1109/IWSOC.2003.1213066","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213066","url":null,"abstract":"The successful application of model-checking to industrial designs requires methods for reducing the complexity of the model. This paper presents an original strategy, for a well identified class of circuit behaviors; by running an appropriate symbolic simulation pattern before the actual proof of a temporal formula, an important FSM model simplification can be obtained. The actual model reduction step is formalized and illustrated. This method has been implemented within the CMU version of the SMV model checking tool and validated on a large industrial design.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114687775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Free-space optical interconnect for high-performance MCM systems 用于高性能MCM系统的自由空间光互连
C.-S. Seo, A. Chatterjee
An efficient co-optimization algorithm of placement and routing for high-performance multichip module (MCM) systems utilizing free-space optical interconnect technology is introduced. A computer-aided design (CAD) tool is developed for optimizing placement of modules and routing of electro-optic interconnects simultaneously without exceeding the routing capacity of the optical interconnect. About 48% saving in total routing cost is achieved through the use of 600 optical interconnects with 100 modules. It translates to doubling of the MCM performance.
介绍了一种利用自由空间光互连技术实现高性能多芯片模块(MCM)系统布局和路由的高效协同优化算法。开发了一种计算机辅助设计(CAD)工具,用于在不超过光互连布线容量的情况下同时优化电光互连的模块布局和布线。通过使用600个光互连和100个模块,可以节省约48%的总路由成本。它转化为MCM性能的两倍。
{"title":"Free-space optical interconnect for high-performance MCM systems","authors":"C.-S. Seo, A. Chatterjee","doi":"10.1109/IWSOC.2003.1213051","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213051","url":null,"abstract":"An efficient co-optimization algorithm of placement and routing for high-performance multichip module (MCM) systems utilizing free-space optical interconnect technology is introduced. A computer-aided design (CAD) tool is developed for optimizing placement of modules and routing of electro-optic interconnects simultaneously without exceeding the routing capacity of the optical interconnect. About 48% saving in total routing cost is achieved through the use of 600 optical interconnects with 100 modules. It translates to doubling of the MCM performance.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128177246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
RtrASSoc: an adaptable superscalar reconfigurable system-on-chip. The simulator RtrASSoc:一种自适应的超标量可重构片上系统。模拟器
J. L. Silva, R. Costa, G. Jorge
This paper describes the purpose of development for RtrASSoc, an Adaptable, Superscalar and Reconfigurable System on Chip. The RtrASSoc will be used in Embedded Systems that need capacity, performance, and low cost, based on Programmable System-on-Chip (PSOC), where part of the system will be a embedded superscalar processor (ESP), another part will be a Embedded Operating System (EOS), and finally a reconfigurable part where a reconfigurable routines (RR) can be reconfigured, extracted from the application program. A C-compiler extracts the reconfigurable routines from the application program and fix the parameters for reconfiguration that will be used during the execution of the application. The system will be tested in recognition pattern applications in a FPGA Virtex from Xilinx.
本文介绍了自适应、超标量、可重构片上系统RtrASSoc的开发目的。RtrASSoc将用于需要容量,性能和低成本的嵌入式系统,基于可编程片上系统(PSOC),其中系统的一部分将是嵌入式标量处理器(ESP),另一部分将是嵌入式操作系统(EOS),最后是可重构部分,其中可重构例程(RR)可以重新配置,从应用程序中提取。c编译器从应用程序中提取可重构例程,并为在应用程序执行期间使用的可重构参数进行修复。该系统将在Xilinx公司的FPGA Virtex上进行模式识别应用测试。
{"title":"RtrASSoc: an adaptable superscalar reconfigurable system-on-chip. The simulator","authors":"J. L. Silva, R. Costa, G. Jorge","doi":"10.1109/IWSOC.2003.1213034","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213034","url":null,"abstract":"This paper describes the purpose of development for RtrASSoc, an Adaptable, Superscalar and Reconfigurable System on Chip. The RtrASSoc will be used in Embedded Systems that need capacity, performance, and low cost, based on Programmable System-on-Chip (PSOC), where part of the system will be a embedded superscalar processor (ESP), another part will be a Embedded Operating System (EOS), and finally a reconfigurable part where a reconfigurable routines (RR) can be reconfigured, extracted from the application program. A C-compiler extracts the reconfigurable routines from the application program and fix the parameters for reconfiguration that will be used during the execution of the application. The system will be tested in recognition pattern applications in a FPGA Virtex from Xilinx.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121109668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An efficient mechanism for debugging RTL description 一个有效的调试RTL描述的机制
J. Rau, Yi-Yuan Chang, Chia-Hung Lin
In this paper, an efficient algorithm to diagnose design errors in RTL description is proposed. The diagnosis algorithm exploits the hierarchy available in RTL designs to locate design errors. Using data-path to reduce the number of error candidates and ensure that true errors are included in. According to the estimated probability, the most suspected error candidates would be reported first in the display. The advantages of the proposed method are simple and available.
本文提出了一种有效的RTL描述设计错误诊断算法。诊断算法利用RTL设计中的层次结构来定位设计错误。使用数据路径来减少候选错误的数量,并确保包含真正的错误。根据估计的概率,最可疑的候选错误将首先在显示中报告。该方法具有简单易行的优点。
{"title":"An efficient mechanism for debugging RTL description","authors":"J. Rau, Yi-Yuan Chang, Chia-Hung Lin","doi":"10.1109/IWSOC.2003.1213064","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213064","url":null,"abstract":"In this paper, an efficient algorithm to diagnose design errors in RTL description is proposed. The diagnosis algorithm exploits the hierarchy available in RTL designs to locate design errors. Using data-path to reduce the number of error candidates and ensure that true errors are included in. According to the estimated probability, the most suspected error candidates would be reported first in the display. The advantages of the proposed method are simple and available.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123804678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
The implementation of 100MHz data acquisition based on FPGA 基于FPGA的100MHz数据采集的实现
Tao Lin, Zhou Zhengou
A high-speed data acquisition based on FPGA and implemented in VHDL is presented in this paper. According to the requirement of a new radar system, several new technologies are adopted in the design and implementation such as time compression storage and memory rewriting. As a result, the system performs well with low dissipation of power, simple circuit layout and high efficient utilization of memory. The acquisition system comprises four parts: ADC circuit, data package and interface, sampling data memory and data flag memory. To implement large circuit, FPGA is adopted in this data acquisition system with reconfigurable ability and constant delay feature according to Z.G. Vranesic (1999).
本文介绍了一种基于FPGA的高速数据采集系统,并采用VHDL语言实现。根据一种新型雷达系统的要求,在设计和实现中采用了时间压缩存储和存储器重写等新技术。结果表明,该系统具有功耗低、电路布局简单、内存利用率高等优点。该采集系统由四部分组成:ADC电路、数据包和接口、采样数据存储器和数据标志存储器。为了实现大电路,根据Z.G. Vranesic(1999)的理论,本数据采集系统采用FPGA,具有可重构性和恒延时特性。
{"title":"The implementation of 100MHz data acquisition based on FPGA","authors":"Tao Lin, Zhou Zhengou","doi":"10.1109/IWSOC.2003.1213050","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213050","url":null,"abstract":"A high-speed data acquisition based on FPGA and implemented in VHDL is presented in this paper. According to the requirement of a new radar system, several new technologies are adopted in the design and implementation such as time compression storage and memory rewriting. As a result, the system performs well with low dissipation of power, simple circuit layout and high efficient utilization of memory. The acquisition system comprises four parts: ADC circuit, data package and interface, sampling data memory and data flag memory. To implement large circuit, FPGA is adopted in this data acquisition system with reconfigurable ability and constant delay feature according to Z.G. Vranesic (1999).","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131197405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Incorporating pattern prediction technique for energy efficient filter cache design 结合模式预测技术的高效节能滤波器缓存设计
K. Vivekanandarajah, T. Srikanthan, S. Bhattacharyya, Prasanna Venkatesh Kannan
A filter cache is proposed at a higher level than the L1 (main) cache in the memory hierarchy and is much smaller. The typical size of filter cache is of the order of 256 Bytes. Prediction algorithms popularly based upon the Next Fetch Prediction Table (NFPT) help making the choice between the filter cache and the main cache. In this paper we introduce a new prediction mechanism for predicting filter cache access, which relies on the hit or miss pattern of the instruction access stream over the past filter cache lines accesses. While NFPT makes predominantly incorrect hit-predictions, the proposed Pattern Table based approach reduces this. Predominantly correct prediction achieves efficient cache access, and eliminates cache-miss penalties. Our extensive simulations across a wide range of benchmark applications illustrate that the new prediction scheme is efficient as it results in improved prediction accuracy. Moreover, it reduces energy consumption of the filter cache by as much as 25% compared to NFPT based approaches. Further, the technique implemented is elegant in the form of hardware implementation as it consists only of a shift register and a Look up Table (LUT) and is hence area and energy efficient in contrast to the published prediction techniques.
在内存层次结构中,过滤器缓存被建议在比L1(主)缓存更高的级别上,并且要小得多。过滤器缓存的典型大小为256字节。通常基于下一个读取预测表(NFPT)的预测算法有助于在过滤器缓存和主缓存之间做出选择。本文介绍了一种新的预测过滤器缓存访问的机制,该机制依赖于指令访问流在过去过滤器缓存线访问中的命中或未命中模式。虽然NFPT主要导致错误的命中预测,但是基于模式表的建议方法减少了这种情况。主要正确的预测实现了高效的缓存访问,并消除了缓存丢失的惩罚。我们在广泛的基准应用中进行了广泛的模拟,表明新的预测方案是有效的,因为它可以提高预测精度。此外,与基于NFPT的方法相比,它将过滤器缓存的能耗降低了25%。此外,所实现的技术以硬件实现的形式是优雅的,因为它仅由移位寄存器和查找表(LUT)组成,因此与已发布的预测技术相比,它具有面积和能源效率。
{"title":"Incorporating pattern prediction technique for energy efficient filter cache design","authors":"K. Vivekanandarajah, T. Srikanthan, S. Bhattacharyya, Prasanna Venkatesh Kannan","doi":"10.1109/IWSOC.2003.1213003","DOIUrl":"https://doi.org/10.1109/IWSOC.2003.1213003","url":null,"abstract":"A filter cache is proposed at a higher level than the L1 (main) cache in the memory hierarchy and is much smaller. The typical size of filter cache is of the order of 256 Bytes. Prediction algorithms popularly based upon the Next Fetch Prediction Table (NFPT) help making the choice between the filter cache and the main cache. In this paper we introduce a new prediction mechanism for predicting filter cache access, which relies on the hit or miss pattern of the instruction access stream over the past filter cache lines accesses. While NFPT makes predominantly incorrect hit-predictions, the proposed Pattern Table based approach reduces this. Predominantly correct prediction achieves efficient cache access, and eliminates cache-miss penalties. Our extensive simulations across a wide range of benchmark applications illustrate that the new prediction scheme is efficient as it results in improved prediction accuracy. Moreover, it reduces energy consumption of the filter cache by as much as 25% compared to NFPT based approaches. Further, the technique implemented is elegant in the form of hardware implementation as it consists only of a shift register and a Look up Table (LUT) and is hence area and energy efficient in contrast to the published prediction techniques.","PeriodicalId":259178,"journal":{"name":"The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114540416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
The 3rd IEEE International Workshop on System-on-Chip for Real-Time Applications, 2003. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1