首页 > 最新文献

Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems最新文献

英文 中文
Transient fault sensitivity analysis of analog-to-digital converters (ADCs) 模数转换器(adc)暂态故障灵敏度分析
Mandeep Singh, R. Rachala, I. Koren
Reliability of systems used in space, avionic and biomedical applications is highly critical. Such systems consist of an analog front-end to collect data, an ADC to convert the collected data to digital form and a digital unit to process it. It is important to analyze the fault sensitivities of each of these to effectively gauge and improve the reliability of the system. This paper addresses the issue of fault sensitivity of ADCs. A generic methodology for analyzing the fault sensitivity of ADCs is presented. A novel concept of "node weights" specific to /spl alpha/-particle induced transient faults is introduced to increase the accuracy of such an analysis.
用于空间、航空电子和生物医学应用的系统的可靠性是非常关键的。这样的系统包括一个用于采集数据的模拟前端、一个用于将采集到的数据转换为数字形式的ADC和一个用于处理数据的数字单元。分析每一种故障的灵敏度对于有效地测量和提高系统的可靠性是非常重要的。本文研究了adc的故障灵敏度问题。提出了一种分析adc故障灵敏度的通用方法。为了提高分析的准确性,引入了针对/spl α /粒子诱发瞬态故障的“节点权重”的新概念。
{"title":"Transient fault sensitivity analysis of analog-to-digital converters (ADCs)","authors":"Mandeep Singh, R. Rachala, I. Koren","doi":"10.1109/IWV.2001.923153","DOIUrl":"https://doi.org/10.1109/IWV.2001.923153","url":null,"abstract":"Reliability of systems used in space, avionic and biomedical applications is highly critical. Such systems consist of an analog front-end to collect data, an ADC to convert the collected data to digital form and a digital unit to process it. It is important to analyze the fault sensitivities of each of these to effectively gauge and improve the reliability of the system. This paper addresses the issue of fault sensitivity of ADCs. A generic methodology for analyzing the fault sensitivity of ADCs is presented. A novel concept of \"node weights\" specific to /spl alpha/-particle induced transient faults is introduced to increase the accuracy of such an analysis.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117290927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A heterogeneous multiprocessor architecture for low-power audio signal processing applications 用于低功耗音频信号处理应用的异构多处理器体系结构
O. Paker, Jens Sparsø, Niels Haandbæk, Mogens Isager, L. S. Nielsen
This paper describes a low-power programmable DSP architecture that targets audio signal processing. The architecture can be characterized as a heterogeneous multiprocessor consisting of small and simple instruction set processors called mini-cores that communicate using message passing. The processors are tailored for different classes of filtering algorithms (FIR, IIR, N-LMS etc.), and in a typical system the communication among processors occurs at the sampling rate only. The processors are parameterized in word-size, memory-size, etc. and can be instantiated according to the needs of the application at hand using a normal synthesis based ASIC design flow. To give an impression of the size of a processor we mention that one of the FIR processors in a prototype design has 16 instructions, a 32 word/spl times/16 bit program memory, a 64 word/spl times/16 bit data memory and a 25 word/spl times/16 bit coefficient memory. Early results obtained from the design of a prototype chip containing filter processors for a hearing aid application, indicate a power consumption that is an order of magnitude better than current state of the art low-power audio DSPs implemented using full-custom techniques. This is due to: (1) the small size of the processors and (2) a smaller instruction count for a given task.
本文介绍了一种针对音频信号处理的低功耗可编程DSP体系结构。该体系结构可以被描述为一个异构多处理器,由称为微核的小而简单的指令集处理器组成,这些处理器使用消息传递进行通信。处理器是为不同类别的滤波算法(FIR, IIR, N-LMS等)量身定制的,在典型的系统中,处理器之间的通信仅以采样率发生。处理器在字长、内存大小等方面是参数化的,并且可以根据手头应用程序的需要使用基于ASIC设计流程的正常合成来实例化。为了对处理器的大小有一个印象,我们提到一个原型设计中的FIR处理器有16条指令,一个32字/spl次/16位程序存储器,一个64字/spl次/16位数据存储器和一个25字/spl次/16位系数存储器。从包含用于助听器应用的滤波器处理器的原型芯片设计中获得的早期结果表明,功耗比目前使用全定制技术实现的最先进的低功耗音频dsp好一个数量级。这是由于:(1)处理器的尺寸较小,(2)给定任务的指令计数较小。
{"title":"A heterogeneous multiprocessor architecture for low-power audio signal processing applications","authors":"O. Paker, Jens Sparsø, Niels Haandbæk, Mogens Isager, L. S. Nielsen","doi":"10.1109/IWV.2001.923139","DOIUrl":"https://doi.org/10.1109/IWV.2001.923139","url":null,"abstract":"This paper describes a low-power programmable DSP architecture that targets audio signal processing. The architecture can be characterized as a heterogeneous multiprocessor consisting of small and simple instruction set processors called mini-cores that communicate using message passing. The processors are tailored for different classes of filtering algorithms (FIR, IIR, N-LMS etc.), and in a typical system the communication among processors occurs at the sampling rate only. The processors are parameterized in word-size, memory-size, etc. and can be instantiated according to the needs of the application at hand using a normal synthesis based ASIC design flow. To give an impression of the size of a processor we mention that one of the FIR processors in a prototype design has 16 instructions, a 32 word/spl times/16 bit program memory, a 64 word/spl times/16 bit data memory and a 25 word/spl times/16 bit coefficient memory. Early results obtained from the design of a prototype chip containing filter processors for a hearing aid application, indicate a power consumption that is an order of magnitude better than current state of the art low-power audio DSPs implemented using full-custom techniques. This is due to: (1) the small size of the processors and (2) a smaller instruction count for a given task.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"8 Suppl 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130341961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A memory management approach for efficient implementation of multimedia kernels on programmable architectures 一种在可编程架构上有效实现多媒体内核的内存管理方法
M. Dasigenis, N. Kroupis, A. Argyriou, K. Tatas, D. Soudris, A. Thanailakis, N. Zervas
A methodology for power optimization of the data memory hierarchy and instruction memory, is introduced. The impact of the methodology on a set of widely used multimedia application kernels, namely Full Search (FS), Hierarchical Search (HS), Parallel Hierarchical One Dimension Search (PHODS), and Three Step Logarithmic Search (3SLS), is demonstrated. We find the power optimal data memory hierarchy applying the appropriate data-use transformation, while the instruction power optimization is done using suitable cache memory. Using data-reuse transformations, performance optimizations techniques, and instruction-level transformations, we perform exhaustive exploration of an the possible alternatives to reach power efficient solutions. Concerning the embedded processor ARM, the experimental results prove the efficiency of the methodology in terms of power for all the multimedia kernels.
介绍了一种数据存储器和指令存储器的功耗优化方法。本文还演示了该方法对一组广泛使用的多媒体应用程序内核的影响,即全搜索(FS)、分层搜索(HS)、并行分层一维搜索(PHODS)和三步对数搜索(3SLS)。我们通过适当的数据使用转换找到功率最优的数据存储器层次结构,而指令功率优化则使用合适的缓存存储器完成。通过使用数据重用转换、性能优化技术和指令级转换,我们对实现节能解决方案的可能替代方案进行了详尽的探索。在嵌入式处理器ARM上,实验结果证明了该方法在功耗方面对所有多媒体内核的有效性。
{"title":"A memory management approach for efficient implementation of multimedia kernels on programmable architectures","authors":"M. Dasigenis, N. Kroupis, A. Argyriou, K. Tatas, D. Soudris, A. Thanailakis, N. Zervas","doi":"10.1109/IWV.2001.923157","DOIUrl":"https://doi.org/10.1109/IWV.2001.923157","url":null,"abstract":"A methodology for power optimization of the data memory hierarchy and instruction memory, is introduced. The impact of the methodology on a set of widely used multimedia application kernels, namely Full Search (FS), Hierarchical Search (HS), Parallel Hierarchical One Dimension Search (PHODS), and Three Step Logarithmic Search (3SLS), is demonstrated. We find the power optimal data memory hierarchy applying the appropriate data-use transformation, while the instruction power optimization is done using suitable cache memory. Using data-reuse transformations, performance optimizations techniques, and instruction-level transformations, we perform exhaustive exploration of an the possible alternatives to reach power efficient solutions. Concerning the embedded processor ARM, the experimental results prove the efficiency of the methodology in terms of power for all the multimedia kernels.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131176731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Electronic nanotechnology and reconfigurable computing 电子纳米技术和可重构计算
S. Goldstein
Chemically assembled electronic nanotechnology (CAEN) is a promising alternative to CMOS for constructing circuits with feature sizes in the tens of nanometers range. In this paper we describe some of the recent advances in CAEN and how they influence the design of digital circuits. We show how reconfigurability supports inexpensive manufacturing. Finally, we describe a molecular latch that overcomes the lack of a viable CAEN-based transistor.
化学组装电子纳米技术(CAEN)是一种很有前途的替代CMOS,用于构建特征尺寸在几十纳米范围内的电路。在本文中,我们描述了CAEN的一些最新进展以及它们如何影响数字电路的设计。我们将展示可重构性如何支持廉价制造。最后,我们描述了一种分子锁存器,克服了缺乏可行的基于caen的晶体管。
{"title":"Electronic nanotechnology and reconfigurable computing","authors":"S. Goldstein","doi":"10.1109/IWV.2001.923133","DOIUrl":"https://doi.org/10.1109/IWV.2001.923133","url":null,"abstract":"Chemically assembled electronic nanotechnology (CAEN) is a promising alternative to CMOS for constructing circuits with feature sizes in the tens of nanometers range. In this paper we describe some of the recent advances in CAEN and how they influence the design of digital circuits. We show how reconfigurability supports inexpensive manufacturing. Finally, we describe a molecular latch that overcomes the lack of a viable CAEN-based transistor.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133180161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A hybrid wave-pipelined network router 混合波管道网络路由器
J. Delgado-Frías, J. Nyathi
In this paper a novel hybrid wave-pipelined bit-pattern associative router is presented. A router is an important component in communication network systems. The bit-pattern associative router (BPAR) allows for flexibility and can accommodate a large number of routing algorithms. Wave-pipelining is a high performance approach which implements pipelining in logic without using intermediate registers. In this study a hybrid wave-pipelined approach has been proposed and implemented. Hybrid wave-pipelining allows for the reduction of the delay difference between the maximum and minimum delays by narrowing the gap between each stage of the system. This approach yields narrow "computing cones" that allow faster clocks to be run. This is the first study in wave-pipelining that deals with a system that has a substantially different set of pipeline stages. The bit-pattern associative router has three stages: condition match, selection function, and port assignment. Each stage's data delay paths are tightly controlled to optimize the proper propagation of signals. The simulation results show that using hybrid wave-pipelining significantly reduces the clock period and circuit delays become the limiting factor, preventing further clock cycle time reduction.
本文提出了一种新型的混合波管道式位模式关联路由器。路由器是通信网络系统的重要组成部分。位模式关联路由器(BPAR)具有灵活性,可以容纳大量的路由算法。波形流水线是一种高性能的方法,它在逻辑上实现流水线,而不使用中间寄存器。本文提出并实现了一种混合波管道方法。混合波管道允许通过缩小系统每个阶段之间的差距来减少最大和最小延迟之间的延迟差异。这种方法产生狭窄的“计算锥”,允许更快的时钟运行。这是第一次对波浪管道系统进行研究,该系统具有完全不同的管道级集。位模式关联路由器有三个阶段:条件匹配、选择功能和端口分配。每个阶段的数据延迟路径被严格控制,以优化信号的适当传播。仿真结果表明,采用混合波管道可以显著缩短时钟周期,电路延迟成为限制因素,阻止了时钟周期时间的进一步缩短。
{"title":"A hybrid wave-pipelined network router","authors":"J. Delgado-Frías, J. Nyathi","doi":"10.1109/IWV.2001.923156","DOIUrl":"https://doi.org/10.1109/IWV.2001.923156","url":null,"abstract":"In this paper a novel hybrid wave-pipelined bit-pattern associative router is presented. A router is an important component in communication network systems. The bit-pattern associative router (BPAR) allows for flexibility and can accommodate a large number of routing algorithms. Wave-pipelining is a high performance approach which implements pipelining in logic without using intermediate registers. In this study a hybrid wave-pipelined approach has been proposed and implemented. Hybrid wave-pipelining allows for the reduction of the delay difference between the maximum and minimum delays by narrowing the gap between each stage of the system. This approach yields narrow \"computing cones\" that allow faster clocks to be run. This is the first study in wave-pipelining that deals with a system that has a substantially different set of pipeline stages. The bit-pattern associative router has three stages: condition match, selection function, and port assignment. Each stage's data delay paths are tightly controlled to optimize the proper propagation of signals. The simulation results show that using hybrid wave-pipelining significantly reduces the clock period and circuit delays become the limiting factor, preventing further clock cycle time reduction.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122384327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
System design of low-energy wearable computers with wireless networking 具有无线网络的低能耗可穿戴计算机系统设计
A. Smailagic, D. Siewiorek, M. Ettus
The paper describes a system level design approach to the wearable computers and wireless networks project at Carnegie Mellon University (CMU). Over the last almost ten years we have designed and fabricated twenty new generations of wearable computers, with most of them using wireless network infrastructure. We emphasize the importance of wireless communication and the amount of energy it requires. A system-level approach to power/performance optimization is going to be a crucial catalyst for making wearable computers an everyday tool for the general public.
本文介绍了卡耐基梅隆大学(CMU)可穿戴计算机和无线网络项目的系统级设计方法。在过去的近十年里,我们设计并制造了20代新一代可穿戴计算机,其中大多数使用无线网络基础设施。我们强调无线通信的重要性和它所需要的能量。功率/性能优化的系统级方法将成为使可穿戴计算机成为普通大众日常工具的关键催化剂。
{"title":"System design of low-energy wearable computers with wireless networking","authors":"A. Smailagic, D. Siewiorek, M. Ettus","doi":"10.1109/IWV.2001.923135","DOIUrl":"https://doi.org/10.1109/IWV.2001.923135","url":null,"abstract":"The paper describes a system level design approach to the wearable computers and wireless networks project at Carnegie Mellon University (CMU). Over the last almost ten years we have designed and fabricated twenty new generations of wearable computers, with most of them using wireless network infrastructure. We emphasize the importance of wireless communication and the amount of energy it requires. A system-level approach to power/performance optimization is going to be a crucial catalyst for making wearable computers an everyday tool for the general public.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"359 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122763190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Evaluating metastability in electronic circuits for random number generation 评估电子电路中用于随机数产生的亚稳态
S. Walker, S. Foo
This paper presents a method for evaluating the metastability of a flip-flop circuit for random number generation applications. It is well known that digital circuits can exhibit metastable behavior when the input to a flip-flop is asynchronous to the system clock. In the past, extensive research has been focused on eliminating metastability in digital systems. Here, we present some preliminary results of our research to exploit metastable behavior in sequential logic circuits to produce random bit streams for random number generation. In particular, we explore the idea of tapping the electronic noise present in D-type flip-flops to produce random bit streams for use as a one-time cryptographic key-pad for encryption algorithms. This research will serve as a basis for further research into the very-large-scale-integration (VLSI) of random number generators (RNGs).
本文提出了一种用于随机数产生的触发器电路亚稳性的评估方法。众所周知,当触发器的输入与系统时钟不同步时,数字电路可以表现出亚稳态行为。过去,广泛的研究集中在消除数字系统中的亚稳态上。在这里,我们提出了一些初步的研究结果,利用时序逻辑电路的亚稳行为来产生随机数生成的随机比特流。特别是,我们探索了利用d型触发器中存在的电子噪声来产生随机比特流的想法,作为加密算法的一次性密码键盘。这项研究将为进一步研究随机数发生器(rng)的超大规模集成(VLSI)奠定基础。
{"title":"Evaluating metastability in electronic circuits for random number generation","authors":"S. Walker, S. Foo","doi":"10.1109/IWV.2001.923146","DOIUrl":"https://doi.org/10.1109/IWV.2001.923146","url":null,"abstract":"This paper presents a method for evaluating the metastability of a flip-flop circuit for random number generation applications. It is well known that digital circuits can exhibit metastable behavior when the input to a flip-flop is asynchronous to the system clock. In the past, extensive research has been focused on eliminating metastability in digital systems. Here, we present some preliminary results of our research to exploit metastable behavior in sequential logic circuits to produce random bit streams for random number generation. In particular, we explore the idea of tapping the electronic noise present in D-type flip-flops to produce random bit streams for use as a one-time cryptographic key-pad for encryption algorithms. This research will serve as a basis for further research into the very-large-scale-integration (VLSI) of random number generators (RNGs).","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122897288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A linear threshold gate implementation in single electron technology 单电子技术中线性阈值门的实现
C. Lageweg, S. Cotofana, S. Vassiliadis
In this paper we focus on the design of threshold logic functions in Single Electron Tunneling (SET) technology, using the tunnel junction's specific behavior i.e., the ability to control the transport of individual electrons. We introduce a novel design of an n-input linear threshold gate which can accommodate both positive and negative weights and built-in signal amplification, using 1 tunnel junction and n+2 true capacitors. As an example we present a 4-input threshold gate with both positive and negative weights.
本文重点研究了单电子隧道(SET)技术中阈值逻辑函数的设计,利用隧道结的特定行为,即控制单个电子输运的能力。我们介绍了一种新颖的n输入线性阈值门的设计,它可以容纳正负权值和内置信号放大,使用1个隧道结和n+2个真电容。作为一个例子,我们提出了一个具有正负权的4输入阈值门。
{"title":"A linear threshold gate implementation in single electron technology","authors":"C. Lageweg, S. Cotofana, S. Vassiliadis","doi":"10.1109/IWV.2001.923145","DOIUrl":"https://doi.org/10.1109/IWV.2001.923145","url":null,"abstract":"In this paper we focus on the design of threshold logic functions in Single Electron Tunneling (SET) technology, using the tunnel junction's specific behavior i.e., the ability to control the transport of individual electrons. We introduce a novel design of an n-input linear threshold gate which can accommodate both positive and negative weights and built-in signal amplification, using 1 tunnel junction and n+2 true capacitors. As an example we present a 4-input threshold gate with both positive and negative weights.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127772785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 116
LUT-based FPGA technology mapping for power minimization with optimal depth 基于lut的FPGA技术映射的最优深度功耗最小化
Hao Li, Wai-Kei Mak, S. Katkoori
In this paper, we study the technology mapping problem for LUT-based FPGAs targeting power minimization. We present the PowerMap algorithm to generate a mapping solution to minimize power consumption while keeping the delay optimal. We compute min-height K-feasible cuts for critical nodes to optimize the depth and compute min-weight K-feasible cuts for noncritical nodes to minimize the power consumption of the mapping solution. We have implemented PowerMap in C and tested it on a number of MCNC benchmark circuits. Compared to FlowMap, a delay-optimal mapper, our algorithm reduces the power consumption by 17.8% and uses 9.4% less LUTs without any depth penalty.
本文研究了以功率最小化为目标的基于lut的fpga的技术映射问题。我们提出了PowerMap算法来生成一个映射解决方案,以最小化功耗,同时保持最优的延迟。我们计算关键节点的最小高度k可行切割以优化深度,计算非关键节点的最小权重k可行切割以最小化映射解的功耗。我们已经用C语言实现了PowerMap,并在一些MCNC基准电路上进行了测试。与延迟最优映射器FlowMap相比,我们的算法在没有任何深度损失的情况下减少了17.8%的功耗和9.4%的lut使用。
{"title":"LUT-based FPGA technology mapping for power minimization with optimal depth","authors":"Hao Li, Wai-Kei Mak, S. Katkoori","doi":"10.1109/IWV.2001.923150","DOIUrl":"https://doi.org/10.1109/IWV.2001.923150","url":null,"abstract":"In this paper, we study the technology mapping problem for LUT-based FPGAs targeting power minimization. We present the PowerMap algorithm to generate a mapping solution to minimize power consumption while keeping the delay optimal. We compute min-height K-feasible cuts for critical nodes to optimize the depth and compute min-weight K-feasible cuts for noncritical nodes to minimize the power consumption of the mapping solution. We have implemented PowerMap in C and tested it on a number of MCNC benchmark circuits. Compared to FlowMap, a delay-optimal mapper, our algorithm reduces the power consumption by 17.8% and uses 9.4% less LUTs without any depth penalty.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128602148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Reducing register and phase requirements for synchronous circuits derived using software pipelining techniques 利用软件流水线技术,减少同步电路的寄存器和相位要求
N. Chabini, E. Aboulhamid, Y. Savaria
A method based on a modulo scheduling algorithm for software pipelining has been recently proposed to optimize clocked circuits. The resulting circuits are multi-phase clocked circuits, where all clocks have the same period. To preserve the functionality of the original circuit, registers must be placed after minimizing the clock period. The placement of these registers is derived from an arbitrary schedule determined during a clock period minimization step. A good schedule may allow one to decrease the number of registers and the number of phases needed in the final circuit. Decreasing the number of registers contributes to minimizing the area occupied by the circuit and reduces its power consumption; while decreasing the number of phases reduces the complexity of the clock generation and distribution task. In this paper, we propose polynomial-time-solvable methods to choose a good schedule once the clock period is minimized. The methods have been tested on a subject of the ISCAS89 benchmarks. Experimental results show that the number of registers which must be inserted in the final circuit, and the number of phases, have been significantly decreased compared to the case where an arbitrary schedule is chosen.
最近提出了一种基于软件流水线的模调度算法来优化时钟电路。由此产生的电路是多相时钟电路,其中所有时钟都具有相同的周期。为了保持原始电路的功能,必须在最小化时钟周期之后放置寄存器。这些寄存器的位置来自于在时钟周期最小化步骤中确定的任意调度。一个好的调度可以减少寄存器的数量和最终电路所需的相数。减少寄存器的数量有助于减少电路占用的面积并降低其功耗;同时减少相位数也降低了时钟生成和分配任务的复杂性。在本文中,我们提出多项式时间可解的方法来选择一个好的调度,一旦时钟周期是最小的。这些方法已经在ISCAS89基准的一个主题上进行了测试。实验结果表明,与任意时序的情况相比,在最终电路中必须插入的寄存器数和相数明显减少。
{"title":"Reducing register and phase requirements for synchronous circuits derived using software pipelining techniques","authors":"N. Chabini, E. Aboulhamid, Y. Savaria","doi":"10.1109/IWV.2001.923142","DOIUrl":"https://doi.org/10.1109/IWV.2001.923142","url":null,"abstract":"A method based on a modulo scheduling algorithm for software pipelining has been recently proposed to optimize clocked circuits. The resulting circuits are multi-phase clocked circuits, where all clocks have the same period. To preserve the functionality of the original circuit, registers must be placed after minimizing the clock period. The placement of these registers is derived from an arbitrary schedule determined during a clock period minimization step. A good schedule may allow one to decrease the number of registers and the number of phases needed in the final circuit. Decreasing the number of registers contributes to minimizing the area occupied by the circuit and reduces its power consumption; while decreasing the number of phases reduces the complexity of the clock generation and distribution task. In this paper, we propose polynomial-time-solvable methods to choose a good schedule once the clock period is minimized. The methods have been tested on a subject of the ISCAS89 benchmarks. Experimental results show that the number of registers which must be inserted in the final circuit, and the number of phases, have been significantly decreased compared to the case where an arbitrary schedule is chosen.","PeriodicalId":114059,"journal":{"name":"Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124358563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings IEEE Computer Society Workshop on VLSI 2001. Emerging Technologies for VLSI Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1