首页 > 最新文献

2011 IEEE 29th International Conference on Computer Design (ICCD)最新文献

英文 中文
Energy-aware Standby-Sparing Technique for periodic real-time applications 周期性实时应用的能量感知备用节省技术
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081396
M. A. Haque, Hakan Aydin, Dakai Zhu
In this paper, we present an energy-aware standby-sparing technique for periodic real-time applications. A standby-sparing system consists of a primary processor where the application tasks are executed using Dynamic Voltage Scaling (DVS) to save energy, and a spare processor where the backup tasks are executed at maximum voltage/frequency, should there be a need. In our framework, we employ Earliest-Deadline-First (EDF) and Earliest-Deadline-Late (EDL) scheduling policies on the primary and spare CPUs, respectively. The use of EDL on the spare CPU allows delaying the backup tasks on the spare CPU as much as possible, enabling energy savings. We develop static and dynamic algorithms based on these principles, and evaluate their performance experimentally. Our simulation results show significant energy savings compared to existing reliability-aware power management (RAPM) techniques for most execution scenarios.
在本文中,我们提出了一种周期性实时应用的能量感知备用节省技术。备用备用系统由一个主处理器组成,主处理器使用动态电压缩放(DVS)来执行应用程序任务,以节省能源;如果有需要,备用处理器在最大电压/频率下执行备份任务。在我们的框架中,我们分别在主cpu和备用cpu上采用了最早截止日期优先(EDF)和最早截止日期晚(EDL)调度策略。在备用CPU上使用EDL,可以尽可能地延迟备用CPU上的备份任务,从而节省能源。我们基于这些原理开发了静态和动态算法,并通过实验评估了它们的性能。我们的模拟结果显示,在大多数执行场景中,与现有的可靠性感知电源管理(RAPM)技术相比,该技术显著节省了能源。
{"title":"Energy-aware Standby-Sparing Technique for periodic real-time applications","authors":"M. A. Haque, Hakan Aydin, Dakai Zhu","doi":"10.1109/ICCD.2011.6081396","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081396","url":null,"abstract":"In this paper, we present an energy-aware standby-sparing technique for periodic real-time applications. A standby-sparing system consists of a primary processor where the application tasks are executed using Dynamic Voltage Scaling (DVS) to save energy, and a spare processor where the backup tasks are executed at maximum voltage/frequency, should there be a need. In our framework, we employ Earliest-Deadline-First (EDF) and Earliest-Deadline-Late (EDL) scheduling policies on the primary and spare CPUs, respectively. The use of EDL on the spare CPU allows delaying the backup tasks on the spare CPU as much as possible, enabling energy savings. We develop static and dynamic algorithms based on these principles, and evaluate their performance experimentally. Our simulation results show significant energy savings compared to existing reliability-aware power management (RAPM) techniques for most execution scenarios.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123884063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Output process of variable bit-rate flows in on-chip networks based on aggregate scheduling 基于聚合调度的片上网络可变比特率流输出过程
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081442
Fahimeh Jafari, A. Jantsch, Zhonghai Lu
In NoCs often several flows are merged into one aggregate flow due to heavy resource sharing. For strengthening formal performance analysis, we propose an improved model for an output flow of a FIFO multiplexer under aggregate scheduling. The model of the aggregate flow is formally proven and can serve as the basis for a stringent worst case delay and buffer analysis.
在noc中,由于大量的资源共享,经常将多个流合并为一个聚合流。为了加强形式化的性能分析,我们提出了一个改进的FIFO多路复用器在聚合调度下的输出流模型。该模型可以作为严格的最坏情况延迟和缓冲分析的基础。
{"title":"Output process of variable bit-rate flows in on-chip networks based on aggregate scheduling","authors":"Fahimeh Jafari, A. Jantsch, Zhonghai Lu","doi":"10.1109/ICCD.2011.6081442","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081442","url":null,"abstract":"In NoCs often several flows are merged into one aggregate flow due to heavy resource sharing. For strengthening formal performance analysis, we propose an improved model for an output flow of a FIFO multiplexer under aggregate scheduling. The model of the aggregate flow is formally proven and can serve as the basis for a stringent worst case delay and buffer analysis.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126386632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Video quality-driven buffer dimensioning in MPSoC platforms via prioritized frame drops 视频质量驱动的MPSoC平台中通过优先帧丢失的缓冲区尺寸
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081404
Deepak Gangadharan, Haiyang Ma, S. Chakraborty, Roger Zimmermann
We study the impact of a novel prioritized frame dropping scheme in buffer-constrained multiprocessor system-on-chip (MPSoC) platforms. Accurate buffer dimensioning has attracted lot of research interest as large on-chip buffers result in increased silicon area and higher costs. Multimedia applications present the flexibility of trading off quality for buffer space without any noticeable deterioration in video quality. The frame dropping scheme is crucial here to drop frames appropriately such that the required buffer size is reduced and target quality requirement is satisfied. Towards this, we propose a simple prioritized frame dropping mechanism which reduces the required buffer space more than existing frame dropping policies. We also provide a fast iterative procedure to find the minimum buffer size for a video clip with O(log(Ndrop)) number of iterations, where Ndrop is the maximum number of frames that can be dropped for a video clip so that a prespecified quality in terms of peak signal to noise ratio (PSNR) value is satisfied.
我们研究了一种新的优先帧丢弃方案在缓冲受限的多处理器片上系统(MPSoC)平台中的影响。由于大的片上缓冲器会导致硅面积的增加和成本的提高,精确的缓冲器尺寸引起了人们的广泛关注。多媒体应用程序提供了在缓冲空间中权衡质量的灵活性,而不会造成视频质量的明显下降。帧丢弃方案在这里是至关重要的,以适当地丢弃帧,以减少所需的缓冲区大小,并满足目标质量要求。为此,我们提出了一种简单的优先级丢帧机制,它比现有的丢帧策略更能减少所需的缓冲空间。我们还提供了一个快速迭代过程,用于找到具有O(log(Ndrop))迭代次数的视频剪辑的最小缓冲区大小,其中Ndrop是视频剪辑可以丢弃的最大帧数,以便在峰值信噪比(PSNR)值方面满足预先指定的质量。
{"title":"Video quality-driven buffer dimensioning in MPSoC platforms via prioritized frame drops","authors":"Deepak Gangadharan, Haiyang Ma, S. Chakraborty, Roger Zimmermann","doi":"10.1109/ICCD.2011.6081404","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081404","url":null,"abstract":"We study the impact of a novel prioritized frame dropping scheme in buffer-constrained multiprocessor system-on-chip (MPSoC) platforms. Accurate buffer dimensioning has attracted lot of research interest as large on-chip buffers result in increased silicon area and higher costs. Multimedia applications present the flexibility of trading off quality for buffer space without any noticeable deterioration in video quality. The frame dropping scheme is crucial here to drop frames appropriately such that the required buffer size is reduced and target quality requirement is satisfied. Towards this, we propose a simple prioritized frame dropping mechanism which reduces the required buffer space more than existing frame dropping policies. We also provide a fast iterative procedure to find the minimum buffer size for a video clip with O(log(Ndrop)) number of iterations, where Ndrop is the maximum number of frames that can be dropped for a video clip so that a prespecified quality in terms of peak signal to noise ratio (PSNR) value is satisfied.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126815201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
3D vs. 2D analysis of FinFET logic gates under process variations 工艺变化下FinFET逻辑门的3D与2D分析
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081437
S. Chaudhuri, N. Jha
Among various multi-gate structures, FinFETs have emerged dominant owing to their ease of fabrication. Thus, characterization of FinFET devices/gates needs immediate attention for them to become the industry driver in this decade. Ideally, 3D device simulation should be done to enable accurate circuit synthesis. However, this is impractical due to the huge CPU times required. Simulating a 2D cross-section of the device yields 100–1000× reduction in CPU time. However, this introduces significant error, in the range of 10% to 50%, while evaluating the on/off current (ION/IOFF) for a single device and leakage current or propagation delay (ILEAK/tD) for logic gates. In this work, we develop accurate 2D models of FinFET devices to capture 3D simulation accuracy with 2D simulation efficiency. We report results for the 22nm FinFET technology node. As far as we know, this is the first such attempt. We establish the validity of the model even under process variations. We target variations in gate length (LG), workfunction (ΦG) and fin thickness (TSI) that are known to have the most impact on leakage and delay. We adjust their values in the 2D model in order to mimic the actual 3D device behavior. When the 2D models are employed in mixed-mode simulation of FinFET logic gates, the error in the evaluation of ILEAK/tD is quite small.
在各种多栅极结构中,finfet因其易于制造而占据主导地位。因此,FinFET器件/栅极的特性需要立即得到关注,以便在这十年中成为行业的驱动力。理想情况下,应该进行3D器件仿真以实现精确的电路合成。然而,这是不切实际的,因为需要大量的CPU时间。模拟该器件的二维截面可使CPU时间减少100 - 1000倍。然而,在评估单个器件的开/关电流(ION/IOFF)和逻辑门的漏电流或传播延迟(ILEAK/tD)时,这会引入10%至50%的显著误差。在这项工作中,我们开发了精确的FinFET器件的2D模型,以获得2D仿真效率的3D仿真精度。我们报告了22nm FinFET技术节点的结果。据我们所知,这是第一次此类尝试。我们建立了即使在工艺变化下模型的有效性。我们的目标是栅极长度(LG)、工作函数(ΦG)和鳍片厚度(TSI)的变化,这些变化已知对泄漏和延迟影响最大。为了模拟实际的3D设备行为,我们在2D模型中调整了它们的值。将二维模型用于FinFET逻辑门的混合模式仿真时,ILEAK/tD的评估误差相当小。
{"title":"3D vs. 2D analysis of FinFET logic gates under process variations","authors":"S. Chaudhuri, N. Jha","doi":"10.1109/ICCD.2011.6081437","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081437","url":null,"abstract":"Among various multi-gate structures, FinFETs have emerged dominant owing to their ease of fabrication. Thus, characterization of FinFET devices/gates needs immediate attention for them to become the industry driver in this decade. Ideally, 3D device simulation should be done to enable accurate circuit synthesis. However, this is impractical due to the huge CPU times required. Simulating a 2D cross-section of the device yields 100–1000× reduction in CPU time. However, this introduces significant error, in the range of 10% to 50%, while evaluating the on/off current (ION/IOFF) for a single device and leakage current or propagation delay (ILEAK/tD) for logic gates. In this work, we develop accurate 2D models of FinFET devices to capture 3D simulation accuracy with 2D simulation efficiency. We report results for the 22nm FinFET technology node. As far as we know, this is the first such attempt. We establish the validity of the model even under process variations. We target variations in gate length (LG), workfunction (ΦG) and fin thickness (TSI) that are known to have the most impact on leakage and delay. We adjust their values in the 2D model in order to mimic the actual 3D device behavior. When the 2D models are employed in mixed-mode simulation of FinFET logic gates, the error in the evaluation of ILEAK/tD is quite small.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"298 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134265279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Using analog circuit behavior to generate SystemC events for an acceleration of mixed-signal simulation 使用模拟电路行为来生成SystemC事件,以加速混合信号仿真
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081384
Stefan Hoelldampf, D. Zaum, M. Olbrich, E. Barke
Analysis of mixed-signal circuits on system level demands for an accelerated simulation of analog blocks. We precompute state space representations that can be used to evaluate the circuit response and predict the occurrence of events triggered by analog components. Interfacing such models to SystemC digital simulation allows for very fast mixed-signal transient analysis. The key contribution of this paper is the dynamic generation of SystemC events from analog descriptions. We evaluate the performance of our approach using PWM (pulse width modulation) example circuits from the automotive domain which are interfaced to a SystemC microcontroller model. Accuracy and speed is compared to reference simulations utilizing traditional SPICE-like analog simulators. Our approach allows for a speedup of up to 70 for mixed-signal simulations compared to traditional analog and digital simulators.
混合信号电路的系统级分析要求模拟模块的加速仿真。我们预先计算状态空间表示,可用于评估电路响应和预测由模拟组件触发的事件的发生。将这些模型与SystemC数字仿真相结合,可以实现非常快速的混合信号瞬态分析。本文的主要贡献是根据模拟描述动态生成SystemC事件。我们使用来自汽车领域的PWM(脉宽调制)示例电路来评估我们的方法的性能,这些电路与SystemC微控制器模型接口。精度和速度与使用传统spice类模拟模拟器的参考模拟进行了比较。与传统的模拟和数字模拟器相比,我们的方法允许混合信号模拟的加速高达70。
{"title":"Using analog circuit behavior to generate SystemC events for an acceleration of mixed-signal simulation","authors":"Stefan Hoelldampf, D. Zaum, M. Olbrich, E. Barke","doi":"10.1109/ICCD.2011.6081384","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081384","url":null,"abstract":"Analysis of mixed-signal circuits on system level demands for an accelerated simulation of analog blocks. We precompute state space representations that can be used to evaluate the circuit response and predict the occurrence of events triggered by analog components. Interfacing such models to SystemC digital simulation allows for very fast mixed-signal transient analysis. The key contribution of this paper is the dynamic generation of SystemC events from analog descriptions. We evaluate the performance of our approach using PWM (pulse width modulation) example circuits from the automotive domain which are interfaced to a SystemC microcontroller model. Accuracy and speed is compared to reference simulations utilizing traditional SPICE-like analog simulators. Our approach allows for a speedup of up to 70 for mixed-signal simulations compared to traditional analog and digital simulators.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"39 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115568455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Leveraging torus topology with deadlock recovery for cost-efficient on-chip network 利用环面拓扑与死锁恢复为成本效益的片上网络
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081371
Minjeong Shin, John Kim
On-chip networks are becoming more important as the number of on-chip components continue to increase. 2D mesh topology is a commonly assumed topology for on-chip networks but in this work, we make the argument that 2D torus can provide a more cost-efficient on-chip network since the on-chip network datapath is reduced by 2× while providing the same bisection bandwidth as a mesh network. Our results show that 2D torus can achieve an improvement of up to 1.9× over a 2D mesh in performance per watt metric. However, routing deadlock can occur in a torus network with the wrap-around channel and requires additional virtual channels for deadlock avoidance. In this work, we propose deadlock recovery with tokens (DRT) in on-chip networks that exploits on-chip networks - exploiting the abundant wires available while minimizing the need for additional buffers. As a result, deadlocks can be exactly detected without having to rely on a timeout mechanism and when needed, recover from the deadlock. We show how DRT results in minimal loss in performance, compared with deadlock avoidance using virtual channels, while reducing the on-chip network complexity.
随着片上元件数量的不断增加,片上网络变得越来越重要。二维网格拓扑是片上网络通常假设的拓扑结构,但在这项工作中,我们认为二维环面可以提供更具成本效益的片上网络,因为片上网络数据路径减少了2倍,同时提供与网格网络相同的等分带宽。我们的研究结果表明,二维环面在每瓦性能方面可以比二维网格提高1.9倍。但是,路由死锁可能发生在具有环绕通道的环面网络中,并且需要额外的虚拟通道来避免死锁。在这项工作中,我们提出在利用片上网络的片上网络中使用令牌(DRT)进行死锁恢复-利用丰富的可用线路,同时最大限度地减少对额外缓冲区的需求。因此,可以准确地检测死锁,而不必依赖于超时机制,并在需要时从死锁中恢复。与使用虚拟通道避免死锁相比,我们展示了DRT如何在降低片上网络复杂性的同时,将性能损失降到最低。
{"title":"Leveraging torus topology with deadlock recovery for cost-efficient on-chip network","authors":"Minjeong Shin, John Kim","doi":"10.1109/ICCD.2011.6081371","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081371","url":null,"abstract":"On-chip networks are becoming more important as the number of on-chip components continue to increase. 2D mesh topology is a commonly assumed topology for on-chip networks but in this work, we make the argument that 2D torus can provide a more cost-efficient on-chip network since the on-chip network datapath is reduced by 2× while providing the same bisection bandwidth as a mesh network. Our results show that 2D torus can achieve an improvement of up to 1.9× over a 2D mesh in performance per watt metric. However, routing deadlock can occur in a torus network with the wrap-around channel and requires additional virtual channels for deadlock avoidance. In this work, we propose deadlock recovery with tokens (DRT) in on-chip networks that exploits on-chip networks - exploiting the abundant wires available while minimizing the need for additional buffers. As a result, deadlocks can be exactly detected without having to rely on a timeout mechanism and when needed, recover from the deadlock. We show how DRT results in minimal loss in performance, compared with deadlock avoidance using virtual channels, while reducing the on-chip network complexity.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116176321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A novel cryptographic key exchange scheme using resistors 一种利用电阻器的新型密码密钥交换方案
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081445
P. Lin, A. Ivanov, Bradley Johnson, S. Khatri
Recently, a secure key exchange technique was developed, in which both communicators (Alice and Bob) randomly select between two known resistors. By measuring the resulting thermal noise on a shared wire, they can each determine the resistor chosen by their counterpart, while the eavesdropper (Eve) cannot determine this. By repeating this transaction, they can create a common secure key, one bit a time. Although theoretically elegant, this approach is difficult to realize in practice. In this paper, we present a practical realization of a secure key exchange technique, intended for use over the Ethernet. Our approach is inspired by the above scheme with significant differences. In our approach, Alice and Bob utilize programmable resistors and exchange their resistance values securely. Our technique has been implemented in a hardware FPGA based platform, and was found to be able to exchange 4 secure bits per transaction over a 100ft CAT5 cable.
最近,开发了一种安全的密钥交换技术,其中通信双方(Alice和Bob)随机选择两个已知的电阻。通过测量共享导线上产生的热噪声,它们可以各自确定对方选择的电阻,而窃听者(夏娃)无法确定这一点。通过重复这个交易,他们可以创建一个通用的安全密钥,每次一个比特。虽然理论上很优雅,但这种方法在实践中很难实现。在本文中,我们提出了一种安全密钥交换技术的实际实现,旨在通过以太网使用。我们的方法受到上述方案的启发,但有很大的不同。在我们的方法中,Alice和Bob利用可编程电阻并安全地交换其电阻值。我们的技术已经在基于硬件FPGA的平台上实现,并且发现能够在100英尺的CAT5电缆上每笔交易交换4个安全比特。
{"title":"A novel cryptographic key exchange scheme using resistors","authors":"P. Lin, A. Ivanov, Bradley Johnson, S. Khatri","doi":"10.1109/ICCD.2011.6081445","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081445","url":null,"abstract":"Recently, a secure key exchange technique was developed, in which both communicators (Alice and Bob) randomly select between two known resistors. By measuring the resulting thermal noise on a shared wire, they can each determine the resistor chosen by their counterpart, while the eavesdropper (Eve) cannot determine this. By repeating this transaction, they can create a common secure key, one bit a time. Although theoretically elegant, this approach is difficult to realize in practice. In this paper, we present a practical realization of a secure key exchange technique, intended for use over the Ethernet. Our approach is inspired by the above scheme with significant differences. In our approach, Alice and Bob utilize programmable resistors and exchange their resistance values securely. Our technique has been implemented in a hardware FPGA based platform, and was found to be able to exchange 4 secure bits per transaction over a 100ft CAT5 cable.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123392617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using stochastic computing to implement digital image processing algorithms 利用随机计算实现数字图像处理算法
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081391
Peng Li, D. Lilja
As device scaling continues to nanoscale dimensions, circuit reliability will continue to become an ever greater problem. Stochastic computing, which performs computing with random bits (stochastic bits streams), can be used to enable reliable computation using those unreliable devices. However, one of the major issues of stochastic computing is that applications implemented with this technique are limited by the available computational elements. In this paper, first we will introduce and prove a stochastic absolute value function. Second, we will demonstrate a mathematical analysis of a stochastic tanh function, which is a key component used in a stochastic comparator. Third, we will present a quantitative analysis of a one-parameter linear gain function, and propose a new two-parameter version. The validity of the present stochastic computational elements is demonstrated through four basic digital image processing algorithms: edge detection, frame difference based image segmentation, median filter based noise reduction, and image contrast stretching. Our experimental results show that stochastic implementations tolerate more noise and consume less hardware than their conventional counterparts.
随着器件规模不断扩大到纳米级,电路的可靠性将继续成为一个更大的问题。随机计算使用随机比特(随机比特流)进行计算,可以使用那些不可靠的设备实现可靠的计算。然而,随机计算的一个主要问题是,用这种技术实现的应用程序受到可用计算元素的限制。本文首先介绍并证明了一个随机绝对值函数。其次,我们将演示随机tanh函数的数学分析,这是随机比较器中使用的关键组件。第三,我们将给出一个单参数线性增益函数的定量分析,并提出一个新的双参数版本。通过四种基本的数字图像处理算法:边缘检测、基于帧差的图像分割、基于中值滤波的降噪和图像对比度拉伸,证明了随机计算元素的有效性。我们的实验结果表明,随机实现比传统实现容忍更多的噪声和消耗更少的硬件。
{"title":"Using stochastic computing to implement digital image processing algorithms","authors":"Peng Li, D. Lilja","doi":"10.1109/ICCD.2011.6081391","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081391","url":null,"abstract":"As device scaling continues to nanoscale dimensions, circuit reliability will continue to become an ever greater problem. Stochastic computing, which performs computing with random bits (stochastic bits streams), can be used to enable reliable computation using those unreliable devices. However, one of the major issues of stochastic computing is that applications implemented with this technique are limited by the available computational elements. In this paper, first we will introduce and prove a stochastic absolute value function. Second, we will demonstrate a mathematical analysis of a stochastic tanh function, which is a key component used in a stochastic comparator. Third, we will present a quantitative analysis of a one-parameter linear gain function, and propose a new two-parameter version. The validity of the present stochastic computational elements is demonstrated through four basic digital image processing algorithms: edge detection, frame difference based image segmentation, median filter based noise reduction, and image contrast stretching. Our experimental results show that stochastic implementations tolerate more noise and consume less hardware than their conventional counterparts.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114754071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 145
Modeling and design of a nanoscale memory cell for hardening to a single event with multiple node upset 多节点扰动下单事件硬化纳米级存储单元建模与设计
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081418
Sheng Lin, Yong-Bin Kim, F. Lombardi
The occurrence of a multiple node upset is likely to increase significantly in nanoscale CMOS due to reduced device size and power supply voltage scaling. This paper presents a comprehensive treatment (model, analysis and design) for hardening a memory cell against a soft error resulting in a multiple node upset at 32nm feature size in CMOS. A novel 13T memory cell configuration is proposed, analyzed, and simulated to show a better tolerance to the likely multiple node upset, i.e. a transient or soft fault affecting any two nodes in a cell. The proposed hardened memory cell utilizes a Schmitt trigger design; simulation shows that the multiple node upset tolerance is improved by nearly twice as much over existing designs. Moreover the 13T cell achieves a 33% reduction in write delay and only a 5% increase in power consumption compared to the DICE cell (consisting of 12 transistors). Simulation results are provided using the predictive technology file for 32nm feature size in CMOS. Monte Carlo simulation confirms the excellent multiple node upset tolerance of the proposed memory cell in the presence of process, voltage, and temperature variations in their designs.
在纳米级CMOS中,由于器件尺寸减小和电源电压缩放,多节点翻转的发生可能会显著增加。本文提出了一种综合处理方法(模型、分析和设计),用于硬化存储单元,以防止在CMOS中32nm特征尺寸下导致多节点破坏的软错误。提出了一种新的13T存储单元配置,分析并模拟了它对可能的多节点故障(即影响单元中任意两个节点的瞬态或软故障)有更好的容忍度。所提出的硬化存储单元采用施密特触发器设计;仿真结果表明,与现有设计相比,多节点扰动容忍度提高了近两倍。此外,与DICE单元(由12个晶体管组成)相比,13T单元的写入延迟减少了33%,功耗仅增加了5%。利用CMOS中32nm特征尺寸的预测技术文件,给出了仿真结果。蒙特卡罗模拟证实了所提出的存储单元在其设计中存在工艺,电压和温度变化的情况下具有出色的多节点干扰容忍度。
{"title":"Modeling and design of a nanoscale memory cell for hardening to a single event with multiple node upset","authors":"Sheng Lin, Yong-Bin Kim, F. Lombardi","doi":"10.1109/ICCD.2011.6081418","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081418","url":null,"abstract":"The occurrence of a multiple node upset is likely to increase significantly in nanoscale CMOS due to reduced device size and power supply voltage scaling. This paper presents a comprehensive treatment (model, analysis and design) for hardening a memory cell against a soft error resulting in a multiple node upset at 32nm feature size in CMOS. A novel 13T memory cell configuration is proposed, analyzed, and simulated to show a better tolerance to the likely multiple node upset, i.e. a transient or soft fault affecting any two nodes in a cell. The proposed hardened memory cell utilizes a Schmitt trigger design; simulation shows that the multiple node upset tolerance is improved by nearly twice as much over existing designs. Moreover the 13T cell achieves a 33% reduction in write delay and only a 5% increase in power consumption compared to the DICE cell (consisting of 12 transistors). Simulation results are provided using the predictive technology file for 32nm feature size in CMOS. Monte Carlo simulation confirms the excellent multiple node upset tolerance of the proposed memory cell in the presence of process, voltage, and temperature variations in their designs.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114425806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Analysis of on-chip interconnection network interface reliability in multicore systems 多核系统片上互连网络接口可靠性分析
Pub Date : 2011-10-09 DOI: 10.1109/ICCD.2011.6081433
Yong Zou, Yi Xiang, S. Pasricha
In Networks-on-Chip (NoC), with ever-increasing complexity and technology scaling, transient single-event upsets (SEUs) have become a key design challenge. In this work, we extend the concept of architectural vulnerability factor (AVF) from the microprocessor domain and propose a network vulnerability factor (NVF) to characterize the susceptibility of NoC components such as the Network Interface (NI) to transient faults. Our studies reveal that different NI buffers behave quite differently on transient faults and each buffer can have different levels of inherent fault-tolerant capability. Our analysis also considers the impact of thermal hotspot mitigation techniques such as frequency throttling on the NVF estimation.
在片上网络(NoC)中,随着复杂性和技术规模的不断增加,瞬态单事件干扰(seu)已成为一个关键的设计挑战。在这项工作中,我们从微处理器领域扩展了架构脆弱性因子(AVF)的概念,并提出了网络脆弱性因子(NVF)来表征NoC组件(如网络接口(NI))对瞬态故障的敏感性。我们的研究表明,不同的NI缓冲区对瞬态故障的行为完全不同,每个缓冲区可以具有不同级别的固有容错能力。我们的分析还考虑了热热点缓解技术(如频率节流)对NVF估计的影响。
{"title":"Analysis of on-chip interconnection network interface reliability in multicore systems","authors":"Yong Zou, Yi Xiang, S. Pasricha","doi":"10.1109/ICCD.2011.6081433","DOIUrl":"https://doi.org/10.1109/ICCD.2011.6081433","url":null,"abstract":"In Networks-on-Chip (NoC), with ever-increasing complexity and technology scaling, transient single-event upsets (SEUs) have become a key design challenge. In this work, we extend the concept of architectural vulnerability factor (AVF) from the microprocessor domain and propose a network vulnerability factor (NVF) to characterize the susceptibility of NoC components such as the Network Interface (NI) to transient faults. Our studies reveal that different NI buffers behave quite differently on transient faults and each buffer can have different levels of inherent fault-tolerant capability. Our analysis also considers the impact of thermal hotspot mitigation techniques such as frequency throttling on the NVF estimation.","PeriodicalId":354015,"journal":{"name":"2011 IEEE 29th International Conference on Computer Design (ICCD)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124158953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2011 IEEE 29th International Conference on Computer Design (ICCD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1