首页 > 最新文献

2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)最新文献

英文 中文
Process Variation-Aware Analytical Modeling of Subthreshold Leakage Power 亚阈值泄漏功率过程变化感知分析建模
M. Anala, B. Harish
Leakage current is making a substantial contribution to the power dissipation in nanometer regime due to continued technology scaling. The problem is further accentuated with the increasing levels of unpredictability in process parameters. Consequently, accurate and reliable modeling of leakage current is critical for the prediction of static power, especially for ultra low power applications. In contrast to gate leakage and Band-to-Band-Tunneling (BTBT) leakage, subthreshold leakage is the most sensitive to parameter variations and hence has been considered for variability modeling. The variations in electrical and geometry parameters of the device drastically impact the sub-threshold leakage current. In this paper, a subthreshold leakage power estimation model in the presence of process variations, with Drain-Induced Barrier Lowering (DIBL) considerations, is proposed. The model focuses on the subthreshold leakage variations induced by the simultaneous effect of threshold voltage variability and variations in gate length and width. The variation in the subthreshold leakage power is characterized by using an extensive Monte Carlo analysis. In order to demonstrate the efficacy of the proposed model, the model generated distributions of a static CMOS inverter are overlaid on the SPICE generated distributions in 32 nm PTM technology. The results demonstrate that, in the presence of process variations, the proposed model offers better predictability with a mean error in the range of 0.09% to 0.45% and reduction in the standard deviation of 3.3% to 34%, resulting in tighter distributions, thereby ensuring better predictability and design robustness. Further, the proposed model is about 700X computationally faster than SPICE simulations.
由于技术的不断扩展,泄漏电流对纳米级功率损耗的影响越来越大。随着工艺参数不可预测性的增加,问题进一步加剧。因此,准确可靠的泄漏电流建模对于静态功率的预测至关重要,特别是对于超低功耗应用。与栅极泄漏和带到带隧道(BTBT)泄漏相比,亚阈值泄漏对参数变化最敏感,因此已被考虑用于变异性建模。器件的电气和几何参数的变化极大地影响了亚阈值泄漏电流。本文提出了一种考虑漏极势垒降低(DIBL)的过程变化下的亚阈值泄漏功率估计模型。该模型主要研究阈值电压变化和栅极长度和宽度变化共同作用下的亚阈泄漏变化。亚阈值泄漏功率的变化通过使用广泛的蒙特卡罗分析来表征。为了验证该模型的有效性,将静态CMOS逆变器的模型生成分布叠加在32nm PTM技术的SPICE生成分布上。结果表明,在存在过程变化的情况下,所提出的模型具有更好的可预测性,平均误差在0.09%至0.45%之间,标准差降低3.3%至34%,导致分布更紧密,从而确保了更好的可预测性和设计稳健性。此外,该模型的计算速度比SPICE模拟快约700倍。
{"title":"Process Variation-Aware Analytical Modeling of Subthreshold Leakage Power","authors":"M. Anala, B. Harish","doi":"10.1109/PATMOS.2019.8862039","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862039","url":null,"abstract":"Leakage current is making a substantial contribution to the power dissipation in nanometer regime due to continued technology scaling. The problem is further accentuated with the increasing levels of unpredictability in process parameters. Consequently, accurate and reliable modeling of leakage current is critical for the prediction of static power, especially for ultra low power applications. In contrast to gate leakage and Band-to-Band-Tunneling (BTBT) leakage, subthreshold leakage is the most sensitive to parameter variations and hence has been considered for variability modeling. The variations in electrical and geometry parameters of the device drastically impact the sub-threshold leakage current. In this paper, a subthreshold leakage power estimation model in the presence of process variations, with Drain-Induced Barrier Lowering (DIBL) considerations, is proposed. The model focuses on the subthreshold leakage variations induced by the simultaneous effect of threshold voltage variability and variations in gate length and width. The variation in the subthreshold leakage power is characterized by using an extensive Monte Carlo analysis. In order to demonstrate the efficacy of the proposed model, the model generated distributions of a static CMOS inverter are overlaid on the SPICE generated distributions in 32 nm PTM technology. The results demonstrate that, in the presence of process variations, the proposed model offers better predictability with a mean error in the range of 0.09% to 0.45% and reduction in the standard deviation of 3.3% to 34%, resulting in tighter distributions, thereby ensuring better predictability and design robustness. Further, the proposed model is about 700X computationally faster than SPICE simulations.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116782133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Graph-Based STA for Asynchronous Controllers 异步控制器的基于图的STA
N. Xiromeritis, S. Simoglou, C. Sotiriou, Nikolaos Sketopoulos
In this work, we present an Asynchronous Static Timing Analysis (ASTA) EDA methodology for cyclic, Asynchronous Control Circuits. Our methodology operates using Graph-based Analysis (GBA) principles, as conventional synchronous GBA STA, is fast, and pessimistically computes Critical Cycle(s), instead of Critical Paths, without cycle cutting. Our ASTA flow supports industrial Timing Libraries, Verilog input and multiple PVT corners. Gate timing arc delay/slew computation, input/output environment constraints, and path delay propagation, are implemented based on GBA STA principles. To perform ASTA, both gate-level netlist and a graph-based Event Model, Marked Graph (MG) or PeTri Net (PTnet), is required. The pair is used to construct the Event Timing Graph (ETG), an MG with annotated netlist extracted delays, for Event Model Transition to Transition (T2T) arcs. ETG delays are computed automatically, based on cyclic equilibrium slews, and GBA critical path identification between relevant T2T netlist gate pins. GBA T2T paths may be manually overridden. As GBA is non-functional, we illustrate a mapping between an Event Model, where choice places may be allowed, and the ETG, where places are collapsed to their corresponding timing annotated T2T arcs. The resultant ETG is live and 1-bounded, making it suitable for Period analysis using Burns Primal-Dual Algorithm. Our methodology has been successfully tested on 23 asynchronous benchmarks, and validated via timing simulation. We compare results against an industrial, synchronous STA tool with cycle cutting, and illustrate significant timing errors, when synchronous STA is used for delay annotation, as well as a 50% delta in Critical Cycle Delay.
在这项工作中,我们提出了一种用于循环异步控制电路的异步静态时序分析(ASTA) EDA方法。我们的方法使用基于图的分析(GBA)原理,与传统的同步GBA STA一样,速度快,并且悲观地计算关键周期,而不是关键路径,没有周期切割。我们的ASTA流支持工业时序库,Verilog输入和多个PVT角。栅极时序电弧延迟/摆计算、输入/输出环境约束和路径延迟传播都是基于GBA STA原理实现的。要执行ASTA,需要门级网络列表和基于图的事件模型,标记图(MG)或PeTri网(PTnet)。该对用于构建事件时序图(ETG),这是一个带有带注释的网络列表提取延迟的MG,用于事件模型过渡到过渡(T2T)弧。基于循环平衡slews和相关T2T网表门脚之间的GBA关键路径识别,自动计算ETG延迟。GBA T2T路径可能被手动覆盖。由于GBA是非功能性的,我们说明了事件模型和ETG之间的映射,事件模型允许选择位置,ETG将位置折叠到相应的时序注释的T2T弧线。由此产生的ETG是活的和1有界的,使其适合使用Burns原始对偶算法进行周期分析。我们的方法已经成功地在23个异步基准测试上进行了测试,并通过时序模拟进行了验证。我们将结果与具有周期切割的工业同步STA工具进行比较,并说明了当同步STA用于延迟注释时的显着时序错误,以及临界周期延迟的50%增量。
{"title":"Graph-Based STA for Asynchronous Controllers","authors":"N. Xiromeritis, S. Simoglou, C. Sotiriou, Nikolaos Sketopoulos","doi":"10.1109/PATMOS.2019.8862081","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862081","url":null,"abstract":"In this work, we present an Asynchronous Static Timing Analysis (ASTA) EDA methodology for cyclic, Asynchronous Control Circuits. Our methodology operates using Graph-based Analysis (GBA) principles, as conventional synchronous GBA STA, is fast, and pessimistically computes Critical Cycle(s), instead of Critical Paths, without cycle cutting. Our ASTA flow supports industrial Timing Libraries, Verilog input and multiple PVT corners. Gate timing arc delay/slew computation, input/output environment constraints, and path delay propagation, are implemented based on GBA STA principles. To perform ASTA, both gate-level netlist and a graph-based Event Model, Marked Graph (MG) or PeTri Net (PTnet), is required. The pair is used to construct the Event Timing Graph (ETG), an MG with annotated netlist extracted delays, for Event Model Transition to Transition (T2T) arcs. ETG delays are computed automatically, based on cyclic equilibrium slews, and GBA critical path identification between relevant T2T netlist gate pins. GBA T2T paths may be manually overridden. As GBA is non-functional, we illustrate a mapping between an Event Model, where choice places may be allowed, and the ETG, where places are collapsed to their corresponding timing annotated T2T arcs. The resultant ETG is live and 1-bounded, making it suitable for Period analysis using Burns Primal-Dual Algorithm. Our methodology has been successfully tested on 23 asynchronous benchmarks, and validated via timing simulation. We compare results against an industrial, synchronous STA tool with cycle cutting, and illustrate significant timing errors, when synchronous STA is used for delay annotation, as well as a 50% delta in Critical Cycle Delay.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132180141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A High-Performance Neuron for Artificial Neural Network based on Izhikevich model 基于Izhikevich模型的高性能人工神经网络神经元
Maria Sapounaki, A. Kakarountas
Neuromorphic circuits have gained a lot of interest through the last decades since they may be deployed in a large spectrum of scientific research. In this paper a hardware realization of a single neuron targeting Field Programmable Gate Arrays (FPGA) with 6 levels of pipeline is presented. The proposed circuit implements the Izhikevich’s model and is presenting better performance compared to a previous pipelined design. The proposed implementation is based on fixed-point arithmetic, allowing faster computations on values related to the membrane potential and the membrane recovery variable of the neuron. The exploitation of balanced and reduced stages of pipeline, in combination to the fixed point arithmetic, offers two significant characteristics. The circuits characteristics are higher performance up to 14%, achieving also parallel computation, better simulation of the actual operation of a neuron, while area requirements of the FPGA implementation remain low as the initial reference design. The proposed circuit is the first of its kind, in an effort to minimize area and at the same time improve performance of an artificial neuron.
在过去的几十年里,神经形态电路获得了很多的兴趣,因为它们可能被部署在广泛的科学研究中。本文介绍了一种针对6级流水线的现场可编程门阵列(FPGA)单神经元的硬件实现。所提出的电路实现了Izhikevich的模型,并且与以前的流水线设计相比具有更好的性能。提出的实现基于定点算法,允许更快地计算与膜电位和神经元的膜恢复变量相关的值。利用平衡级和简化级的管道,结合不动点算法,具有两个显著的特点。该电路的特点是性能提高了14%,还实现了并行计算,更好地模拟了神经元的实际操作,同时FPGA实现的面积要求与初始参考设计一样低。该电路是同类电路中的第一个,旨在尽量减少面积,同时提高人工神经元的性能。
{"title":"A High-Performance Neuron for Artificial Neural Network based on Izhikevich model","authors":"Maria Sapounaki, A. Kakarountas","doi":"10.1109/PATMOS.2019.8862154","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862154","url":null,"abstract":"Neuromorphic circuits have gained a lot of interest through the last decades since they may be deployed in a large spectrum of scientific research. In this paper a hardware realization of a single neuron targeting Field Programmable Gate Arrays (FPGA) with 6 levels of pipeline is presented. The proposed circuit implements the Izhikevich’s model and is presenting better performance compared to a previous pipelined design. The proposed implementation is based on fixed-point arithmetic, allowing faster computations on values related to the membrane potential and the membrane recovery variable of the neuron. The exploitation of balanced and reduced stages of pipeline, in combination to the fixed point arithmetic, offers two significant characteristics. The circuits characteristics are higher performance up to 14%, achieving also parallel computation, better simulation of the actual operation of a neuron, while area requirements of the FPGA implementation remain low as the initial reference design. The proposed circuit is the first of its kind, in an effort to minimize area and at the same time improve performance of an artificial neuron.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"351 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131230301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Stochastic Radial Basis Neural Networks 随机径向基神经网络
Fabio Galán-Prado, Alejandro Morán, J. Font-Rosselló, M. Roca, J. Rosselló
Stochastic spiking Neural Networks (SNN) is a new neural modeling oriented to include the intrinsic stochastic processes present in the brain. One of the main advantages of this kind of modeling is that they can be easily implemented in a digital circuit, thus taking advantage of this mature technology. In this paper we propose a digital design for stochastic spiking neurons oriented to high-density hardware implementation. We compare the proposal with other neural models, comparing in terms of speed, area and precision. As is shown, the circuit proposal is able to provide competitive results when comparing with other works present in the literature.
随机脉冲神经网络(SNN)是一种新的神经建模方法,旨在将大脑中存在的固有随机过程包括在内。这种建模的主要优点之一是它们可以很容易地在数字电路中实现,从而利用了这种成熟的技术。本文提出了一种面向高密度硬件实现的随机尖峰神经元的数字化设计。我们将该方法与其他神经模型进行了比较,在速度、面积和精度方面进行了比较。如图所示,与文献中的其他作品相比,电路提案能够提供有竞争力的结果。
{"title":"Stochastic Radial Basis Neural Networks","authors":"Fabio Galán-Prado, Alejandro Morán, J. Font-Rosselló, M. Roca, J. Rosselló","doi":"10.1109/PATMOS.2019.8862129","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862129","url":null,"abstract":"Stochastic spiking Neural Networks (SNN) is a new neural modeling oriented to include the intrinsic stochastic processes present in the brain. One of the main advantages of this kind of modeling is that they can be easily implemented in a digital circuit, thus taking advantage of this mature technology. In this paper we propose a digital design for stochastic spiking neurons oriented to high-density hardware implementation. We compare the proposal with other neural models, comparing in terms of speed, area and precision. As is shown, the circuit proposal is able to provide competitive results when comparing with other works present in the literature.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116180437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Involution Tool for Accurate Digital Timingand Power Analysis 用于精确数字定时和功率分析的对合工具
Daniel Öhlinger, Jürgen Maier, Matthias Függer, U. Schmid
We introduce the prototype of a digital timing simulation and power analysis tool for integrated circuit (Involution Tool) which employs the involution delay model introduced by Fugger et al. at DATE’15. Unlike the pure and inertial delay¨ models typically used in digital timing analysis tools, the involution model faithfully captures pulse propagation. The presented tool is able to quantify for the first time the accuracy of the latter by facilitating comparisons of its timing and power predictions with both SPICE-generated results and results achieved by standard timing analysis tools. It is easily customizable, both w.r.t. different instances of the involution model and different circuits, and supports automatic test case generation, including parameter sweeping. We demonstrate its capabilities by providing timing and power analysis results for three circuits in varying technologies: an inverter tree, the clock tree of an open-source processor, and a combinational circuit that involves multi-input NAND gates. It turns out that the timing and power predictions of two natural types of involution models are significantly better than the predictions obtained by standard digital simulations for the inverter tree and the clock tree. For the NAND circuit, the performance is comparable but not significantly better. Our simulations thus confirm the benefits of the involution model, but also demonstrate shortcomings for multi-input gates.
我们介绍了集成电路的数字时序仿真和功率分析工具(Involution tool)的原型,该工具采用了Fugger等人在DATE ' 15上介绍的Involution延迟模型。与数字时序分析工具中通常使用的纯延迟模型和惯性延迟模型不同,对合模型忠实地捕获脉冲传播。该工具能够通过将其时序和功率预测与spice生成的结果和标准时序分析工具获得的结果进行比较,首次量化后者的准确性。它很容易定制,既可以使用对合模型的不同实例,也可以使用不同的电路,并且支持自动生成测试用例,包括参数扫描。我们通过提供三种不同技术的电路的时序和功率分析结果来展示其功能:逆变器树、开源处理器的时钟树和涉及多输入NAND门的组合电路。结果表明,两种自然对合模型的时序和功率预测结果明显优于逆变器树和时钟树的标准数字仿真结果。对于NAND电路,性能是相当的,但没有明显更好。因此,我们的仿真证实了对合模型的优点,但也表明了多输入门的缺点。
{"title":"The Involution Tool for Accurate Digital Timingand Power Analysis","authors":"Daniel Öhlinger, Jürgen Maier, Matthias Függer, U. Schmid","doi":"10.1109/PATMOS.2019.8862165","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862165","url":null,"abstract":"We introduce the prototype of a digital timing simulation and power analysis tool for integrated circuit (Involution Tool) which employs the involution delay model introduced by Fugger et al. at DATE’15. Unlike the pure and inertial delay¨ models typically used in digital timing analysis tools, the involution model faithfully captures pulse propagation. The presented tool is able to quantify for the first time the accuracy of the latter by facilitating comparisons of its timing and power predictions with both SPICE-generated results and results achieved by standard timing analysis tools. It is easily customizable, both w.r.t. different instances of the involution model and different circuits, and supports automatic test case generation, including parameter sweeping. We demonstrate its capabilities by providing timing and power analysis results for three circuits in varying technologies: an inverter tree, the clock tree of an open-source processor, and a combinational circuit that involves multi-input NAND gates. It turns out that the timing and power predictions of two natural types of involution models are significantly better than the predictions obtained by standard digital simulations for the inverter tree and the clock tree. For the NAND circuit, the performance is comparable but not significantly better. Our simulations thus confirm the benefits of the involution model, but also demonstrate shortcomings for multi-input gates.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128234829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
PATMOS 2019 Technical Papers PATMOS 2019技术论文
{"title":"PATMOS 2019 Technical Papers","authors":"","doi":"10.1109/patmos.2019.8862116","DOIUrl":"https://doi.org/10.1109/patmos.2019.8862116","url":null,"abstract":"","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127422626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulation-based Verification of the Youngest-First Round-Robin Core Gating Pattern 基于仿真的最年轻优先轮询核心门控模式验证
A. Simevski, M. Krstic
Integrated circuit aging becomes a major concern with technology downscaling. Long-life systems therefore require better mechanisms for improving their lifetime. Here we present an implementation of the Youngest-First Round-Robin (YFRR) core gating pattern as a mean for reduction of aging in a four core multiprocessor. The pattern is optimal in respect to achieving the maximal possible system lifetime and significantly outperforms the simple Round-Robin (RR) pattern. For the purposes of simulation, a Verilog model of the circuit aging is developed and integrated in the "all-digital" simulation environment. The relative wear-out of the cores is obtained by using aging monitors which direct the core selection process of the YFRR pattern. The results confirm that YFRR excels when the initial age of the cores is uneven. Even greater than 32% increase in lifetime is obtained, predicted by the theoretical model which is based on a Weibul distribution of the lifetime reliability function. Here, we further find out that YFRR is far better than RR when the aging rate is high.
集成电路老化成为技术小型化的主要问题。因此,长寿命系统需要更好的机制来改善其寿命。在这里,我们提出了一种实现最年轻优先轮询(YFRR)核心门控模式,作为减少四核多处理器老化的平均方法。该模式在实现最大可能的系统生命周期方面是最优的,并且显著优于简单的轮询(RR)模式。为了仿真的目的,开发了电路老化的Verilog模型,并将其集成到“全数字”仿真环境中。利用老化监测仪获得了芯的相对磨损量,从而指导了YFRR模式芯的选择过程。结果表明,当岩心的初始年龄不均匀时,YFRR表现优异。根据基于寿命可靠性函数威布尔分布的理论模型预测,寿命的提高幅度甚至超过32%。这里,我们进一步发现,当老化率较高时,YFRR远优于RR。
{"title":"Simulation-based Verification of the Youngest-First Round-Robin Core Gating Pattern","authors":"A. Simevski, M. Krstic","doi":"10.1109/PATMOS.2019.8862045","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862045","url":null,"abstract":"Integrated circuit aging becomes a major concern with technology downscaling. Long-life systems therefore require better mechanisms for improving their lifetime. Here we present an implementation of the Youngest-First Round-Robin (YFRR) core gating pattern as a mean for reduction of aging in a four core multiprocessor. The pattern is optimal in respect to achieving the maximal possible system lifetime and significantly outperforms the simple Round-Robin (RR) pattern. For the purposes of simulation, a Verilog model of the circuit aging is developed and integrated in the \"all-digital\" simulation environment. The relative wear-out of the cores is obtained by using aging monitors which direct the core selection process of the YFRR pattern. The results confirm that YFRR excels when the initial age of the cores is uneven. Even greater than 32% increase in lifetime is obtained, predicted by the theoretical model which is based on a Weibul distribution of the lifetime reliability function. Here, we further find out that YFRR is far better than RR when the aging rate is high.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122700235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Morphological Reservoir Computing Hardware 形态库计算硬件
Fabio Galán-Prado, J. Font-Rosselló, J. Rosselló
In the recent years, Reservoir Computing arises as an emerging machine-learning technique that is highly suitable for time-series processing. In this work, we propose the implementation of reservoir computing systems in hardware via morphological neurons which make use of tropical algebra concepts that allow us to reduce the area cost in the neural synapses. The main consequence of using tropical algebra is that synapses multipliers are substituted by adders, with lower hardware requirements. The proposed design is synthesized in a Field-Programmable Gate Array (FPGA) and benchmarked against a time-series prediction task. The current approach achieves significant savings in terms of power and hardware, as well as an appreciable higher precision if compared to classical reservoir systems.
近年来,油藏计算作为一种新兴的机器学习技术兴起,它非常适合于时间序列处理。在这项工作中,我们提出通过形态学神经元在硬件中实现储层计算系统,形态学神经元利用热带代数概念,使我们能够减少神经突触的面积成本。使用热带代数的主要结果是突触乘数被加法器取代,对硬件的要求更低。该设计在现场可编程门阵列(FPGA)中进行了综合,并针对时间序列预测任务进行了基准测试。与传统的储层系统相比,目前的方法在电力和硬件方面节省了大量成本,而且精度明显更高。
{"title":"Morphological Reservoir Computing Hardware","authors":"Fabio Galán-Prado, J. Font-Rosselló, J. Rosselló","doi":"10.1109/PATMOS.2019.8862100","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862100","url":null,"abstract":"In the recent years, Reservoir Computing arises as an emerging machine-learning technique that is highly suitable for time-series processing. In this work, we propose the implementation of reservoir computing systems in hardware via morphological neurons which make use of tropical algebra concepts that allow us to reduce the area cost in the neural synapses. The main consequence of using tropical algebra is that synapses multipliers are substituted by adders, with lower hardware requirements. The proposed design is synthesized in a Field-Programmable Gate Array (FPGA) and benchmarked against a time-series prediction task. The current approach achieves significant savings in terms of power and hardware, as well as an appreciable higher precision if compared to classical reservoir systems.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123143512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design of a flexible multi-source energy harvesting system for autonomously powered IoT : The PERPS project 自主供电物联网的灵活多源能量收集系统设计:PERPS项目
S. Siskos, V. Gogolou, C. Tsamis, A. Kerasidou, G. Doumenis, Konstantine Tsiapali, S. Katsikas, Andreas Sakellariou
A desired property of an autonomous system is the capability to operate and survive in unforeseen conditions. Wireless IoT (formerly Wireless Sensor Network) applications, pose a series of limitations regarding an embedded system’s power consumption and energy autonomy. The PERPS project proposes an innovative approach to energy harvesting systems, aiming to perpetual operation of WSN nodes and portable electronics. A state-of-the-art energy conversion integrated circuit (ENC IC), with real-time S/W algorithms is implemented, to allow the predictive estimation of energy availability at the system’s installation site. A multi-source input is employed combining parallel harvesters for various energy sources including (ambient) light, (micro) vibrations and (small) temperature differences, to upgrade the topology’s efficiency and versatility. In addition, the newly-introduced concept of harvesting energy via triboelectric microgenerators is studied. Ultra-low power consumption microelectronic circuitries and novel storage structure techniques are employed in order the overall architecture to present optimum energy utilization, therefore maximized efficiency. The final version of the system will be tested on a ship’s engine room thus the verification of the PERPS project operational principle will be based on real and demanding environmental conditions.
自主系统的一个理想特性是能够在不可预见的条件下运行和生存。无线物联网(以前称为无线传感器网络)应用,对嵌入式系统的功耗和能源自主性提出了一系列限制。PERPS项目提出了一种能量收集系统的创新方法,旨在WSN节点和便携式电子设备的永久运行。采用最先进的能量转换集成电路(ENC IC),采用实时S/W算法,可以预测系统安装现场的能量可用性。采用多源输入组合并联采集器,用于各种能量源,包括(环境)光、(微)振动和(小)温差,以提高拓扑结构的效率和多功能性。此外,本文还研究了通过微摩擦发电机收集能量的新概念。采用超低功耗微电子电路和新颖的存储结构技术,使整体结构具有最佳的能量利用,从而实现效率最大化。该系统的最终版本将在一艘船的机舱进行测试,因此PERPS项目操作原理的验证将基于真实和苛刻的环境条件。
{"title":"Design of a flexible multi-source energy harvesting system for autonomously powered IoT : The PERPS project","authors":"S. Siskos, V. Gogolou, C. Tsamis, A. Kerasidou, G. Doumenis, Konstantine Tsiapali, S. Katsikas, Andreas Sakellariou","doi":"10.1109/PATMOS.2019.8862078","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862078","url":null,"abstract":"A desired property of an autonomous system is the capability to operate and survive in unforeseen conditions. Wireless IoT (formerly Wireless Sensor Network) applications, pose a series of limitations regarding an embedded system’s power consumption and energy autonomy. The PERPS project proposes an innovative approach to energy harvesting systems, aiming to perpetual operation of WSN nodes and portable electronics. A state-of-the-art energy conversion integrated circuit (ENC IC), with real-time S/W algorithms is implemented, to allow the predictive estimation of energy availability at the system’s installation site. A multi-source input is employed combining parallel harvesters for various energy sources including (ambient) light, (micro) vibrations and (small) temperature differences, to upgrade the topology’s efficiency and versatility. In addition, the newly-introduced concept of harvesting energy via triboelectric microgenerators is studied. Ultra-low power consumption microelectronic circuitries and novel storage structure techniques are employed in order the overall architecture to present optimum energy utilization, therefore maximized efficiency. The final version of the system will be tested on a ship’s engine room thus the verification of the PERPS project operational principle will be based on real and demanding environmental conditions.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132506924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
High Performance Accelerator for CNN Applications CNN应用的高性能加速器
A. Kyriakos, V. Kitsakis, Alexandros Louropoulos, E. Papatheofanous, I. Patronas, D. Reisis
The continuing advancement of the neural networks based techniques led to their exploitation in many applications, such as computer vision and the natural language processing systems where they provide high accuracy results at the cost of their high computational complexity. Hardware implemented AI accelerators provide the needed performance improvement for applications in specific areas, including robotics, autonomous systems and internet of things. The current study presents an FPGA based accelerator for Convolutional Neural Networks (CNN). The CNN model is trained for the MNIST dataset and the VHDL design targets high throughput, low power while using only on chip memory. The architecture uses parallel computations at the convolutional and fully connected layers and it has a highly pipelined output layer. The architecture implementation on a Xilinx Virtex VC707 validates the results.
基于神经网络的技术的不断进步导致了它们在许多应用中的开发,例如计算机视觉和自然语言处理系统,它们以高计算复杂性为代价提供高精度的结果。硬件实现的人工智能加速器为特定领域的应用提供了所需的性能改进,包括机器人、自主系统和物联网。本研究提出了一种基于FPGA的卷积神经网络(CNN)加速器。CNN模型是针对MNIST数据集进行训练的,VHDL设计的目标是高吞吐量、低功耗,同时只使用芯片内存。该架构在卷积层和全连接层上使用并行计算,并具有高度流水线化的输出层。在Xilinx Virtex VC707上的体系结构实现验证了结果。
{"title":"High Performance Accelerator for CNN Applications","authors":"A. Kyriakos, V. Kitsakis, Alexandros Louropoulos, E. Papatheofanous, I. Patronas, D. Reisis","doi":"10.1109/PATMOS.2019.8862166","DOIUrl":"https://doi.org/10.1109/PATMOS.2019.8862166","url":null,"abstract":"The continuing advancement of the neural networks based techniques led to their exploitation in many applications, such as computer vision and the natural language processing systems where they provide high accuracy results at the cost of their high computational complexity. Hardware implemented AI accelerators provide the needed performance improvement for applications in specific areas, including robotics, autonomous systems and internet of things. The current study presents an FPGA based accelerator for Convolutional Neural Networks (CNN). The CNN model is trained for the MNIST dataset and the VHDL design targets high throughput, low power while using only on chip memory. The architecture uses parallel computations at the convolutional and fully connected layers and it has a highly pipelined output layer. The architecture implementation on a Xilinx Virtex VC707 validates the results.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131384828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1