首页 > 最新文献

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文 中文
GraphRSim: A Joint Device-Algorithm Reliability Analysis for ReRAM-based Graph Processing GraphRSim:基于reram的图形处理的联合设备-算法可靠性分析
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116232
Chin-Fu Nien, Yi-Jou Hsiao, Hsiang-Yun Cheng, Cheng-Yu Wen, Ya-Cheng Ko, Chen-Ching Lin
Graph processing has attracted a lot of interests in recent years as it plays a key role to analyze huge datasets. ReRAM-based accelerators provide a promising solution to accelerate graph processing. However, the intrinsic stochastic behavior of ReRAM devices makes its computation results unreliable. In this paper, we build a simulation platform to analyze the impact of non-ideal ReRAM devices on the error rates of various graph algorithms. We show that the characteristic of the targeted graph algorithm and the type of ReRAM computations employed greatly affect the error rates. Using representative graph algorithms as case studies, we demonstrate that our simulation platform can guide chip designers to select better design options and develop new techniques to improve reliability.
图处理在分析海量数据集方面发挥着关键作用,近年来引起了人们的广泛关注。基于rerram的加速器为加速图形处理提供了一个很有前途的解决方案。然而,ReRAM器件固有的随机特性使其计算结果不可靠。在本文中,我们建立了一个仿真平台来分析非理想的ReRAM器件对各种图算法错误率的影响。结果表明,目标图算法的特性和所采用的ReRAM计算类型对错误率有很大影响。使用代表性的图形算法作为案例研究,我们证明了我们的仿真平台可以指导芯片设计人员选择更好的设计方案,并开发新技术来提高可靠性。
{"title":"GraphRSim: A Joint Device-Algorithm Reliability Analysis for ReRAM-based Graph Processing","authors":"Chin-Fu Nien, Yi-Jou Hsiao, Hsiang-Yun Cheng, Cheng-Yu Wen, Ya-Cheng Ko, Chen-Ching Lin","doi":"10.23919/DATE48585.2020.9116232","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116232","url":null,"abstract":"Graph processing has attracted a lot of interests in recent years as it plays a key role to analyze huge datasets. ReRAM-based accelerators provide a promising solution to accelerate graph processing. However, the intrinsic stochastic behavior of ReRAM devices makes its computation results unreliable. In this paper, we build a simulation platform to analyze the impact of non-ideal ReRAM devices on the error rates of various graph algorithms. We show that the characteristic of the targeted graph algorithm and the type of ReRAM computations employed greatly affect the error rates. Using representative graph algorithms as case studies, we demonstrate that our simulation platform can guide chip designers to select better design options and develop new techniques to improve reliability.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128487782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient Optical Power Delivery System for Hybrid Electronic-Photonic Manycore Processors 电子-光子混合多核处理器的高效光功率传输系统
Pub Date : 2020-03-01 DOI: 10.23919/date48585.2020.9116328
Shixi Chen, Jiang Xu, Xuanqi Chen, Zhifei Wang, Jun Feng, Jiaxu Zhang, Zhongyuan Tian, Xiao Li
A lot of efforts have been devoted to optically enabled high-performance communication infrastructures for future manycore processors. Silicon photonic network promises high bandwidth, high energy efficiency and low latency. However, the ever-increasing network complexity results in high optical power demands, which stress the optical power delivery and affect delivery efficiency. Facing these challenges, we propose Ring-based Optical Active Delivery (ROAD) system, to effectively manage and efficiently deliver high optical power throughout photonic-electronic hybrid systems. Experimental results demonstrate up to 5.49X energy efficiency improvement compared to traditional design without affecting processor performance.
为未来多核处理器的光学高性能通信基础设施已经投入了大量的努力。硅光子网络具有高带宽、高能效和低延迟的特点。但是,随着网络复杂度的不断提高,对光功率的需求也越来越大,这给光功率的传输带来了压力,影响了传输效率。面对这些挑战,我们提出了基于环形的光主动传输(ROAD)系统,以有效地管理和高效地在光电子混合系统中提供高光功率。实验结果表明,与传统设计相比,在不影响处理器性能的情况下,能效提高了5.49倍。
{"title":"Efficient Optical Power Delivery System for Hybrid Electronic-Photonic Manycore Processors","authors":"Shixi Chen, Jiang Xu, Xuanqi Chen, Zhifei Wang, Jun Feng, Jiaxu Zhang, Zhongyuan Tian, Xiao Li","doi":"10.23919/date48585.2020.9116328","DOIUrl":"https://doi.org/10.23919/date48585.2020.9116328","url":null,"abstract":"A lot of efforts have been devoted to optically enabled high-performance communication infrastructures for future manycore processors. Silicon photonic network promises high bandwidth, high energy efficiency and low latency. However, the ever-increasing network complexity results in high optical power demands, which stress the optical power delivery and affect delivery efficiency. Facing these challenges, we propose Ring-based Optical Active Delivery (ROAD) system, to effectively manage and efficiently deliver high optical power throughout photonic-electronic hybrid systems. Experimental results demonstrate up to 5.49X energy efficiency improvement compared to traditional design without affecting processor performance.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"30 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128186543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PCM: Precision-Controlled Memory System for Energy Efficient Deep Neural Network Training 用于高能效深度神经网络训练的精确控制记忆系统
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116530
Boyeal Kim, Sang Hyun Lee, Hyun Kim, Duy-Thanh Nguyen, Minh-Son Le, I. Chang, Do-Wan Kwon, J. Yoo, J. Choi, Hyuk-Jae Lee
Deep neural network (DNN) training suffers from the significant energy consumption in memory system, and most existing energy reduction techniques for memory system have focused on introducing low precision that is compatible with computing unit (e.g., FP16, FP8). These researches have shown that even in learning the networks with FP16 data precision, it is possible to provide training accuracy as good as FP32, de facto standard of the DNN training. However, our extensive experiments show that we can further reduce the data precision while maintaining the training accuracy of DNNs, which can be obtained by truncating some least significant bits (LSBs) of FP16, named as hard approximation. Nevertheless, the existing hard-ware structures for DNN training cannot efficiently support such low precision. In this work, we propose a novel memory system architecture for GPUs, named as precision-controlled memory system (PCM), which allows for flexible management at the level of hard approximation. PCM provides high DRAM bandwidth by distributing each precision to different channels with as transposed data mapping on DRAM. In addition, PCM supports fine-grained hard approximation in the L1 data cache using software-controlled registers, which can reduce data movement and thereby improve energy saving and system performance. Furthermore, PCM facilitates the reduction of data maintenance energy, which accounts for a considerable portion of memory energy consumption, by controlling refresh period of DRAM. The experimental results show that in training CIFAR-100 dataset on Resnet-20 with precision tuning, PCM achieves energy saving and performance enhancement by 66% and 20%, respectively, without loss of accuracy.
深度神经网络(Deep neural network, DNN)训练存在存储系统能耗大的问题,现有的存储系统能耗降低技术主要集中在引入与计算单元(如FP16、FP8)兼容的低精度。这些研究表明,即使在学习具有FP16数据精度的网络时,也有可能提供与DNN训练事实上的标准FP32一样好的训练精度。然而,我们的大量实验表明,我们可以进一步降低数据精度,同时保持dnn的训练精度,这可以通过截断FP16的一些最低有效位(LSBs)来获得,称为硬近似。然而,现有的DNN训练硬件结构无法有效支持如此低的精度。在这项工作中,我们提出了一种新的gpu存储系统架构,称为精确控制存储系统(PCM),它允许在硬近似水平上灵活管理。PCM通过将每个精度分配到不同的通道,并在DRAM上进行转置数据映射,从而提供高DRAM带宽。此外,PCM使用软件控制的寄存器在L1数据缓存中支持细粒度的硬近似,这可以减少数据移动,从而提高节能和系统性能。此外,PCM通过控制DRAM的刷新周期,有利于减少占内存能耗相当大一部分的数据维护能量。实验结果表明,在Resnet-20上对CIFAR-100数据集进行精确调优训练时,PCM在不损失精度的情况下,分别实现了66%和20%的节能和性能提升。
{"title":"PCM: Precision-Controlled Memory System for Energy Efficient Deep Neural Network Training","authors":"Boyeal Kim, Sang Hyun Lee, Hyun Kim, Duy-Thanh Nguyen, Minh-Son Le, I. Chang, Do-Wan Kwon, J. Yoo, J. Choi, Hyuk-Jae Lee","doi":"10.23919/DATE48585.2020.9116530","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116530","url":null,"abstract":"Deep neural network (DNN) training suffers from the significant energy consumption in memory system, and most existing energy reduction techniques for memory system have focused on introducing low precision that is compatible with computing unit (e.g., FP16, FP8). These researches have shown that even in learning the networks with FP16 data precision, it is possible to provide training accuracy as good as FP32, de facto standard of the DNN training. However, our extensive experiments show that we can further reduce the data precision while maintaining the training accuracy of DNNs, which can be obtained by truncating some least significant bits (LSBs) of FP16, named as hard approximation. Nevertheless, the existing hard-ware structures for DNN training cannot efficiently support such low precision. In this work, we propose a novel memory system architecture for GPUs, named as precision-controlled memory system (PCM), which allows for flexible management at the level of hard approximation. PCM provides high DRAM bandwidth by distributing each precision to different channels with as transposed data mapping on DRAM. In addition, PCM supports fine-grained hard approximation in the L1 data cache using software-controlled registers, which can reduce data movement and thereby improve energy saving and system performance. Furthermore, PCM facilitates the reduction of data maintenance energy, which accounts for a considerable portion of memory energy consumption, by controlling refresh period of DRAM. The experimental results show that in training CIFAR-100 dataset on Resnet-20 with precision tuning, PCM achieves energy saving and performance enhancement by 66% and 20%, respectively, without loss of accuracy.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129578986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Fast and Accurate DRAM Simulation: Can we Further Accelerate it? 快速准确的DRAM仿真:能否进一步加速?
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116275
Johannes Feldmann, Kira Kraft, Lukas Steiner, N. Wehn, Matthias Jung
The simulation of Dynamic Random Access Memories (DRAMs) in a system context requires highly accurate models due to the complex timing and power behavior of DRAMs. However, cycle accurate DRAM models often become the bottleneck regarding the overall simulation time. Therefore, fast but accurate DRAM simulation models are mandatory. This paper proposes two new performance optimized DRAM models that further accelerate the simulation speed with only a negligible degradation in accuracy. The first model is an enhanced Transaction Level Model (TLM), which uses a look-up table to accelerate parts of the simulation that feature a high memory access density for online scenarios. The second model is a neural network based simulator for offline trace analysis. We show a mathematical methodology to generate the inputs for the Look-Up Table (LUT) and an optimized artificial training set for the neural network. The enhanced TLM model is up to 5 times faster compared to a state-of-the-art TLM DRAM simulator. The neural network is able to speed up the simulation up to a factor of 10×, while inferring on a GPU. Both solutions provide only a slight decrease in accuracy of approximately 5%.
动态随机存取存储器(dram)在系统环境中的仿真需要高度精确的模型,因为dram的时序和功率行为非常复杂。然而,周期精确的DRAM模型往往成为整体仿真时间的瓶颈。因此,快速而准确的DRAM仿真模型是必不可少的。本文提出了两种新的性能优化的DRAM模型,进一步加快了仿真速度,而精度的下降可以忽略不计。第一个模型是一个增强的事务级模型(TLM),它使用一个查找表来加速部分模拟,这些模拟以在线场景的高内存访问密度为特征。第二个模型是一个基于神经网络的离线跟踪分析模拟器。我们展示了一种数学方法来生成查找表(LUT)的输入和神经网络的优化人工训练集。增强的TLM模型比最先进的TLM DRAM模拟器快5倍。当在GPU上进行推理时,神经网络能够将模拟速度提高到10倍。这两种解决方案的精度都只有大约5%的轻微下降。
{"title":"Fast and Accurate DRAM Simulation: Can we Further Accelerate it?","authors":"Johannes Feldmann, Kira Kraft, Lukas Steiner, N. Wehn, Matthias Jung","doi":"10.23919/DATE48585.2020.9116275","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116275","url":null,"abstract":"The simulation of Dynamic Random Access Memories (DRAMs) in a system context requires highly accurate models due to the complex timing and power behavior of DRAMs. However, cycle accurate DRAM models often become the bottleneck regarding the overall simulation time. Therefore, fast but accurate DRAM simulation models are mandatory. This paper proposes two new performance optimized DRAM models that further accelerate the simulation speed with only a negligible degradation in accuracy. The first model is an enhanced Transaction Level Model (TLM), which uses a look-up table to accelerate parts of the simulation that feature a high memory access density for online scenarios. The second model is a neural network based simulator for offline trace analysis. We show a mathematical methodology to generate the inputs for the Look-Up Table (LUT) and an optimized artificial training set for the neural network. The enhanced TLM model is up to 5 times faster compared to a state-of-the-art TLM DRAM simulator. The neural network is able to speed up the simulation up to a factor of 10×, while inferring on a GPU. Both solutions provide only a slight decrease in accuracy of approximately 5%.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129830967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
ABC: Abstract prediction Before Concreteness ABC:抽象预测先于具体
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116479
Jung-Eun Kim, Richard M. Bradford, Man-Ki Yoon, Zhong Shao
Learning techniques are advancing the utility and capability of modern embedded systems. However, the challenge of incorporating learning modules into embedded systems is that computing resources are scarce. For such a resource-constrained environment, we have developed a framework for learning abstract information early and learning more concretely as time allows. The intermediate results can be utilized to prepare for early decisions/actions as needed. To apply this framework to a classification task, the datasets are categorized in an abstraction hierarchy. Then the framework classifies intermediate labels from the most abstract level to the most concrete. Our proposed method outperforms the existing approaches and reference baselines in terms of accuracy. We show our framework with different architectures and on various benchmark datasets CIFAR-10, CIFAR-100, and GTSRB. We measure prediction times on GPUequipped embedded computing platforms as well.
学习技术正在提高现代嵌入式系统的实用性和性能。然而,将学习模块整合到嵌入式系统的挑战在于计算资源是稀缺的。对于这样一个资源受限的环境,我们已经开发了一个框架,用于早期学习抽象信息,并在时间允许的情况下学习更具体的信息。中间结果可用于根据需要为早期决策/行动做准备。为了将此框架应用于分类任务,数据集在抽象层次结构中进行分类。然后,该框架将中间标签从最抽象的层次分类到最具体的层次。我们提出的方法在精度方面优于现有的方法和参考基线。我们在不同的架构和不同的基准数据集CIFAR-10、CIFAR-100和GTSRB上展示了我们的框架。我们还在配备gpu的嵌入式计算平台上测量预测时间。
{"title":"ABC: Abstract prediction Before Concreteness","authors":"Jung-Eun Kim, Richard M. Bradford, Man-Ki Yoon, Zhong Shao","doi":"10.23919/DATE48585.2020.9116479","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116479","url":null,"abstract":"Learning techniques are advancing the utility and capability of modern embedded systems. However, the challenge of incorporating learning modules into embedded systems is that computing resources are scarce. For such a resource-constrained environment, we have developed a framework for learning abstract information early and learning more concretely as time allows. The intermediate results can be utilized to prepare for early decisions/actions as needed. To apply this framework to a classification task, the datasets are categorized in an abstraction hierarchy. Then the framework classifies intermediate labels from the most abstract level to the most concrete. Our proposed method outperforms the existing approaches and reference baselines in terms of accuracy. We show our framework with different architectures and on various benchmark datasets CIFAR-10, CIFAR-100, and GTSRB. We measure prediction times on GPUequipped embedded computing platforms as well.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127213566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Blockchain Technology Enabled Pay Per Use Licensing Approach for Hardware IPs 区块链技术支持硬件ip的按次付费许可方法
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116526
Krishnendu Guha, Debasri Saha, A. Chakrabarti
The present era is witnessing a reuse of hardware IPs to reduce cost. As trustworthiness is an essential factor, designers prefer to use hardware IPs which performed effectively in the past, but at the same time, are still active and did not age. In such scenarios, pay per use licensing schemes suit best for both producers and users. Existing pay per use licensing mechanisms consider a centralized third party, which may not be trustworthy. Hence, we seek refuge to blockchain technology to eradicate such third parties and facilitate a transparent and automated pay per use licensing mechanism. A blockchain is a distributed public ledger whose records are added based on peer review and majority consensus of its participants, that cannot be tampered or modified later. Smart contracts are deployed to facilitate the mechanism. Even dynamic pricing of the hardware IPs based on the factors of trustworthiness and aging have been focused in this work, which are not associated in existing literature. Security analysis of the proposed mechanism has been provided. Performance evaluation is carried based on the gas usage of Ethereum Solidity test environment, along with cost analysis based on lifetime and related user ratings.
当前的时代见证了硬件ip的重用以降低成本。由于可信度是必不可少的因素,设计师更倾向于使用过去运行有效,但同时仍然活跃且不老化的硬件ip。在这种情况下,按使用付费的许可方案最适合生产者和用户。现有的按次付费许可机制考虑的是中心化的第三方,这可能不值得信任。因此,我们寻求区块链技术的庇护,以消除此类第三方,并促进透明和自动的按次付费许可机制。区块链是一个分布式的公共分类账,其记录是根据同行评审和参与者的多数共识添加的,以后不能篡改或修改。智能合约的部署是为了促进这一机制。甚至基于可信度和老化因素的硬件ip动态定价在本工作中也得到了关注,这在现有文献中没有关联。对所提议的机制进行了安全性分析。性能评估基于以太坊稳定性测试环境的气体使用情况,以及基于寿命和相关用户评级的成本分析。
{"title":"Blockchain Technology Enabled Pay Per Use Licensing Approach for Hardware IPs","authors":"Krishnendu Guha, Debasri Saha, A. Chakrabarti","doi":"10.23919/DATE48585.2020.9116526","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116526","url":null,"abstract":"The present era is witnessing a reuse of hardware IPs to reduce cost. As trustworthiness is an essential factor, designers prefer to use hardware IPs which performed effectively in the past, but at the same time, are still active and did not age. In such scenarios, pay per use licensing schemes suit best for both producers and users. Existing pay per use licensing mechanisms consider a centralized third party, which may not be trustworthy. Hence, we seek refuge to blockchain technology to eradicate such third parties and facilitate a transparent and automated pay per use licensing mechanism. A blockchain is a distributed public ledger whose records are added based on peer review and majority consensus of its participants, that cannot be tampered or modified later. Smart contracts are deployed to facilitate the mechanism. Even dynamic pricing of the hardware IPs based on the factors of trustworthiness and aging have been focused in this work, which are not associated in existing literature. Security analysis of the proposed mechanism has been provided. Performance evaluation is carried based on the gas usage of Ethereum Solidity test environment, along with cost analysis based on lifetime and related user ratings.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127334198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CPS-oriented Modeling and Control of Traffic Signals Using Adaptive Back Pressure 基于cps的自适应背压交通信号建模与控制
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116403
Wanli Chang, Debayan Roy, Shuai Zhao, A. Annaswamy, S. Chakraborty
Modeling and design of automotive systems from a cyber-physical system (CPS) perspective have lately attracted extensive attention. As the trend towards automated driving and connectivity accelerates, strong interactions between vehicles and the infrastructure are expected. This requires modeling and control of the traffic network in a similarly formal manner. Modeling of such networks involves a tradeoff between expressivity of the appropriate features and tractability of the control problem. Back-pressure control of traffic signals is gaining ground due to its decentralized implementation, low computational complexity, and no requirements on prior traffic information. It guarantees maximum stability under idealistic assumptions. However, when deployed in real traffic intersections, the existing back-pressure control algorithms may result in poor junction utilization due to (i) fixed-length control phases; (ii) stability as the only objective; and (iii) obliviousness to finite road capacities and empty roads. In this paper, we propose a CPS-oriented model of traffic intersections and control of traffic signals, aiming to address the utilization issue of the back-pressure algorithms. We consider a more realistic model with transition phases and dedicated turning lanes, the latter influencing computation of the pressure and subsequently the utilization. The main technical contribution is an adaptive controller that enables varying-length control phases and considers both stability and utilization, while taking both cases of full roads and empty roads into account. We implement a mechanism to prevent frequent changes of control phases and thus limit the number of transition phases, which have negative impact on the junction utilization. Microscopic simulation results with SUMO on a 3×3 traffic network under various traffic patterns show that the proposed algorithm is at least about 13% better in performance than the existing fixed-length backpressure control algorithms reported in previous works. This is a significant improvement in the context of traffic signal control.
基于信息物理系统(CPS)的汽车系统建模与设计近年来受到了广泛关注。随着自动驾驶和互联趋势的加速,车辆与基础设施之间的强烈互动有望实现。这需要以类似的正式方式对交通网络进行建模和控制。这种网络的建模涉及到适当特征的表达性和控制问题的可跟踪性之间的权衡。交通信号背压控制因其分散实现、计算复杂度低、不需要预先获取交通信息等优点而得到广泛应用。它保证了理想主义假设下的最大稳定性。然而,当部署在实际交通路口时,现有的背压控制算法可能会由于(1)固定长度的控制阶段而导致交叉口利用率低下;稳定作为唯一目标;(三)对有限的道路容量和空旷道路的遗忘。在本文中,我们提出了一个面向cps的交通路口和交通信号控制模型,旨在解决背压算法的利用问题。我们考虑了一个具有过渡阶段和专用转弯车道的更现实的模型,后者影响压力的计算和随后的利用。主要的技术贡献是一个自适应控制器,它可以实现变长控制阶段,同时考虑稳定性和利用率,同时考虑满路和空路两种情况。我们实现了一种机制来防止控制阶段的频繁变化,从而限制过渡阶段的数量,这对结的利用率有负面影响。在3×3交通网络上的SUMO微观仿真结果表明,与已有的定长背压控制算法相比,本文算法的性能至少提高了13%左右。这在交通信号控制方面是一个重大的改进。
{"title":"CPS-oriented Modeling and Control of Traffic Signals Using Adaptive Back Pressure","authors":"Wanli Chang, Debayan Roy, Shuai Zhao, A. Annaswamy, S. Chakraborty","doi":"10.23919/DATE48585.2020.9116403","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116403","url":null,"abstract":"Modeling and design of automotive systems from a cyber-physical system (CPS) perspective have lately attracted extensive attention. As the trend towards automated driving and connectivity accelerates, strong interactions between vehicles and the infrastructure are expected. This requires modeling and control of the traffic network in a similarly formal manner. Modeling of such networks involves a tradeoff between expressivity of the appropriate features and tractability of the control problem. Back-pressure control of traffic signals is gaining ground due to its decentralized implementation, low computational complexity, and no requirements on prior traffic information. It guarantees maximum stability under idealistic assumptions. However, when deployed in real traffic intersections, the existing back-pressure control algorithms may result in poor junction utilization due to (i) fixed-length control phases; (ii) stability as the only objective; and (iii) obliviousness to finite road capacities and empty roads. In this paper, we propose a CPS-oriented model of traffic intersections and control of traffic signals, aiming to address the utilization issue of the back-pressure algorithms. We consider a more realistic model with transition phases and dedicated turning lanes, the latter influencing computation of the pressure and subsequently the utilization. The main technical contribution is an adaptive controller that enables varying-length control phases and considers both stability and utilization, while taking both cases of full roads and empty roads into account. We implement a mechanism to prevent frequent changes of control phases and thus limit the number of transition phases, which have negative impact on the junction utilization. Microscopic simulation results with SUMO on a 3×3 traffic network under various traffic patterns show that the proposed algorithm is at least about 13% better in performance than the existing fixed-length backpressure control algorithms reported in previous works. This is a significant improvement in the context of traffic signal control.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129986971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Long-term Continuous Assessment of SRAM PUF and Source of Random Numbers SRAM PUF和随机数来源的长期持续评估
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116353
Rui Wang, G. Selimis, Roel Maes, Sven Goossens
The qualities of Physical Unclonable Functions (PUFs) suffer from several noticeable degradations due to silicon aging. In this paper, we investigate the long-term effects of silicon aging on PUFs derived from the start-up behavior of Static Random Access Memories (SRAM). Previous research on SRAM aging is based on transistor-level simulation or accelerated aging test at high temperature and voltage to observe aging effects within a short period of time. In contrast, we have run a long-term continuous power-up test on 16 Arduino Leonardo boards under nominal conditions for two years. In total, we collected around 175 million measurements for reliability, uniqueness and randomness evaluations. Analysis shows that the number of bits that flip with respect to the reference increased by 19.3% while min-entropy of SRAM PUF noise improves by 19.3% on average after two years of aging. The impact of aging on reliability is smaller under nominal conditions than was previously assessed by the accelerated aging test. The test we conduct in this work more closely resembles the conditions of a device in the field, and therefore we more accurately evaluate how silicon aging affects SRAM PUFs.
由于硅老化,物理不可克隆功能(puf)的质量受到几个明显的降低。在本文中,我们研究了硅老化对静态随机存取存储器(SRAM)启动行为衍生的puf的长期影响。以往对SRAM老化的研究是基于晶体管级模拟或高温高压加速老化试验,观察短时间内的老化效应。相比之下,我们在标称条件下对16块Arduino Leonardo板进行了为期两年的长期连续通电测试。我们总共收集了大约1.75亿个测量值,用于可靠性、唯一性和随机性评估。分析表明,经过两年的老化,SRAM PUF噪声的最小熵平均提高了19.3%,相对于基准的翻转比特数增加了19.3%。在名义条件下,老化对可靠性的影响比先前通过加速老化试验评估的影响要小。我们在这项工作中进行的测试更接近于现场设备的条件,因此我们更准确地评估硅老化如何影响SRAM puf。
{"title":"Long-term Continuous Assessment of SRAM PUF and Source of Random Numbers","authors":"Rui Wang, G. Selimis, Roel Maes, Sven Goossens","doi":"10.23919/DATE48585.2020.9116353","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116353","url":null,"abstract":"The qualities of Physical Unclonable Functions (PUFs) suffer from several noticeable degradations due to silicon aging. In this paper, we investigate the long-term effects of silicon aging on PUFs derived from the start-up behavior of Static Random Access Memories (SRAM). Previous research on SRAM aging is based on transistor-level simulation or accelerated aging test at high temperature and voltage to observe aging effects within a short period of time. In contrast, we have run a long-term continuous power-up test on 16 Arduino Leonardo boards under nominal conditions for two years. In total, we collected around 175 million measurements for reliability, uniqueness and randomness evaluations. Analysis shows that the number of bits that flip with respect to the reference increased by 19.3% while min-entropy of SRAM PUF noise improves by 19.3% on average after two years of aging. The impact of aging on reliability is smaller under nominal conditions than was previously assessed by the accelerated aging test. The test we conduct in this work more closely resembles the conditions of a device in the field, and therefore we more accurately evaluate how silicon aging affects SRAM PUFs.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130034081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Parallel Implementation of Iterative Learning Controllers on Multi-core Platforms 多核平台上迭代学习控制器的并行实现
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116241
Mojtaba Haghi, Yusheng Yao, Dip Goswami, K. Goossens
This paper presents design and implementation techniques for iterative learning controllers (ILCs) targeting predictable multi-core embedded platforms. Implementation on embedded platforms results in a number of timing artifacts. Sensor-to-actuator delay (referred to as delay) is an important timing artifact which influences the control performance by changing the dynamic behavior of the system. We propose a delay-based design for ILCs that identifies and operates in the performance-optimal delay region. We then propose two implementation methods – sequential and parallel – for ILCs targeting the predictable multi-core platforms. The proposed methods enable the designer to carefully adjust the scheduling to achieve the optimal delay region in the resulting control system. We validate our results by the hardware-in-the-loop (HIL) simulation, considering a motion system as a case-study. Index Terms—Embedded control, Iterative learning control, Sensor-to-actuator-delay, Predictable multi-core platform.
本文介绍了针对可预测多核嵌入式平台的迭代学习控制器(ILCs)的设计和实现技术。在嵌入式平台上的实现会产生许多计时工件。传感器到执行器的延迟(简称延迟)是一种重要的定时伪影,它通过改变系统的动态行为来影响控制性能。我们提出了一种基于延迟的ilc设计,可以识别并在性能最佳的延迟区域中工作。然后,我们提出了针对可预测的多核平台的ilc的顺序和并行两种实现方法。所提出的方法使设计人员能够仔细地调整调度,以在得到的控制系统中达到最优延迟区域。我们通过硬件在环(HIL)仿真验证了我们的结果,并以运动系统为例进行了研究。索引术语:嵌入式控制,迭代学习控制,传感器到执行器延迟,可预测的多核平台。
{"title":"Parallel Implementation of Iterative Learning Controllers on Multi-core Platforms","authors":"Mojtaba Haghi, Yusheng Yao, Dip Goswami, K. Goossens","doi":"10.23919/DATE48585.2020.9116241","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116241","url":null,"abstract":"This paper presents design and implementation techniques for iterative learning controllers (ILCs) targeting predictable multi-core embedded platforms. Implementation on embedded platforms results in a number of timing artifacts. Sensor-to-actuator delay (referred to as delay) is an important timing artifact which influences the control performance by changing the dynamic behavior of the system. We propose a delay-based design for ILCs that identifies and operates in the performance-optimal delay region. We then propose two implementation methods – sequential and parallel – for ILCs targeting the predictable multi-core platforms. The proposed methods enable the designer to carefully adjust the scheduling to achieve the optimal delay region in the resulting control system. We validate our results by the hardware-in-the-loop (HIL) simulation, considering a motion system as a case-study. Index Terms—Embedded control, Iterative learning control, Sensor-to-actuator-delay, Predictable multi-core platform.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131033558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Securing Programmable Analog ICs Against Piracy 保护可编程模拟ic免受盗版
Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116520
M. Elshamy, A. Sayed, M. Louërat, A. Rhouni, H. Aboushady, H. Stratigopoulos
In this paper, we demonstrate a security approach for the class of highly-programmable analog Integrated Circuits (ICs) that can be used as a countermeasure for unauthorized chip use and piracy. The approach relies on functionality locking, i.e. a lock mechanism is introduced into the design such that unless the correct key is provided the functionality breaks. We show that for highly-programmable analog ICs the programmable fabric can naturally be used as the lock mechanism. We demonstrate the approach on a multi-standard RF receiver with configuration settings of 64-bit words.
在本文中,我们展示了一种高度可编程模拟集成电路(ic)类的安全方法,可以用作未经授权的芯片使用和盗版的对策。该方法依赖于功能锁定,即在设计中引入锁机制,除非提供正确的密钥,否则功能将中断。我们表明,对于高度可编程的模拟ic,可编程结构可以自然地用作锁定机制。我们在具有64位字配置设置的多标准射频接收器上演示了该方法。
{"title":"Securing Programmable Analog ICs Against Piracy","authors":"M. Elshamy, A. Sayed, M. Louërat, A. Rhouni, H. Aboushady, H. Stratigopoulos","doi":"10.23919/DATE48585.2020.9116520","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116520","url":null,"abstract":"In this paper, we demonstrate a security approach for the class of highly-programmable analog Integrated Circuits (ICs) that can be used as a countermeasure for unauthorized chip use and piracy. The approach relies on functionality locking, i.e. a lock mechanism is introduced into the design such that unless the correct key is provided the functionality breaks. We show that for highly-programmable analog ICs the programmable fabric can naturally be used as the lock mechanism. We demonstrate the approach on a multi-standard RF receiver with configuration settings of 64-bit words.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130200252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1