首页 > 最新文献

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文 中文
NeuADC: Neural Network-Inspired RRAM-Based Synthesizable Analog-to-Digital Conversion with Reconfigurable Quantization Support NeuADC:神经网络启发的基于随机存储器的可合成模数转换,具有可重构量化支持
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714933
Weidong Cao, Xin He, Ayan Chakrabarti, Xuan Zhang
Traditional analog-to-digital converters (ADCs) employ dedicated analog and mixed-signal (AMS) circuits and require time-consuming manual design process. They also exhibit limited reconfigurability and are unable to support diverse quantization schemes using the same circuitry. In this paper, we propose NeuADC — an automated design approach to synthesizing an analog-to-digital (A/D) interface that can approximate the desired quantization function using a neural network (NN) with a single hidden layer. Our design leverages the mixed-signal resistive random-access memory (RRAM) crossbar architecture in a novel dual-path configuration to realize basic NN operations at the circuit level and exploits smooth bit-encoding scheme to improve the training accuracy. Results obtained from SPICE simulations based on 130nm technology suggest that not only can NeuADC deliver promising performance compared to the state-of-art ADC designs across comprehensive design metrics, but also it can intrinsically support multiple reconfigurable quantization schemes using the same hardware substrate, paving the ways for future adaptable application-driven signal conversion. The robustness of NeuADC’s quantization quality under moderate RRAM resistance precision is also evaluated using SPICE simulations.
传统的模数转换器(adc)采用专用的模拟和混合信号(AMS)电路,需要耗时的人工设计过程。它们还表现出有限的可重构性,并且无法使用相同的电路支持多种量化方案。在本文中,我们提出了NeuADC -一种自动化设计方法,用于合成模拟-数字(A/D)接口,该接口可以使用具有单个隐藏层的神经网络(NN)近似所需的量化函数。我们的设计利用新的双路径配置中的混合信号电阻随机存取存储器(RRAM)交叉结构来实现电路级的基本神经网络操作,并利用平滑位编码方案来提高训练精度。基于130nm技术的SPICE仿真结果表明,与最先进的ADC设计相比,NeuADC不仅在综合设计指标上具有良好的性能,而且它可以使用相同的硬件衬底支持多种可重构量化方案,为未来自适应应用驱动的信号转换铺平了道路。通过SPICE仿真对NeuADC量化质量在中等RRAM电阻精度下的鲁棒性进行了评价。
{"title":"NeuADC: Neural Network-Inspired RRAM-Based Synthesizable Analog-to-Digital Conversion with Reconfigurable Quantization Support","authors":"Weidong Cao, Xin He, Ayan Chakrabarti, Xuan Zhang","doi":"10.23919/DATE.2019.8714933","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714933","url":null,"abstract":"Traditional analog-to-digital converters (ADCs) employ dedicated analog and mixed-signal (AMS) circuits and require time-consuming manual design process. They also exhibit limited reconfigurability and are unable to support diverse quantization schemes using the same circuitry. In this paper, we propose NeuADC — an automated design approach to synthesizing an analog-to-digital (A/D) interface that can approximate the desired quantization function using a neural network (NN) with a single hidden layer. Our design leverages the mixed-signal resistive random-access memory (RRAM) crossbar architecture in a novel dual-path configuration to realize basic NN operations at the circuit level and exploits smooth bit-encoding scheme to improve the training accuracy. Results obtained from SPICE simulations based on 130nm technology suggest that not only can NeuADC deliver promising performance compared to the state-of-art ADC designs across comprehensive design metrics, but also it can intrinsically support multiple reconfigurable quantization schemes using the same hardware substrate, paving the ways for future adaptable application-driven signal conversion. The robustness of NeuADC’s quantization quality under moderate RRAM resistance precision is also evaluated using SPICE simulations.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125573897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Model Checking is Possible to Verify Large-scale Vehicle Distributed Application Systems 模型检验是验证大规模车辆分布式应用系统的可能
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714795
Haitao Zhang, Ayang Tuo, Guoqiang Li
OSEK/VDX is a specification for vehicle-mounted systems. Currently, the specification has been widely adopted by many automotive companies to develop a distributed vehicle application system. However, the ever increasing complexity of the developed distributed application system has created a challenge for exhaustively ensuring its reliability. Model checking as an exhaustive technique has been applied to verify OSEK/VDX distributed application systems to discover subtle errors. Unfortunately, it faces a poor scalability for practical systems because the verification models derived from such systems are highly complex. This paper presents an efficient approach that addresses this problem by reducing the complexity of the verification model such that model checking can easily complete the verification.
OSEK/VDX是车载系统的规范。目前,该规范已被许多汽车公司广泛采用,用于开发分布式汽车应用系统。然而,开发的分布式应用系统的复杂性不断增加,这对彻底保证其可靠性提出了挑战。模型检查作为一种详尽的技术已被应用于验证OSEK/VDX分布式应用系统,以发现细微的错误。不幸的是,它在实际系统中面临着较差的可伸缩性,因为从此类系统派生的验证模型非常复杂。本文提出了一种有效的方法,通过降低验证模型的复杂性来解决这个问题,从而使模型检查可以很容易地完成验证。
{"title":"Model Checking is Possible to Verify Large-scale Vehicle Distributed Application Systems","authors":"Haitao Zhang, Ayang Tuo, Guoqiang Li","doi":"10.23919/DATE.2019.8714795","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714795","url":null,"abstract":"OSEK/VDX is a specification for vehicle-mounted systems. Currently, the specification has been widely adopted by many automotive companies to develop a distributed vehicle application system. However, the ever increasing complexity of the developed distributed application system has created a challenge for exhaustively ensuring its reliability. Model checking as an exhaustive technique has been applied to verify OSEK/VDX distributed application systems to discover subtle errors. Unfortunately, it faces a poor scalability for practical systems because the verification models derived from such systems are highly complex. This paper presents an efficient approach that addresses this problem by reducing the complexity of the verification model such that model checking can easily complete the verification.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126609768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
HotR: Alleviating Read/Write Interference with Hot Read Data Replication for Flash Storage HotR:缓解Flash存储热读数据复制的读写干扰
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715100
Suzhen Wu, Weiwei Zhang, Bo Mao, Hong Jiang
The read/write interference problem of flash storage remains a critical concern under workloads with a mixture of read and write requests. To significantly improve the read performance in face of read/write interference, we propose a Hot Data Replication scheme for flash storage, called HotR. HotR utilizes the asymmetric read and write performance characteristics of flash-based SSDs and outsources the popular read data to a surrogate space such as a dedicated spare flash chip or an over-provisioned space within an SSD. By servicing some conflicted read requests on the surrogate flash space, HotR can alleviate, if not entirely eliminate, the contention between the read requests and the on-going write requests. The evaluation results show that HotR improves the state-of-the-art scheme in the system performance and cost efficiency significantly. Consequently, the tail-latency of the flash-based storage systems is also reduced.
在读写请求混合的工作负载下,闪存的读/写干扰问题仍然是一个关键问题。为了显著提高读写干扰下的读取性能,我们提出了一种用于闪存的热数据复制方案,称为HotR。HotR利用了基于闪存的SSD的非对称读写性能特征,并将流行的读取数据外包给代理空间,例如专用的备用闪存芯片或SSD内的超额供应空间。通过在代理闪存空间上处理一些冲突的读请求,HotR可以缓解(如果不能完全消除的话)读请求和正在进行的写请求之间的争用。评估结果表明,HotR在系统性能和成本效率方面显著提高了现有方案。因此,基于闪存的存储系统的尾部延迟也降低了。
{"title":"HotR: Alleviating Read/Write Interference with Hot Read Data Replication for Flash Storage","authors":"Suzhen Wu, Weiwei Zhang, Bo Mao, Hong Jiang","doi":"10.23919/DATE.2019.8715100","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715100","url":null,"abstract":"The read/write interference problem of flash storage remains a critical concern under workloads with a mixture of read and write requests. To significantly improve the read performance in face of read/write interference, we propose a Hot Data Replication scheme for flash storage, called HotR. HotR utilizes the asymmetric read and write performance characteristics of flash-based SSDs and outsources the popular read data to a surrogate space such as a dedicated spare flash chip or an over-provisioned space within an SSD. By servicing some conflicted read requests on the surrogate flash space, HotR can alleviate, if not entirely eliminate, the contention between the read requests and the on-going write requests. The evaluation results show that HotR improves the state-of-the-art scheme in the system performance and cost efficiency significantly. Consequently, the tail-latency of the flash-based storage systems is also reduced.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122178539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Specifying and Evaluating Quality Metrics for Vision-based Perception Systems 指定和评估基于视觉的感知系统的质量指标
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715114
Anand Balakrishnan, Aniruddh Gopinath Puranic, Xin Qin, Adel Dokhanchi, Jyotirmoy V. Deshmukh, H. B. Amor, Georgios Fainekos
Robust perception algorithms are a vital ingredient for autonomous systems such as self-driving vehicles. Checking the correctness of perception algorithms such as those based on deep convolutional neural networks (CNN) is a formidable challenge problem. In this paper, we suggest the use of Timed Quality Temporal Logic (TQTL) as a formal language to express desirable spatio-temporal properties of a perception algorithm processing a video. While perception algorithms are traditionally tested by comparing their performance to ground truth labels, we show how TQTL can be a useful tool to determine quality of perception, and offers an alternative metric that can give useful information, even in the absence of ground truth labels. We demonstrate TQTL monitoring on two popular CNNs: YOLO and SqueezeDet, and give a comparative study of the results obtained for each architecture.
强大的感知算法是自动驾驶汽车等自动系统的重要组成部分。检测基于深度卷积神经网络(CNN)的感知算法的正确性是一个艰巨的挑战问题。在本文中,我们建议使用时间质量时间逻辑(TQTL)作为形式语言来表达处理视频的感知算法所需的时空特性。虽然传统上通过将感知算法的性能与基础真值标签进行比较来测试感知算法,但我们展示了TQTL如何成为确定感知质量的有用工具,并提供了一种替代度量,即使在没有基础真值标签的情况下也可以提供有用的信息。我们在两种流行的cnn: YOLO和SqueezeDet上演示了TQTL监控,并对每种架构获得的结果进行了比较研究。
{"title":"Specifying and Evaluating Quality Metrics for Vision-based Perception Systems","authors":"Anand Balakrishnan, Aniruddh Gopinath Puranic, Xin Qin, Adel Dokhanchi, Jyotirmoy V. Deshmukh, H. B. Amor, Georgios Fainekos","doi":"10.23919/DATE.2019.8715114","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715114","url":null,"abstract":"Robust perception algorithms are a vital ingredient for autonomous systems such as self-driving vehicles. Checking the correctness of perception algorithms such as those based on deep convolutional neural networks (CNN) is a formidable challenge problem. In this paper, we suggest the use of Timed Quality Temporal Logic (TQTL) as a formal language to express desirable spatio-temporal properties of a perception algorithm processing a video. While perception algorithms are traditionally tested by comparing their performance to ground truth labels, we show how TQTL can be a useful tool to determine quality of perception, and offers an alternative metric that can give useful information, even in the absence of ground truth labels. We demonstrate TQTL monitoring on two popular CNNs: YOLO and SqueezeDet, and give a comparative study of the results obtained for each architecture.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122457202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
RiskiM: Toward Complete Kernel Protection with Hardware Support RiskiM:通过硬件支持实现完整的内核保护
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715277
Dongil Hwang, Myonghoon Yang, Seongil Jeon, Younghan Lee, Donghyun Kwon, Y. Paek
The OS kernel is typically the assumed trusted computing base in a system. Consequently, when they try to protect the kernel, developers often build their solutions in a separate secure execution environment externally located and protected by special hardware. Due to limited visibility into the host system, the external solutions basically all entail the semantic gap problem which can be easily exploited by an adversary to circumvent them. Thus, for complete kernel protection against such adversarial exploits, previous solutions resorted to aggressive techniques that usually come with various adverse side effects, such as high performance overhead, kernel code modifications and/or excessively complicated hardware designs. In this paper, we introduce RiskiM, our new hardware-based monitoring platform to ensure kernel integrity from outside the host system. To overcome the semantic gap problem, we have devised a hardware interface architecture, called PEMI, by which RiskiM is supplied with all internal states of the host system essential for fulfilling its monitoring task to protect the kernel even in the presence of attacks exploiting the semantic gap between the host and RiskiM. To empirically validate the security strength and performance of our monitoring platform in existing systems, we have fully implemented RiskiM in a RISC-V system. Our experiments show that RiskiM succeeds in the host kernel protection by detecting even the advanced attacks which could circumvent previous solutions, yet suffering from virtually no aforementioned side effects.
操作系统内核通常是系统中假定的可信计算基础。因此,当他们试图保护内核时,开发人员通常在一个独立的安全执行环境中构建他们的解决方案,该环境位于外部,并受到特殊硬件的保护。由于对主机系统的可见性有限,外部解决方案基本上都包含语义缺口问题,这很容易被攻击者利用来绕过它们。因此,为了完整地保护内核免受此类对抗性攻击,以前的解决方案采用了侵略性技术,这些技术通常会带来各种不利的副作用,例如高性能开销、内核代码修改和/或过于复杂的硬件设计。在本文中,我们介绍了RiskiM,一个新的基于硬件的监控平台,从主机系统外部确保内核完整性。为了克服语义间隙问题,我们设计了一种称为PEMI的硬件接口体系结构,通过该体系结构,RiskiM提供了主机系统的所有内部状态,这些状态对于完成其监视任务至关重要,即使存在利用主机和RiskiM之间的语义间隙的攻击,也可以保护内核。为了在现有系统中经验验证我们的监控平台的安全强度和性能,我们在RISC-V系统中完全实现了RiskiM。我们的实验表明,RiskiM通过检测甚至可以绕过先前解决方案的高级攻击,成功地保护了主机内核,但几乎没有上述副作用。
{"title":"RiskiM: Toward Complete Kernel Protection with Hardware Support","authors":"Dongil Hwang, Myonghoon Yang, Seongil Jeon, Younghan Lee, Donghyun Kwon, Y. Paek","doi":"10.23919/DATE.2019.8715277","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715277","url":null,"abstract":"The OS kernel is typically the assumed trusted computing base in a system. Consequently, when they try to protect the kernel, developers often build their solutions in a separate secure execution environment externally located and protected by special hardware. Due to limited visibility into the host system, the external solutions basically all entail the semantic gap problem which can be easily exploited by an adversary to circumvent them. Thus, for complete kernel protection against such adversarial exploits, previous solutions resorted to aggressive techniques that usually come with various adverse side effects, such as high performance overhead, kernel code modifications and/or excessively complicated hardware designs. In this paper, we introduce RiskiM, our new hardware-based monitoring platform to ensure kernel integrity from outside the host system. To overcome the semantic gap problem, we have devised a hardware interface architecture, called PEMI, by which RiskiM is supplied with all internal states of the host system essential for fulfilling its monitoring task to protect the kernel even in the presence of attacks exploiting the semantic gap between the host and RiskiM. To empirically validate the security strength and performance of our monitoring platform in existing systems, we have fully implemented RiskiM in a RISC-V system. Our experiments show that RiskiM succeeds in the host kernel protection by detecting even the advanced attacks which could circumvent previous solutions, yet suffering from virtually no aforementioned side effects.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122483164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
TypeCNN: CNN Development Framework With Flexible Data Types TypeCNN:具有灵活数据类型的CNN开发框架
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714855
Petr Rek, L. Sekanina
The rapid progress in artificial intelligence technologies based on deep and convolutional neural networks (CNN) has led to an enormous interest in efficient implementations of neural networks in embedded devices and hardware. We present a new software framework for the development of (approximate) convolutional neural networks in which the user can define and use various data types for forward (inference) procedure, backward (training) procedure and weights. Moreover, non-standard arithmetic operations such as approximate multipliers can easily be integrated into the CNN under design. This flexibility enables to analyze the impact of chosen data types and non-standard arithmetic operations on CNN training and inference efficiency. The framework was implemented in C++ and evaluated using several case studies.
基于深度和卷积神经网络(CNN)的人工智能技术的快速发展引起了人们对在嵌入式设备和硬件中高效实现神经网络的极大兴趣。我们提出了一种用于开发(近似)卷积神经网络的新软件框架,其中用户可以定义和使用各种数据类型用于前向(推理)过程,后向(训练)过程和权重。此外,近似乘数等非标准算术运算可以很容易地集成到设计中的CNN中。这种灵活性可以分析所选数据类型和非标准算术运算对CNN训练和推理效率的影响。该框架是用c++实现的,并通过几个案例研究进行了评估。
{"title":"TypeCNN: CNN Development Framework With Flexible Data Types","authors":"Petr Rek, L. Sekanina","doi":"10.23919/DATE.2019.8714855","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714855","url":null,"abstract":"The rapid progress in artificial intelligence technologies based on deep and convolutional neural networks (CNN) has led to an enormous interest in efficient implementations of neural networks in embedded devices and hardware. We present a new software framework for the development of (approximate) convolutional neural networks in which the user can define and use various data types for forward (inference) procedure, backward (training) procedure and weights. Moreover, non-standard arithmetic operations such as approximate multipliers can easily be integrated into the CNN under design. This flexibility enables to analyze the impact of chosen data types and non-standard arithmetic operations on CNN training and inference efficiency. The framework was implemented in C++ and evaluated using several case studies.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133645725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Trading Digital Accuracy for Power in an RSSI Computation of a Sensor Network Transceiver 传感器网络收发器RSSI计算中数字精度与功率的交换
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715146
Paul Detterer, Cumhur Erdin, Majid Nabi, J. P. D. Gyvez, T. Basten, Hailong Jiao
To handle the rigid power and energy constraints in the Digital BaseBand (DBB) of Wireless Sensor Networks (WSN)s, we introduce approximate computing as a new power reduction method. The Received Signal Strength Indicator (RSSI) computation is a key element in DBB processing. We evaluate the trade-off in RSSI computation between Quality-of-Service (QoS) and power consumption through circuit-level approximation. RSSI elements are approximated in such a way that error propagation is minimized. In an industrial 40-nm CMOS technology, substantial energy savings up to 24% are achieved for every successfully transferred bit in DBB processing in a low- power listening WSN scenario.
为了解决无线传感器网络(WSN)中数字基带(DBB)的刚性功率和能量限制,我们引入近似计算作为一种新的功耗降低方法。接收信号强度指标(RSSI)的计算是DBB处理中的一个关键因素。我们通过电路级近似来评估RSSI计算中服务质量(QoS)和功耗之间的权衡。RSSI元素以这样一种最小化错误传播的方式进行近似。在工业40纳米CMOS技术中,在低功耗监听WSN场景中,DBB处理中每成功传输一个比特,可节省高达24%的大量能源。
{"title":"Trading Digital Accuracy for Power in an RSSI Computation of a Sensor Network Transceiver","authors":"Paul Detterer, Cumhur Erdin, Majid Nabi, J. P. D. Gyvez, T. Basten, Hailong Jiao","doi":"10.23919/DATE.2019.8715146","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715146","url":null,"abstract":"To handle the rigid power and energy constraints in the Digital BaseBand (DBB) of Wireless Sensor Networks (WSN)s, we introduce approximate computing as a new power reduction method. The Received Signal Strength Indicator (RSSI) computation is a key element in DBB processing. We evaluate the trade-off in RSSI computation between Quality-of-Service (QoS) and power consumption through circuit-level approximation. RSSI elements are approximated in such a way that error propagation is minimized. In an industrial 40-nm CMOS technology, substantial energy savings up to 24% are achieved for every successfully transferred bit in DBB processing in a low- power listening WSN scenario.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117350139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Energy-Efficient Convolutional Neural Networks with Deterministic Bit-Stream Processing 具有确定性比特流处理的高效卷积神经网络
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714937
S. R. Faraji, M. Najafi, Bingzhe Li, D. Lilja, K. Bazargan
Stochastic computing (SC) has been used for lowcost and low power implementation of neural networks. Inherent inaccuracy and long latency of processing random bit-streams have made prior SC-based implementations inefficient compared to conventional fixed-point designs. Random or pseudo-random bitstreams often need to be processed for a very long time to produce acceptable results. This long latency leads to a significantly higher energy consumption than binary design counterparts. Low-discrepancy sequences have been recently used for fast-converging deterministic computation with stochastic constructs. In this work, we propose a low-cost, low-latency, and energy-efficient implementation of convolutional neural networks based on low-discrepancy deterministic bit-streams. Experimental results show a significant reduction in the energy consumption compared to previous random bitstream-based implementations and to the optimized fixed-point design with no quality degradation.
随机计算(SC)已被用于低成本、低功耗的神经网络实现。与传统的定点设计相比,处理随机比特流固有的不准确性和长延迟使得先前基于sc的实现效率低下。随机或伪随机比特流通常需要处理很长时间才能产生可接受的结果。这种长延迟导致的能耗明显高于二进制设计。低差异序列最近被用于随机结构的快速收敛确定性计算。在这项工作中,我们提出了一种基于低差异确定性比特流的低成本,低延迟和节能的卷积神经网络实现。实验结果表明,与之前基于随机比特流的实现和优化的定点设计相比,能耗显著降低,且没有质量下降。
{"title":"Energy-Efficient Convolutional Neural Networks with Deterministic Bit-Stream Processing","authors":"S. R. Faraji, M. Najafi, Bingzhe Li, D. Lilja, K. Bazargan","doi":"10.23919/DATE.2019.8714937","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714937","url":null,"abstract":"Stochastic computing (SC) has been used for lowcost and low power implementation of neural networks. Inherent inaccuracy and long latency of processing random bit-streams have made prior SC-based implementations inefficient compared to conventional fixed-point designs. Random or pseudo-random bitstreams often need to be processed for a very long time to produce acceptable results. This long latency leads to a significantly higher energy consumption than binary design counterparts. Low-discrepancy sequences have been recently used for fast-converging deterministic computation with stochastic constructs. In this work, we propose a low-cost, low-latency, and energy-efficient implementation of convolutional neural networks based on low-discrepancy deterministic bit-streams. Experimental results show a significant reduction in the energy consumption compared to previous random bitstream-based implementations and to the optimized fixed-point design with no quality degradation.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115154093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Using Machine Learning for Quality Configurable Approximate Computing 使用机器学习进行高质量可配置近似计算
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714957
Mahmoud Masadeh, O. Hasan, S. Tahar
Approximate computing (AC) is a nascent energy-efficient computing paradigm for error-resilient applications. However, the quality control of AC is quite challenging due to its input-dependent nature. Existing solutions fail to address fine-grained input-dependent controlled approximation. In this paper, we propose an input-aware machine learning based approach for the quality control of AC. For illustration purposes, we use 20 configurations of 8-bit approximate multipliers. We evaluate these designs for all combinations of possible input data. Then, we use machine learning algorithms to efficiently make predictive decisions for the quality control of the target approximate application, based on experimentally collected training data. The key benefits of the proposed approach include: (1) fine-grained input-dependent approximation, (2) no missed approximation opportunities, (3) no rollback recovery overhead, (4) applicable to any approximate computation with error-tolerant components, and (5) flexibility in adapting various error metrics.
近似计算(AC)是一种用于容错应用的新兴节能计算范式。然而,由于AC的输入依赖性,其质量控制相当具有挑战性。现有的解决方案无法解决细粒度的输入依赖控制近似。在本文中,我们提出了一种基于输入感知机器学习的AC质量控制方法。为了说明目的,我们使用了20种8位近似乘法器配置。我们针对所有可能的输入数据组合来评估这些设计。然后,基于实验收集的训练数据,我们使用机器学习算法有效地为目标近似应用的质量控制做出预测决策。该方法的主要优点包括:(1)细粒度输入依赖近似,(2)不会错过近似机会,(3)没有回滚恢复开销,(4)适用于任何带有容错组件的近似计算,以及(5)适应各种错误度量的灵活性。
{"title":"Using Machine Learning for Quality Configurable Approximate Computing","authors":"Mahmoud Masadeh, O. Hasan, S. Tahar","doi":"10.23919/DATE.2019.8714957","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714957","url":null,"abstract":"Approximate computing (AC) is a nascent energy-efficient computing paradigm for error-resilient applications. However, the quality control of AC is quite challenging due to its input-dependent nature. Existing solutions fail to address fine-grained input-dependent controlled approximation. In this paper, we propose an input-aware machine learning based approach for the quality control of AC. For illustration purposes, we use 20 configurations of 8-bit approximate multipliers. We evaluate these designs for all combinations of possible input data. Then, we use machine learning algorithms to efficiently make predictive decisions for the quality control of the target approximate application, based on experimentally collected training data. The key benefits of the proposed approach include: (1) fine-grained input-dependent approximation, (2) no missed approximation opportunities, (3) no rollback recovery overhead, (4) applicable to any approximate computation with error-tolerant components, and (5) flexibility in adapting various error metrics.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116223664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Mitigating Power Supply Glitch based Fault Attacks with Fast All-Digital Clock Modulation Circuit 基于快速全数字时钟调制电路减轻电源故障攻击
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715058
Arvind Singh, Monodeep Kar, Nikhil Chawla, S. Mukhopadhyay
This paper experimentally demonstrates that an on-chip integrated fast all-digital clock modulation (F-ADCM) circuit can be used as a countermeasure against supply glitch and temperature variations-based fault injection attacks (FIA). The F-ADCM circuit modulates clock edges in presence of DC/transient supply glitches and temperature variations to ensure correct operation of the underlying cryptographic circuit. With a testchip manufactured in 130nm CMOS process, we first demonstrate an inexpensive methodology to conduct a fault attack on hardware implementation of a 128-bit advanced encryption standard (AES) engine using externally controlled supply glitches. Next, we show that with F-ADCM circuit, it is no longer possible to inject supply/temperature glitch-based faults even after 10 million encryptions across varying operating conditions. Moreover, in extreme operating conditions, the F-ADCM circuit doesn’t generate any clock edges, leading to complete failure of the AES encryption, indicating no exploitable faults are present.
实验证明了片上集成的快速全数字时钟调制(F-ADCM)电路可用于对抗基于电源故障和温度变化的故障注入攻击(FIA)。F-ADCM电路在存在DC/瞬态电源故障和温度变化的情况下调制时钟边,以确保底层加密电路的正确操作。利用130nm CMOS工艺制造的测试芯片,我们首先展示了一种廉价的方法,利用外部控制的电源故障对128位高级加密标准(AES)引擎的硬件实现进行故障攻击。接下来,我们证明了使用F-ADCM电路,即使在不同操作条件下进行1000万次加密后,也不再可能注入基于电源/温度故障的故障。此外,在极端的操作条件下,F-ADCM电路不会产生任何时钟边,导致AES加密完全失败,表明没有可利用的故障存在。
{"title":"Mitigating Power Supply Glitch based Fault Attacks with Fast All-Digital Clock Modulation Circuit","authors":"Arvind Singh, Monodeep Kar, Nikhil Chawla, S. Mukhopadhyay","doi":"10.23919/DATE.2019.8715058","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715058","url":null,"abstract":"This paper experimentally demonstrates that an on-chip integrated fast all-digital clock modulation (F-ADCM) circuit can be used as a countermeasure against supply glitch and temperature variations-based fault injection attacks (FIA). The F-ADCM circuit modulates clock edges in presence of DC/transient supply glitches and temperature variations to ensure correct operation of the underlying cryptographic circuit. With a testchip manufactured in 130nm CMOS process, we first demonstrate an inexpensive methodology to conduct a fault attack on hardware implementation of a 128-bit advanced encryption standard (AES) engine using externally controlled supply glitches. Next, we show that with F-ADCM circuit, it is no longer possible to inject supply/temperature glitch-based faults even after 10 million encryptions across varying operating conditions. Moreover, in extreme operating conditions, the F-ADCM circuit doesn’t generate any clock edges, leading to complete failure of the AES encryption, indicating no exploitable faults are present.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114656049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1