2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文中文

Production Recipe Validation through Formalization and Digital Twin Generation 通过形式化和数字孪生生成来验证生产配方

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116343

Stefano Spellini, Roberta Chirico, M. Panato, M. Lora, F. Fummi

The advent of Industry 4.0 is making production processes every day more complicated. As such, early process validation is becoming crucial to avoid production errors thus decreasing costs. In this paper, we present an approach to validate production recipes. Initially, the recipe is specified according to the ISA-95 standard, while the production plant is described using AutomationML. These specifications are formalized into a hierarchy of assume-guarantee contracts. Each contract specifies a set of temporal behaviors, characterizing the different machines composing the production line, their actions and interaction. Then, the formal specifications provided by the contracts are systematically synthesized to automatically generate a digital twin for the production line. Finally, the digital twin is used to evaluate, and validate, both the functional and the extra-functional characteristics of the system.The methodology has been applied to validate the production of a product requiring additive manufacturing, robotic assembling and transportation.

工业4.0的出现使得每天的生产过程变得更加复杂。因此，早期工艺验证对于避免生产错误从而降低成本变得至关重要。在本文中，我们提出了一种验证生产配方的方法。最初，配方是根据ISA-95标准指定的，而生产工厂是使用AutomationML描述的。这些规范被形式化为假设-担保契约的层次结构。每个契约指定了一组临时行为，描述了组成生产线的不同机器、它们的动作和交互。然后，系统地综合合同提供的正式规范，自动生成生产线的数字孪生。最后，使用数字孪生来评估和验证系统的功能和功能外特性。该方法已被应用于验证需要增材制造、机器人组装和运输的产品的生产。

引用次数: 2

High-Density, Low-Power Voltage-Control Spin Orbit Torque Memory with Synchronous Two-Step Write and Symmetric Read Techniques 采用同步两步写入和对称读取技术的高密度、低功耗电压控制自旋轨道转矩存储器

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116576

Haotian Wang, W. Kang, Liuyang Zhang, He Zhang, B. Kaushik, Weisheng Zhao

Voltage-control spin orbit torque (VC-SOT) magnetic tunnel junction (MTJ) has the potential to achieve high-speed and low-power spintronic memory, owing to the adaptive voltage modulated energy barrier of the MTJ. However, the three-terminal device structure needs two access transistors (one for write operation and the other one for read operation) and thus occupies larger bit-cell area compared to two terminal MTJs. A feasible method to reduce area overhead is to stack multiple VC-SOT MTJs on a common antiferromagnetic strip to share the write access transistors. In this structure, high density can be achieved. However, write and read operations face problems and the design space is not sure given a strip length. In this paper, we propose a synchronous two-step multi-bit write and symmetric read method by exploiting the selective VC-SOT driven MTJ switching mechanism. Then hybrid circuits are designed and evaluated based a physics-based VC-SOT MTJ model and a 40nm CMOS design-kit to show the feasibility and performance of our method. Our work enables high-density, low-power, high-speed voltage-control SOT memory.

电压控制自旋轨道转矩(VC-SOT)磁隧道结(MTJ)具有自适应电压调制能垒，具有实现高速低功耗自旋电子存储的潜力。然而，三端器件结构需要两个访问晶体管(一个用于写操作，另一个用于读操作)，因此与两个端mtj相比，占用更大的位元面积。减少面积开销的一种可行方法是将多个VC-SOT mtj堆叠在一个共同的反铁磁条带上，以共享写存取晶体管。在这种结构中，可以实现高密度。但是，写入和读取操作面临问题，并且给定条带长度的设计空间不确定。在本文中，我们利用选择性VC-SOT驱动的MTJ切换机制，提出了一种同步的两步多比特写入和对称读取方法。然后基于VC-SOT MTJ模型和40nm CMOS设计套件对混合电路进行了设计和评估，以证明该方法的可行性和性能。我们的工作使高密度、低功耗、高速电压控制的SOT存储器成为可能。

{"title":"High-Density, Low-Power Voltage-Control Spin Orbit Torque Memory with Synchronous Two-Step Write and Symmetric Read Techniques","authors":"Haotian Wang, W. Kang, Liuyang Zhang, He Zhang, B. Kaushik, Weisheng Zhao","doi":"10.23919/DATE48585.2020.9116576","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116576","url":null,"abstract":"Voltage-control spin orbit torque (VC-SOT) magnetic tunnel junction (MTJ) has the potential to achieve high-speed and low-power spintronic memory, owing to the adaptive voltage modulated energy barrier of the MTJ. However, the three-terminal device structure needs two access transistors (one for write operation and the other one for read operation) and thus occupies larger bit-cell area compared to two terminal MTJs. A feasible method to reduce area overhead is to stack multiple VC-SOT MTJs on a common antiferromagnetic strip to share the write access transistors. In this structure, high density can be achieved. However, write and read operations face problems and the design space is not sure given a strip length. In this paper, we propose a synchronous two-step multi-bit write and symmetric read method by exploiting the selective VC-SOT driven MTJ switching mechanism. Then hybrid circuits are designed and evaluated based a physics-based VC-SOT MTJ model and a 40nm CMOS design-kit to show the feasibility and performance of our method. Our work enables high-density, low-power, high-speed voltage-control SOT memory.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117088753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Wafer-Level Test Path Pattern Recognition and Test Characteristics for Test-Induced Defect Diagnosis 基于测试缺陷诊断的晶圆级测试路径模式识别与测试特性

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116546

Ken Chau-Cheung Cheng, Katherine Shu-Min Li, Andrew Yi-Ann Huang, Ji-Wei Li, L. Chen, Nova Cheng-Yen Tsai, Sying-Jyan Wang, Chen-Shiun Lee, Leon Chou, Peter Yi-Yu Liao, Hsing-Chung Liang, Jwu E. Chen

Wafer defect maps provide precious information of fabrication and test process defects, so they can be used as valuable sources to improve fabrication and test yield. This paper applies artificial intelligence based pattern recognition techniques to distinguish fab-induced defects from test-induced ones. As a result, test quality, reliability and yield could be improved accordingly. Wafer test data contain site-dependent information regarding test configurations in automatic test equipment, including effective load push force, gap between probe and load-board, probe tip size, probe-cleaning stress, etc. Our method analyzes both the test paths and site-dependent test characteristics to identify test-induced defects. Experimental results achieve 96.83% prediction accuracy of six NXP products, which show that our methods are both effective and efficient.

晶圆缺陷图提供了制造和测试过程缺陷的宝贵信息，因此它们可以作为提高制造和测试良率的有价值的来源。本文应用基于人工智能的模式识别技术来区分晶圆厂缺陷和测试缺陷。从而提高测试质量、可靠性和良率。晶圆测试数据包含有关自动测试设备测试配置的现场相关信息，包括有效负载推力、探头与负载板之间的间隙、探头尖端尺寸、探头清洗应力等。我们的方法分析了测试路径和与站点相关的测试特征，以识别测试引起的缺陷。实验结果表明，对6种恩智浦产品的预测准确率达到96.83%，表明本文方法的有效性和高效性。

引用次数: 2

Pitfalls in Machine Learning-based Adversary Modeling for Hardware Systems 基于机器学习的硬件系统对手建模中的缺陷

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116316

F. Ganji, Sarah Amir, Shahin Tajik, Domenic Forte, Jean-Pierre Seifert

The concept of the adversary model has been widely applied in the context of cryptography. When designing a cryptographic scheme or protocol, the adversary model plays a crucial role in the formalization of the capabilities and limitations of potential attackers. These models further enable the designer to verify the security of the scheme or protocol under investigation. Although being well established for conventional cryptanalysis attacks, adversary models associated with attackers enjoying the advantages of machine learning techniques have not yet been developed thoroughly. In particular, when it comes to composed hardware, often being security-critical, the lack of such models has become increasingly noticeable in the face of advanced, machine learning-enabled attacks. This paper aims at exploring the adversary models from the machine learning perspective. In this regard, we provide examples of machine learning-based attacks against hardware primitives, e.g., obfuscation schemes and hardware root-of-trust, claimed to be infeasible. We demonstrate that this assumption becomes however invalid as inaccurate adversary models have been considered in the literature.

对手模型的概念在密码学中得到了广泛的应用。在设计加密方案或协议时，对手模型在形式化潜在攻击者的能力和限制方面起着至关重要的作用。这些模型进一步使设计人员能够验证所研究的方案或协议的安全性。尽管对于传统的密码分析攻击已经很好地建立了，但与享受机器学习技术优势的攻击者相关的对手模型尚未得到彻底的开发。特别是，当涉及到通常是安全关键的组合硬件时，在面对高级的、支持机器学习的攻击时，缺乏此类模型变得越来越明显。本文旨在从机器学习的角度探讨对手模型。在这方面，我们提供了针对硬件原语的基于机器学习的攻击的示例，例如，混淆方案和硬件信任根，声称是不可行的。我们证明，这种假设变得无效，但不准确的对手模型已被认为在文献中。

{"title":"Pitfalls in Machine Learning-based Adversary Modeling for Hardware Systems","authors":"F. Ganji, Sarah Amir, Shahin Tajik, Domenic Forte, Jean-Pierre Seifert","doi":"10.23919/DATE48585.2020.9116316","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116316","url":null,"abstract":"The concept of the adversary model has been widely applied in the context of cryptography. When designing a cryptographic scheme or protocol, the adversary model plays a crucial role in the formalization of the capabilities and limitations of potential attackers. These models further enable the designer to verify the security of the scheme or protocol under investigation. Although being well established for conventional cryptanalysis attacks, adversary models associated with attackers enjoying the advantages of machine learning techniques have not yet been developed thoroughly. In particular, when it comes to composed hardware, often being security-critical, the lack of such models has become increasingly noticeable in the face of advanced, machine learning-enabled attacks. This paper aims at exploring the adversary models from the machine learning perspective. In this regard, we provide examples of machine learning-based attacks against hardware primitives, e.g., obfuscation schemes and hardware root-of-trust, claimed to be infeasible. We demonstrate that this assumption becomes however invalid as inaccurate adversary models have been considered in the literature.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116743367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

LightBulb: A Photonic-Nonvolatile-Memory-based Accelerator for Binarized Convolutional Neural Networks 灯泡:基于光子-非易失性记忆的二值化卷积神经网络加速器

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116494

Farzaneh Zokaee, Qian Lou, N. Youngblood, Weichen Liu, Yiyuan Xie, Lei Jiang

Although Convolutional Neural Networks (CNNs) have demonstrated the state-of-the-art inference accuracy in various intelligent applications, each CNN inference involves millions of expensive floating point multiply-accumulate (MAC) operations. To energy-efficiently process CNN inferences, prior work proposes an electro-optical accelerator to process power-of-2 quantized CNNs by electro-optical ripple-carry adders and optical binary shifters. The electro-optical accelerator also uses SRAM registers to store intermediate data. However, electro-optical ripple-carry adders and SRAMs seriously limit the operating frequency and inference throughput of the electro-optical accelerator, due to the long critical path of the adder and the long access latency of SRAMs. In this paper, we propose a photonic nonvolatile memory (NVM)-based accelerator, Light-Bulb, to process binarized CNNs by high frequency photonic XNOR gates and popcount units. LightBulb also adopts photonic racetrack memory to serve as input/output registers to achieve high operating frequency. Compared to prior electro-optical accelerators, on average, LightBulb improves the CNN inference throughput by 17× ~ 173× and the inference throughput per Watt by 17.5 × ~ 660×.

尽管卷积神经网络(CNN)已经在各种智能应用中展示了最先进的推理精度，但每次CNN推理都涉及数百万次昂贵的浮点乘法累加(MAC)运算。为了高效地处理CNN推断，先前的工作提出了一种光电加速器，通过光电纹波进位加法器和光学二进制移位器来处理2次方量子化的CNN。光电加速器也使用SRAM寄存器来存储中间数据。然而，由于加法器的关键路径长，而sram的访问延迟长，光电纹波携带加法器和sram严重限制了光电加速器的工作频率和推理吞吐量。在本文中，我们提出了一种基于光子非易失性存储器(NVM)的加速器Light-Bulb，通过高频光子XNOR门和popcount单元来处理二值化cnn。灯泡还采用光子赛道存储器作为输入/输出寄存器，实现高工作频率。与现有的电光加速器相比，LightBulb的CNN推理吞吐量平均提高了17× ~ 173×，每瓦推理吞吐量平均提高了17.5 × ~ 660×。

{"title":"LightBulb: A Photonic-Nonvolatile-Memory-based Accelerator for Binarized Convolutional Neural Networks","authors":"Farzaneh Zokaee, Qian Lou, N. Youngblood, Weichen Liu, Yiyuan Xie, Lei Jiang","doi":"10.23919/DATE48585.2020.9116494","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116494","url":null,"abstract":"Although Convolutional Neural Networks (CNNs) have demonstrated the state-of-the-art inference accuracy in various intelligent applications, each CNN inference involves millions of expensive floating point multiply-accumulate (MAC) operations. To energy-efficiently process CNN inferences, prior work proposes an electro-optical accelerator to process power-of-2 quantized CNNs by electro-optical ripple-carry adders and optical binary shifters. The electro-optical accelerator also uses SRAM registers to store intermediate data. However, electro-optical ripple-carry adders and SRAMs seriously limit the operating frequency and inference throughput of the electro-optical accelerator, due to the long critical path of the adder and the long access latency of SRAMs. In this paper, we propose a photonic nonvolatile memory (NVM)-based accelerator, Light-Bulb, to process binarized CNNs by high frequency photonic XNOR gates and popcount units. LightBulb also adopts photonic racetrack memory to serve as input/output registers to achieve high operating frequency. Compared to prior electro-optical accelerators, on average, LightBulb improves the CNN inference throughput by 17× ~ 173× and the inference throughput per Watt by 17.5 × ~ 660×.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127367156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25

System-level Evaluation of Chip-Scale Silicon Photonic Networks for Emerging Data-Intensive Applications 用于新兴数据密集型应用的芯片级硅光子网络的系统级评估

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116496

A. Narayan, Y. Thonnart, P. Vivet, A. Joshi, A. Coskun

Emerging data-driven applications such as graph processing applications are characterized by their excessive memory footprint and abundant parallelism, resulting in high memory bandwidth demand. As the scale of datasets for applications is reaching orders of TBs, performance limitation due to bandwidth demands is a major concern. Traditional on-chip electrical networks fail to meet such high bandwidth demands due to increased energy-per-bit or physical limitations with pin counts. Silicon photonic networks have emerged as a promising alternative to electrical interconnects, owing to their high bandwidth density and low energy-per-bit communication with negligible data-dependent power. Wide-scale adoption of silicon photonics at chip level, however, is hampered by their high sensitivity to process and thermal variations, high laser power due to losses along the network, and power consumption of the electrical-optical conversion. Device-level technological innovations to mitigate these issues are promising, yet they do not consider the system-level implications of the applications running on manycore systems with photonic networks. This work aims to bridge the gap between the system-level attributes of applications with the underlying architectural and device-level characteristics of silicon photonic networks to achieve energy-efficient computing. We particularly focus on graph applications, which involve unstructured yet abundant parallel memory accesses that stress the on-chip communication networks, and develop a cross-layer framework to evaluate 2.5D systems with silicon photonic networks. We demonstrate 38% power savings through system-level management using wavelength selection policies with only 1% loss in system performance and further evaluate architectural design choices on 2.5D systems with photonic networks.

新兴的数据驱动应用程序，如图形处理应用程序，其特点是内存占用过多，并行性强，导致内存带宽需求高。随着应用程序的数据集规模达到tb数量级，带宽需求导致的性能限制是一个主要问题。由于每比特能量的增加或引脚数的物理限制，传统的片上网络无法满足如此高的带宽需求。硅光子网络由于其高带宽密度和低每比特能量通信以及可忽略不计的数据依赖功率而成为电互连的有前途的替代品。然而，硅光子学在芯片级的广泛采用受到其对工艺和热变化的高灵敏度，沿网络损耗引起的高激光功率以及电光转换的功耗的阻碍。缓解这些问题的设备级技术创新是有希望的，但他们没有考虑到运行在多核系统上的应用程序对光子网络的系统级影响。这项工作旨在弥合应用程序的系统级属性与硅光子网络的底层架构和设备级特性之间的差距，以实现节能计算。我们特别关注图形应用程序，它涉及非结构化但丰富的并行存储器访问，强调片上通信网络，并开发一个跨层框架来评估带有硅光子网络的2.5D系统。我们展示了通过使用波长选择策略的系统级管理节省38%的功率，而系统性能仅损失1%，并进一步评估了具有光子网络的2.5D系统的架构设计选择。

{"title":"System-level Evaluation of Chip-Scale Silicon Photonic Networks for Emerging Data-Intensive Applications","authors":"A. Narayan, Y. Thonnart, P. Vivet, A. Joshi, A. Coskun","doi":"10.23919/DATE48585.2020.9116496","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116496","url":null,"abstract":"Emerging data-driven applications such as graph processing applications are characterized by their excessive memory footprint and abundant parallelism, resulting in high memory bandwidth demand. As the scale of datasets for applications is reaching orders of TBs, performance limitation due to bandwidth demands is a major concern. Traditional on-chip electrical networks fail to meet such high bandwidth demands due to increased energy-per-bit or physical limitations with pin counts. Silicon photonic networks have emerged as a promising alternative to electrical interconnects, owing to their high bandwidth density and low energy-per-bit communication with negligible data-dependent power. Wide-scale adoption of silicon photonics at chip level, however, is hampered by their high sensitivity to process and thermal variations, high laser power due to losses along the network, and power consumption of the electrical-optical conversion. Device-level technological innovations to mitigate these issues are promising, yet they do not consider the system-level implications of the applications running on manycore systems with photonic networks. This work aims to bridge the gap between the system-level attributes of applications with the underlying architectural and device-level characteristics of silicon photonic networks to achieve energy-efficient computing. We particularly focus on graph applications, which involve unstructured yet abundant parallel memory accesses that stress the on-chip communication networks, and develop a cross-layer framework to evaluate 2.5D systems with silicon photonic networks. We demonstrate 38% power savings through system-level management using wavelength selection policies with only 1% loss in system performance and further evaluate architectural design choices on 2.5D systems with photonic networks.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125712857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Approximation Trade Offs in an Image-Based Control System 基于图像的控制系统中的近似权衡

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116552

S. De, S. Mohamed, Konstantinos Bimpisidis, Dip Goswami, T. Basten, H. Corporaal

Image-based control (IBC) systems use camera sensor(s) to perceive the environment. The inherent compute-heavy nature of image processing causes long processing delay that negatively influences the performance of the IBC systems. Our idea is to reduce the long delay using coarse-grained approximation of the image signal processing pipeline without affecting the functionality and performance of the IBC system. The question is: how is the degree of approximation related to the closed-loop quality-of-control (QoC), memory utilization and energy consumption? We present a software-in-the-loop (SiL) evaluation framework for the above approximation-in-the-loop system. We identify the error resilient stages and the corresponding coarse-grained approximation settings for the IBC system. We perform trade off analysis between the QoC, memory utilisation and energy consumption for varying degrees of coarse-grained approximation. We demonstrate the effectiveness of our approach using a concrete case study of a lane keeping assist system (LKAS). We obtain energy and memory reduction of upto 84% and 29% respectively, for 28% QoC improvements.

基于图像的控制(IBC)系统使用相机传感器来感知环境。图像处理固有的计算量大的特性导致了较长的处理延迟，这对IBC系统的性能产生了负面影响。我们的想法是在不影响IBC系统功能和性能的情况下，使用图像信号处理管道的粗粒度近似来减少长延迟。问题是:近似程度如何与闭环控制质量(QoC)、内存利用率和能耗相关?我们提出了一个软件在环(SiL)评估框架，用于上述在环近似系统。我们确定了IBC系统的误差弹性阶段和相应的粗粒度近似设置。我们对不同程度的粗粒度近似执行QoC、内存利用率和能耗之间的权衡分析。我们使用车道保持辅助系统(LKAS)的具体案例研究来证明我们方法的有效性。我们获得能量和内存分别减少高达84%和29%，改善28%的QoC。

{"title":"Approximation Trade Offs in an Image-Based Control System","authors":"S. De, S. Mohamed, Konstantinos Bimpisidis, Dip Goswami, T. Basten, H. Corporaal","doi":"10.23919/DATE48585.2020.9116552","DOIUrl":"https://doi.org/10.23919/DATE48585.2020.9116552","url":null,"abstract":"Image-based control (IBC) systems use camera sensor(s) to perceive the environment. The inherent compute-heavy nature of image processing causes long processing delay that negatively influences the performance of the IBC systems. Our idea is to reduce the long delay using coarse-grained approximation of the image signal processing pipeline without affecting the functionality and performance of the IBC system. The question is: how is the degree of approximation related to the closed-loop quality-of-control (QoC), memory utilization and energy consumption? We present a software-in-the-loop (SiL) evaluation framework for the above approximation-in-the-loop system. We identify the error resilient stages and the corresponding coarse-grained approximation settings for the IBC system. We perform trade off analysis between the QoC, memory utilisation and energy consumption for varying degrees of coarse-grained approximation. We demonstrate the effectiveness of our approach using a concrete case study of a lane keeping assist system (LKAS). We obtain energy and memory reduction of upto 84% and 29% respectively, for 28% QoC improvements.","PeriodicalId":289525,"journal":{"name":"2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128048969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Lightweight Anonymous Routing in NoC based SoCs 基于NoC的soc中的轻量级匿名路由

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116572

Subodha Charles, Megan Logan, P. Mishra

System-on-Chip (SoC) supply chain is widely acknowledged as a major source of security vulnerabilities. Potentially malicious third-party IPs integrated on the same Network-on-Chip (NoC) with the trusted components can lead to security and trust concerns. While secure communication is a well studied problem in computer networks domain, it is not feasible to implement those solutions on resource-constrained SoCs. In this paper, we present a lightweight anonymous routing protocol for communication between IP cores in NoC based SoCs. Our method eliminates the major overhead associated with traditional anonymous routing protocols while ensuring that the desired security goals are met. Experimental results demonstrate that existing security solutions on NoC can introduce significant (1.5X) performance degradation, whereas our approach provides the same security features with minor (4%) impact on performance.

片上系统(SoC)供应链被广泛认为是安全漏洞的主要来源。与受信任组件集成在同一片上网络(NoC)上的潜在恶意第三方ip可能会导致安全和信任问题。安全通信是计算机网络领域研究较多的一个问题，但在资源受限的soc上实现这些解决方案并不可行。在本文中，我们提出了一种轻量级匿名路由协议，用于基于NoC的soc中IP核之间的通信。我们的方法消除了与传统匿名路由协议相关的主要开销，同时确保满足所需的安全目标。实验结果表明，现有的NoC安全解决方案可能会导致显著的(1.5倍)性能下降，而我们的方法提供了相同的安全特性，对性能的影响很小(4%)。

引用次数: 34

Optimising Resource Management for Embedded Machine Learning 优化嵌入式机器学习的资源管理

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116235

Lei Xun, Long Tran-Thanh, B. Al-Hashimi, G. Merrett

Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning work-loads. Performance can be defined using platform-dependent (e.g. speed, energy) and platform-independent (accuracy, confidence) metrics. In particular, we show how a Deep Neural Network (DNN) can be dynamically scalable to trade-off these various performance metrics. Achieving consistent performance when executing on different platforms is necessary yet challenging, due to the different resources provided and their capability, and their time-varying availability when executing alongside other workloads. Managing the interface between available hardware resources (often numerous and heterogeneous in nature), software requirements, and user experience is increasingly complex.

由于在延迟、隐私和连接性方面的明显优势，机器学习推理越来越多地在移动和嵌入式平台上本地执行。在本文中，我们提出了异构多核系统中在线资源管理的方法，并展示了如何将它们应用于优化机器学习工作负载的性能。性能可以使用平台相关(例如速度、能量)和平台无关(准确性、置信度)指标来定义。特别是，我们展示了深度神经网络(DNN)如何动态扩展以权衡这些不同的性能指标。在不同的平台上执行时实现一致的性能是必要的，但也具有挑战性，因为所提供的资源和它们的能力不同，并且在与其他工作负载一起执行时，它们的可用性随时间变化。管理可用硬件资源(通常数量众多且性质各异)、软件需求和用户体验之间的接口变得越来越复杂。

引用次数: 8

An Efficient SRAM yield Analysis Using Scaled-Sigma Adaptive Importance Sampling 基于尺度sigma自适应重要性抽样的SRAM成品率分析

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2020-03-01 DOI: 10.23919/DATE48585.2020.9116233

L. Pang, Mengyun Yao, Yifan Chai

Statistical SRAM yield analysis has become a growing concern for the requirement of high integration density and reliability of SRAM under process variations. It is a challenge to estimate the SRAM failure probability efficiently and accurately because the circuit failure is a "rare-event". Existing methods are still not efficient enough to solve the problem, especially in high dimensions. In this paper, we develop a scaled-sigma adaptive importance sampling (SSAIS) which is an extension of the adaptive importance sampling. This method changes not only the location parameters but the shape parameters by searching the failure region iteratively. The 40nm SRAM cell experiment validated that our method outperforms Monte Carlo method by 1500x and is 2.3x~5.2x faster than the state-of-art methods with reasonable accuracy. Another experiment on sense amplifier shows our method achieves 1811x speedup over the Monte Carlo method and 2x~11x speedup over the other methods.

由于对SRAM在工艺变化下的高集成度和可靠性的要求，SRAM的良率统计分析日益受到关注。由于电路故障是一种“罕见事件”，如何有效准确地估计SRAM的故障概率是一个挑战。现有的方法仍然不足以有效地解决问题，特别是在高维情况下。本文提出了一种尺度sigma自适应重要性抽样(SSAIS)，它是自适应重要性抽样的扩展。该方法通过对失效区域的迭代搜索，不仅改变了位置参数，而且改变了形状参数。40nm SRAM电池实验验证了我们的方法比蒙特卡罗方法快1500倍，在合理的精度下比目前的方法快2.3 ~5.2倍。在传感放大器上进行的实验表明，该方法比蒙特卡罗方法提高了1811倍，比其他方法提高了2 ~11倍。

引用次数: 3

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀