首页 > 最新文献

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文 中文
CARS: A Multi-layer Conflict-Aware Request Scheduler for NVMe SSDs CARS: NVMe ssd的多层冲突感知请求调度器
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715070
Tianming Yang, Ping Huang, Weiying Zhang, Haitao Wu, Longxin Lin
NVMe SSDs are nowadays widely deployed in various computing platforms due to its high performance and low power consumption, especially in data centers to support modern latency-sensitive applications. NVMe SSDs improve on SATA and SAS interfaced SSDs by providing a large number of device I/O queues at the host side and applications can directly manage the queues to concurrently issue requests to the device. However, the currently deployed request scheduling approach is oblivious to the states of the various device internal components and thus may lead to suboptimal decisions due to various resource contentions at different layers inside the SSD device. In this work, we propose a Conflict Aware Request Scheduling policy named CARS for NVMe SSDs to maximally leverage the rich parallelism available in modern NVMe SSDs. The central idea is to check possible conflicts that a fetched request might be associated with before dispatching that request. If there exists a conflict, it refrains from issuing the request and move to check a request in the next submission queue. In doing so, our scheduler can evenly distribute the requests among the parallel idle components in the flash chips, improving performance. Our evaluations have shown that our scheduler can reduce the slowdown metric by up to 46% relative to the de facto round-robin scheduling policy for a variety of patterned workloads.
NVMe ssd由于其高性能和低功耗的特点,目前被广泛部署在各种计算平台中,特别是在数据中心中,以支持现代对延迟敏感的应用。NVMe ssd是对SATA和SAS接口ssd的改进,在主机端提供大量的设备I/O队列,应用程序可以直接管理这些队列,并发地向设备发出请求。然而,当前部署的请求调度方法忽略了各种设备内部组件的状态,因此可能会由于SSD设备内部不同层的各种资源争用而导致次优决策。在这项工作中,我们提出了一种名为CARS的NVMe ssd冲突感知请求调度策略,以最大限度地利用现代NVMe ssd中可用的丰富并行性。其核心思想是在调度请求之前检查获取的请求可能与之关联的冲突。如果存在冲突,它将避免发出请求,并在下一个提交队列中检查请求。这样,我们的调度器可以在闪存芯片中的并行空闲组件之间均匀地分配请求,从而提高性能。我们的评估表明,对于各种模式工作负载,相对于事实上的循环调度策略,我们的调度器可以将速度指标降低46%。
{"title":"CARS: A Multi-layer Conflict-Aware Request Scheduler for NVMe SSDs","authors":"Tianming Yang, Ping Huang, Weiying Zhang, Haitao Wu, Longxin Lin","doi":"10.23919/DATE.2019.8715070","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715070","url":null,"abstract":"NVMe SSDs are nowadays widely deployed in various computing platforms due to its high performance and low power consumption, especially in data centers to support modern latency-sensitive applications. NVMe SSDs improve on SATA and SAS interfaced SSDs by providing a large number of device I/O queues at the host side and applications can directly manage the queues to concurrently issue requests to the device. However, the currently deployed request scheduling approach is oblivious to the states of the various device internal components and thus may lead to suboptimal decisions due to various resource contentions at different layers inside the SSD device. In this work, we propose a Conflict Aware Request Scheduling policy named CARS for NVMe SSDs to maximally leverage the rich parallelism available in modern NVMe SSDs. The central idea is to check possible conflicts that a fetched request might be associated with before dispatching that request. If there exists a conflict, it refrains from issuing the request and move to check a request in the next submission queue. In doing so, our scheduler can evenly distribute the requests among the parallel idle components in the flash chips, improving performance. Our evaluations have shown that our scheduler can reduce the slowdown metric by up to 46% relative to the de facto round-robin scheduling policy for a variety of patterned workloads.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127345286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
New method for the automated massive characterization of Bias Temperature Instability in CMOS transistors CMOS晶体管偏置温度不稳定性自动大规模表征的新方法
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715029
P. Saraza-Canflanca, J. Diaz-Fortuny, R. Castro-López, Elisenda Roca Moreno, J. Martín-Martínez, R. Rodríguez, M. Nafría, F. Fernández
Bias Temperature Instability has become a critical issue for circuit reliability. This phenomenon has been found to have a stochastic and discrete nature in nanometer-scale CMOS technologies. To account for this random nature, massive experimental characterization is necessary so that the extracted model parameters are accurate enough. However, there is a lack of automated analysis tools for the extraction of the BTI parameters from the extensive amount of generated data in those massive characterization tests. In this paper, a novel algorithm that allows the precise and fully automated parameter extraction from experimental BTI recovery current traces is presented. This algorithm is based on the Maximum Likelihood Estimation principles, and is able to extract, in a robust and exact manner, the threshold voltage shifts and emission times associated to oxide trap emissions during BTI recovery, required to properly model the phenomenon.
偏置温度的不稳定性已经成为影响电路可靠性的关键问题。这种现象在纳米级CMOS技术中具有随机性和离散性。为了解释这种随机性,大量的实验表征是必要的,以便提取的模型参数足够准确。然而,在这些大规模表征测试中,缺乏用于从大量生成的数据中提取BTI参数的自动化分析工具。本文提出了一种从实验BTI恢复电流迹线中精确、全自动提取参数的新算法。该算法基于最大似然估计原理,能够以鲁棒和精确的方式提取与BTI恢复过程中氧化阱排放相关的阈值电压位移和排放时间,从而正确地模拟该现象。
{"title":"New method for the automated massive characterization of Bias Temperature Instability in CMOS transistors","authors":"P. Saraza-Canflanca, J. Diaz-Fortuny, R. Castro-López, Elisenda Roca Moreno, J. Martín-Martínez, R. Rodríguez, M. Nafría, F. Fernández","doi":"10.23919/DATE.2019.8715029","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715029","url":null,"abstract":"Bias Temperature Instability has become a critical issue for circuit reliability. This phenomenon has been found to have a stochastic and discrete nature in nanometer-scale CMOS technologies. To account for this random nature, massive experimental characterization is necessary so that the extracted model parameters are accurate enough. However, there is a lack of automated analysis tools for the extraction of the BTI parameters from the extensive amount of generated data in those massive characterization tests. In this paper, a novel algorithm that allows the precise and fully automated parameter extraction from experimental BTI recovery current traces is presented. This algorithm is based on the Maximum Likelihood Estimation principles, and is able to extract, in a robust and exact manner, the threshold voltage shifts and emission times associated to oxide trap emissions during BTI recovery, required to properly model the phenomenon.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125596013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Realization of Four-Terminal Switching Lattices: Technology Development and Circuit Modeling 四端开关格的实现:技术开发与电路建模
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715123
Şerzat Safaltın, Oguz Gencer, Muhammed Ceylan Morgül, L. Aksoy, Sebahattin Gurmen, C. A. Moritz, M. Altun
Our European Union’s Horizon-2020 project aims to develop a complete synthesis and performance optimization methodology for switching nano-crossbar arrays that leads to the design and construction of an emerging nanocomputer. Within the project, we investigate different computing models based on either two-terminal switches, realized with field effect transistors, resistive and diode devices, or four-terminal switches. Although a four-terminal switch based model offers a significant area advantage, its realization at the technology level needs further justifications and raises a number of questions about its feasibility. In this study, we answer these questions. First, by using three dimensional technology computer-aided design (TCAD) simulations, we show that four-terminal switches can be directly implemented with the CMOS technology. For this purpose, we try different semiconductor gate materials in different formations of geometric shapes. Then, by fitting the TCAD simulation data to the standard CMOS current-voltage equations, we develop a Spice model of a four-terminal switch. Finally, we successfully perform Spice circuit simulations on four-terminal switches with different sizes. As a follow-up work within the project, we will proceed to the fabrication step.
我们的欧盟Horizon-2020项目旨在为开关纳米交叉棒阵列开发一种完整的合成和性能优化方法,从而设计和构建新兴的纳米计算机。在项目中,我们研究了基于双端开关的不同计算模型,由场效应晶体管,电阻和二极管器件实现,或四端开关。尽管基于四端开关的模型提供了显著的面积优势,但其在技术层面的实现需要进一步的论证,并提出了一些关于其可行性的问题。在这项研究中,我们回答了这些问题。首先,通过三维计算机辅助设计(TCAD)仿真,我们证明了四端开关可以直接使用CMOS技术实现。为此,我们尝试了不同几何形状的半导体栅极材料。然后,通过将TCAD仿真数据拟合到标准CMOS电流-电压方程中,我们建立了四端开关的Spice模型。最后,我们成功地对不同尺寸的四端开关进行Spice电路仿真。作为项目的后续工作,我们将继续进行制作步骤。
{"title":"Realization of Four-Terminal Switching Lattices: Technology Development and Circuit Modeling","authors":"Şerzat Safaltın, Oguz Gencer, Muhammed Ceylan Morgül, L. Aksoy, Sebahattin Gurmen, C. A. Moritz, M. Altun","doi":"10.23919/DATE.2019.8715123","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715123","url":null,"abstract":"Our European Union’s Horizon-2020 project aims to develop a complete synthesis and performance optimization methodology for switching nano-crossbar arrays that leads to the design and construction of an emerging nanocomputer. Within the project, we investigate different computing models based on either two-terminal switches, realized with field effect transistors, resistive and diode devices, or four-terminal switches. Although a four-terminal switch based model offers a significant area advantage, its realization at the technology level needs further justifications and raises a number of questions about its feasibility. In this study, we answer these questions. First, by using three dimensional technology computer-aided design (TCAD) simulations, we show that four-terminal switches can be directly implemented with the CMOS technology. For this purpose, we try different semiconductor gate materials in different formations of geometric shapes. Then, by fitting the TCAD simulation data to the standard CMOS current-voltage equations, we develop a Spice model of a four-terminal switch. Finally, we successfully perform Spice circuit simulations on four-terminal switches with different sizes. As a follow-up work within the project, we will proceed to the fabrication step.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126746902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Embedded Systems’ Automation following OMG’s Model Driven Architecture Vision 遵循OMG模型驱动架构愿景的嵌入式系统自动化
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715154
W. Ecker, Keerthikumara Devarajegowda, Michael Werner, Zhao Han, Lorenzo Servadei
This paper presents an automated process for end-to-end embedded system design following OMG’s model driven architecture (MDA) vision. It tackles a major challenge in automation: bridging the large semantic gap between the specification and the target code. The shown MDA adaption proposes an uniform and systematic way by splitting the translation process into multiple layers and introducing design platform independent and implementation independent views.In our adaption of MDA, we start with a formalized specification and we end with code (view) generation. The code is then compiled (software) or synthesized (hardware) and finally assembled to the embedded system design. We split the translation process in Model-of-Thing (MoT), Model-of-Design (MoD) and Model-of-View (MoV) layers. MoTs represent the formalized specification, MoDs contain the implementation architecture in a view independent way, and MoVs are implementation dependent and view dependent, i.e., specific details in target language.MoT is translated to MoD, MoD is translated to MoV and MoV is finally used to generate views. The translation between the Models is based on templates, that reflect design and coding blueprints. The final step of the view generation is itself part of generation. The Model MoV and the unparse method are generated from a view language description.The approach has been successfully adapted for generating digital hardware (RTL), properties for verification (SVA), and snippets of firmware that have been successfully synthesized to an FPGA.
本文提出了一个遵循OMG模型驱动体系结构(MDA)愿景的端到端嵌入式系统设计的自动化过程。它解决了自动化中的一个主要挑战:弥合规范和目标代码之间巨大的语义差距。所示的MDA自适应通过将转换过程拆分为多个层,并引入独立于设计平台和独立于实现的视图,提出了一种统一和系统的方法。在我们对MDA的改编中,我们从形式化的规范开始,并以代码(视图)生成结束。然后对代码进行编译(软件)或合成(硬件),最后组装到嵌入式系统设计中。我们将翻译过程分为物模型(MoT)、设计模型(MoD)和视图模型(MoV)层。mot代表形式化的规范,mod以一种与视图无关的方式包含实现架构,mov依赖于实现和视图,即目标语言的特定细节。MoT被转换成MoD, MoD被转换成MoV, MoV最后被用来生成视图。模型之间的转换基于反映设计和编码蓝图的模板。视图生成的最后一步本身就是生成的一部分。Model MoV和unparse方法是从视图语言描述中生成的。该方法已成功用于生成数字硬件(RTL)、验证属性(SVA)和已成功合成到FPGA的固件片段。
{"title":"Embedded Systems’ Automation following OMG’s Model Driven Architecture Vision","authors":"W. Ecker, Keerthikumara Devarajegowda, Michael Werner, Zhao Han, Lorenzo Servadei","doi":"10.23919/DATE.2019.8715154","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715154","url":null,"abstract":"This paper presents an automated process for end-to-end embedded system design following OMG’s model driven architecture (MDA) vision. It tackles a major challenge in automation: bridging the large semantic gap between the specification and the target code. The shown MDA adaption proposes an uniform and systematic way by splitting the translation process into multiple layers and introducing design platform independent and implementation independent views.In our adaption of MDA, we start with a formalized specification and we end with code (view) generation. The code is then compiled (software) or synthesized (hardware) and finally assembled to the embedded system design. We split the translation process in Model-of-Thing (MoT), Model-of-Design (MoD) and Model-of-View (MoV) layers. MoTs represent the formalized specification, MoDs contain the implementation architecture in a view independent way, and MoVs are implementation dependent and view dependent, i.e., specific details in target language.MoT is translated to MoD, MoD is translated to MoV and MoV is finally used to generate views. The translation between the Models is based on templates, that reflect design and coding blueprints. The final step of the view generation is itself part of generation. The Model MoV and the unparse method are generated from a view language description.The approach has been successfully adapted for generating digital hardware (RTL), properties for verification (SVA), and snippets of firmware that have been successfully synthesized to an FPGA.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129967838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Better Late Than Never : Verification of Embedded Systems After Deployment 迟做总比不做好:嵌入式系统部署后的验证
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714967
Martin Ring, Fritjof Bornebusch, Christoph Lüth, R. Wille, R. Drechsler
This paper investigates the benefits of verifying embedded systems after deployment. We argue that one reason for the huge state spaces of contemporary embedded and cyber-physical systems is the large variety of operating contexts, which are unknown during design. Once the system is deployed, these contexts become observable, confining several variables. By this, the search space is dramatically reduced, making verification possible even on the limited resources of a deployed system. In this paper, we propose a design and verification flow which exploits this observation. We show how specifications are transferred to the deployed system and verified there. Evaluations on a number of case studies demonstrate the reduction of the search space, and we sketch how the proposed approach can be employed in practice.
本文研究了嵌入式系统部署后验证的好处。我们认为,当代嵌入式和网络物理系统的巨大状态空间的一个原因是各种各样的操作环境,这是在设计期间未知的。一旦系统被部署,这些上下文就变成了可观察的,限制了几个变量。通过这种方式,搜索空间大大减少,甚至可以在已部署系统的有限资源上进行验证。在本文中,我们提出了一个利用这一观察结果的设计和验证流程。我们将展示如何将规范转移到已部署的系统并在那里进行验证。对许多案例研究的评估证明了搜索空间的减少,并且我们概述了所提出的方法如何在实践中使用。
{"title":"Better Late Than Never : Verification of Embedded Systems After Deployment","authors":"Martin Ring, Fritjof Bornebusch, Christoph Lüth, R. Wille, R. Drechsler","doi":"10.23919/DATE.2019.8714967","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714967","url":null,"abstract":"This paper investigates the benefits of verifying embedded systems after deployment. We argue that one reason for the huge state spaces of contemporary embedded and cyber-physical systems is the large variety of operating contexts, which are unknown during design. Once the system is deployed, these contexts become observable, confining several variables. By this, the search space is dramatically reduced, making verification possible even on the limited resources of a deployed system. In this paper, we propose a design and verification flow which exploits this observation. We show how specifications are transferred to the deployed system and verified there. Evaluations on a number of case studies demonstrate the reduction of the search space, and we sketch how the proposed approach can be employed in practice.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"2509 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131299013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Thermal-Awareness in a Soft Error Tolerant Architecture 软容错架构中的热感知
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715105
Sajjad Hussain, M. Shafique, J. Henkel
It is crucial to provide soft error reliability in a power-efficient manner such that the maximum chip temperature remains within the safe operating limits. Different execution phases of an application have diverse performance, power, temperature and vulnerability behavior that can be leveraged to fulfill the resiliency requirements within the allowed thermal constraints. We propose a soft error tolerant architecture with fine-grained redundancy for different architectural components, such that their reliable operations can be activated selectively at fine-granularity to maximize the reliability under a given thermal constraint. When compared with state-of-the-art, our temperature-aware fine-grained reliability manager provides up to 30% reliability within the thermal budget.
以一种节能的方式提供软误差可靠性是至关重要的,这样芯片的最高温度就保持在安全的工作范围内。应用程序的不同执行阶段具有不同的性能、功率、温度和漏洞行为,可以利用这些行为在允许的热约束范围内满足弹性需求。我们针对不同的架构组件提出了一种具有细粒度冗余的软容错架构,使得它们的可靠操作可以在细粒度上选择性地激活,从而在给定的热约束下最大化可靠性。与最先进的产品相比,我们的温度感知细粒度可靠性管理器在热预算范围内提供高达30%的可靠性。
{"title":"Thermal-Awareness in a Soft Error Tolerant Architecture","authors":"Sajjad Hussain, M. Shafique, J. Henkel","doi":"10.23919/DATE.2019.8715105","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715105","url":null,"abstract":"It is crucial to provide soft error reliability in a power-efficient manner such that the maximum chip temperature remains within the safe operating limits. Different execution phases of an application have diverse performance, power, temperature and vulnerability behavior that can be leveraged to fulfill the resiliency requirements within the allowed thermal constraints. We propose a soft error tolerant architecture with fine-grained redundancy for different architectural components, such that their reliable operations can be activated selectively at fine-granularity to maximize the reliability under a given thermal constraint. When compared with state-of-the-art, our temperature-aware fine-grained reliability manager provides up to 30% reliability within the thermal budget.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"245 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131569168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Mixed-Height Standard Cell Placement Flow for Digital Circuit Blocks* 数字电路块的混合高度标准单元放置流*
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714849
Yi-Cheng Zhao, Yu-Chieh Lin, Ting-Chi Wang, Ting-Hsiung Wang, Yun-Ru Wu, Hsin-Chang Lin, Shu-Yi Kao
In this paper, we present a mixed-height standard cell placement flow for digital circuit blocks. To our best knowledge, commercial tools currently do not support this type of flow in a fully automated manner. In our placement flow, we leverage a commercial placement tool and integrate it with several new point tools. Promising experimental results are reported to demonstrate the efficacy of our placement flow.
在本文中,我们提出了一种数字电路块的混合高度标准单元放置流程。据我们所知,商业工具目前还不能完全自动化地支持这种类型的流。在我们的放置流程中,我们利用商业放置工具并将其与几个新的点工具集成。实验结果证明了我们的安置流程的有效性。
{"title":"A Mixed-Height Standard Cell Placement Flow for Digital Circuit Blocks*","authors":"Yi-Cheng Zhao, Yu-Chieh Lin, Ting-Chi Wang, Ting-Hsiung Wang, Yun-Ru Wu, Hsin-Chang Lin, Shu-Yi Kao","doi":"10.23919/DATE.2019.8714849","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714849","url":null,"abstract":"In this paper, we present a mixed-height standard cell placement flow for digital circuit blocks. To our best knowledge, commercial tools currently do not support this type of flow in a fully automated manner. In our placement flow, we leverage a commercial placement tool and integrate it with several new point tools. Promising experimental results are reported to demonstrate the efficacy of our placement flow.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130987805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Enabling Energy-Efficient Unsupervised Monocular Depth Estimation on ARMv7-Based Platforms 在armv7平台上实现节能无监督单目深度估计
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714893
Valentino Peluso, A. Cipolletta, A. Calimera, Matteo Poggi, F. Tosi, S. Mattoccia
This work deals with the implementation of energy-efficient monocular depth estimation using a low-cost CPU for low-power embedded systems. It first describes the PyD-Net depth estimation network, which consists of a lightweight CNN able to approach state-of-the-art accuracy with ultra-low resource usage. Then it proposes an accuracy-driven complexity reduction strategy based on a hardware-friendly fixed-point quantization. Finally, it introduces the low-level optimization enabling effective use of integer neural kernels. The objective is threefold: (i) prove the efficiency of the new quantization flow on a depth estimation network, that is, the capability to retaining the accuracy reached by floating-point arithmetic using 16- and 8-bit integers, (ii) demonstrate the portability of the quantized model into a general-purpose 32-bit RISC architecture of the ARM Cortex family, (iii) quantify the accuracy-energy tradeoff of unsupervised monocular estimation to establish its use in the embedded domain. The experiments have been run on a Raspberry PI board powered by a Broadcom BCM2837 chipset. A parametric analysis conducted over the KITTI date-set shows marginal accuracy loss with 16-bit (8-bit) integers and energy savings up to 6.55× (9.23×) w.r.t. floating-point. Compared to high-end CPU and GPU the proposed solution improves scalability.
本文研究了在低功耗嵌入式系统中使用低成本CPU实现节能的单目深度估计。它首先描述了PyD-Net深度估计网络,该网络由一个轻量级的CNN组成,能够以超低的资源使用达到最先进的精度。然后提出了一种基于硬件友好的定点量化的精度驱动的复杂性降低策略。最后,介绍了能够有效利用整数神经核的低级优化。目标有三个:(i)证明新的量化流在深度估计网络上的效率,即使用16位和8位整数保持浮点运算达到的精度的能力,(ii)证明量化模型可移植到ARM Cortex家族的通用32位RISC架构中,(iii)量化无监督单目估计的精度-能量权衡,以建立其在嵌入式领域的使用。实验在由Broadcom BCM2837芯片组供电的树莓派板上运行。对KITTI日期集进行的参数分析显示,16位(8位)整数的边际精度损失和高达6.55× (9.23×) w.r.t.浮点的能量节省。与高端CPU和GPU相比,该解决方案提高了可扩展性。
{"title":"Enabling Energy-Efficient Unsupervised Monocular Depth Estimation on ARMv7-Based Platforms","authors":"Valentino Peluso, A. Cipolletta, A. Calimera, Matteo Poggi, F. Tosi, S. Mattoccia","doi":"10.23919/DATE.2019.8714893","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714893","url":null,"abstract":"This work deals with the implementation of energy-efficient monocular depth estimation using a low-cost CPU for low-power embedded systems. It first describes the PyD-Net depth estimation network, which consists of a lightweight CNN able to approach state-of-the-art accuracy with ultra-low resource usage. Then it proposes an accuracy-driven complexity reduction strategy based on a hardware-friendly fixed-point quantization. Finally, it introduces the low-level optimization enabling effective use of integer neural kernels. The objective is threefold: (i) prove the efficiency of the new quantization flow on a depth estimation network, that is, the capability to retaining the accuracy reached by floating-point arithmetic using 16- and 8-bit integers, (ii) demonstrate the portability of the quantized model into a general-purpose 32-bit RISC architecture of the ARM Cortex family, (iii) quantify the accuracy-energy tradeoff of unsupervised monocular estimation to establish its use in the embedded domain. The experiments have been run on a Raspberry PI board powered by a Broadcom BCM2837 chipset. A parametric analysis conducted over the KITTI date-set shows marginal accuracy loss with 16-bit (8-bit) integers and energy savings up to 6.55× (9.23×) w.r.t. floating-point. Compared to high-end CPU and GPU the proposed solution improves scalability.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134132629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Secure Intermittent Computing Protocol: Protecting State Across Power Loss 安全间歇计算协议:跨失电保护状态
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714997
Archanaa S. Krishnan, Charles Suslowicz, Daniel Dinu, P. Schaumont
Intermittent computing systems execute long-running tasks under a transient power supply such as an energy harvesting power source. During a power loss, they save intermediate program state as a checkpoint into write-efficient non-volatile memory. When the power is restored, the system state is reconstructed from the checkpoint, and the long-running computation continues. We analyze the security risks when power interruption is used as an attack vector, and we demonstrate the need to protect the integrity, authenticity, confidentiality, continuity, and freshness of checkpointed data. We propose a secure checkpointing technique called the Se-cure Intermittent Computing Protocol (SICP). The proposed protocol has the following properties. First, it associates every checkpoint with a unique power-on state to checkpoint replay. Second, every checkpoint is cryptographically chained to its predecessor, providing continuity, which enables the programmer to carry run-time security properties such as attested program images across power loss events. Third, SICP is atomic and resistant to power loss. We demonstrate a prototype implementation of SICP on an MSP430 microcontroller, and we investigate the overhead of SICP for several cryptographic kernels. To the best of our knowledge, this is the first work to provide a robust solution to secure intermittent computing.
间歇计算系统在瞬态电源(如能量收集电源)下执行长时间运行的任务。在断电期间,它们将中间程序状态作为检查点保存到写效率高的非易失性存储器中。当电源恢复后,从检查点重构系统状态,并继续长时间运行的计算。我们分析了当电源中断被用作攻击向量时的安全风险,并演示了保护检查点数据的完整性、真实性、机密性、连续性和新鲜度的必要性。我们提出了一种安全的检查点技术,称为Se-cure间歇计算协议(SICP)。提议的协议具有以下属性。首先,它将每个检查点与检查点重播的唯一通电状态关联起来。其次,每个检查点都以加密方式链接到它的前身,从而提供连续性,这使程序员能够携带运行时安全性属性,例如跨断电事件的已证明的程序映像。第三,SICP是原子的,可以抵抗功率损失。我们在MSP430微控制器上演示了SICP的原型实现,并研究了几种加密内核的SICP开销。据我们所知,这是第一个为确保间歇性计算提供健壮解决方案的工作。
{"title":"Secure Intermittent Computing Protocol: Protecting State Across Power Loss","authors":"Archanaa S. Krishnan, Charles Suslowicz, Daniel Dinu, P. Schaumont","doi":"10.23919/DATE.2019.8714997","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714997","url":null,"abstract":"Intermittent computing systems execute long-running tasks under a transient power supply such as an energy harvesting power source. During a power loss, they save intermediate program state as a checkpoint into write-efficient non-volatile memory. When the power is restored, the system state is reconstructed from the checkpoint, and the long-running computation continues. We analyze the security risks when power interruption is used as an attack vector, and we demonstrate the need to protect the integrity, authenticity, confidentiality, continuity, and freshness of checkpointed data. We propose a secure checkpointing technique called the Se-cure Intermittent Computing Protocol (SICP). The proposed protocol has the following properties. First, it associates every checkpoint with a unique power-on state to checkpoint replay. Second, every checkpoint is cryptographically chained to its predecessor, providing continuity, which enables the programmer to carry run-time security properties such as attested program images across power loss events. Third, SICP is atomic and resistant to power loss. We demonstrate a prototype implementation of SICP on an MSP430 microcontroller, and we investigate the overhead of SICP for several cryptographic kernels. To the best of our knowledge, this is the first work to provide a robust solution to secure intermittent computing.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"14 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129747881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A Fine-Grained Soft Error Resilient Architecture under Power Considerations 考虑功耗的细粒度软错误弹性体系结构
Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714797
Sajjad Hussain, M. Shafique, J. Henkel
Besides the limited power budgets and the darksilicon issue, soft error is one of the most critical reliability issues in computing systems fabricated using nano-scale devices. During the execution, different applications have varying performance, power/energy consumption and vulnerability properties. Different trade-offs can be devised to provide required resiliency within the allowed power constraints. To exploit this behavior, we propose a novel soft error resilient architecture and the corresponding run-time system that enables power-aware finegrained resiliency for different processor components. It selectively determines the reliability state of various components, such that the overall application reliability is improved under a given power budget. Our architecture saves power up to 16% and reliability degradation up to 11% compared to state-of-the-art techniques.
除了有限的功率预算和暗硅问题外,软误差是使用纳米级器件制造的计算系统中最关键的可靠性问题之一。在执行过程中,不同的应用程序具有不同的性能、功耗/能耗和漏洞属性。可以设计不同的折衷方案,以在允许的功率限制范围内提供所需的弹性。为了利用这种行为,我们提出了一种新的软错误弹性体系结构和相应的运行时系统,该系统可以为不同的处理器组件提供功率感知的细粒度弹性。它有选择地确定各个组件的可靠性状态,从而在给定的功率预算下提高整个应用程序的可靠性。与最先进的技术相比,我们的架构可节省高达16%的电力,可靠性降低高达11%。
{"title":"A Fine-Grained Soft Error Resilient Architecture under Power Considerations","authors":"Sajjad Hussain, M. Shafique, J. Henkel","doi":"10.23919/DATE.2019.8714797","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714797","url":null,"abstract":"Besides the limited power budgets and the darksilicon issue, soft error is one of the most critical reliability issues in computing systems fabricated using nano-scale devices. During the execution, different applications have varying performance, power/energy consumption and vulnerability properties. Different trade-offs can be devised to provide required resiliency within the allowed power constraints. To exploit this behavior, we propose a novel soft error resilient architecture and the corresponding run-time system that enables power-aware finegrained resiliency for different processor components. It selectively determines the reliability state of various components, such that the overall application reliability is improved under a given power budget. Our architecture saves power up to 16% and reliability degradation up to 11% compared to state-of-the-art techniques.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126215910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1