2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)最新文献

英文中文

Brook Auto: High-Level Certification-Friendly Programming for GPU-powered Automotive Systems Brook Auto: gpu驱动的汽车系统的高级认证友好编程

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196002

Matina Maria Trompouki, Leonidas Kosmidis

Modern automotive systems require increased performance to implement Advanced Driving Assistance Systems (ADAS). GPU-powered platforms are promising candidates for such computational tasks, however current low-level programming models challenge the accelerator software certification process, while they limit the hardware selection to a fraction of the available platforms. In this paper we present Brook Auto, a high-level programming language for automotive GPU systems which removes these limitations. We describe the challenges and solutions we faced in its implementation, as well as a complete evaluation in terms of performance and productivity, which shows the effectiveness of our method.

现代汽车系统需要更高的性能来实现高级驾驶辅助系统(ADAS)。gpu驱动的平台是此类计算任务的有希望的候选者，然而当前的低级编程模型挑战加速器软件认证过程，同时它们将硬件选择限制在可用平台的一小部分。在本文中，我们提出了Brook Auto，一种用于汽车GPU系统的高级编程语言，它消除了这些限制。我们描述了我们在实施过程中面临的挑战和解决方案，以及在性能和生产力方面的完整评估，这表明了我们的方法的有效性。

引用次数: 22

A Fast and Robust Failure Analysis of Memory Circuits Using Adaptive Importance Sampling Method 基于自适应重要性采样方法的记忆电路快速鲁棒失效分析

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3195972

Xiao Shi, Jun Yang, Fengyuan Liu, Lei He

Performance failure has become a growing concern for the robustness and reliability of memory circuits. It is challenging to accurately estimate the extremely small failure probability when failed samples are distributed in multiple disjoint failure regions. In this paper, we develop an adaptive importance sampling (AlS) method. AIS has several iterations of sampling region adjusbnents, while existing methods pre-decide a static sampling distribution. By iteratively searching for failure regions, AIS may lead to better efficiency and accuracy. This is validated by our experiments. For SRAM cell with single failure region, AIS uses 5–10X fewer samples and reaches better accuracy when compared to several recent methods. For sense amplifier circuit with multiple failure regions, AIS is 4369X faster than MC without compromising accuracy, while other methods fail to cover all failure regions in our experiment

性能故障已成为存储器电路鲁棒性和可靠性日益受到关注的问题。当破坏样本分布在多个不相交的破坏区域时，如何准确估计极小的破坏概率是一个挑战。本文提出了一种自适应重要抽样(AlS)方法。AIS有多次迭代的采样区域调整，而现有的方法预先确定一个静态的采样分布。通过迭代搜索故障区域，AIS可以提高效率和准确性。我们的实验证实了这一点。对于具有单个故障区域的SRAM单元，与最近的几种方法相比，AIS使用的样品减少了5 - 10倍，并且达到了更好的精度。对于具有多个故障区域的感测放大电路，AIS在不影响精度的情况下比MC快4369X，而在我们的实验中，其他方法无法覆盖所有故障区域

引用次数: 20

DAC 2018 Abstract Page DAC 2018摘要页

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1109/dac.2018.8465900

引用次数: 0

Locality Aware Memory Assignment and Tiling 位置感知内存分配和平铺

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196070

Samuel Rogers, H. Tabkhi

With the trend toward specialization, an efficient memory-path design is vital to capitalize customization in data-path. A monolithic memory hierarchy is often highly inefficient for irregular applications, traditionally targeted for CPUs. New approaches and tools are required to offer application-specific memory customization combining the benefits of cache and scratchpad memory simultaneously.This paper introduces a novel approach for automated application-specific on-chip memory assignment and tiling. The approach offers two major tools: (1) static memory access analysis and (2) variable-level memory assignment. Static memory analysis performs at the LLVM abstraction. It extracts target-independent pointer behaviors, measures the access strides and analyze the prefetchability of variables. (2) variable-level memory assignment creates a memory allocation graph for memory assignment (cache vs. scratchpad) based on the variables size and their estimated locality. It also explores the opportunity for tiling memory access. For the exploration and results, this paper uses Machsuite benchmarks (with both regular & irregular memory access behaviors), and gem5-Aladdin tool for performance & power evaluation. The proposed approach optimizes the memory hierarchy by automatically combining the benefits of cache, (tiled-) scratchpad at variable level granularity per individual applications. The results demonstrate more than 45% improvement in our power-stall product, on average, over the monolithic cache or scratchpad design.

随着专门化趋势的发展，有效的内存路径设计对于实现数据路径的定制化至关重要。对于不规则的应用程序(传统上以cpu为目标)，单片内存层次结构通常效率非常低。需要新的方法和工具来提供特定于应用程序的内存定制，同时结合缓存和暂存的优点。本文介绍了一种用于特定应用的片上内存自动分配和平铺的新方法。该方法提供了两个主要工具:(1)静态内存访问分析和(2)变量级内存分配。静态内存分析在LLVM抽象中执行。它提取了与目标无关的指针行为，测量了访问步幅并分析了变量的可预取性。(2)变量级内存分配创建内存分配图(cache vs. scratchpad)基于变量大小和它们的估计位置。它还探讨了平铺内存访问的可能性。为了探索和结果，本文使用Machsuite基准测试(包括规则和不规则的内存访问行为)，并使用gem5-Aladdin工具进行性能和功耗评估。所提出的方法通过自动结合每个应用程序可变粒度的缓存(平铺)刮擦板的优点来优化内存层次结构。结果表明，与单片缓存或刮擦板设计相比，我们的产品在电源失速方面平均改善了45%以上。

{"title":"Locality Aware Memory Assignment and Tiling","authors":"Samuel Rogers, H. Tabkhi","doi":"10.1145/3195970.3196070","DOIUrl":"https://doi.org/10.1145/3195970.3196070","url":null,"abstract":"With the trend toward specialization, an efficient memory-path design is vital to capitalize customization in data-path. A monolithic memory hierarchy is often highly inefficient for irregular applications, traditionally targeted for CPUs. New approaches and tools are required to offer application-specific memory customization combining the benefits of cache and scratchpad memory simultaneously.This paper introduces a novel approach for automated application-specific on-chip memory assignment and tiling. The approach offers two major tools: (1) static memory access analysis and (2) variable-level memory assignment. Static memory analysis performs at the LLVM abstraction. It extracts target-independent pointer behaviors, measures the access strides and analyze the prefetchability of variables. (2) variable-level memory assignment creates a memory allocation graph for memory assignment (cache vs. scratchpad) based on the variables size and their estimated locality. It also explores the opportunity for tiling memory access. For the exploration and results, this paper uses Machsuite benchmarks (with both regular & irregular memory access behaviors), and gem5-Aladdin tool for performance & power evaluation. The proposed approach optimizes the memory hierarchy by automatically combining the benefits of cache, (tiled-) scratchpad at variable level granularity per individual applications. The results demonstrate more than 45% improvement in our power-stall product, on average, over the monolithic cache or scratchpad design.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88374071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Reducing the Overhead of Authenticated Memory Encryption Using Delta Encoding and ECC Memory 利用增量编码和ECC内存降低认证内存加密的开销

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196102

Salessawi Ferede Yitbarek, T. Austin

Data stored in an off-chip memory, such as DRAM or non-volatile main memory, can potentially be extracted or tampered by an attacker with physical access to a device. Protecting such attacks requires storing message authentication codes and counters - which incur a 22% storage overhead. In this work, we propose techniques for reducing these overheads.We first present a scheme that leverages ECC DRAMs to reduce MAC verification & storage overheads. We replace the parity bits in standard ECC by a combination of MAC and parity bits to provide both authentication and error correction. This eliminates the extra MAC storage and minimizes the verification overhead as MACs can be read in parallel with data through the ECC bus. Next, we use efficient integer encodings to reduce counter storage overhead by 6 × while enhancing application performance.

存储在片外存储器(如DRAM或非易失性主存储器)中的数据可能被攻击者通过物理访问设备提取或篡改。保护这类攻击需要存储消息身份验证码和计数器——这会产生22%的存储开销。在这项工作中，我们提出了减少这些开销的技术。我们首先提出了一种利用ECC dram来减少MAC验证和存储开销的方案。我们将标准ECC中的奇偶校验位替换为MAC和奇偶校验位的组合，以提供认证和纠错。这消除了额外的MAC存储，并最大限度地减少了验证开销，因为MAC可以通过ECC总线与数据并行读取。接下来，我们使用高效的整数编码将计数器存储开销减少了6倍，同时提高了应用程序性能。

引用次数: 11

SRAM based Opportunistic Energy Efficiency Improvement in Dual-Supply Near-Threshold Processors 基于SRAM的双电源近阈值处理器的机会性能效改进

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196121

Yunfei Gu, Dengxue Yan, Vaibhav Verma, M. Stan, Xuan Zhang

Energy-efficient microprocessors are essential for a wide range of applications. While near-threshold computing is a promising technique to improve energy efficiency, optimal supply demands from logic core and on-chip memory are confiicting. In this paper, we perform reliability analysis of 6T SRAM and discover imbalanced minimum voltage requirements between read and write operations. We leverage this imbalance property in near-threshold processors equipped with voltage boosting capability by proposing an opportunistic dual-supply switching scheme with a write aggregation buffer. Our results show that proposed technique improves energy efficiency by more than 18% with approximate 8.54% performance speed-up.

节能微处理器对于广泛的应用是必不可少的。虽然近阈值计算是一种很有前途的提高能源效率的技术，但逻辑核心和片上存储器的最佳供应需求是相互矛盾的。本文对6T SRAM进行了可靠性分析，发现读写操作之间的最小电压要求不平衡。我们通过提出带有写聚合缓冲器的机会双电源开关方案，在具有电压提升能力的近阈值处理器中利用这种不平衡特性。我们的结果表明，该技术提高了18%以上的能源效率，性能加速约为8.54%。

引用次数: 4

Achieving Defect-Free Multilevel 3D Flash Memories with One-Shot Program Design 用一次性程序设计实现无缺陷多级3D闪存

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3195982

Chien-Chung Ho, Yung-Chun Li, Yuan-Hao Chang, Yu-Ming Chang

To store the desired data on MLC and TLC flash memories, the conventional programming strategies need to divide a fixed range of threshold voltage (Vt) window into several parts. The narrowly partitioned Vt window in turn limits the design of programming strategy and becomes the main reason to cause flash-memory defects, i.e., the longer read/write latency and worse data reliability. This motivates this work to explore the innovative programming design for solving the flash-memory defects. Thus, to achieve the defect-free 3D NAND flash memory, this paper presents and realizes a one-shot program design to significantly eliminate the negative impacts caused by conventional programming strategies. The proposed one-shot program design includes two strategies, i.e., prophetic and classification programming, for MLC flash memories, and the idea is extended to TLC flash memories. The measurement results show that it can accelerate programming speed by 31x and reduce RBER by 1000x for the MLC flash memory, and it can broaden the available window of threshold voltage up to 5.1x for the TLC flash memory.

为了在MLC和TLC闪存中存储所需的数据，传统的编程策略需要将固定范围的阈值电压(Vt)窗口分成几个部分。而窄分割的Vt窗口又限制了编程策略的设计，成为造成闪存缺陷的主要原因，即读写延迟变长和数据可靠性变差。这促使本研究探索解决闪存缺陷的创新编程设计。因此，为了实现无缺陷的3D NAND闪存，本文提出并实现了一种一次性程序设计，显著消除了传统编程策略带来的负面影响。提出的一次性程序设计包括MLC闪存的预测和分类规划两种策略，并将该思想推广到TLC闪存。测试结果表明，该方法可使MLC闪存的编程速度提高31倍，RBER降低1000倍，并可将TLC闪存的阈值电压可用窗口扩大5.1倍。

引用次数: 9

Efficient Computation of ECO Patch Functions ECO补丁函数的高效计算

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196039

A. Dao, Nian-Ze Lee, Li-Cheng Chen, Mark Po-Hung Lin, J. H. Jiang, A. Mishchenko, R. Brayton

Engineering Change Orders (ECO) modify a synthesized netlist after its specification has changed. ECO is divided into two major tasks: finding target signals whose functions should be updated and synthesizing the patch that produces the desired change. This paper proposes an efficient SAT-based solution for the second task: resource-aware computation of multi-output patch functions. The solution is based on several new algorithms and outperforms the top three winners of the 2017 ICCAD CAD Contest (Problem A).

工程变更令(ECO)在合成网表的规格发生变化后对其进行修改。ECO分为两个主要任务:寻找功能需要更新的目标信号和合成产生所需变化的补丁。本文提出了一种基于sat的高效解决方案:多输出patch函数的资源感知计算。该解决方案基于几种新算法，并且优于2017年ICCAD CAD竞赛(问题A)的前三名获奖者。

引用次数: 20

VRL-DRAM: Improving DRAM Performance via Variable Refresh Latency VRL-DRAM:通过可变刷新延迟提高DRAM性能

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196136

Anup Das, Hasan Hassan, O. Mutlu

A DRAM chip requires periodic refresh operations to prevent data loss due to charge leakage in DRAM cells. Refresh operations incur significant performance overhead as a DRAM bank/rank becomes unavailable to service access requests while being refreshed. In this work, our goal is to reduce the performance overhead of DRAM refresh by reducing the latency of a refresh operation. We observe that a significant number of DRAM cells can retain their data for longer than the worst-case refresh period of 64ms. Such cells do not always need to be fully refreshed; a low-latency partial refresh is sufficient for them.We propose Variable Refresh Latency DRAM (VRL-DRAM), a mechanism that fully refreshes a DRAM cell only when necessary, and otherwise ensures data integrity by issuing low-latency partial refresh operations. We develop a new detailed analytical model to estimate the minimum latency of a refresh operation that ensures data integrity of a cell with a given retention time profile. We evaluate VRL-DRAM with memory traces from real workloads, and show that it reduces the average refresh performance overhead by 34% compared to the state-of-the-art approach.

DRAM芯片需要定期刷新操作，以防止由于DRAM单元中的电荷泄漏而导致数据丢失。刷新操作会导致显著的性能开销，因为在刷新时，DRAM组/rank对业务访问请求不可用。在这项工作中，我们的目标是通过减少刷新操作的延迟来减少DRAM刷新的性能开销。我们观察到，相当数量的DRAM单元可以保留其数据的时间长于最坏情况下64ms的刷新周期。这些细胞并不总是需要完全更新;对它们来说，低延迟的部分刷新就足够了。我们提出可变刷新延迟DRAM (VRL-DRAM)，这是一种仅在必要时完全刷新DRAM单元的机制，否则通过发出低延迟部分刷新操作来确保数据完整性。我们开发了一个新的详细分析模型来估计刷新操作的最小延迟，以确保具有给定保留时间配置文件的单元的数据完整性。我们使用来自实际工作负载的内存跟踪来评估VRL-DRAM，并表明与最先进的方法相比，它将平均刷新性能开销降低了34%。

{"title":"VRL-DRAM: Improving DRAM Performance via Variable Refresh Latency","authors":"Anup Das, Hasan Hassan, O. Mutlu","doi":"10.1145/3195970.3196136","DOIUrl":"https://doi.org/10.1145/3195970.3196136","url":null,"abstract":"A DRAM chip requires periodic refresh operations to prevent data loss due to charge leakage in DRAM cells. Refresh operations incur significant performance overhead as a DRAM bank/rank becomes unavailable to service access requests while being refreshed. In this work, our goal is to reduce the performance overhead of DRAM refresh by reducing the latency of a refresh operation. We observe that a significant number of DRAM cells can retain their data for longer than the worst-case refresh period of 64ms. Such cells do not always need to be fully refreshed; a low-latency partial refresh is sufficient for them.We propose Variable Refresh Latency DRAM (VRL-DRAM), a mechanism that fully refreshes a DRAM cell only when necessary, and otherwise ensures data integrity by issuing low-latency partial refresh operations. We develop a new detailed analytical model to estimate the minimum latency of a refresh operation that ensures data integrity of a cell with a given retention time profile. We evaluate VRL-DRAM with memory traces from real workloads, and show that it reduces the average refresh performance overhead by 34% compared to the state-of-the-art approach.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"37 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89955223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

Revisiting Context-Based Authentication in IoT 回顾物联网中基于上下文的身份验证

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196106

Markus Miettinen, T. D. Nguyen, A. Sadeghi, N. Asokan

The emergence of IoT poses new challenges towards solutions for authenticating numerous very heterogeneous IoT devices to their respective trust domains. Using passwords or pre-defined keys have drawbacks that limit their use in IoT scenarios. Recent works propose to use contextual information about ambient physical properties of devices' surroundings as a shared secret to mutually authenticate devices that are co-located, e.g., the same room. In this paper, we analyze these context-based authentication solutions with regard to their security and requirements on context quality. We quantify their achievable security based on empirical real-world data from context measurements in typical IoT environments.

物联网的出现对众多异构物联网设备各自信任域的认证解决方案提出了新的挑战。使用密码或预定义密钥有缺点，限制了它们在物联网场景中的使用。最近的作品建议使用有关设备周围环境物理特性的上下文信息作为共享秘密，以相互认证共存的设备，例如同一房间。本文分析了这些基于上下文的身份验证方案的安全性和对上下文质量的要求。我们根据典型物联网环境中上下文测量的实际经验数据量化了其可实现的安全性。

引用次数: 31

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀