Proceedings of the Great Lakes Symposium on VLSI 2022最新文献

英文中文

A Scheduling Framework for Decomposable Kernels on Energy Harvesting IoT Edge Nodes 能量收集物联网边缘节点上可分解核的调度框架

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530350

Sethu Jose, J. Sampson, N. Vijaykrishnan, M. Kandemir

With the growing popularity of the Internet of Things (IoTs), emerging applications demand that edge nodes provide higher computational capabilities and long operation times while requiring minimal maintenance. Ambient energy harvesting is a promising alternative to batteries, but only if the hardware and software are optimized for the intermittent nature of the power source. At the same time, many compute tasks in IoT workloads involve executing decomposable kernels that may have application-dependent accuracy requirements. In this work, we introduce a hardware-software co-optimization framework for such kernels that aim to achieve maximum forward progress while running on energy harvesting Non-Volatile Processors (NVP). Using this framework, we develop an FFT and a convolution accelerator that computes up to 3.2x faster, while consuming 5.4x less energy, compared to a baseline energy-harvesting system. With our accuracy-aware scheduling strategy, the approximate computing enabled by this framework delivers on average 6.2x energy reduction and 3.2x speedup by sacrificing minimal accuracy of up to 6.9%.

随着物联网(iot)的日益普及，新兴应用要求边缘节点提供更高的计算能力和更长的操作时间，同时需要最少的维护。环境能量收集是一种很有前途的电池替代品，但前提是硬件和软件针对电源的间歇性进行了优化。与此同时，物联网工作负载中的许多计算任务涉及执行可分解内核，这些内核可能具有与应用程序相关的精度要求。在这项工作中，我们为这些内核引入了一个软硬件协同优化框架，旨在在能量收集非易失性处理器(NVP)上运行时实现最大的向前进展。使用这个框架，我们开发了一个FFT和一个卷积加速器，与基线能量收集系统相比，计算速度提高了3.2倍，同时消耗的能量减少了5.4倍。通过我们的精度感知调度策略，该框架支持的近似计算通过牺牲高达6.9%的最小精度，平均减少6.2倍的能量和3.2倍的加速。

引用次数: 0

HDnn-PIM: Efficient in Memory Design of Hyperdimensional Computing with Feature Extraction hdn - pim:基于特征提取的高效超维计算内存设计

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530331

Arpan Dutta, Saransh Gupta, Behnam Khaleghi, Rishikanth Chandrasekaran, Weihong Xu, T. Simunic

Brain-inspired Hyperdimensional (HD) computing is a new machine learning approach that leverages simple and highly parallelizable operations. Unfortunately, none of the published HD computing algorithms to date have been able to accurately classify more complex image datasets, such as CIFAR100. In this work, we propose HDnn-PIM, that implements both feature extraction and HD-based classification for complex images by using processing-in-memory. We compare HDnn-PIM with HD-only and CNN implementations for various image datasets. HDnn-PIM achieves 52.4% higher accuracy as compared to pure HD computing. It also gains 1.2% accuracy improvement over state-of-the-art CNNs, but with 3.63x smaller memory footprint and 1.53x less MAC operations. Furthermore, HDnn-PIM is 3.6x-223x faster than RTX 3090 GPU, and 3.7x more energy efficient than state-of-the-art FloatPIM.

脑启发的超维计算(HD)是一种新的机器学习方法，利用简单和高度并行化的操作。不幸的是，迄今为止，没有一个已发表的高清计算算法能够准确地分类更复杂的图像数据集，比如CIFAR100。在这项工作中，我们提出了hdn - pim，它通过使用内存处理来实现复杂图像的特征提取和基于高清的分类。我们比较了HDnn-PIM与HD-only和CNN在各种图像数据集上的实现。与纯高清计算相比，HDnn-PIM的准确率提高了52.4%。与最先进的cnn相比，它的准确率提高了1.2%，但内存占用减少了3.63倍，MAC操作减少了1.53倍。此外，HDnn-PIM比RTX 3090 GPU快3.6x-223倍，比最先进的FloatPIM节能3.7倍。

引用次数: 11

A Scalable, Deterministic Approach to Stochastic Computing 一种可扩展的、确定性的随机计算方法

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530344

Y. Kiran, Marc D. Riedel

Stochastic computing is a paradigm in which logical operations are performed on randomly generated bit streams. Complex arithmetic operations can be performed by simple logic circuits, with a much smaller area footprint than conventional binary counterparts. However, the random or pseudorandom sources required to generate the bit streams are costly in terms of area and offset the gains. Also, due to randomness, the computation is not precise, which limits the applicability of the paradigm. Most importantly, to achieve reasonable accuracy, high latency is necessitated. Recently, deterministic approaches to stochastic computing have been proposed. They demonstrated that randomness is not a requirement. By structuring the computation deterministically, the result is exact and the latency is greatly reduced. However, despite being an improvement over conventional stochastic techniques, the latency increases quadratically with each level of logic. Beyond a few levels of logic, it becomes unmanageable. In this paper, we present a method for approximating the results of their deterministic method, with latency that only increases linearly with each level. The improvement comes at the cost of additional logic, but we demonstrate that the increase in area scales with √n, where n is the equivalent number of binary bits of precision. The new approach is general, efficient, composable, and applicable to all arithmetic operations performed with stochastic logic.

随机计算是在随机生成的比特流上执行逻辑运算的一种范式。复杂的算术运算可以通过简单的逻辑电路来执行，其占地面积比传统的二进制对应物小得多。然而，生成比特流所需的随机或伪随机源在面积和抵消增益方面是昂贵的。此外，由于随机性，计算不精确，这限制了范式的适用性。最重要的是，为了达到合理的精度，需要高延迟。最近，人们提出了确定性的随机计算方法。他们证明了随机性并不是必要条件。通过确定性地构建计算结构，计算结果准确，大大降低了延迟。然而，尽管与传统的随机技术相比有了改进，但随着逻辑的每一级，延迟时间都呈二次增长。超出几个层次的逻辑，它就变得难以管理。在本文中，我们提出了一种近似他们的确定性方法的结果的方法，其延迟只随每一级线性增加。这种改进是以额外的逻辑为代价的，但我们证明了面积的增加与√n有关，其中n是精度的二进制位的等效数量。该方法具有通用性、高效性、可组合性，适用于随机逻辑下的所有算术运算。

{"title":"A Scalable, Deterministic Approach to Stochastic Computing","authors":"Y. Kiran, Marc D. Riedel","doi":"10.1145/3526241.3530344","DOIUrl":"https://doi.org/10.1145/3526241.3530344","url":null,"abstract":"Stochastic computing is a paradigm in which logical operations are performed on randomly generated bit streams. Complex arithmetic operations can be performed by simple logic circuits, with a much smaller area footprint than conventional binary counterparts. However, the random or pseudorandom sources required to generate the bit streams are costly in terms of area and offset the gains. Also, due to randomness, the computation is not precise, which limits the applicability of the paradigm. Most importantly, to achieve reasonable accuracy, high latency is necessitated. Recently, deterministic approaches to stochastic computing have been proposed. They demonstrated that randomness is not a requirement. By structuring the computation deterministically, the result is exact and the latency is greatly reduced. However, despite being an improvement over conventional stochastic techniques, the latency increases quadratically with each level of logic. Beyond a few levels of logic, it becomes unmanageable. In this paper, we present a method for approximating the results of their deterministic method, with latency that only increases linearly with each level. The improvement comes at the cost of additional logic, but we demonstrate that the increase in area scales with √n, where n is the equivalent number of binary bits of precision. The new approach is general, efficient, composable, and applicable to all arithmetic operations performed with stochastic logic.","PeriodicalId":188228,"journal":{"name":"Proceedings of the Great Lakes Symposium on VLSI 2022","volume":"514 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123201919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session details: Session 7A: Special Session - 3: Machine Learning-Aided Computer-Aided Design 会议详情:7A:特别会议- 3:机器学习辅助计算机辅助设计

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3542694

Sai Manoj Pudukotai Dinakarrao

引用次数: 0

Session details: Session 3A: VLSI Design + VLSI Circuits and Power Aware Design 1 会议详情:3A: VLSI设计+ VLSI电路和功耗感知设计

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3542686

S. Mohanty

引用次数: 0

On Attacking Locking SIB based IJTAG Architecture 基于IJTAG架构的攻击锁定SIB的研究

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530370

G. Kumar, Anjum Riaz, Yamuna Prasad, Satyadev Ahlawat

The IEEE 1687 standard, which is commonly used for efficient access of on-chip instruments, could be exploited by an intruder and thus needs to be secured. One of the techniques to alleviate the vulnerability of 1687 network is to use a secure access protocol that is based on licensed access software, Chip ID and locking SIB. A licensed access software is generally used to gain control of the embedded instruments and use them as per requirement. In this paper, a successful attack using various machine learning algorithms has been instigated on secure access protocol scheme. It is demonstrated that machine learning algorithms have the potential of breaching the secure communication between the access software and the board and hence access the sensitive instruments. Furthermore, Random Forest significantly outperforms the other models in terms of breaking the security.

通常用于有效访问片上仪器的IEEE 1687标准可能被入侵者利用，因此需要加以保护。缓解1687网络漏洞的技术之一是使用基于许可访问软件、芯片ID和锁定SIB的安全访问协议。通常使用许可访问软件来获得对嵌入式仪器的控制并按要求使用它们。本文利用各种机器学习算法对安全访问协议方案进行了成功的攻击。研究表明，机器学习算法有可能破坏访问软件和电路板之间的安全通信，从而访问敏感仪器。此外，随机森林在破坏安全性方面明显优于其他模型。

引用次数: 3

Ran$Net: An Anti-Ransomware Methodology based on Cache Monitoring and Deep Learning Ran$Net:一种基于缓存监控和深度学习的反勒索软件方法

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530830

Xiang Zhang, Ziyue Zhang, Ruyi Ding, Gongye Cheng, A. Ding, Yunsi Fei

Ransomware has become a serious threat in the cyberspace. Existing software pattern-based malware detectors are specific for certain ransomware and may not capture new variants. Recognizing a common essential behavior of ransomware - employing local cryptographic software for malicious encryption and therefore leaving footprints on the victim machine's caches, this work proposes an anti-ransomware methodology, Ran$Net, based on hardware activities. It consists of a passive cache monitor to log suspicious cache activities, and a follow-on non-profiled deep learning analysis strategy to retrieve the secret cryptographic key from the timing traces generated by the monitor. We implement the first of its kind tool to combat an open-source ransomware and successfully recover the secret key.

勒索软件已成为网络空间的严重威胁。现有的基于软件模式的恶意软件检测器是特定于某些勒索软件的，可能无法捕获新的变体。认识到勒索软件的常见基本行为-使用本地加密软件进行恶意加密，从而在受害者机器的缓存上留下足迹，本工作提出了一种基于硬件活动的反勒索软件方法Ran$Net。它包括一个被动缓存监视器，用于记录可疑的缓存活动，以及一个后续的非概要深度学习分析策略，用于从监视器生成的定时跟踪中检索秘密加密密钥。我们实现了首个此类工具来打击开源勒索软件并成功恢复密钥。

引用次数: 0

Two 0.8 V, Highly Reliable RHBD 10T and 12T SRAM Cells for Aerospace Applications 两个0.8 V，高可靠的RHBD 10T和12T SRAM电池，用于航空航天应用

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530312

Aibin Yan, Zhihui He, Jing Xiang, Jie Cui, Yong Zhou, Zhengfeng Huang, P. Girard, X. Wen

Aggressive scaling of CMOS technologies requires to pay attention to the reliability issues of circuits. This paper presents two highly reliable RHBD 10T and 12T SRAM cells, which can protect against single-node upsets (SNUs) and double-node upsets (DNUs). The 10T cell mainly consists of two cross-coupled input-split inverters and the cell can robustly keep stored values through a feedback mechanism among its internal nodes. It also has a low cost in terms of area and power consumption, since it uses only a few transistors. Based on the 10T cell, a 12T cell is proposed that uses four parallel access transistors. The 12T cell has a reduced read/write access time with the same soft error tolerance when compared to the 10T cell. Simulation results demonstrate that the proposed cells can recover from SNUs and a part of DNUs. Moreover, compared with the state-of-the-art hardened SRAM cells, the proposed 10T cell can save 28.59% write access time, 55.83% read access time, and 4.46% power dissipation at the cost of 4.04% silicon area on average.

CMOS技术的大规模扩展需要关注电路的可靠性问题。本文提出了两种高可靠的RHBD 10T和12T SRAM单元，可以防止单节点宕机(snu)和双节点宕机(dnu)。10T单元主要由两个交叉耦合的输入分路逆变器组成，单元通过内部节点间的反馈机制实现对存储值的鲁棒性保持。它在面积和功耗方面的成本也很低，因为它只使用几个晶体管。在10T电池的基础上，提出了采用4个并联接入晶体管的12T电池。与10T电池相比，12T电池具有相同的软容错能力，并减少了读/写访问时间。仿真结果表明，所提出的细胞可以从snu和部分dnu中恢复。此外，与目前最先进的硬化SRAM单元相比，该10T单元可节省28.59%的写访问时间，55.83%的读访问时间和4.46%的功耗，平均硅面积为4.04%。

引用次数: 0

LEAD: Logarithmic Exponent Approximate Divider For Image Quantization Application 用于图像量化应用的对数指数近似分频器

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3526241.3530323

Omkar G. Ratnaparkhi, M. Rao

Most of the applications of modern day VLSI designs are approaching towards energy efficient and high speed computing solutions. Approximate computing is considered a suitable design methodology that satisfies the current requirements of hardware and performance metrics without compromising on the outcome significantly. Many of the arithmetic operations are realized using approximate computing techniques, and many successful implementations are reported at system level designs. However divider operations in general are rarely realized in hardware and this needs much attention considering the surge in neural networks implementation in hardware. In this paper, a novel approximate divider is proposed which is not only characterized to have better accuracy and hardware efficient when compared to the other accurate dividers. The proposed divider is built on logarithmic divider and approximates the exponent part to achieve the desired hardware characteristics. The proposed 8-bit, and 16-bit divider design were realized in 45-NM CMOS technology for different input and output data format including integer, fixed-point, and floating-point. The proposed divider was characterized for error and hardware metrics and compared with other dividers. The novel divider was validated on K-means color quantization algorithm, showcasing improved quantization results.

现代VLSI设计的大多数应用都在朝着节能和高速计算解决方案的方向发展。近似计算被认为是一种合适的设计方法，可以满足当前硬件和性能指标的要求，而不会显著影响结果。许多算术运算是使用近似计算技术实现的，并且在系统级设计中有许多成功的实现。然而，除法运算通常很少在硬件上实现，考虑到神经网络在硬件上的实现激增，这需要引起人们的高度重视。本文提出了一种新的近似分频器，与其他精确分频器相比，它不仅具有更好的精度和硬件效率。所提出的除法器是建立在对数除法器的基础上，并近似于指数部分，以达到所需的硬件特性。针对不同的输入输出数据格式，包括整数、定点和浮点数，采用45纳米CMOS技术实现了8位和16位分频器设计。对所提出的分频器进行了误差和硬件指标的表征，并与其他分频器进行了比较。在K-means颜色量化算法上进行了验证，显示出改进的量化效果。

{"title":"LEAD: Logarithmic Exponent Approximate Divider For Image Quantization Application","authors":"Omkar G. Ratnaparkhi, M. Rao","doi":"10.1145/3526241.3530323","DOIUrl":"https://doi.org/10.1145/3526241.3530323","url":null,"abstract":"Most of the applications of modern day VLSI designs are approaching towards energy efficient and high speed computing solutions. Approximate computing is considered a suitable design methodology that satisfies the current requirements of hardware and performance metrics without compromising on the outcome significantly. Many of the arithmetic operations are realized using approximate computing techniques, and many successful implementations are reported at system level designs. However divider operations in general are rarely realized in hardware and this needs much attention considering the surge in neural networks implementation in hardware. In this paper, a novel approximate divider is proposed which is not only characterized to have better accuracy and hardware efficient when compared to the other accurate dividers. The proposed divider is built on logarithmic divider and approximates the exponent part to achieve the desired hardware characteristics. The proposed 8-bit, and 16-bit divider design were realized in 45-NM CMOS technology for different input and output data format including integer, fixed-point, and floating-point. The proposed divider was characterized for error and hardware metrics and compared with other dividers. The novel divider was validated on K-means color quantization algorithm, showcasing improved quantization results.","PeriodicalId":188228,"journal":{"name":"Proceedings of the Great Lakes Symposium on VLSI 2022","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126231943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Session details: Session 7B: Microelectronic Systems Education 会议详情:7B部分:微电子系统教育

Proceedings of the Great Lakes Symposium on VLSI 2022

Pub Date : 2022-06-06 DOI: 10.1145/3542695

B. Skromme

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Great Lakes Symposium on VLSI 2022

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀