首页 > 最新文献

2022 11th International Conference on Modern Circuits and Systems Technologies (MOCAST)最新文献

英文 中文
A Low-Latency Syndrome-based Deep Learning Decoder Architecture and its FPGA Implementation 一种基于低延迟综合征的深度学习解码器架构及其FPGA实现
Pub Date : 2022-06-08 DOI: 10.1109/mocast54814.2022.9837752
E. Kavvousanos, Vassilis Paliouras
Recently, Machine Learning has been considered as an alternative design paradigm for various communications sub-systems. However, the works that have assessed the performance of these methods beyond the algorithmic level are limited. In this paper, we implement in hardware and evaluate the performance of the Syndrome-based Deep Learning Decoder for a BCH(63,45) code in terms of throughput rate and latency. The implemented Neural Network is compressed by applying pruning, clustering and quantization to an 8-bit fixed-point representation, with no significant loss in its BER performance, while achieving 90% weight sparsity in each layer. An FPGA architecture is designed for the decoder which exploits the compressed structure of the Neural Network in order to accelerate the underlying computations with moderate hardware requirements. Experimental results are provided which show that the decoder achieves latency less than a tenth of a millisecond and a throughput rate up to 5 Mbps, substantially outperforming previous implementations by 30×.
最近,机器学习被认为是各种通信子系统的另一种设计范式。然而,在算法层面之外评估这些方法性能的工作是有限的。在本文中,我们在硬件上实现了基于综合征的深度学习解码器,并从吞吐量和延迟方面评估了BCH(63,45)代码的性能。所实现的神经网络通过对8位定点表示应用剪枝、聚类和量化来压缩,其误码率性能没有明显损失,同时每层的权重稀疏度达到90%。为解码器设计了一种FPGA架构,利用神经网络的压缩结构,在对硬件要求适中的情况下加快底层计算速度。实验结果表明,该解码器的延迟小于十分之一毫秒,吞吐率高达5 Mbps,大大优于以前的实现30倍。
{"title":"A Low-Latency Syndrome-based Deep Learning Decoder Architecture and its FPGA Implementation","authors":"E. Kavvousanos, Vassilis Paliouras","doi":"10.1109/mocast54814.2022.9837752","DOIUrl":"https://doi.org/10.1109/mocast54814.2022.9837752","url":null,"abstract":"Recently, Machine Learning has been considered as an alternative design paradigm for various communications sub-systems. However, the works that have assessed the performance of these methods beyond the algorithmic level are limited. In this paper, we implement in hardware and evaluate the performance of the Syndrome-based Deep Learning Decoder for a BCH(63,45) code in terms of throughput rate and latency. The implemented Neural Network is compressed by applying pruning, clustering and quantization to an 8-bit fixed-point representation, with no significant loss in its BER performance, while achieving 90% weight sparsity in each layer. An FPGA architecture is designed for the decoder which exploits the compressed structure of the Neural Network in order to accelerate the underlying computations with moderate hardware requirements. Experimental results are provided which show that the decoder achieves latency less than a tenth of a millisecond and a throughput rate up to 5 Mbps, substantially outperforming previous implementations by 30×.","PeriodicalId":122414,"journal":{"name":"2022 11th International Conference on Modern Circuits and Systems Technologies (MOCAST)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121901535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A low power low noise 65nm charge pump using mismatch compensation and smoothing capacitor 采用失配补偿和平滑电容的低功耗低噪声65nm电荷泵
Pub Date : 2022-06-08 DOI: 10.1109/mocast54814.2022.9837615
D. Samaras, A. Hatzopoulos
This work presents a Charge Pump circuit using active current mismatch compensation, which exhibits low power, low noise and good linearity for integer or fractional -N PLL. The proposed charge pump has a Vctrl range of 0.3V to 0.75V, 1.135 × 10-24 A2/Hz output current noise and current mismatch below 0.01%. Additionally, it includes a smoothing capacitor to reduce the transient phenomena which generate non-linearity. The layout dimensions of the proposed charge pump is 88um x 80um. Finally, the charge pump is designed using 65nm TSMC process with a supply voltage of 1V and power consumption of 309uW at 150uA current output.
本文提出了一种采用有源电流失配补偿的电荷泵电路,该电路具有低功耗、低噪声和良好的线性性,适用于整数或分数n锁相环。所提出的电荷泵的电压控制范围为0.3V至0.75V,输出电流噪声为1.135 × 10-24 A2/Hz,电流失配低于0.01%。此外,它还包括一个平滑电容器,以减少产生非线性的瞬态现象。所建议的电荷泵的布局尺寸为88um x 80um。最后,采用65nm TSMC工艺设计了充电泵,电源电压为1V,输出150uA电流时功耗为309w。
{"title":"A low power low noise 65nm charge pump using mismatch compensation and smoothing capacitor","authors":"D. Samaras, A. Hatzopoulos","doi":"10.1109/mocast54814.2022.9837615","DOIUrl":"https://doi.org/10.1109/mocast54814.2022.9837615","url":null,"abstract":"This work presents a Charge Pump circuit using active current mismatch compensation, which exhibits low power, low noise and good linearity for integer or fractional -N PLL. The proposed charge pump has a Vctrl range of 0.3V to 0.75V, 1.135 × 10-24 A2/Hz output current noise and current mismatch below 0.01%. Additionally, it includes a smoothing capacitor to reduce the transient phenomena which generate non-linearity. The layout dimensions of the proposed charge pump is 88um x 80um. Finally, the charge pump is designed using 65nm TSMC process with a supply voltage of 1V and power consumption of 309uW at 150uA current output.","PeriodicalId":122414,"journal":{"name":"2022 11th International Conference on Modern Circuits and Systems Technologies (MOCAST)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133141325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
HashPIM: High-Throughput SHA-3 via Memristive Digital Processing-in-Memory HashPIM:基于内存中的记忆数字处理的高吞吐量SHA-3
Batel Oved, Orian Leitersdorf, R. Ronen, Shahar Kvatinsky
Recent research has sought to accelerate cryptographic hash functions as they are at the core of modern cryptography. Traditional designs, however, suffer from the von Neumann bottleneck that originates from the separation of processing and memory units. An emerging solution to overcome this bottleneck is processing-in-memory (PIM): performing logic within the same devices responsible for memory to eliminate data-transfer and simultaneously provide massive computational parallelism. In this paper, we seek to vastly accelerate the state-of-the-art SHA-3 cryptographic function using the memristive memory processing unit (mMPU), a general-purpose memristive PIM architecture. To that end, we propose a novel in-memory algorithm for variable rotation, and utilize an efficient mapping of the SHA-3 state vector for memristive crossbar arrays to efficiently exploit PIM parallelism. We demonstrate a massive energy efficiency of 1, 422 Gbps/W, improving a state-of-the-art memristive SHA-3 accelerator (SHINE-2) by 4.6 ×.
最近的研究一直在寻求加速加密哈希函数,因为它们是现代密码学的核心。然而,传统的设计受到冯·诺依曼瓶颈的困扰,这种瓶颈源于处理单元和存储单元的分离。克服这一瓶颈的一个新兴解决方案是内存中处理(PIM):在负责内存的相同设备中执行逻辑,以消除数据传输,同时提供大量的计算并行性。在本文中,我们试图使用记忆存储器处理单元(mMPU)极大地加速最先进的SHA-3加密功能,mMPU是一种通用的记忆性PIM架构。为此,我们提出了一种新的内存可变旋转算法,并利用记忆交叉棒阵列的SHA-3状态向量的有效映射来有效地利用PIM并行性。我们展示了1422 Gbps/W的巨大能源效率,将最先进的记忆性SHA-3加速器(SHINE-2)提高了4.6倍。
{"title":"HashPIM: High-Throughput SHA-3 via Memristive Digital Processing-in-Memory","authors":"Batel Oved, Orian Leitersdorf, R. Ronen, Shahar Kvatinsky","doi":"10.48550/arXiv.2205.13559","DOIUrl":"https://doi.org/10.48550/arXiv.2205.13559","url":null,"abstract":"Recent research has sought to accelerate cryptographic hash functions as they are at the core of modern cryptography. Traditional designs, however, suffer from the von Neumann bottleneck that originates from the separation of processing and memory units. An emerging solution to overcome this bottleneck is processing-in-memory (PIM): performing logic within the same devices responsible for memory to eliminate data-transfer and simultaneously provide massive computational parallelism. In this paper, we seek to vastly accelerate the state-of-the-art SHA-3 cryptographic function using the memristive memory processing unit (mMPU), a general-purpose memristive PIM architecture. To that end, we propose a novel in-memory algorithm for variable rotation, and utilize an efficient mapping of the SHA-3 state vector for memristive crossbar arrays to efficiently exploit PIM parallelism. We demonstrate a massive energy efficiency of 1, 422 Gbps/W, improving a state-of-the-art memristive SHA-3 accelerator (SHINE-2) by 4.6 ×.","PeriodicalId":122414,"journal":{"name":"2022 11th International Conference on Modern Circuits and Systems Technologies (MOCAST)","volume":"26 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132193337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 11th International Conference on Modern Circuits and Systems Technologies (MOCAST)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1