首页 > 最新文献

2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)最新文献

英文 中文
Secure Communication with Peripherals in NoC-based Many-cores 基于网络的多核外设安全通信
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893244
R. Faccenda, Gustavo Comarú, L. L. Caimi, F. Moraes
Many-core systems-on-chip (MCSoCs) contain pro-cessing elements (PEs), peripherals attached to the system, and an NoC connecting them. These systems have different flows traversing the NoC: PE-PE and PE-peripheral flows. Malicious hardware or software can hinder system security due to the resource sharing feature, such as CPU sharing for multitasking or sharing NoC links for flows belonging to different applications. Methods that isolate applications with security constraints, such as Secure Zones (SZs), protect PE-PE flows against most of the attacks reported in the literature. Proposals with methods to secure the communication with peripherals in the literature are scarce, with most of them focusing on shared memory protection. This paper presents an original approach, Secure Mapping with Access Point - SeMAP, which creates mapping policies for SZs, and communication strategies with IO devices, to protect PE-peripheral flows. Results show that the application execution time is not penalized by applying SeMAP, presenting advantages compared to a state-of-the-art approach. In terms of security, SeMAP successfully resisted an attack campaign, blocking malicious packets attempting to enter the SZ.
多核片上系统(mcsoc)包含处理元件(pe)、连接到系统的外设和连接它们的NoC。这些系统具有穿越NoC的不同流:PE-PE和pe -外设流。由于资源共享特性,恶意的硬件或软件可能会影响系统的安全性,例如多任务时共享CPU或属于不同应用程序的流共享NoC链接。隔离具有安全约束(如安全区域(sz))的应用程序的方法可以保护PE-PE流免受文献中报道的大多数攻击。文献中关于安全与外设通信的方法的建议很少,大多数都集中在共享内存保护上。本文提出了一种新颖的方法,即带接入点的安全映射(SeMAP),它为sz创建映射策略,并与IO设备建立通信策略,以保护pe外设流。结果表明,应用SeMAP不会减少应用程序的执行时间,这与最先进的方法相比具有优势。在安全性方面,SeMAP成功地抵抗了攻击活动,阻止了试图进入SZ的恶意数据包。
{"title":"Secure Communication with Peripherals in NoC-based Many-cores","authors":"R. Faccenda, Gustavo Comarú, L. L. Caimi, F. Moraes","doi":"10.1109/SBCCI55532.2022.9893244","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893244","url":null,"abstract":"Many-core systems-on-chip (MCSoCs) contain pro-cessing elements (PEs), peripherals attached to the system, and an NoC connecting them. These systems have different flows traversing the NoC: PE-PE and PE-peripheral flows. Malicious hardware or software can hinder system security due to the resource sharing feature, such as CPU sharing for multitasking or sharing NoC links for flows belonging to different applications. Methods that isolate applications with security constraints, such as Secure Zones (SZs), protect PE-PE flows against most of the attacks reported in the literature. Proposals with methods to secure the communication with peripherals in the literature are scarce, with most of them focusing on shared memory protection. This paper presents an original approach, Secure Mapping with Access Point - SeMAP, which creates mapping policies for SZs, and communication strategies with IO devices, to protect PE-peripheral flows. Results show that the application execution time is not penalized by applying SeMAP, presenting advantages compared to a state-of-the-art approach. In terms of security, SeMAP successfully resisted an attack campaign, blocking malicious packets attempting to enter the SZ.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123113837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
eSi-BTC: an energy efficient Bitcoin mining core eSi-BTC:高效节能的比特币挖矿核心
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893218
C. Gewehr, Carlis Raupp, J. Leao
Bitcoin is the most well-adopted cryptocurrency today. As of 2022, it is estimated that around US$ 4 billion are transacted daily within the Bitcoin network. Optimizing the energy expenditure of Bitcoin mining is of interest to several parties, such as end-users, who make transactions on the Bitcoin network with lower fees, miners themselves, who increase mining profits due to lower operational costs, and government regulation bodies, who minimize the environmental and power-grid impacts of mining. In this work we present a Bitcoin mining core employing several optimizations and design techniques that aim to maximize energy efficiency. Results in a 6 nm technology node show a 33% improvement in energy efficiency when compared to the state-of-the-art implementation.
比特币是当今最受欢迎的加密货币。截至2022年,据估计,比特币网络每天的交易量约为40亿美元。优化比特币挖矿的能源消耗是各方都感兴趣的,比如终端用户,他们在比特币网络上以较低的费用进行交易,矿工本身,由于较低的运营成本而增加了挖矿利润,以及政府监管机构,他们最大限度地减少了挖矿对环境和电网的影响。在这项工作中,我们提出了一个比特币挖矿核心,采用了几种优化和设计技术,旨在最大限度地提高能源效率。结果显示,与最先进的实现相比,6纳米技术节点的能源效率提高了33%。
{"title":"eSi-BTC: an energy efficient Bitcoin mining core","authors":"C. Gewehr, Carlis Raupp, J. Leao","doi":"10.1109/SBCCI55532.2022.9893218","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893218","url":null,"abstract":"Bitcoin is the most well-adopted cryptocurrency today. As of 2022, it is estimated that around US$ 4 billion are transacted daily within the Bitcoin network. Optimizing the energy expenditure of Bitcoin mining is of interest to several parties, such as end-users, who make transactions on the Bitcoin network with lower fees, miners themselves, who increase mining profits due to lower operational costs, and government regulation bodies, who minimize the environmental and power-grid impacts of mining. In this work we present a Bitcoin mining core employing several optimizations and design techniques that aim to maximize energy efficiency. Results in a 6 nm technology node show a 33% improvement in energy efficiency when compared to the state-of-the-art implementation.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123336337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Error Resilience Evaluation of Approximate Storage in the Intra Prediction of VVC Decoders 近似存储在VVC解码器帧内预测中的容错性评价
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893263
Matheus Isquierdo, Renira Soares, F. Sampaio, B. Zatt, D. Palomino
This paper presents an error resilience evaluation of the intra prediction in VVC decoders when approximate storage is employed in the Reference Line Buffer (RLB). We present an error injection framework to simulate the use of approximate storage in the RLB buffer with commonly used Bit Error Rate (BER) values from literature for SRAM and DRAM technologies. We also perform the resilience evaluation considering different decoding configurations. Our analysis characterizes how the impacts of approximation are dependent on video content and configurations. The results show that approximate storage can be used in some of the evaluated scenarios with very low degradation on the final visual quality of the decoded video sequences.
本文研究了在参考线缓冲器(RLB)中采用近似存储时,VVC解码器的帧内预测的容错性评估。我们提出了一个错误注入框架来模拟RLB缓冲中近似存储的使用,并使用SRAM和DRAM技术的文献中常用的误码率(BER)值。我们还在考虑不同解码配置的情况下进行了弹性评估。我们的分析描述了近似的影响是如何依赖于视频内容和配置的。结果表明,近似存储可以在一些评估场景中使用,并且对解码视频序列的最终视觉质量降低很小。
{"title":"Error Resilience Evaluation of Approximate Storage in the Intra Prediction of VVC Decoders","authors":"Matheus Isquierdo, Renira Soares, F. Sampaio, B. Zatt, D. Palomino","doi":"10.1109/SBCCI55532.2022.9893263","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893263","url":null,"abstract":"This paper presents an error resilience evaluation of the intra prediction in VVC decoders when approximate storage is employed in the Reference Line Buffer (RLB). We present an error injection framework to simulate the use of approximate storage in the RLB buffer with commonly used Bit Error Rate (BER) values from literature for SRAM and DRAM technologies. We also perform the resilience evaluation considering different decoding configurations. Our analysis characterizes how the impacts of approximation are dependent on video content and configurations. The results show that approximate storage can be used in some of the evaluated scenarios with very low degradation on the final visual quality of the decoded video sequences.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116858630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Methodology for an Early Exploration of Embedded Systems using Portable Test and Stimulus Standard 使用便携式测试和激励标准的嵌入式系统早期探索方法
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893231
Frederik Kautz, Holger Blume, C. Sauer
The complexity of modern embedded systems, especially in the areas of health, Internet of Things (IoT), or Cyber Physical Systems is continuously increasing. Due to simultaneously decreasing time to market, methods and techniques for a disciplined early Design Space Exploration (DSE) become mandatory. Recently, the Portable Test and Stimulus Standard (PSS) was introduced as a more abstract level to describe test intend, helping with software driven verification of complex systems. Our work aims at its use early in the design cycle, which still suffers from non-optimal representation of applications namely regarding expressiveness, early availability, reuse and path to implementation. Following an analysis of recent work, we propose an early DSE methodology based on PSS following the Y-chart to tackle before mentioned aspects. From a UML like description of a high-end hearing aid algorithm, we show the first needed steps to create valid application-to-architecture mappings, which can be used for further trade-off analysis and reused within and between common development cycles.
现代嵌入式系统的复杂性,特别是在健康、物联网(IoT)或网络物理系统领域不断增加。由于同时缩短了上市时间,有纪律的早期设计空间探索(DSE)的方法和技术变得必不可少。最近,便携式测试和刺激标准(PSS)作为一个更抽象的层次来描述测试意图,有助于软件驱动复杂系统的验证。我们的工作目标是在设计周期的早期使用它,这仍然受到应用程序的非最佳表示的影响,即在表达性、早期可用性、重用和实现路径方面。在分析了最近的工作之后,我们提出了一种基于y图的PSS的早期DSE方法,以解决前面提到的问题。从高端助听器算法的类似UML的描述中,我们展示了创建有效的应用程序到体系结构映射所需的第一个步骤,这些映射可用于进一步的权衡分析,并可在公共开发周期内和之间重用。
{"title":"Methodology for an Early Exploration of Embedded Systems using Portable Test and Stimulus Standard","authors":"Frederik Kautz, Holger Blume, C. Sauer","doi":"10.1109/SBCCI55532.2022.9893231","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893231","url":null,"abstract":"The complexity of modern embedded systems, especially in the areas of health, Internet of Things (IoT), or Cyber Physical Systems is continuously increasing. Due to simultaneously decreasing time to market, methods and techniques for a disciplined early Design Space Exploration (DSE) become mandatory. Recently, the Portable Test and Stimulus Standard (PSS) was introduced as a more abstract level to describe test intend, helping with software driven verification of complex systems. Our work aims at its use early in the design cycle, which still suffers from non-optimal representation of applications namely regarding expressiveness, early availability, reuse and path to implementation. Following an analysis of recent work, we propose an early DSE methodology based on PSS following the Y-chart to tackle before mentioned aspects. From a UML like description of a high-end hearing aid algorithm, we show the first needed steps to create valid application-to-architecture mappings, which can be used for further trade-off analysis and reused within and between common development cycles.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127580950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Circuit Reliability Analysis with Considerations of Aging Effect 考虑老化效应的电路可靠性分析
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893233
Suoyue Zhan, Chunhong Chen
Aging effect is one of the critical factors causing circuit reliability degradation due to negative bias temperature instability (NBTI) with continuous and intense logic operation. While there is no lack of research work on aging-related reliability analysis at transistor- or gate-level, little has been done to estimate the reliability at circuit-level. This makes it difficult for the designers to predict the circuit lifetime. To fill this gap, this paper proposes a reliability estimation model to target the reliability degradation at the output of integrated circuits. Simulations on benchmark circuits show that the reliability degradation rate ranges from 1.5% to 8.2% over one-year period, depending on specific circuits.
老化效应是连续高强度逻辑运行负偏置温度不稳定性(NBTI)导致电路可靠性下降的关键因素之一。虽然在晶体管或栅极级的老化可靠性分析方面的研究并不缺乏,但在电路级的可靠性评估方面的研究却很少。这使得设计人员很难预测电路的寿命。为了填补这一空白,本文提出了一种针对集成电路输出可靠性退化的可靠性估计模型。在基准电路上的仿真表明,根据具体电路的不同,一年的可靠性退化率在1.5%到8.2%之间。
{"title":"Circuit Reliability Analysis with Considerations of Aging Effect","authors":"Suoyue Zhan, Chunhong Chen","doi":"10.1109/SBCCI55532.2022.9893233","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893233","url":null,"abstract":"Aging effect is one of the critical factors causing circuit reliability degradation due to negative bias temperature instability (NBTI) with continuous and intense logic operation. While there is no lack of research work on aging-related reliability analysis at transistor- or gate-level, little has been done to estimate the reliability at circuit-level. This makes it difficult for the designers to predict the circuit lifetime. To fill this gap, this paper proposes a reliability estimation model to target the reliability degradation at the output of integrated circuits. Simulations on benchmark circuits show that the reliability degradation rate ranges from 1.5% to 8.2% over one-year period, depending on specific circuits.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127636622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Level Design of a 14-bit Continuous-Time Sigma-Delta Modulator with FIR DAC for Low-Voltage Audio Devices 用于低压音频器件的带FIR DAC的14位连续σ - δ调制器的高级设计
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893239
Matheus Cortez, A. Girardi, P. Aguirre
Currently, mobile and wearable devices have digital audio signal processing capabilities. Since the nature of audio signals is analog, there is a need to use analog-to-digital converters (ADCs) with high-resolution for a high signal-to-noise ratio audio acquisition. This paper presents the high-level design of a continuous-time third-order sigma-delta modulator with a FIR DAC for audio devices, using a supply voltage of 0.5 V. The design is carried out using the Delta-sigma toolbox and a discrete-time to continuous-time (DT-CT) transformation. It was estimated that the first-integrator amplifier needs a gain of 50 dB, a GBW of 5 MHz and a slew-rate of at least $3 mathrm{V}/mu mathrm{s}$. By implementing this amplifier in VerilogA and performing a transient simulation with noise, the modulator obtained an SNR of 86.63 dB, an SNDR of 86.46 dB and an ENOB of 14.07 bits. Finally, the extraction of the initial parameters for the amplifier design proved to be satisfactory, since the modulator performance results were within the specified in the design.
目前,移动和可穿戴设备具有数字音频信号处理能力。由于音频信号的性质是模拟的,因此需要使用高分辨率的模数转换器(adc)来实现高信噪比的音频采集。本文介绍了一种带FIR DAC的音频设备连续三阶sigma-delta调制器的高级设计,其电源电压为0.5 V。设计是使用Delta-sigma工具箱和离散时间到连续时间(DT-CT)变换进行的。据估计,第一积分器放大器需要50 dB的增益,5 MHz的GBW和至少$3 mathm {V}/mu mathm {s}$的自旋速率。通过在VerilogA中实现该放大器并进行带噪声的瞬态仿真,调制器的信噪比为86.63 dB,信噪比为86.46 dB, ENOB为14.07 bits。最后,对放大器设计初始参数的提取证明是令人满意的,因为调制器的性能结果在设计中规定的范围内。
{"title":"High-Level Design of a 14-bit Continuous-Time Sigma-Delta Modulator with FIR DAC for Low-Voltage Audio Devices","authors":"Matheus Cortez, A. Girardi, P. Aguirre","doi":"10.1109/SBCCI55532.2022.9893239","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893239","url":null,"abstract":"Currently, mobile and wearable devices have digital audio signal processing capabilities. Since the nature of audio signals is analog, there is a need to use analog-to-digital converters (ADCs) with high-resolution for a high signal-to-noise ratio audio acquisition. This paper presents the high-level design of a continuous-time third-order sigma-delta modulator with a FIR DAC for audio devices, using a supply voltage of 0.5 V. The design is carried out using the Delta-sigma toolbox and a discrete-time to continuous-time (DT-CT) transformation. It was estimated that the first-integrator amplifier needs a gain of 50 dB, a GBW of 5 MHz and a slew-rate of at least $3 mathrm{V}/mu mathrm{s}$. By implementing this amplifier in VerilogA and performing a transient simulation with noise, the modulator obtained an SNR of 86.63 dB, an SNDR of 86.46 dB and an ENOB of 14.07 bits. Finally, the extraction of the initial parameters for the amplifier design proved to be satisfactory, since the modulator performance results were within the specified in the design.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130772972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Design Procedure for Sizing Comparators in Active Rectifiers using $g_{m}/I_{D}$ Technique 采用$g_{m}/ $ I_{D}$技术设计有源整流器中比较器的尺寸
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893249
A. Ballo, A. D. Grasso, M. Privitera
This paper introduces a design strategy for common-gate comparators, as the widely implemented comparators in active rectification systems. The method is based on the $g_{m}/I_{D}$ technique and it embraces both the power conversion efficiency of the whole system and large signal performance parameters, such as the slew-rate. As an example, the proposed strategy has been adopted to design the comparator in a power management integrated circuit for energy harvesting in fully battery-less implantable medical devices. The overall performances of the rectifier are shown, giving out a post-layout power conversion efficiency higher than 92.5%.
本文介绍了在有源整流系统中广泛应用的共门比较器的设计策略。该方法基于$g_{m}/I_{D}$技术,既考虑了整个系统的功率转换效率,又考虑了回转率等大信号性能参数。作为一个实例,采用该策略设计了用于完全无电池植入式医疗设备能量收集的电源管理集成电路中的比较器。整流器的整体性能得到了展示,其布局后功率转换效率高于92.5%。
{"title":"A Design Procedure for Sizing Comparators in Active Rectifiers using $g_{m}/I_{D}$ Technique","authors":"A. Ballo, A. D. Grasso, M. Privitera","doi":"10.1109/SBCCI55532.2022.9893249","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893249","url":null,"abstract":"This paper introduces a design strategy for common-gate comparators, as the widely implemented comparators in active rectification systems. The method is based on the $g_{m}/I_{D}$ technique and it embraces both the power conversion efficiency of the whole system and large signal performance parameters, such as the slew-rate. As an example, the proposed strategy has been adopted to design the comparator in a power management integrated circuit for energy harvesting in fully battery-less implantable medical devices. The overall performances of the rectifier are shown, giving out a post-layout power conversion efficiency higher than 92.5%.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124066979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hardware Design for the Separable Symmetric Normalized Wiener Filter of the AV1 Decoder AV1解码器可分离对称归一化维纳滤波器的硬件设计
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893219
Roberta Palau, Wagner Penny, J. Goebel, Eduardo Zummach, G. Corrêa, M. Porto, L. Agostini
This paper presents the first dedicated hardware design in the literature for the Separable Symmetric Normalized Wiener Filter (SSNWF) targeting the decoder of the AOM Video 1 (AV1) video format. The SSNWF is one of the last filters into the decoding filtering loop, being part of the Switchable Loop Restoration Filter (SLRF). The SLRF is the main novelty introduced by AV1 in the in-loop filtering process. It is used to attenuate the blurring artifacts, improving the subjective video quality and the coding efficiency. The developed hardware design presented in this paper targets the AV1 decoder and it can process Ultra-High Definition (UHD) videos with 3840x2160 pixels per frame at 60 frames per second (fps) when running at 207.03 MHz. The architecture was synthesized to standard cells using the 40 nm TSMC library, reaching an area of 37.78 Kgates and a power dissipation of 26.36 mW.
本文提出了针对AOM Video 1 (AV1)视频格式解码器的可分离对称归一化维纳滤波器(SSNWF)的专用硬件设计。SSNWF是解码滤波环路中最后的滤波器之一,是可切换环路恢复滤波器(SLRF)的一部分。SLRF是AV1在环内滤波过程中引入的主要新颖之处。该算法用于减弱模糊伪影,提高主观视频质量和编码效率。本文所开发的硬件设计以AV1解码器为目标,在207.03 MHz工作频率下,能够以60帧/秒的速度处理3840x2160像素/帧的超高清视频。利用40 nm TSMC库将该结构合成为标准单元,面积达到37.78 Kgates,功耗为26.36 mW。
{"title":"Hardware Design for the Separable Symmetric Normalized Wiener Filter of the AV1 Decoder","authors":"Roberta Palau, Wagner Penny, J. Goebel, Eduardo Zummach, G. Corrêa, M. Porto, L. Agostini","doi":"10.1109/SBCCI55532.2022.9893219","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893219","url":null,"abstract":"This paper presents the first dedicated hardware design in the literature for the Separable Symmetric Normalized Wiener Filter (SSNWF) targeting the decoder of the AOM Video 1 (AV1) video format. The SSNWF is one of the last filters into the decoding filtering loop, being part of the Switchable Loop Restoration Filter (SLRF). The SLRF is the main novelty introduced by AV1 in the in-loop filtering process. It is used to attenuate the blurring artifacts, improving the subjective video quality and the coding efficiency. The developed hardware design presented in this paper targets the AV1 decoder and it can process Ultra-High Definition (UHD) videos with 3840x2160 pixels per frame at 60 frames per second (fps) when running at 207.03 MHz. The architecture was synthesized to standard cells using the 40 nm TSMC library, reaching an area of 37.78 Kgates and a power dissipation of 26.36 mW.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129158887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Enhancing an Asynchronous Circuit Design Flow to Support Complex Digital System Design 增强异步电路设计流程以支持复杂的数字系统设计
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893258
M. Sartori, W. Nunes, Ney Laert Vilar Calazans
Robustness to variations is desirable in current digital circuit design techniques. Sources of variations are many, and the evolution of current integrated circuit fabrication technologies does increase the amount of such sources and the complexity of ensuring circuit robustness against them. Some design paradigms naturally counter variations to one or more variation sources. Asynchronous quasi-delay insensitive design is such a paradigm, providing robustness to process, supply voltage, temperature, ageing and IR drop variations. This paper proposes an enhancement to Pulsar, a recently proposed open source automated flow for the design of quasi-delay insensitive circuits. A new set of abstract components enables the description of choices and decisions about the flow of data tokens inside asynchronous circuits. These components are now available to be used in the design capture phase of Pulsar. To build circuit cells that implement the abstract (synthesis-enabled) components, this paper brings the proposal of the handshaking mutex (HM), a versatile complex gate that eases the design of asynchronous arbiters. Results demonstrate the new flow is more powerful than the baseline version, enabling the automated synthesis of complex asynchronous circuits.
对变化的鲁棒性是当前数字电路设计技术所需要的。变化的来源有很多,当前集成电路制造技术的发展确实增加了这些来源的数量和确保电路对它们的鲁棒性的复杂性。一些设计范例自然地对抗一个或多个变化源的变化。异步准延迟不敏感设计就是这样一种范例,提供了对工艺、电源电压、温度、老化和IR下降变化的鲁棒性。本文提出了对脉冲星的改进,脉冲星是最近提出的一种用于设计准延迟不敏感电路的开源自动化流程。一组新的抽象组件支持对异步电路中数据令牌流的选择和决策进行描述。这些组件现在可以用于脉冲星的设计捕获阶段。为了构建实现抽象(合成)组件的电路单元,本文提出了握手互斥锁(HM),这是一种简化异步仲裁器设计的多功能复杂门。结果表明,新流程比基准版本更强大,可以自动合成复杂的异步电路。
{"title":"Enhancing an Asynchronous Circuit Design Flow to Support Complex Digital System Design","authors":"M. Sartori, W. Nunes, Ney Laert Vilar Calazans","doi":"10.1109/SBCCI55532.2022.9893258","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893258","url":null,"abstract":"Robustness to variations is desirable in current digital circuit design techniques. Sources of variations are many, and the evolution of current integrated circuit fabrication technologies does increase the amount of such sources and the complexity of ensuring circuit robustness against them. Some design paradigms naturally counter variations to one or more variation sources. Asynchronous quasi-delay insensitive design is such a paradigm, providing robustness to process, supply voltage, temperature, ageing and IR drop variations. This paper proposes an enhancement to Pulsar, a recently proposed open source automated flow for the design of quasi-delay insensitive circuits. A new set of abstract components enables the description of choices and decisions about the flow of data tokens inside asynchronous circuits. These components are now available to be used in the design capture phase of Pulsar. To build circuit cells that implement the abstract (synthesis-enabled) components, this paper brings the proposal of the handshaking mutex (HM), a versatile complex gate that eases the design of asynchronous arbiters. Results demonstrate the new flow is more powerful than the baseline version, enabling the automated synthesis of complex asynchronous circuits.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123083530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study of Motion Coding Schemes for Learned Video Compression 基于学习视频压缩的运动编码方案研究
Pub Date : 2022-08-22 DOI: 10.1109/SBCCI55532.2022.9893226
Peng Chen, C. Lin, Wen-Hsiao Peng
This paper presents a study of motion coding schemes for learned video compression. Most learned video compression systems explicitly signal optical flow maps to characterize motion between video frames for motion compensation. The flow maps, usually of the same size as the video frames, represent a considerable portion of the compressed bitstream. This work studies several schemes to make a non-linear prediction of the flow maps for efficient motion coding. These include signaling an incremental flow map between a coding frame and a motion-compensated frame derived from the flow map predictor. In forming the flow map predictor, we propose a learned motion extrapolation module and a motion forward warping scheme. They are further incorporated into two novel approaches, termed double warping and frame synthesis with motion forward warping, in creating an inter-frame predictor by combining the incremental flow and the flow map predictor. Extensive experiments are conducted to analyze the merits and faults of these variants, and demonstrate their superiority to predictive motion coding and intra motion coding.
本文研究了用于学习视频压缩的运动编码方案。大多数学习过的视频压缩系统明确地发出光流映射来表征视频帧之间的运动,以进行运动补偿。流映射通常与视频帧大小相同,代表了压缩比特流的相当一部分。本文研究了几种对流图进行非线性预测的方案,以实现高效的运动编码。这些包括在编码帧和源自流图预测器的运动补偿帧之间发出递增流图的信号。在形成流图预测器时,我们提出了一个学习运动外推模块和一个运动前向扭曲方案。它们进一步结合到两种新方法中,称为双翘曲和帧合成与运动前向翘曲,通过结合增量流和流图预测器来创建帧间预测器。通过大量的实验分析了这些变体的优缺点,并证明了它们相对于预测运动编码和运动内编码的优越性。
{"title":"A Study of Motion Coding Schemes for Learned Video Compression","authors":"Peng Chen, C. Lin, Wen-Hsiao Peng","doi":"10.1109/SBCCI55532.2022.9893226","DOIUrl":"https://doi.org/10.1109/SBCCI55532.2022.9893226","url":null,"abstract":"This paper presents a study of motion coding schemes for learned video compression. Most learned video compression systems explicitly signal optical flow maps to characterize motion between video frames for motion compensation. The flow maps, usually of the same size as the video frames, represent a considerable portion of the compressed bitstream. This work studies several schemes to make a non-linear prediction of the flow maps for efficient motion coding. These include signaling an incremental flow map between a coding frame and a motion-compensated frame derived from the flow map predictor. In forming the flow map predictor, we propose a learned motion extrapolation module and a motion forward warping scheme. They are further incorporated into two novel approaches, termed double warping and frame synthesis with motion forward warping, in creating an inter-frame predictor by combining the incremental flow and the flow map predictor. Extensive experiments are conducted to analyze the merits and faults of these variants, and demonstrate their superiority to predictive motion coding and intra motion coding.","PeriodicalId":231587,"journal":{"name":"2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116733097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 35th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1