首页 > 最新文献

2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)最新文献

英文 中文
Exploiting Approximate Computing for Low-Cost Fault Tolerant Architectures 利用近似计算实现低成本容错架构
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339875
G. Rodrigues, J. Fonseca, F. Benevenuti, F. Kastensmidt, A. Bosio
This work investigates how the approximate computing paradigm can be exploited to provide low-cost fault tolerant architectures. In particular, we focus on the implementation of Approximate Triple Modular Redundancy (ATMR) designs using the precision reduction technique. The proposed method is applied to two benchmarks and a multitude of ATMR designs with different degrees of approximation. The benchmarks are implemented on a Xilinx Zynq-7000 APSoC FPGA through high-level synthesis and evaluated concerning area usage and the inaccuracy caused by approximation. Fault injection experiments are performed by flipping bits of the FPGA configuration bitstream. Results show that the proposed approximation method can decrease the DSP usage of the hardware implementation up to 80% and the number of sensitive configuration bits up to 75% while maintaining an accuracy of more than 99.96%.
这项工作研究了如何利用近似计算范式来提供低成本的容错架构。特别是,我们专注于使用精度约简技术实现近似三模冗余(ATMR)设计。该方法应用于两个基准测试和许多具有不同近似程度的ATMR设计。通过高级综合,在Xilinx Zynq-7000 APSoC FPGA上实现了基准测试,并评估了面积使用和近似引起的不准确性。故障注入实验通过翻转FPGA配置位流的位进行。结果表明,该近似方法在保持99.96%以上的精度的同时,可将硬件实现的DSP使用量减少80%,敏感配置位数减少75%。
{"title":"Exploiting Approximate Computing for Low-Cost Fault Tolerant Architectures","authors":"G. Rodrigues, J. Fonseca, F. Benevenuti, F. Kastensmidt, A. Bosio","doi":"10.1145/3338852.3339875","DOIUrl":"https://doi.org/10.1145/3338852.3339875","url":null,"abstract":"This work investigates how the approximate computing paradigm can be exploited to provide low-cost fault tolerant architectures. In particular, we focus on the implementation of Approximate Triple Modular Redundancy (ATMR) designs using the precision reduction technique. The proposed method is applied to two benchmarks and a multitude of ATMR designs with different degrees of approximation. The benchmarks are implemented on a Xilinx Zynq-7000 APSoC FPGA through high-level synthesis and evaluated concerning area usage and the inaccuracy caused by approximation. Fault injection experiments are performed by flipping bits of the FPGA configuration bitstream. Results show that the proposed approximation method can decrease the DSP usage of the hardware implementation up to 80% and the number of sensitive configuration bits up to 75% while maintaining an accuracy of more than 99.96%.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134357550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Comparison between Direct and Indirect Learnings for the Digital Pre-distortion of Concurrent Dual-band Power Amplifiers 并行双频功率放大器数字预失真直接学习与间接学习的比较
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339842
Luis Schuartz, Artur T. Hara, A. Mariano, B. Leite, E. G. Lima
Current radio-communication systems demand high linearity and high efficiency. The digital baseband pre-distorter (DPD) is a cost-effective solution to guarantee the required linearity without compromising the efficiency. In the design of a DPD for a single band power amplifier (PA), the position of the inverse system is exchanged during the identification procedure to avoid the necessity of a PA model within a cumbersome closed-loop process. However, in a practical environment where only an approximation to the inverse is achieved, the linearization capability is affected by shifting the post-inverse placed after the PA to a pre-inverse located before the PA. In DPD intended for concurrent dual-band PAs, an additional advantage of such approach is that the post-inverse identifications for each band are completely independent of each other. This work performs a comparative analysis between two learning architectures applied to the linearization of two concurrent dual-band PAs stimulated by 2.4 GHz Wi-Fi and 3.5 GHz LTE signals. For the first PA, an exact PA model is known and the replacement of a post-inverse to a pre-inverse produces only negligible degradation in linearity. For the second PA, only an approximate PA model is available and the accuracy of such PA model produces a major impact on the linearization capability than the shifting of the inverse.
当前的无线电通信系统要求高线性度和高效率。数字基带预失真器(DPD)是一种经济有效的解决方案,可以在不影响效率的情况下保证所需的线性度。在单波段功率放大器(PA)的DPD设计中,在辨识过程中交换了逆系统的位置,避免了在繁琐的闭环过程中建立PA模型的必要性。然而,在仅实现近似逆的实际环境中,将放置在PA之后的后逆移到位于PA之前的前逆会影响线性化能力。在用于并发双频pa的DPD中,这种方法的另一个优点是每个波段的后逆识别是完全相互独立的。这项工作对两种学习架构进行了比较分析,这些架构应用于2.4 GHz Wi-Fi和3.5 GHz LTE信号刺激下的两个并发双频PAs的线性化。对于第一个PA,精确的PA模型是已知的,并且将后逆替换为前逆只会产生可忽略不计的线性退化。对于第二个PA,只有一个近似的PA模型,这种PA模型的精度比逆移对线性化能力的影响更大。
{"title":"Comparison between Direct and Indirect Learnings for the Digital Pre-distortion of Concurrent Dual-band Power Amplifiers","authors":"Luis Schuartz, Artur T. Hara, A. Mariano, B. Leite, E. G. Lima","doi":"10.1145/3338852.3339842","DOIUrl":"https://doi.org/10.1145/3338852.3339842","url":null,"abstract":"Current radio-communication systems demand high linearity and high efficiency. The digital baseband pre-distorter (DPD) is a cost-effective solution to guarantee the required linearity without compromising the efficiency. In the design of a DPD for a single band power amplifier (PA), the position of the inverse system is exchanged during the identification procedure to avoid the necessity of a PA model within a cumbersome closed-loop process. However, in a practical environment where only an approximation to the inverse is achieved, the linearization capability is affected by shifting the post-inverse placed after the PA to a pre-inverse located before the PA. In DPD intended for concurrent dual-band PAs, an additional advantage of such approach is that the post-inverse identifications for each band are completely independent of each other. This work performs a comparative analysis between two learning architectures applied to the linearization of two concurrent dual-band PAs stimulated by 2.4 GHz Wi-Fi and 3.5 GHz LTE signals. For the first PA, an exact PA model is known and the replacement of a post-inverse to a pre-inverse produces only negligible degradation in linearity. For the second PA, only an approximate PA model is available and the accuracy of such PA model produces a major impact on the linearization capability than the shifting of the inverse.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129717590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FPGA hardware linear regression implementation using fixed-point arithmetic 用FPGA实现硬件线性回归的定点算法
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339853
W. D. A. P. Ferreira, I. Grout, A. M. Silva
In this paper, a hardware design based on the field programmable gate array (FPGA) to implement a linear regression algorithm is presented. The arithmetic operations were optimized by applying a fixed-point number representation for all hardware based computations. A floating-point number training data point was initially created and stored in a personal computer (PC) which was then converted to fixed-point representation and transmitted to the FPGA via a serial communication link. With the proposed VHDL design description synthesized and implemented within the FPGA, the custom hardware architecture performs the linear regression algorithm based on matrix algebra considering a fixed size training data point set. To validate the hardware fixed-point arithmetic operations, the same algorithm was implemented in the Python language and the results of the two computation approaches were compared. The power consumption of the proposed embedded FPGA system was estimated to be 136.82 mW.
本文提出了一种基于现场可编程门阵列(FPGA)实现线性回归算法的硬件设计方案。通过对所有基于硬件的计算应用定点数字表示来优化算术运算。首先创建一个浮点数训练数据点并存储在个人计算机(PC)中,然后将其转换为定点表示并通过串行通信链路传输到FPGA。随着所提出的VHDL设计描述在FPGA内的综合和实现,自定义硬件架构执行基于矩阵代数的线性回归算法,考虑固定大小的训练数据点集。为了验证硬件定点运算,在Python语言中实现了相同的算法,并比较了两种计算方法的结果。该嵌入式FPGA系统的功耗估计为136.82 mW。
{"title":"FPGA hardware linear regression implementation using fixed-point arithmetic","authors":"W. D. A. P. Ferreira, I. Grout, A. M. Silva","doi":"10.1145/3338852.3339853","DOIUrl":"https://doi.org/10.1145/3338852.3339853","url":null,"abstract":"In this paper, a hardware design based on the field programmable gate array (FPGA) to implement a linear regression algorithm is presented. The arithmetic operations were optimized by applying a fixed-point number representation for all hardware based computations. A floating-point number training data point was initially created and stored in a personal computer (PC) which was then converted to fixed-point representation and transmitted to the FPGA via a serial communication link. With the proposed VHDL design description synthesized and implemented within the FPGA, the custom hardware architecture performs the linear regression algorithm based on matrix algebra considering a fixed size training data point set. To validate the hardware fixed-point arithmetic operations, the same algorithm was implemented in the Python language and the results of the two computation approaches were compared. The power consumption of the proposed embedded FPGA system was estimated to be 136.82 mW.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128690087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
New Insight for next Generation SRAM: Tunnel FET versus FinFET for Different Topologies 新一代SRAM:隧道场效应管与不同拓扑结构的FinFET
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339871
Adriana Arevalo, Romain Liautard, Daniel Romero, L. Trojman, L. Prócel
The purpose of this work is to point out the main differences between a Static Random-Access Memory (SRAM) cells implemented by using Tunnel FET (TFET) and FinFET technologies. We have compared the behavior of SRAM cells implemented in both technologies cells for a supply voltage range from 0.4V to 1.2V. Furthermore, for our study, we have chosen different SRAM cell topologies, such as 6T, 8T, 9T and 10T. Therefore, we have simulated all of these topologies for both technologies and extracted the Static Noise Margins (SNM) for the reading and writing processes. In addition, we have determined the power consumption in order to find the best trade-off between stability and power. By analyzing these results, we have determined the best topology for each technology. Finally, we have compared these best topologies for each technology in order to perform a study of advantages and shortcomings. Our results show more advantages using TFET technology instead of FinFET one.
本研究的目的是指出使用隧道场效应管(TFET)和FinFET技术实现的静态随机存取存储器(SRAM)单元之间的主要区别。我们比较了两种技术中SRAM电池在供电电压范围为0.4V至1.2V时的性能。此外,在我们的研究中,我们选择了不同的SRAM单元拓扑,如6T, 8T, 9T和10T。因此,我们模拟了这两种技术的所有这些拓扑结构,并提取了读取和写入过程的静态噪声边界(SNM)。此外,我们还确定了功耗,以便在稳定性和功率之间找到最佳权衡。通过分析这些结果,我们确定了每种技术的最佳拓扑。最后,我们比较了每种技术的最佳拓扑结构,以便研究其优缺点。我们的研究结果表明,使用TFET技术比使用FinFET技术更有优势。
{"title":"New Insight for next Generation SRAM: Tunnel FET versus FinFET for Different Topologies","authors":"Adriana Arevalo, Romain Liautard, Daniel Romero, L. Trojman, L. Prócel","doi":"10.1145/3338852.3339871","DOIUrl":"https://doi.org/10.1145/3338852.3339871","url":null,"abstract":"The purpose of this work is to point out the main differences between a Static Random-Access Memory (SRAM) cells implemented by using Tunnel FET (TFET) and FinFET technologies. We have compared the behavior of SRAM cells implemented in both technologies cells for a supply voltage range from 0.4V to 1.2V. Furthermore, for our study, we have chosen different SRAM cell topologies, such as 6T, 8T, 9T and 10T. Therefore, we have simulated all of these topologies for both technologies and extracted the Static Noise Margins (SNM) for the reading and writing processes. In addition, we have determined the power consumption in order to find the best trade-off between stability and power. By analyzing these results, we have determined the best topology for each technology. Finally, we have compared these best topologies for each technology in order to perform a study of advantages and shortcomings. Our results show more advantages using TFET technology instead of FinFET one.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130142466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Behavioral Modeling of a Control Module for an Energy-investing Piezoelectric Harvester 能量投资型压电采集器控制模块的行为建模
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339855
Tales Luiz Bortolin, A. Aita, J. B. Martins
This work analyzes a piezoelectric energy harvesting system that uses a single inductor and the concept of energy investment. The harvester behavior, with special focus on its control logic module and state machine, is fully described and modeled in Verilog-A. The needed sensors and control variables were also identified and modeled. Simulation results have shown the correct behavioral modeling of the piezoelectric energy harvester system and proposed control, highlighting the harvesting mechanism based on the concept of energy-investment and the effect of the energy invested on the characteristics of the battery charging profile. The speed of the behavioral simulations when compared to electrical ones and the obtained model accuracy, have shown a reliable and prospective higher-level design approach.
本文分析了一种单电感式压电能量收集系统和能量投入的概念。在Verilog-A中对收割机的行为进行了完整的描述和建模,特别关注其控制逻辑模块和状态机。还对所需的传感器和控制变量进行了识别和建模。仿真结果验证了压电能量收集系统的正确行为建模和所提出的控制方法,突出了基于能量投入概念的能量收集机制以及能量投入对电池充电曲线特性的影响。与电模拟相比,行为模拟的速度和模型的精度显示了一种可靠的、有前景的高级设计方法。
{"title":"Behavioral Modeling of a Control Module for an Energy-investing Piezoelectric Harvester","authors":"Tales Luiz Bortolin, A. Aita, J. B. Martins","doi":"10.1145/3338852.3339855","DOIUrl":"https://doi.org/10.1145/3338852.3339855","url":null,"abstract":"This work analyzes a piezoelectric energy harvesting system that uses a single inductor and the concept of energy investment. The harvester behavior, with special focus on its control logic module and state machine, is fully described and modeled in Verilog-A. The needed sensors and control variables were also identified and modeled. Simulation results have shown the correct behavioral modeling of the piezoelectric energy harvester system and proposed control, highlighting the harvesting mechanism based on the concept of energy-investment and the effect of the energy invested on the characteristics of the battery charging profile. The speed of the behavioral simulations when compared to electrical ones and the obtained model accuracy, have shown a reliable and prospective higher-level design approach.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115712610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Exploring Tabu Search Based Algorithms for Mapping and Placement in NoC-based Reconfigurable Systems 探索基于禁忌搜索的可重构系统中映射和放置算法
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339843
Guilherme Apolinario Silva Novaes, L. C. Moreira, W. Chau
Nowadays, the development of systems based on Networks-on-Chip (NoCs) brings big challenges to the designers due to problems of scalability, such as efficient Mapping and Placement, which are NP-hard problems. Several solutions have been proposed to solve this type of problem that is a variation of Quadratic Assignment Problems (QAP), being Tabu Search (TS) algorithms the ones showing most promising results. In NoC-based dynamically reconfigurable systems (NoC-DRSs), both mapping and placement problems present several layers of complexity due the reconfigurable scenarios. A previous work has adopted TS algorithm variations, but the best solution is not achieved with the wished high frequency. This paper introduces the original Forced Inversion (FI) Heuristic over Tabu Search algorithms for 2D-Mesh FPGA NoC-DRSs, in order to avoid local minima. Results with a series of benchmarks are presented and the performances of different approaches are quantitatively and qualitatively compared.
目前,基于片上网络(noc)的系统开发由于其可扩展性问题,如有效的映射和放置,给设计人员带来了很大的挑战,这是NP-hard问题。已经提出了几种解决方案来解决这种类型的问题,它是二次分配问题(QAP)的一种变体,其中禁忌搜索(TS)算法显示出最有希望的结果。在基于noc - drs的动态可重构系统(noc - drs)中,由于可重构场景的存在,映射和布局问题呈现出多层复杂性。以前的工作采用了TS算法的变体,但并没有达到期望的高频率的最佳解决方案。针对二维网格FPGA noc - drs,本文介绍了基于禁忌搜索的强制反演启发式算法,以避免局部最小值。给出了一系列基准测试的结果,并对不同方法的性能进行了定量和定性比较。
{"title":"Exploring Tabu Search Based Algorithms for Mapping and Placement in NoC-based Reconfigurable Systems","authors":"Guilherme Apolinario Silva Novaes, L. C. Moreira, W. Chau","doi":"10.1145/3338852.3339843","DOIUrl":"https://doi.org/10.1145/3338852.3339843","url":null,"abstract":"Nowadays, the development of systems based on Networks-on-Chip (NoCs) brings big challenges to the designers due to problems of scalability, such as efficient Mapping and Placement, which are NP-hard problems. Several solutions have been proposed to solve this type of problem that is a variation of Quadratic Assignment Problems (QAP), being Tabu Search (TS) algorithms the ones showing most promising results. In NoC-based dynamically reconfigurable systems (NoC-DRSs), both mapping and placement problems present several layers of complexity due the reconfigurable scenarios. A previous work has adopted TS algorithm variations, but the best solution is not achieved with the wished high frequency. This paper introduces the original Forced Inversion (FI) Heuristic over Tabu Search algorithms for 2D-Mesh FPGA NoC-DRSs, in order to avoid local minima. Results with a series of benchmarks are presented and the performances of different approaches are quantitatively and qualitatively compared.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116496009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Lightweight Security Mechanisms for MPSoCs mpsoc的轻量级安全机制
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339876
A. Sant'Ana, Henrique Martins Medina, Kevin Boucinha Fiorentin, F. Moraes
Computational systems tend to adopt parallel architectures, by using multiprocessor systems-on-chip (MPSoCs). MPSoCs are vulnerable to software and hardware attacks, as infected applications and Hardware Trojans respectively. These attacks may have the purpose to gain access to sensitive data, interrupt a given application or even damage the system physically. The literature presents countermeasures using dedicated routing algorithms, cryptography, firewalls and secure zones. These approaches present a significant hardware cost (firewalls, cryptography) or are too restrictive regarding the use of MPSoC resources (secure zones). The goal of this paper is to present lightweight security mechanisms for MPSoCs, using four techniques: spatial isolation of applications; dedicated network to send sensitive data; traffic blocking filter; lightweight cryptography. These mechanisms protect the MPSoC against the most common software attacks, as Denial of Service (DoS) and spoofing (man-in-the-middle), and ensures confidentiality and integrity to applications. Results present low area and latency overhead, as well as the effectiveness of using the mechanisms to block malicious traffic.
通过使用多处理器片上系统(mpsoc),计算系统倾向于采用并行架构。mpsoc容易受到软件和硬件攻击,分别是受感染的应用程序和硬件木马。这些攻击的目的可能是获取对敏感数据的访问权限,中断给定的应用程序,甚至在物理上破坏系统。文献提出对策使用专用路由算法,密码学,防火墙和安全区域。这些方法带来了巨大的硬件成本(防火墙、加密),或者对MPSoC资源(安全区)的使用限制太大。本文的目标是介绍mpsoc的轻量级安全机制,使用四种技术:应用程序的空间隔离;专用网络发送敏感数据;流量阻断过滤器;轻量级加密。这些机制保护MPSoC免受最常见的软件攻击,如拒绝服务(DoS)和欺骗(中间人),并确保应用程序的机密性和完整性。结果显示低面积和延迟开销,以及使用该机制阻止恶意流量的有效性。
{"title":"Lightweight Security Mechanisms for MPSoCs","authors":"A. Sant'Ana, Henrique Martins Medina, Kevin Boucinha Fiorentin, F. Moraes","doi":"10.1145/3338852.3339876","DOIUrl":"https://doi.org/10.1145/3338852.3339876","url":null,"abstract":"Computational systems tend to adopt parallel architectures, by using multiprocessor systems-on-chip (MPSoCs). MPSoCs are vulnerable to software and hardware attacks, as infected applications and Hardware Trojans respectively. These attacks may have the purpose to gain access to sensitive data, interrupt a given application or even damage the system physically. The literature presents countermeasures using dedicated routing algorithms, cryptography, firewalls and secure zones. These approaches present a significant hardware cost (firewalls, cryptography) or are too restrictive regarding the use of MPSoC resources (secure zones). The goal of this paper is to present lightweight security mechanisms for MPSoCs, using four techniques: spatial isolation of applications; dedicated network to send sensitive data; traffic blocking filter; lightweight cryptography. These mechanisms protect the MPSoC against the most common software attacks, as Denial of Service (DoS) and spoofing (man-in-the-middle), and ensures confidentiality and integrity to applications. Results present low area and latency overhead, as well as the effectiveness of using the mechanisms to block malicious traffic.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"13 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131754785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Energy efficient fJ/spike LTS e-Neuron using 55-nm node 采用55-nm节点的高效fJ/spike LTS e-Neuron
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339852
Pietro M. Ferreira, Nathan De Carvalho, G. Klisnick, Aziz Benlarbi-Delaï
While CMOS technology is currently reaching its limits in power consumption and circuit density, a challenger is emerging from the analogy between biology and silicon. Hardware-based neural networks may drive a new generation of bio-inspired computers by the urge of a hardware solution for real-time applications. This paper redesigns a previous proposed electronic neuron (e-Neuron) in a higher firing rate to reduce the silicon area and highlight a better energy efficiency trade-off. Besides, an innovative schematic is proposed to state an e-Neuron library based on Izhikevichs model of neural firing patterns. Both e-Neuron circuits are designed using 55 nm technology node. Physical design of transistors in weak inversion are discussed to a minimal leakage. Neural firing pattern behaviors are validated by post-layout simulations, demonstrating the spike frequency adaptation and the rebound spikes due to post-inhibitory effect in LTS e-Neuron. Presented results suggest that the time to rebound spikes is dependent of the excitation current amplitude. Both e-Neurons have presented a fF/spike energy efficiency and a smaller silicon area in comparison to Izhikevichs library propositions in the literature.
当CMOS技术目前在功耗和电路密度方面达到极限时,一个挑战者正在从生物学和硅之间的类比中出现。基于硬件的神经网络可能会在实时应用的硬件解决方案的推动下驱动新一代的生物启发计算机。本文以更高的放电速率重新设计了先前提出的电子神经元(e-Neuron),以减少硅面积并突出更好的能源效率权衡。此外,提出了一种基于Izhikevichs神经放电模式模型的e-Neuron库的创新方案。两种e-Neuron电路均采用55纳米技术设计。讨论了弱反转时晶体管的物理设计,使漏损最小。通过布局后模拟验证了LTS e-Neuron的放电模式行为,证明了LTS e-Neuron的脉冲频率适应和由于后抑制效应而产生的反弹峰值。研究结果表明,回弹尖峰的时间与励磁电流幅值有关。与文献中的Izhikevichs库命题相比,这两种e- neuron都具有fF/spike能量效率和更小的硅面积。
{"title":"Energy efficient fJ/spike LTS e-Neuron using 55-nm node","authors":"Pietro M. Ferreira, Nathan De Carvalho, G. Klisnick, Aziz Benlarbi-Delaï","doi":"10.1145/3338852.3339852","DOIUrl":"https://doi.org/10.1145/3338852.3339852","url":null,"abstract":"While CMOS technology is currently reaching its limits in power consumption and circuit density, a challenger is emerging from the analogy between biology and silicon. Hardware-based neural networks may drive a new generation of bio-inspired computers by the urge of a hardware solution for real-time applications. This paper redesigns a previous proposed electronic neuron (e-Neuron) in a higher firing rate to reduce the silicon area and highlight a better energy efficiency trade-off. Besides, an innovative schematic is proposed to state an e-Neuron library based on Izhikevichs model of neural firing patterns. Both e-Neuron circuits are designed using 55 nm technology node. Physical design of transistors in weak inversion are discussed to a minimal leakage. Neural firing pattern behaviors are validated by post-layout simulations, demonstrating the spike frequency adaptation and the rebound spikes due to post-inhibitory effect in LTS e-Neuron. Presented results suggest that the time to rebound spikes is dependent of the excitation current amplitude. Both e-Neurons have presented a fF/spike energy efficiency and a smaller silicon area in comparison to Izhikevichs library propositions in the literature.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"32 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129148829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Hardware Design of DC/CFL Intra-Prediction Decoder for the AV1 Codec AV1编解码器DC/CFL内预测解码器硬件设计
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339873
J. Goebel, B. Zatt, L. Agostini, M. Porto
This paper presents a dedicated hardware design for the DC and Chroma from Luma (CFL) intra-prediction modes of AV1 decoder. The hardware was designed to reach real-time when processing UHD 4K videos. The AV1 codec is an open-source and royalties-free video coding, which was developed by the AOMedia group, this group is composed of multiple companies like Google, Netflix, AMD, ARM, Intel, Nvidia, Microsoft, Mozilla and others. The proposed solution can support all 19 block sizes allowed in AV1 encoder, being able to process UHD 4K videos at 60 frames per second. The DC/CFL modules were synthesized to the TSMC 40 nm cells library targeting the frequency of 132.1 MHz. Synthesis results show the proposed hardware used 89.39 Kgates and a power dissipation of 7.96mW.
本文介绍了AV1解码器的DC和CFL内预测模式的专用硬件设计。硬件设计是为了在处理超高清4K视频时达到实时。AV1编解码器是一种开源且免版税的视频编码,由AOMedia集团开发,该集团由多家公司组成,如Google, Netflix, AMD, ARM, Intel, Nvidia, Microsoft, Mozilla等。提出的解决方案可以支持AV1编码器允许的所有19个块大小,能够以每秒60帧的速度处理UHD 4K视频。DC/CFL模块合成到TSMC 40 nm单元库中,目标频率为132.1 MHz。综合结果表明,该硬件使用89.39 Kgates,功耗为7.96mW。
{"title":"Hardware Design of DC/CFL Intra-Prediction Decoder for the AV1 Codec","authors":"J. Goebel, B. Zatt, L. Agostini, M. Porto","doi":"10.1145/3338852.3339873","DOIUrl":"https://doi.org/10.1145/3338852.3339873","url":null,"abstract":"This paper presents a dedicated hardware design for the DC and Chroma from Luma (CFL) intra-prediction modes of AV1 decoder. The hardware was designed to reach real-time when processing UHD 4K videos. The AV1 codec is an open-source and royalties-free video coding, which was developed by the AOMedia group, this group is composed of multiple companies like Google, Netflix, AMD, ARM, Intel, Nvidia, Microsoft, Mozilla and others. The proposed solution can support all 19 block sizes allowed in AV1 encoder, being able to process UHD 4K videos at 60 frames per second. The DC/CFL modules were synthesized to the TSMC 40 nm cells library targeting the frequency of 132.1 MHz. Synthesis results show the proposed hardware used 89.39 Kgates and a power dissipation of 7.96mW.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117324001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A new algorithm for an incremental sigma-delta converter reconstruction filter 增量式σ - δ变换器重构滤波器的新算法
Pub Date : 2019-08-26 DOI: 10.1145/3338852.3339839
Li Huang, C. Lelandais-Perrault, A. Kolar, P. Bénabès
Image sensors dedicated for the applications of the Earth observation require medium-speed and high-resolution analog-to-digital converters (ADCs). For that purpose, an incremental sigma-delta analog-to-digital converter (IΣ∆ ADC) has been designed. Post-layout simulations highlighted a degradation in resolution caused by the circuit imperfections. Therefore, a digital correction has been investigated. This paper proposes a new reconstruction filter which takes into account not only the bit values of the modulator output sequence but also the occurrence of certain patterns. This technique has been applied to an incremental sigma-delta analog-to-digital converter in order to correct the conversion errors. Performing with 400 clock periods for each conversion cycle, the theoretical resolution is 15.4 bits. Post-layout simulation shows that a 13.5-bit resolution is obtained by using the classical optimal filter whereas a 14.8-bit resolution is obtained with our reconstruction filter.
用于地球观测应用的图像传感器需要中速和高分辨率的模数转换器(adc)。为此,设计了一种增量式sigma-delta模数转换器(IΣ∆ADC)。布局后的模拟突出了由电路缺陷引起的分辨率下降。因此,研究了数字校正方法。本文提出了一种新的重构滤波器,它不仅考虑了调制器输出序列的位值,而且考虑了某些模式的出现。该技术已应用于增量σ - δ模数转换器,以纠正转换误差。每个转换周期执行400个时钟周期,理论分辨率为15.4位。布局后仿真表明,使用经典最优滤波器可获得13.5位分辨率,而我们的重构滤波器可获得14.8位分辨率。
{"title":"A new algorithm for an incremental sigma-delta converter reconstruction filter","authors":"Li Huang, C. Lelandais-Perrault, A. Kolar, P. Bénabès","doi":"10.1145/3338852.3339839","DOIUrl":"https://doi.org/10.1145/3338852.3339839","url":null,"abstract":"Image sensors dedicated for the applications of the Earth observation require medium-speed and high-resolution analog-to-digital converters (ADCs). For that purpose, an incremental sigma-delta analog-to-digital converter (IΣ∆ ADC) has been designed. Post-layout simulations highlighted a degradation in resolution caused by the circuit imperfections. Therefore, a digital correction has been investigated. This paper proposes a new reconstruction filter which takes into account not only the bit values of the modulator output sequence but also the occurrence of certain patterns. This technique has been applied to an incremental sigma-delta analog-to-digital converter in order to correct the conversion errors. Performing with 400 clock periods for each conversion cycle, the theoretical resolution is 15.4 bits. Post-layout simulation shows that a 13.5-bit resolution is obtained by using the classical optimal filter whereas a 14.8-bit resolution is obtained with our reconstruction filter.","PeriodicalId":184401,"journal":{"name":"2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128133409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2019 32nd Symposium on Integrated Circuits and Systems Design (SBCCI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1