首页 > 最新文献

2014 IEEE 5th Latin American Symposium on Circuits and Systems最新文献

英文 中文
Configurable hardware design for the HEVC-based Adaptive Loop Filter 基于hevc的自适应环路滤波器的可配置硬件设计
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820308
R. Conceição, Fabiane Rediess, B. Zatt, M. Porto, L. Agostini
This paper is focused on the Adaptive Loop Filter (ALF) which is responsible to reduce the distortion between an original image and the encoded image during the video coding process by fixing artifacts from previous stages. It was proposed a novel hardware design for the ALF core which is capable to process all ALF sizes (5×5, 7×7 and 9×9), saving hardware resources consumption through reuse. The design was planned to process QFHD (3840 × 2160 pixels) video sequences in real time at 30 frames per second. The synthesis process was targeted to Altera Cyclone II and Stratix V FPGA devices. The synthesis results show that the designed architecture is capable to process 33 QFHD frames per second, considering the Stratix V implementation.
本文重点研究了自适应环路滤波器(ALF),它通过固定前一阶段的伪影来减少视频编码过程中原始图像与编码图像之间的失真。提出了一种新颖的ALF核硬件设计,能够处理所有大小的ALF (5×5、7×7和9×9),通过重用节省硬件资源消耗。该设计计划以每秒30帧的速度实时处理QFHD (3840 × 2160像素)视频序列。合成过程针对Altera Cyclone II和Stratix V FPGA器件。综合结果表明,考虑到Stratix V的实现,所设计的体系结构能够每秒处理33个QFHD帧。
{"title":"Configurable hardware design for the HEVC-based Adaptive Loop Filter","authors":"R. Conceição, Fabiane Rediess, B. Zatt, M. Porto, L. Agostini","doi":"10.1109/LASCAS.2014.6820308","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820308","url":null,"abstract":"This paper is focused on the Adaptive Loop Filter (ALF) which is responsible to reduce the distortion between an original image and the encoded image during the video coding process by fixing artifacts from previous stages. It was proposed a novel hardware design for the ALF core which is capable to process all ALF sizes (5×5, 7×7 and 9×9), saving hardware resources consumption through reuse. The design was planned to process QFHD (3840 × 2160 pixels) video sequences in real time at 30 frames per second. The synthesis process was targeted to Altera Cyclone II and Stratix V FPGA devices. The synthesis results show that the designed architecture is capable to process 33 QFHD frames per second, considering the Stratix V implementation.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115293745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Exploring more efficient architectures for Multiple Dynamic Supply Voltage designs 探索更有效的多动态电源电压设计架构
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820313
M. Terres, C. Meinhardt, G. Bontorin, R. Reis
Multiple Dynamic Supply Voltage (MDSV) is a technique that focuses on reducing the dynamic power. This technique is an evolution of the Multiple Supply Voltage (MSV). MSV and MDSV introduce some difference on traditional physical synthesis due to the different voltage operations of each region in the design. To convert the voltage among regions supplied by different voltages, these techniques insert a Level Shifter (LS) circuit. As a penalty consequence, traditional LS circuits insert a delay and extra power consumption in the design. The dynamical behavior of MDSV designs has brought a new condition, where LS inserted in the circuit can be in an idle state temporally. This work presents a new architecture to reduce the power consumption and delay, bypassing the LS. The architecture explores an alternative path to current flow in the cases that LS is idle. With this new approach we reduce up to 15% of power consumption and up to 75% and 15% of delay.
多动态电源电压(MDSV)是一种降低动态功率的技术。该技术是多电源电压(MSV)的演变。由于设计中各区域的电压操作不同,MSV和MDSV在传统物理合成上存在一定的差异。为了在不同电压提供的区域之间转换电压,这些技术插入电平移位器(LS)电路。作为惩罚的结果,传统的LS电路在设计中插入了延迟和额外的功耗。MDSV设计的动态特性带来了一种新的情况,即插入电路的LS可以暂时处于空闲状态。本文提出了一种新的架构,以降低功耗和延迟,绕过LS。该体系结构在LS空闲的情况下探索当前流的替代路径。通过这种新方法,我们可以减少高达15%的功耗和高达75%和15%的延迟。
{"title":"Exploring more efficient architectures for Multiple Dynamic Supply Voltage designs","authors":"M. Terres, C. Meinhardt, G. Bontorin, R. Reis","doi":"10.1109/LASCAS.2014.6820313","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820313","url":null,"abstract":"Multiple Dynamic Supply Voltage (MDSV) is a technique that focuses on reducing the dynamic power. This technique is an evolution of the Multiple Supply Voltage (MSV). MSV and MDSV introduce some difference on traditional physical synthesis due to the different voltage operations of each region in the design. To convert the voltage among regions supplied by different voltages, these techniques insert a Level Shifter (LS) circuit. As a penalty consequence, traditional LS circuits insert a delay and extra power consumption in the design. The dynamical behavior of MDSV designs has brought a new condition, where LS inserted in the circuit can be in an idle state temporally. This work presents a new architecture to reduce the power consumption and delay, bypassing the LS. The architecture explores an alternative path to current flow in the cases that LS is idle. With this new approach we reduce up to 15% of power consumption and up to 75% and 15% of delay.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116527935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Automatic complex instruction identification for efficient application mapping onto ASIPs 自动复杂指令识别有效的应用程序映射到api
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820291
A. S. Nery, N. Nedjah, F. França, L. Józwiak, H. Corporaal
Instruction Set Customization is a well-known technique to enhance the performance and efficiency of Application-Specific Processors (ASIPs). An extensive application profiling can indicate which parts of a given application, or class of applications, are most frequently executed, enabling the implementation of such frequently executed parts in hardware as custom instructions. However, a naive ad hoc instruction set customization process may identify and select poor instruction extension candidates, which may not result in a significantly improved performance with low circuit-area and energy footprints. In this paper we propose and discuss an efficient instruction set customization method and automatic tool, which exploit the maximal common subgraphs (common operation patterns) of the most frequently executed basic blocks of a given application. The speed results from our tool for a VLIW ASIP are provided for a set of benchmark applications. The average execution time reduction ranges from 30% to 40%, with only a few custom instructions.
指令集定制是一种众所周知的提高专用处理器(Application-Specific processor, asip)性能和效率的技术。广泛的应用程序分析可以指示给定应用程序或应用程序类的哪些部分最频繁执行,从而支持在硬件中实现这些频繁执行的部分作为自定义指令。然而,一个简单的特别指令集定制过程可能会识别和选择较差的指令扩展候选,这可能不会在低电路面积和能量足迹的情况下显著提高性能。本文提出并讨论了一种有效的指令集定制方法和自动工具,该方法利用给定应用程序中执行频率最高的基本块的最大公共子图(公共操作模式)。我们的VLIW ASIP工具的速度结果提供了一组基准应用程序。平均执行时间减少幅度从30%到40%不等,仅使用少数自定义指令。
{"title":"Automatic complex instruction identification for efficient application mapping onto ASIPs","authors":"A. S. Nery, N. Nedjah, F. França, L. Józwiak, H. Corporaal","doi":"10.1109/LASCAS.2014.6820291","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820291","url":null,"abstract":"Instruction Set Customization is a well-known technique to enhance the performance and efficiency of Application-Specific Processors (ASIPs). An extensive application profiling can indicate which parts of a given application, or class of applications, are most frequently executed, enabling the implementation of such frequently executed parts in hardware as custom instructions. However, a naive ad hoc instruction set customization process may identify and select poor instruction extension candidates, which may not result in a significantly improved performance with low circuit-area and energy footprints. In this paper we propose and discuss an efficient instruction set customization method and automatic tool, which exploit the maximal common subgraphs (common operation patterns) of the most frequently executed basic blocks of a given application. The speed results from our tool for a VLIW ASIP are provided for a set of benchmark applications. The average execution time reduction ranges from 30% to 40%, with only a few custom instructions.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126452092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Characterization of fast-response and low-noise poly si uncooled far infrared sensor 快速响应低噪声非冷却多晶硅远红外传感器的特性研究
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820276
R. Neli, I. Doi, J. A. Diniz
This work has as a main goal the characterization of thermal sensors, described as bolometer, which are dedicated to far infrared radiation detection. These sensors are fabricated using microfabrication techniques and the thin films are selective to wet etching. These mechanical microstructures are formed on silicon wafers using surface wet etching. As these structures are obtained using conventional techniques for integrated circuits manufacturing, it becomes possible perform monolithic integration of electronics and mechanical devices, allowing the integrated microsystems development. The porous gold or “gold black” used as a radiation absorber, showed absorption index greater than 80%.
这项工作的主要目标是表征热传感器,称为热辐射计,专门用于远红外辐射检测。这些传感器是用微加工技术制造的,薄膜对湿蚀刻有选择性。这些机械微观结构是用表面湿法蚀刻在硅片上形成的。由于这些结构是使用传统的集成电路制造技术获得的,因此可以将电子和机械设备进行单片集成,从而允许集成微系统的发展。多孔金或“金黑”用作辐射吸收剂,其吸收指数大于80%。
{"title":"Characterization of fast-response and low-noise poly si uncooled far infrared sensor","authors":"R. Neli, I. Doi, J. A. Diniz","doi":"10.1109/LASCAS.2014.6820276","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820276","url":null,"abstract":"This work has as a main goal the characterization of thermal sensors, described as bolometer, which are dedicated to far infrared radiation detection. These sensors are fabricated using microfabrication techniques and the thin films are selective to wet etching. These mechanical microstructures are formed on silicon wafers using surface wet etching. As these structures are obtained using conventional techniques for integrated circuits manufacturing, it becomes possible perform monolithic integration of electronics and mechanical devices, allowing the integrated microsystems development. The porous gold or “gold black” used as a radiation absorber, showed absorption index greater than 80%.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121702733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Memory energy consumption reduction in video coding systems 视频编码系统中存储器能耗的降低
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820299
Lívia Amaral, D. Silveira, Guilherme Povala, M. Porto, L. Agostini, B. Zatt
A very important component in video coding systems is the Motion Estimation (ME), that excels not only in computational cost, but off-chip memory bandwidth consumption as well. These two issues are considered critical constraints for video coding systems, especially for High Definition (HD) videos, since a large volume of data must be processed. This work presents an energy and memory bandwidth reduction approach by using two techniques for external bandwidth reduction from the literature: data reuse and reference frames compression. The proposed solution achieves high rates of memory bandwidth reduction for both writing and reading operations, reaching 31.69% and 86.32% respectively. Therewith, this scheme is able to reduce 80.67% of the ME energy consumption when compared to the traditional method, which presents no bandwidth reduction scheme. The ME search algorithm Full Search (FS) was considered for all evaluations. The solution presented is considered lossless, with no quality degradation for the coded video, and fully complied with current video coding standards.
运动估计(ME)是视频编码系统中一个非常重要的组成部分,它不仅在计算成本方面具有优势,而且在片外存储带宽消耗方面也具有优势。这两个问题被认为是视频编码系统的关键限制,特别是对于高清晰度(HD)视频,因为必须处理大量数据。这项工作提出了一种能量和内存带宽减少的方法,通过使用文献中的两种外部带宽减少技术:数据重用和参考帧压缩。提出的解决方案在写和读操作上都实现了较高的内存带宽减少率,分别达到31.69%和86.32%。因此,与没有带宽缩减方案的传统方法相比,该方案能够降低80.67%的ME能耗。所有评估都考虑了ME搜索算法Full search (FS)。提出的解决方案被认为是无损的,编码视频没有质量下降,并且完全符合当前的视频编码标准。
{"title":"Memory energy consumption reduction in video coding systems","authors":"Lívia Amaral, D. Silveira, Guilherme Povala, M. Porto, L. Agostini, B. Zatt","doi":"10.1109/LASCAS.2014.6820299","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820299","url":null,"abstract":"A very important component in video coding systems is the Motion Estimation (ME), that excels not only in computational cost, but off-chip memory bandwidth consumption as well. These two issues are considered critical constraints for video coding systems, especially for High Definition (HD) videos, since a large volume of data must be processed. This work presents an energy and memory bandwidth reduction approach by using two techniques for external bandwidth reduction from the literature: data reuse and reference frames compression. The proposed solution achieves high rates of memory bandwidth reduction for both writing and reading operations, reaching 31.69% and 86.32% respectively. Therewith, this scheme is able to reduce 80.67% of the ME energy consumption when compared to the traditional method, which presents no bandwidth reduction scheme. The ME search algorithm Full Search (FS) was considered for all evaluations. The solution presented is considered lossless, with no quality degradation for the coded video, and fully complied with current video coding standards.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114245539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Practical hand tracking solution by alternating the use of a priori information 实用的手部跟踪解决方案,交替使用先验信息
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820277
Pedro Cisneros, P. Rodríguez
Hand tracking is the automatic detection and displaying of the user hands captured by a camera throughout a video. This paper proposes a practical solution to two main hand tracking problems: the mutual occlusion between hands and the recovery from a lost tracking. This is done by alternating the use of a priori information about past hand locations according to the presence of a problematic situation. We propose a deterministic solution capable of working at different video resolutions without the need of any early learning stage or hard-coded initial calibration. Thus, it is practical in the sense of its implementation and use. Our experimental results, which are compared to the state of the art, shows that our solution is computationally efficient and has a good performance in different lighting and motion situations, both in terms of speed and precision error.
手跟踪是在整个视频中通过摄像头自动检测和显示用户的手。本文提出了一种实用的方法来解决两个主要的手部跟踪问题:手之间的相互遮挡和丢失跟踪的恢复。这是通过根据有问题的情况交替使用关于过去手的位置的先验信息来完成的。我们提出了一个确定性的解决方案,能够在不同的视频分辨率下工作,而不需要任何早期学习阶段或硬编码的初始校准。因此,它在实现和使用的意义上是实用的。我们的实验结果与目前的技术水平进行了比较,表明我们的解决方案计算效率高,在不同的照明和运动情况下都有很好的性能,无论是在速度还是精度误差方面。
{"title":"Practical hand tracking solution by alternating the use of a priori information","authors":"Pedro Cisneros, P. Rodríguez","doi":"10.1109/LASCAS.2014.6820277","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820277","url":null,"abstract":"Hand tracking is the automatic detection and displaying of the user hands captured by a camera throughout a video. This paper proposes a practical solution to two main hand tracking problems: the mutual occlusion between hands and the recovery from a lost tracking. This is done by alternating the use of a priori information about past hand locations according to the presence of a problematic situation. We propose a deterministic solution capable of working at different video resolutions without the need of any early learning stage or hard-coded initial calibration. Thus, it is practical in the sense of its implementation and use. Our experimental results, which are compared to the state of the art, shows that our solution is computationally efficient and has a good performance in different lighting and motion situations, both in terms of speed and precision error.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126956957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Simulation in 3D integration and TSV 模拟三维集成和TSV
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820324
K. Weide-Zaage, A. Moujbani, J. Kludt
The development of 3D-silicon integrated circuits is an increasing demand especially regarding to advanced 3D-packages and high performance applications, with the intend to miniaturize and to reduce costs. Through-silicon-vias (TSV), interconnects and landing pads have a strong mismatch in proportions. Due to high temperature as well as high applied currents, the reliability of the systems and components is affected by thermal and thermal-electrical loads. The induced stress leads to degradation effects like electro- and thermomigration (EM, TM). Mismatch in coefficient of thermal expansion (CTE) are causing mechanical induced stress during the manufacturing process. This can lead to failure mechanisms like delamination and cracking around the TSV or in the ICs.
3d硅集成电路的发展是一个不断增长的需求,特别是关于先进的3d封装和高性能应用,旨在小型化和降低成本。通过硅通孔(TSV),互连和着陆垫在比例上有很强的不匹配。由于高温和高电流,系统和组件的可靠性受到热负载和热电负载的影响。诱导的应力导致了电和热迁移(EM, TM)等降解效应。在制造过程中,热膨胀系数的不匹配会引起机械诱发应力。这可能导致失败机制,如TSV周围或ic的分层和开裂。
{"title":"Simulation in 3D integration and TSV","authors":"K. Weide-Zaage, A. Moujbani, J. Kludt","doi":"10.1109/LASCAS.2014.6820324","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820324","url":null,"abstract":"The development of 3D-silicon integrated circuits is an increasing demand especially regarding to advanced 3D-packages and high performance applications, with the intend to miniaturize and to reduce costs. Through-silicon-vias (TSV), interconnects and landing pads have a strong mismatch in proportions. Due to high temperature as well as high applied currents, the reliability of the systems and components is affected by thermal and thermal-electrical loads. The induced stress leads to degradation effects like electro- and thermomigration (EM, TM). Mismatch in coefficient of thermal expansion (CTE) are causing mechanical induced stress during the manufacturing process. This can lead to failure mechanisms like delamination and cracking around the TSV or in the ICs.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128909424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adjusted empirical mode decomposition with improved performance for signal modeling and prediction 调整经验模态分解,提高信号建模和预测的性能
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820259
S. Lahmiri, M. Boukadoum
An adjusted empirical mode decomposition method, built on Student's probability density function is presented. Compared to the original EMD, the new version provides a lower number of intrinsic mode functions and is more accurate in signal modeling and prediction. Using a backpropagation neural network for learning and in-sample prediction, our experimental results on a synthetic signal, an electrocardiogram (ECG), and a financial time series show that the presented tEMD is more efficient and leads to higher prediction accuracy than conventional EMD, regardless of the input time signal.
提出了一种基于学生概率密度函数的调整经验模态分解方法。与原来的EMD相比,新版本提供了更少的内禀模态函数,并且在信号建模和预测方面更加准确。使用反向传播神经网络进行学习和样本内预测,我们在合成信号,心电图(ECG)和金融时间序列上的实验结果表明,无论输入时间信号如何,所提出的tEMD都比传统的EMD更有效,预测精度更高。
{"title":"Adjusted empirical mode decomposition with improved performance for signal modeling and prediction","authors":"S. Lahmiri, M. Boukadoum","doi":"10.1109/LASCAS.2014.6820259","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820259","url":null,"abstract":"An adjusted empirical mode decomposition method, built on Student's probability density function is presented. Compared to the original EMD, the new version provides a lower number of intrinsic mode functions and is more accurate in signal modeling and prediction. Using a backpropagation neural network for learning and in-sample prediction, our experimental results on a synthetic signal, an electrocardiogram (ECG), and a financial time series show that the presented tEMD is more efficient and leads to higher prediction accuracy than conventional EMD, regardless of the input time signal.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128931683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Scalable hardware implementation for Quasi-Dyadic Goppa encoder 准双进Goppa编码器的可扩展硬件实现
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820285
P. Massolino, C. Margi, Paulo L. Barreto, W. Ruggiero
Error correcting codes are useful tools not only for increasing channel reliability in telecommunications systems, but also for the asymmetric encryption cryptographic schemes. Their operation consists basically in encoding messages into codewords for transmission; once received, these messages are decoded and the original message is recovered after the removal of all bit errors introduced either due to interferences in the medium or intentionally during the encryption process. Both encoding and decoding operations can be implemented as software libraries and/or hardware circuits, although the latter is sometimes preferable due to the higher throughput they provide. One particular type of error correcting code has particular interest in the field of cryptography: binary Quasi-Dyadic Goppa (QD-Goppa) codes, a subset of binary Goppa codes. Indeed, these codes were proposed as a replacement for generic binary Goppa codes in cryptographic applications, since their internal structure allows the optimization of execution time and storage requirements for public and private keys. Despite the interest of such codes, to the best of our knowledge the literature does not include hardware implementation and evaluation of such codes. Aiming to close this gap, in this paper we discuss our results when designing and simulating a scalable hardware architecture for the encoding operation of binary QD-Goppa codes. This architecture was described using VHDL and simulated in Synopsys tools for security levels of 80 to 256 bits, using a 90nm cell library. With this architecture, we obtained processing times from 95 ms to 12 μs and an area occupation from 850 GE to 42 kGE, which allow its utilization even in restricted-resource applications.
纠错码不仅是提高通信系统信道可靠性的有效工具,而且对于非对称加密密码方案来说也是非常有用的。它们的工作主要是将信息编码成码字以供传输;一旦接收到这些消息,将对其进行解码,并在去除由于介质中的干扰或加密过程中故意引入的所有误码后恢复原始消息。编码和解码操作都可以作为软件库和/或硬件电路来实现,尽管后者有时更可取,因为它们提供了更高的吞吐量。在密码学领域中,有一种特殊类型的纠错码引起了人们的特别关注:二进制准双进Goppa (QD-Goppa)码,它是二进制Goppa码的一个子集。实际上,这些代码被提议作为加密应用程序中通用二进制Goppa代码的替代品,因为它们的内部结构允许优化公钥和私钥的执行时间和存储需求。尽管对这些代码感兴趣,但据我们所知,文献中没有包括这些代码的硬件实现和评估。为了缩小这一差距,本文讨论了我们在设计和模拟二进制QD-Goppa编码操作的可扩展硬件体系结构时的结果。该架构使用VHDL进行描述,并在Synopsys工具中使用90nm单元库对80至256位的安全级别进行了模拟。使用这种架构,我们获得了处理时间从95 ms到12 μs,面积占用从850 GE到42 kGE,即使在资源有限的应用程序中也可以使用它。
{"title":"Scalable hardware implementation for Quasi-Dyadic Goppa encoder","authors":"P. Massolino, C. Margi, Paulo L. Barreto, W. Ruggiero","doi":"10.1109/LASCAS.2014.6820285","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820285","url":null,"abstract":"Error correcting codes are useful tools not only for increasing channel reliability in telecommunications systems, but also for the asymmetric encryption cryptographic schemes. Their operation consists basically in encoding messages into codewords for transmission; once received, these messages are decoded and the original message is recovered after the removal of all bit errors introduced either due to interferences in the medium or intentionally during the encryption process. Both encoding and decoding operations can be implemented as software libraries and/or hardware circuits, although the latter is sometimes preferable due to the higher throughput they provide. One particular type of error correcting code has particular interest in the field of cryptography: binary Quasi-Dyadic Goppa (QD-Goppa) codes, a subset of binary Goppa codes. Indeed, these codes were proposed as a replacement for generic binary Goppa codes in cryptographic applications, since their internal structure allows the optimization of execution time and storage requirements for public and private keys. Despite the interest of such codes, to the best of our knowledge the literature does not include hardware implementation and evaluation of such codes. Aiming to close this gap, in this paper we discuss our results when designing and simulating a scalable hardware architecture for the encoding operation of binary QD-Goppa codes. This architecture was described using VHDL and simulated in Synopsys tools for security levels of 80 to 256 bits, using a 90nm cell library. With this architecture, we obtained processing times from 95 ms to 12 μs and an area occupation from 850 GE to 42 kGE, which allow its utilization even in restricted-resource applications.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123653920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Dynamic partial reconfiguration manager 动态部分重新配置管理器
Pub Date : 2014-05-26 DOI: 10.1109/LASCAS.2014.6820293
J. Tarrillo, Fernando A. Escobar, F. Kastensmidt, C. Valderrama
Dynamic partial reconfiguration (DPR) is a technique that optimizes resource utilization of SRAM-based FPGAs, since it allows changing, on the fly, the functionality of a portion of its logic. A common DPR development flow requires the use of, at least, a microprocessor and several development tools (EDK, XSDK, PlanAhead); moreover, proposals are mainly based on MicroBlaze, ARM or PowerPC embedded processors, which also require extra memory control blocks. This article presents a generic DPR manager IP core (Intellectual Property), whose versatility allows the use of either any embedded processor or simple control logic. Results in terms of reconfiguration time and resources for Virtex 5 and Virtex 6 SRAM-FPGAs show its advantages and interest over traditional solutions.
动态部分重新配置(DPR)是一种优化基于sram的fpga资源利用的技术,因为它允许动态更改其部分逻辑的功能。一个常见的DPR开发流程至少需要使用一个微处理器和几个开发工具(EDK, XSDK, PlanAhead);此外,提案主要基于MicroBlaze, ARM或PowerPC嵌入式处理器,这也需要额外的内存控制块。本文介绍了一个通用的DPR管理器IP核(知识产权),其多功能性允许使用任何嵌入式处理器或简单的控制逻辑。Virtex 5和Virtex 6 sram - fpga在重新配置时间和资源方面的结果显示了它比传统解决方案的优势和兴趣。
{"title":"Dynamic partial reconfiguration manager","authors":"J. Tarrillo, Fernando A. Escobar, F. Kastensmidt, C. Valderrama","doi":"10.1109/LASCAS.2014.6820293","DOIUrl":"https://doi.org/10.1109/LASCAS.2014.6820293","url":null,"abstract":"Dynamic partial reconfiguration (DPR) is a technique that optimizes resource utilization of SRAM-based FPGAs, since it allows changing, on the fly, the functionality of a portion of its logic. A common DPR development flow requires the use of, at least, a microprocessor and several development tools (EDK, XSDK, PlanAhead); moreover, proposals are mainly based on MicroBlaze, ARM or PowerPC embedded processors, which also require extra memory control blocks. This article presents a generic DPR manager IP core (Intellectual Property), whose versatility allows the use of either any embedded processor or simple control logic. Results in terms of reconfiguration time and resources for Virtex 5 and Virtex 6 SRAM-FPGAs show its advantages and interest over traditional solutions.","PeriodicalId":235336,"journal":{"name":"2014 IEEE 5th Latin American Symposium on Circuits and Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127763594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
期刊
2014 IEEE 5th Latin American Symposium on Circuits and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1