首页 > 最新文献

2011 VII Southern Conference on Programmable Logic (SPL)最新文献

英文 中文
High speed acquisition and storage platform for SDR applications development 用于SDR应用开发的高速采集和存储平台
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782619
Jorge Cogo, Javier G. García, P. A. Roncagliolo, C. Muravchik
In this work we present the design of an FPGA based platform for acquiring and storing signals for SDR applications. The system comprises an embedded RISC processor, an A/D converter, RAM memory chips and a DMA controller core. This last component was designed from scratch to meet the high data rate and bulk requirements.
在这项工作中,我们提出了一个基于FPGA的平台的设计,用于SDR应用的信号采集和存储。该系统由嵌入式RISC处理器、A/D转换器、RAM存储芯片和DMA控制器核心组成。最后一个组件是从头开始设计的,以满足高数据速率和批量需求。
{"title":"High speed acquisition and storage platform for SDR applications development","authors":"Jorge Cogo, Javier G. García, P. A. Roncagliolo, C. Muravchik","doi":"10.1109/SPL.2011.5782619","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782619","url":null,"abstract":"In this work we present the design of an FPGA based platform for acquiring and storing signals for SDR applications. The system comprises an embedded RISC processor, an A/D converter, RAM memory chips and a DMA controller core. This last component was designed from scratch to meet the high data rate and bulk requirements.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"79 1","pages":"19-24"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89686289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Framer design, verification and prototyping for G.709 optical transport networks G.709光传输网络帧设计,验证和原型
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782620
Roman Arenas, J. Finochietto, Ramiro R. Lopez, Ulises Morales
Optical Transport Networks (OTN) have emerged as a key enabler to increase the capacity of current telecommunication infrastructure. ITU-T Recommendation G.709 describes these networks by defining a flexible frame structure capable of carrying different client data signals. Recently, G.709 framer devices have received much attention from the telecommunication industry as next generation 10/40/100G transport equipment demands its integration. This paper proposes a simple framer design based on independent modules which implement G.709 layer specific processes. This architecture enables different integration schemes where some modules can be implemented as ASIC while others in FPGA. The framer verification is also discussed in the context of testing its functionality in a network scenario. Finally, the implementation of a prototype on a FPGA board is described.
光传输网络(OTN)已成为增加当前电信基础设施容量的关键推动者。ITU-T G.709建议书通过定义能够承载不同客户端数据信号的灵活帧结构来描述这些网络。近年来,由于下一代10/40/100G传输设备需要G.709框架设备的集成,因此备受电信行业的关注。本文提出了一种基于独立模块实现G.709层特定流程的简单帧器设计。这种架构支持不同的集成方案,其中一些模块可以作为ASIC实现,而另一些模块可以在FPGA中实现。在测试其在网络场景中的功能的背景下,还讨论了帧验证。最后,描述了一个原型在FPGA板上的实现。
{"title":"Framer design, verification and prototyping for G.709 optical transport networks","authors":"Roman Arenas, J. Finochietto, Ramiro R. Lopez, Ulises Morales","doi":"10.1109/SPL.2011.5782620","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782620","url":null,"abstract":"Optical Transport Networks (OTN) have emerged as a key enabler to increase the capacity of current telecommunication infrastructure. ITU-T Recommendation G.709 describes these networks by defining a flexible frame structure capable of carrying different client data signals. Recently, G.709 framer devices have received much attention from the telecommunication industry as next generation 10/40/100G transport equipment demands its integration. This paper proposes a simple framer design based on independent modules which implement G.709 layer specific processes. This architecture enables different integration schemes where some modules can be implemented as ASIC while others in FPGA. The framer verification is also discussed in the context of testing its functionality in a network scenario. Finally, the implementation of a prototype on a FPGA board is described.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"47 1","pages":"25-30"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79119271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multichannel SDRAM controller design for H.264/AVC video decoder H.264/AVC视频解码器多通道SDRAM控制器设计
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782638
A. Bonatto, A. Soares, A. Susin
Embedded consumer electronics like video processing systems require large storage capacity and high bandwidth memory access. Also, those systems are built from heterogeneous processing units, designed specifically to perform dedicated tasks in order to maximize the processing power. A single off-chip memory is shared between the processing units to reduce power and save costs. The external memory access is the system bottleneck when decoding high definition video sequences in real time. This paper presents the design and validation of a multichannel DDR2 SDRAM controller design for a H.264/AVC video decoder. A four-level memory hierarchy was designed to manage the decoded video in macroblock granularity with low latency. The proposed controller is able to manage memory access in decoding 1080p H.264 video sequences. This architecture was validated and prototyped using a Xilinx Virtex-5 FPGA board.
像视频处理系统这样的嵌入式消费电子产品需要大的存储容量和高带宽的内存访问。此外,这些系统由异构处理单元构建,专门设计用于执行专用任务,以最大限度地提高处理能力。单个片外存储器在处理单元之间共享,以降低功耗和节省成本。在实时解码高清视频序列时,外部存储器访问是系统的瓶颈。本文介绍了一种用于H.264/AVC视频解码器的多通道DDR2 SDRAM控制器的设计和验证。设计了一个四层内存结构,以低延迟的宏块粒度管理解码后的视频。所提出的控制器能够在解码1080p H.264视频序列时管理存储器访问。该架构使用Xilinx Virtex-5 FPGA板进行验证和原型设计。
{"title":"Multichannel SDRAM controller design for H.264/AVC video decoder","authors":"A. Bonatto, A. Soares, A. Susin","doi":"10.1109/SPL.2011.5782638","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782638","url":null,"abstract":"Embedded consumer electronics like video processing systems require large storage capacity and high bandwidth memory access. Also, those systems are built from heterogeneous processing units, designed specifically to perform dedicated tasks in order to maximize the processing power. A single off-chip memory is shared between the processing units to reduce power and save costs. The external memory access is the system bottleneck when decoding high definition video sequences in real time. This paper presents the design and validation of a multichannel DDR2 SDRAM controller design for a H.264/AVC video decoder. A four-level memory hierarchy was designed to manage the decoded video in macroblock granularity with low latency. The proposed controller is able to manage memory access in decoding 1080p H.264 video sequences. This architecture was validated and prototyped using a Xilinx Virtex-5 FPGA board.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"53 1","pages":"137-142"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82261677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A reconfigurable GF(2M) elliptic curve cryptographic coprocessor 一种可重构的GF(2M)椭圆曲线密码协处理器
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782650
M. Morales-Sandoval, C. Feregrino-Uribe, R. Cumplido, I. Algredo-Badillo
Elliptic Curve Cryptography (ECC) is a kind of cryptography that provides the security information services using shorter keys than other known public-key crypto-algorithms without decreasing the security level. This makes ECC a good choice for implementing security services in constrained devices, like the mobile ones. However, the diversity of ECC implementation parameters recommended by international standards has led to interoperability problems among ECC implementations. This work presents the design and implementation results of a novel FPGA coprocessor for ECC than can be reconfigured at run time to support different implementation parameters and hence, different security levels. Regardless there are several related works in the literature, to our knowledge this is the first ECC coprocessor that makes use of a partial reconfigurable methodology to deal with interoperability problems in ECC. A suitable application of the proposed reconfigurable coprocessor is the security protocol IPSec, where the domain parameters for ECC-based cryptographic schemes, like digital signature or encryption, have to be negotiated and agreed upon by the communication partners at run time.
椭圆曲线密码学(ECC)是一种在不降低安全级别的前提下,使用比其他已知公钥加密算法更短的密钥提供安全信息服务的密码学。这使得ECC成为在受限设备(如移动设备)中实现安全服务的一个很好的选择。然而,国际标准推荐的ECC实现参数的多样性导致了ECC实现之间的互操作性问题。这项工作提出了一种用于ECC的新型FPGA协处理器的设计和实现结果,该协处理器可以在运行时重新配置以支持不同的实现参数,从而支持不同的安全级别。不管文献中有几个相关的作品,据我们所知,这是第一个ECC协处理器,它利用部分可重构的方法来处理ECC中的互操作性问题。提议的可重构协处理器的一个合适应用是安全协议IPSec,其中基于ecc的加密方案的域参数,如数字签名或加密,必须在运行时由通信伙伴协商和商定。
{"title":"A reconfigurable GF(2M) elliptic curve cryptographic coprocessor","authors":"M. Morales-Sandoval, C. Feregrino-Uribe, R. Cumplido, I. Algredo-Badillo","doi":"10.1109/SPL.2011.5782650","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782650","url":null,"abstract":"Elliptic Curve Cryptography (ECC) is a kind of cryptography that provides the security information services using shorter keys than other known public-key crypto-algorithms without decreasing the security level. This makes ECC a good choice for implementing security services in constrained devices, like the mobile ones. However, the diversity of ECC implementation parameters recommended by international standards has led to interoperability problems among ECC implementations. This work presents the design and implementation results of a novel FPGA coprocessor for ECC than can be reconfigured at run time to support different implementation parameters and hence, different security levels. Regardless there are several related works in the literature, to our knowledge this is the first ECC coprocessor that makes use of a partial reconfigurable methodology to deal with interoperability problems in ECC. A suitable application of the proposed reconfigurable coprocessor is the security protocol IPSec, where the domain parameters for ECC-based cryptographic schemes, like digital signature or encryption, have to be negotiated and agreed upon by the communication partners at run time.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"PP 1","pages":"209-214"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84173669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Architecture driven memory allocation for FPGA based real-time video processing systems 基于FPGA的实时视频处理系统的体系结构驱动内存分配
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782639
N. Lawal, B. Thornberg, M. O’nils
In this paper, we present an approach that uses information about the FPGA architecture to achieve optimized allocation of embedded memory in real-time video processing system. A cost function defined in terms of required memory sizes, available block- and distributed-RAM resources is used to motivate the allocation decision. This work is a high-level exploration that generates VHDL RTL modules and synthesis constraint files to specify memory allocation. Results show that the proposed approach achieves appreciable reduction in block RAM usage over previous logic to memory mapping approach at negligible increase in logic usage.
本文提出了一种利用FPGA架构信息实现实时视频处理系统中嵌入式内存优化分配的方法。根据所需内存大小、可用块和分布式ram资源定义的成本函数用于激励分配决策。这项工作是一个高层次的探索,生成VHDL RTL模块和综合约束文件,以指定内存分配。结果表明,与以前的逻辑到内存映射方法相比,所提出的方法在逻辑使用可以忽略不计的增加下实现了块RAM使用的明显减少。
{"title":"Architecture driven memory allocation for FPGA based real-time video processing systems","authors":"N. Lawal, B. Thornberg, M. O’nils","doi":"10.1109/SPL.2011.5782639","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782639","url":null,"abstract":"In this paper, we present an approach that uses information about the FPGA architecture to achieve optimized allocation of embedded memory in real-time video processing system. A cost function defined in terms of required memory sizes, available block- and distributed-RAM resources is used to motivate the allocation decision. This work is a high-level exploration that generates VHDL RTL modules and synthesis constraint files to specify memory allocation. Results show that the proposed approach achieves appreciable reduction in block RAM usage over previous logic to memory mapping approach at negligible increase in logic usage.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"15 1","pages":"143-148"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73427955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
N-continuous OFDM signal analysis of FPGA-based transmissions 基于fpga传输的n连续OFDM信号分析
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782618
E. Lizarraga, V. Sauchelli, Gabriel N. Maggio
N-Continuous OFDM systems have been proposed to achieve an important reduction of the out-of-band emitted power compared to conventional OFDM. However, system complexity has been increased and some resource demanding operations are necessary. So, this work considers the implementation in FPGA of the transmitter and also provides a novel analysis on the influence of the IFFT length in the representation of the continuity condition. Spectral measurements are practiced in the model to evaluate the performance.
与传统OFDM相比,n连续OFDM系统已被提出以实现带外发射功率的重要降低。但是,系统的复杂性增加了,并且需要进行一些资源要求较高的操作。因此,本工作考虑了发射机的FPGA实现,并对IFFT长度对连续性条件表示的影响进行了新颖的分析。在模型中进行了光谱测量来评估性能。
{"title":"N-continuous OFDM signal analysis of FPGA-based transmissions","authors":"E. Lizarraga, V. Sauchelli, Gabriel N. Maggio","doi":"10.1109/SPL.2011.5782618","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782618","url":null,"abstract":"N-Continuous OFDM systems have been proposed to achieve an important reduction of the out-of-band emitted power compared to conventional OFDM. However, system complexity has been increased and some resource demanding operations are necessary. So, this work considers the implementation in FPGA of the transmitter and also provides a novel analysis on the influence of the IFFT length in the representation of the continuity condition. Spectral measurements are practiced in the model to evaluate the performance.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"1 1","pages":"13-18"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75556108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An unified approach for convolution-based image filtering on reconfigurable systems 可重构系统中基于卷积的图像滤波的统一方法
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782626
J. Y. Mori, Camilo Sánchez-Ferreira, D. Muñoz, C. Llanos, P. Berger
Currently the market and the academic community have required applications of image and video processing with several real-time constraints. In order to seek an alternative design that allows the rapid development of real time image processing systems this paper proposes an unified hardware architecture for some image filtering algorithms in space domain, such as windowing-based operations, which are implemented on FPGAs (Field Programmable Gate Arrays). For achieving this, six different filters have been implemented in a parallel approach, separating them in simple hardware structures, allowing the algorithms to explore their parallel capabilities by using a simple systolic architecture. In this system all implemented algorithms run in parallel allowing the user to select a defined output for depicting it in a display. Both image processing and synthesis results have demonstrated the feasibility of FPGAs for implementing the proposed filtering algorithms in a full parallel approach.
目前,市场和学术界对图像和视频处理的应用有一些实时性的限制。为了寻求一种能够使实时图像处理系统快速发展的替代设计,本文提出了一种统一的硬件架构,用于在fpga(现场可编程门阵列)上实现一些空间域的图像滤波算法,如基于窗口的运算。为了实现这一目标,六种不同的滤波器以并行方式实现,将它们分离在简单的硬件结构中,允许算法通过使用简单的收缩架构来探索它们的并行能力。在这个系统中,所有实现的算法并行运行,允许用户选择一个定义的输出来在显示中描绘它。图像处理和合成的结果都证明了fpga以完全并行的方式实现所提出的滤波算法的可行性。
{"title":"An unified approach for convolution-based image filtering on reconfigurable systems","authors":"J. Y. Mori, Camilo Sánchez-Ferreira, D. Muñoz, C. Llanos, P. Berger","doi":"10.1109/SPL.2011.5782626","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782626","url":null,"abstract":"Currently the market and the academic community have required applications of image and video processing with several real-time constraints. In order to seek an alternative design that allows the rapid development of real time image processing systems this paper proposes an unified hardware architecture for some image filtering algorithms in space domain, such as windowing-based operations, which are implemented on FPGAs (Field Programmable Gate Arrays). For achieving this, six different filters have been implemented in a parallel approach, separating them in simple hardware structures, allowing the algorithms to explore their parallel capabilities by using a simple systolic architecture. In this system all implemented algorithms run in parallel allowing the user to select a defined output for depicting it in a display. Both image processing and synthesis results have demonstrated the feasibility of FPGAs for implementing the proposed filtering algorithms in a full parallel approach.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"8 1","pages":"63-68"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79687341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A H.264/AVC Quarter-Pixel Motion Estimation Refinement architecture targeting high resolution videos 针对高分辨率视频的H.264/AVC四分之一像素运动估计改进架构
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782637
M. Corrêa, M. T. Schoenknecht, L. Agostini
This paper presents a hardware design for the H.264/AVC Quarter-Pixel Motion Estimation Refinement to be used in a complete Fractional Motion Estimation architecture. The architecture was optimized to reach a high throughput through a balanced pipeline and parallelism exploration. The design was described in VHDL and synthesized to an Altera Stratix III FPGA device. The design achieves an operation frequency of 245 MHz, processing up to 39 QHDTV frames (3840×2048 pixels) per second. This architecture is also able to reach real time when processing other resolutions, like HD 1080p (1920×1080 pixels) with lower operation frequencies. The final results are very competitive when compared to related works.
本文提出了H.264/AVC四分之一像素运动估计细化的硬件设计,用于完整的分数阶运动估计体系结构。通过平衡管道和并行性探索,优化了该体系结构以达到高吞吐量。该设计用VHDL语言描述,并合成到Altera Stratix III FPGA器件上。该设计实现了245mhz的工作频率,每秒处理高达39个QHDTV帧(3840×2048像素)。该架构在处理其他分辨率时也能够达到实时性,例如HD 1080p (1920×1080像素),操作频率较低。与相关作品相比,最终的结果是非常有竞争力的。
{"title":"A H.264/AVC Quarter-Pixel Motion Estimation Refinement architecture targeting high resolution videos","authors":"M. Corrêa, M. T. Schoenknecht, L. Agostini","doi":"10.1109/SPL.2011.5782637","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782637","url":null,"abstract":"This paper presents a hardware design for the H.264/AVC Quarter-Pixel Motion Estimation Refinement to be used in a complete Fractional Motion Estimation architecture. The architecture was optimized to reach a high throughput through a balanced pipeline and parallelism exploration. The design was described in VHDL and synthesized to an Altera Stratix III FPGA device. The design achieves an operation frequency of 245 MHz, processing up to 39 QHDTV frames (3840×2048 pixels) per second. This architecture is also able to reach real time when processing other resolutions, like HD 1080p (1920×1080 pixels) with lower operation frequencies. The final results are very competitive when compared to related works.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"62 1 1","pages":"131-136"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78384615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fast parallel audio fingerprinting implementation in reconfigurable hardware and GPUs 在可重构硬件和gpu中的快速并行音频指纹识别实现
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782656
J. Martínez, Jaime Vitola, Adriana Sanabria, C. Pedraza
One of the main challenges that Music Information Retrieval (MIR) faces is performance. This paper presents an algorithm based on fingerprinting techniques implemented in a low-cost embedded reconfigurable platform. This fast algorithm is even faster when implemented in parallel for a GPU platform. The hit rate of the implementations is practically 100% and the response time is two times faster than the response time of a top class PC, which means MIR times of up to 65 audio tracks in real time.
音乐信息检索(MIR)面临的主要挑战之一是性能问题。提出了一种在低成本嵌入式可重构平台上实现的基于指纹识别技术的算法。这种快速算法在GPU平台上并行实现时甚至更快。实现的命中率几乎是100%,响应时间比顶级PC的响应时间快两倍,这意味着实时多达65个音轨的MIR时间。
{"title":"Fast parallel audio fingerprinting implementation in reconfigurable hardware and GPUs","authors":"J. Martínez, Jaime Vitola, Adriana Sanabria, C. Pedraza","doi":"10.1109/SPL.2011.5782656","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782656","url":null,"abstract":"One of the main challenges that Music Information Retrieval (MIR) faces is performance. This paper presents an algorithm based on fingerprinting techniques implemented in a low-cost embedded reconfigurable platform. This fast algorithm is even faster when implemented in parallel for a GPU platform. The hit rate of the implementations is practically 100% and the response time is two times faster than the response time of a top class PC, which means MIR times of up to 65 audio tracks in real time.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"81 1","pages":"245-250"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84913871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A suitable FPGA implementation of floating-point matrix inversion based on Gauss-Jordan elimination 基于高斯-约当消去的浮点矩阵反演的FPGA实现
Pub Date : 2011-04-13 DOI: 10.1109/SPL.2011.5782659
J. Arias-Garcia, R. Pezzuol Jacobi, C. Llanos, M. Ayala-Rincón
This work presents an architecture to compute matrix inversions in a hardware reconfigurable FPGA with single-precision floating-point representation, whose main unit is the processing component for Gauss-Jordan elimination. This component consists of other smaller arithmetic units, organized to maintain the accuracy of the results without the need to internally normalize and de-normalize the floating-point data. The implementation of the operations and the whole unit take advantage of the resources available in the Virtex-5 FPGA. The performance and resource consumption of the implementation are improvements in comparison with different more elaborated architectures whose implementations are more complex for low cost applications. Benchmarks are done with solutions implemented previously in FPGA and software, such as Matlab.
本文提出了一种在单精度浮点表示的硬件可重构FPGA中计算矩阵反转的体系结构,其主要单元是高斯-乔丹消去的处理组件。该组件由其他较小的算术单元组成,其组织目的是维护结果的准确性,而无需在内部对浮点数据进行规范化和反规范化。操作和整个单元的实现充分利用了Virtex-5 FPGA中的可用资源。与其他更精细的体系结构相比,该实现的性能和资源消耗得到了改进,这些体系结构的实现对于低成本应用程序来说更复杂。基准测试使用以前在FPGA和软件(如Matlab)中实现的解决方案完成。
{"title":"A suitable FPGA implementation of floating-point matrix inversion based on Gauss-Jordan elimination","authors":"J. Arias-Garcia, R. Pezzuol Jacobi, C. Llanos, M. Ayala-Rincón","doi":"10.1109/SPL.2011.5782659","DOIUrl":"https://doi.org/10.1109/SPL.2011.5782659","url":null,"abstract":"This work presents an architecture to compute matrix inversions in a hardware reconfigurable FPGA with single-precision floating-point representation, whose main unit is the processing component for Gauss-Jordan elimination. This component consists of other smaller arithmetic units, organized to maintain the accuracy of the results without the need to internally normalize and de-normalize the floating-point data. The implementation of the operations and the whole unit take advantage of the resources available in the Virtex-5 FPGA. The performance and resource consumption of the implementation are improvements in comparison with different more elaborated architectures whose implementations are more complex for low cost applications. Benchmarks are done with solutions implemented previously in FPGA and software, such as Matlab.","PeriodicalId":6329,"journal":{"name":"2011 VII Southern Conference on Programmable Logic (SPL)","volume":"1993 1","pages":"263-268"},"PeriodicalIF":0.0,"publicationDate":"2011-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88193973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
2011 VII Southern Conference on Programmable Logic (SPL)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1