首页 > 最新文献

2017 IEEE International Workshop on Signal Processing Systems (SiPS)最新文献

英文 中文
Odd type DCT/DST for video coding: Relationships and low-complexity implementations 用于视频编码的奇数类型DCT/DST:关系和低复杂度实现
Pub Date : 2017-10-01 DOI: 10.1109/SiPS.2017.8110009
M. Masera, M. Martina, G. Masera
In this paper, we show a class of relationships which link Discrete Cosine Transforms (DCT) and Discrete Sine Transforms (DST) of types V, VI, VII and VIII, which have been recently considered for inclusion in the future video coding technology. In particular, the proposed relationships allow to compute the DCT-V and the DCT-VIII as functions of the DCT-VI and the DST-VII respectively, plus simple reordering and sign-inversion. Moreover, this paper exploits the proposed relationships and the Winograd factorization of the Discrete Fourier Transform to construct low-complexity factorizations for computing the DCT-V and the DCT-VIII of length 4 and 8. Finally, the proposed signal-flow-graphs have been implemented using an FPGA technology, thus showing reduced hardware utilization with respect to the direct implementation of the matrix-vector multiplication algorithm.
在本文中,我们展示了一类将V, VI, VII和VIII类型的离散余弦变换(DCT)和离散正弦变换(DST)联系起来的关系,这些关系最近被考虑包含在未来的视频编码技术中。特别是,所提出的关系允许分别计算DCT-V和DCT-VIII作为DCT-VI和DST-VII的函数,加上简单的重排序和符号反转。此外,本文利用所提出的关系式和离散傅立叶变换的Winograd分解构造了计算长度为4和8的DCT-V和DCT-VIII的低复杂度分解。最后,所提出的信号流图已使用FPGA技术实现,从而显示相对于直接实现矩阵向量乘法算法降低了硬件利用率。
{"title":"Odd type DCT/DST for video coding: Relationships and low-complexity implementations","authors":"M. Masera, M. Martina, G. Masera","doi":"10.1109/SiPS.2017.8110009","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8110009","url":null,"abstract":"In this paper, we show a class of relationships which link Discrete Cosine Transforms (DCT) and Discrete Sine Transforms (DST) of types V, VI, VII and VIII, which have been recently considered for inclusion in the future video coding technology. In particular, the proposed relationships allow to compute the DCT-V and the DCT-VIII as functions of the DCT-VI and the DST-VII respectively, plus simple reordering and sign-inversion. Moreover, this paper exploits the proposed relationships and the Winograd factorization of the Discrete Fourier Transform to construct low-complexity factorizations for computing the DCT-V and the DCT-VIII of length 4 and 8. Finally, the proposed signal-flow-graphs have been implemented using an FPGA technology, thus showing reduced hardware utilization with respect to the direct implementation of the matrix-vector multiplication algorithm.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"337 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123232119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Reduced-memory high-throughput fast-SSC polar code decoder architecture 减少内存的高吞吐量快速ssc极性码解码器架构
Pub Date : 2017-10-01 DOI: 10.1109/SiPS.2017.8110014
Furkan Ercan, C. Condo, W. Gross
Polar codes have been selected for use within 5G networks, and are being considered for data and control channel for additional 5G scenarios, like the next generation ultra reliable low latency channel. As a result, efficient fast polar code decoder implementations are essential. In this work, we present a new fast simplified successive cancellation (Fast-SSC) decoder architecture. Our proposed solution is able to reduce the memory requirements and has an improved throughput with respect to state of the art Fast-SSC decoders. We achieve these objectives through a more efficient memory utilization than that of Fast-SSC, which also enables to execute multiple instructions in a single clock cycle. Our work shows that, compared to the state of the art, memory requirements are reduced by 22.2%; at the same time, a throughput improvement of 11.6% is achieved with (1024, 512) polar codes. Comparing equal throughputs, the memory requirements are reduced by up to 60.4%.
Polar码已被选择用于5G网络,并且正在考虑用于其他5G场景的数据和控制通道,例如下一代超可靠的低延迟通道。因此,高效快速的极性码解码器实现是必不可少的。在这项工作中,我们提出了一种新的快速简化连续消去(fast - ssc)解码器架构。我们提出的解决方案能够降低内存需求,并且相对于最先进的Fast-SSC解码器具有更高的吞吐量。我们通过比Fast-SSC更有效的内存利用来实现这些目标,Fast-SSC还可以在单个时钟周期内执行多条指令。我们的研究表明,与目前的技术水平相比,内存需求降低了22.2%;同时,使用(1024,512)极性码时,吞吐量提高了11.6%。与同等吞吐量相比,内存需求最多可降低60.4%。
{"title":"Reduced-memory high-throughput fast-SSC polar code decoder architecture","authors":"Furkan Ercan, C. Condo, W. Gross","doi":"10.1109/SiPS.2017.8110014","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8110014","url":null,"abstract":"Polar codes have been selected for use within 5G networks, and are being considered for data and control channel for additional 5G scenarios, like the next generation ultra reliable low latency channel. As a result, efficient fast polar code decoder implementations are essential. In this work, we present a new fast simplified successive cancellation (Fast-SSC) decoder architecture. Our proposed solution is able to reduce the memory requirements and has an improved throughput with respect to state of the art Fast-SSC decoders. We achieve these objectives through a more efficient memory utilization than that of Fast-SSC, which also enables to execute multiple instructions in a single clock cycle. Our work shows that, compared to the state of the art, memory requirements are reduced by 22.2%; at the same time, a throughput improvement of 11.6% is achieved with (1024, 512) polar codes. Comparing equal throughputs, the memory requirements are reduced by up to 60.4%.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125720328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Structured sparse ternary weight coding of deep neural networks for efficient hardware implementations 结构化稀疏三元权重编码的深度神经网络的高效硬件实现
Pub Date : 2017-07-01 DOI: 10.1109/SiPS.2017.8110021
Yoonho Boo, Wonyong Sung
Deep neural networks (DNNs) usually demand a large amount of operations for real-time inference. Especially, fully-connected layers contain a large number of weights, thus they usually need many off-chip memory accesses for inference. We propose a weight compression method for deep neural networks, which allows values of +1 or −1 only at predetermined positions of the weights so that decoding using a table can be conducted easily. For example, the structured sparse (8,2) coding allows at most two non-zero values among eight weights. This method not only enables multiplication-free DNN implementations but also compresses the weight storage by up to x32 compared to floating-point networks. Weight distribution normalization and gradual pruning techniques are applied to mitigate the performance degradation. The experiments are conducted with fully-connected deep neural networks and convolutional neural networks.
深度神经网络(dnn)通常需要大量的操作来进行实时推理。特别是,全连接层包含大量的权值,因此它们通常需要大量的片外内存访问来进行推理。我们提出了一种深度神经网络的权值压缩方法,该方法只允许在权值的预定位置取值+1或- 1,以便使用表进行解码可以很容易地进行。例如,结构化稀疏(8,2)编码在八个权重中最多允许两个非零值。这种方法不仅可以实现无乘法的DNN实现,而且与浮点网络相比,还可以将权重存储压缩到x32。采用权分布归一化和渐进式剪枝技术来缓解性能下降。实验采用全连接深度神经网络和卷积神经网络进行。
{"title":"Structured sparse ternary weight coding of deep neural networks for efficient hardware implementations","authors":"Yoonho Boo, Wonyong Sung","doi":"10.1109/SiPS.2017.8110021","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8110021","url":null,"abstract":"Deep neural networks (DNNs) usually demand a large amount of operations for real-time inference. Especially, fully-connected layers contain a large number of weights, thus they usually need many off-chip memory accesses for inference. We propose a weight compression method for deep neural networks, which allows values of +1 or −1 only at predetermined positions of the weights so that decoding using a table can be conducted easily. For example, the structured sparse (8,2) coding allows at most two non-zero values among eight weights. This method not only enables multiplication-free DNN implementations but also compresses the weight storage by up to x32 compared to floating-point networks. Weight distribution normalization and gradual pruning techniques are applied to mitigate the performance degradation. The experiments are conducted with fully-connected deep neural networks and convolutional neural networks.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131872583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Efficient bit-channel reliability computation for multi-mode polar code encoders and decoders 多模极化码编码器和解码器的有效位信道可靠性计算
Pub Date : 2017-05-16 DOI: 10.1109/SiPS.2017.8109987
C. Condo, Seyyed Ali Hashemi, W. Gross
Polar codes are a family of capacity-achieving error-correcting codes, and they have been selected as part of the next generation wireless communication standard. Each polar code bit-channel is assigned a reliability value, used to determine which bits transmit information and which parity. Relative reliabilities need to be known by both encoders and decoders: in case of multi-mode systems, where multiple code lengths and code rates are supported, the storage of relative reliabilities can lead to high implementation complexity. In this work, we observe patterns among code reliabilities, and propose an approximate computation technique to easily represent the reliabilities of multiple codes, through a limited set of variables and update rules. The proposed method allows to tune the trade-off between reliability accuracy and implementation complexity. An approximate computation architecture for encoders and decoders is designed and implemented, showing 50.7% less area occupation than storage-based solutions, with less than 0.05 dB error correction performance degradation. Used within a standard SCL decoder, the proposed architecture results in up to 17.0% less area occupation.
Polar码是一组实现容量的纠错码,它们已被选为下一代无线通信标准的一部分。每个极性码位通道被分配一个可靠性值,用于确定哪些位传输信息和哪些奇偶校验。编码器和解码器都需要知道相对可靠性:在多模式系统中,支持多种码长和码率,相对可靠性的存储可能导致较高的实现复杂性。在这项工作中,我们观察了代码可靠性之间的模式,并提出了一种近似计算技术,可以通过有限的变量集和更新规则轻松表示多个代码的可靠性。所提出的方法可以在可靠性、准确性和实现复杂性之间进行权衡。设计并实现了一种编码器和解码器的近似计算架构,其占用面积比基于存储的解决方案减少50.7%,纠错性能下降小于0.05 dB。在标准SCL解码器中使用,所提出的架构可减少高达17.0%的面积占用。
{"title":"Efficient bit-channel reliability computation for multi-mode polar code encoders and decoders","authors":"C. Condo, Seyyed Ali Hashemi, W. Gross","doi":"10.1109/SiPS.2017.8109987","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8109987","url":null,"abstract":"Polar codes are a family of capacity-achieving error-correcting codes, and they have been selected as part of the next generation wireless communication standard. Each polar code bit-channel is assigned a reliability value, used to determine which bits transmit information and which parity. Relative reliabilities need to be known by both encoders and decoders: in case of multi-mode systems, where multiple code lengths and code rates are supported, the storage of relative reliabilities can lead to high implementation complexity. In this work, we observe patterns among code reliabilities, and propose an approximate computation technique to easily represent the reliabilities of multiple codes, through a limited set of variables and update rules. The proposed method allows to tune the trade-off between reliability accuracy and implementation complexity. An approximate computation architecture for encoders and decoders is designed and implemented, showing 50.7% less area occupation than storage-based solutions, with less than 0.05 dB error correction performance degradation. Used within a standard SCL decoder, the proposed architecture results in up to 17.0% less area occupation.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124777050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Blind detection of polar codes 极性码的盲检测
Pub Date : 2017-05-05 DOI: 10.1109/SiPS.2017.8109977
P. Giard, Alexios Balatsoukas-Stimming, A. Burg
Polar codes were recently chosen to protect the control channel information in the next-generation mobile communication standard (5G) defined by the 3GPP. As a result, receivers will have to implement blind detection of polar coded frames in order to keep complexity, latency, and power consumption tractable. As a newly proposed class of block codes, the problem of polar-code blind detection has received very little attention. In this work, we propose a low-complexity blind-detection algorithm for polar-encoded frames. We base this algorithm on a novel detection metric with update rules that leverage the a priori knowledge of the frozen-bit locations, exploiting the inherent structures that these locations impose on a polar-encoded block of data. We show that the proposed detection metric allows to clearly distinguish polar-encoded frames from other types of data by considering the cumulative distribution functions of the detection metric, and the receiver operating characteristic. The presented results are tailored to the 5G standardization effort discussions, i.e., we consider a short low-rate polar code concatenated with a CRC.
最近,在3GPP制定的下一代移动通信标准(5G)中,为了保护控制信道信息,选择了Polar码。因此,接收器将不得不实现极性编码帧的盲检测,以保持复杂性,延迟和功耗可处理。作为一种新提出的分组码,极码盲检测问题很少受到关注。在这项工作中,我们提出了一种低复杂度的极化编码帧盲检测算法。我们将该算法建立在一种新的检测度量的基础上,该度量具有更新规则,利用冻结位位置的先验知识,利用这些位置对极编码数据块施加的固有结构。我们表明,通过考虑检测度量的累积分布函数和接收器的工作特性,所提出的检测度量可以清楚地将极编码帧与其他类型的数据区分开来。所提出的结果是针对5G标准化工作的讨论量身定制的,即,我们考虑了与CRC连接的短低速率极性码。
{"title":"Blind detection of polar codes","authors":"P. Giard, Alexios Balatsoukas-Stimming, A. Burg","doi":"10.1109/SiPS.2017.8109977","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8109977","url":null,"abstract":"Polar codes were recently chosen to protect the control channel information in the next-generation mobile communication standard (5G) defined by the 3GPP. As a result, receivers will have to implement blind detection of polar coded frames in order to keep complexity, latency, and power consumption tractable. As a newly proposed class of block codes, the problem of polar-code blind detection has received very little attention. In this work, we propose a low-complexity blind-detection algorithm for polar-encoded frames. We base this algorithm on a novel detection metric with update rules that leverage the a priori knowledge of the frozen-bit locations, exploiting the inherent structures that these locations impose on a polar-encoded block of data. We show that the proposed detection metric allows to clearly distinguish polar-encoded frames from other types of data by considering the cumulative distribution functions of the detection metric, and the receiver operating characteristic. The presented results are tailored to the 5G standardization effort discussions, i.e., we consider a short low-rate polar code concatenated with a CRC.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132284398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Energy efficient norm-and-correlation-based antenna selection algorithm in spatially correlated massive multi-user MIMO systems 空间相关海量多用户MIMO系统中基于规范相关的高能效天线选择算法
Pub Date : 1900-01-01 DOI: 10.1109/SiPS.2017.8109989
Tzu-Hao Tai, Hsin-Jung Chen, W. Chung, Ta-Sung Lee
Massive multiuser multi-input multi-output (MU-MIMO) system adopting large amount of antennas is a promising technique to improve spectral efficiency and energy efficiency for next generation wireless communication systems. In practice, the channels composed of pairs of transmit and receive antennas are often correlated and computational complexity is one of the critical concern in implementation issue. Hence, antenna selection techniques can be adopted to improve system performance. In this paper, we propose a norm-and-correlation-based selection algorithm for energy efficiency maximization to decide transmit RF chain configuration under the total power constraint in massive MU-MIMO systems.
采用大量天线的大规模多用户多输入多输出(MU-MIMO)系统是提高下一代无线通信系统频谱效率和能源效率的一种很有前途的技术。在实际应用中,由发射天线对和接收天线对组成的信道往往是相互关联的,计算复杂度是实现问题中的关键问题之一。因此,可以采用天线选择技术来提高系统性能。在本文中,我们提出了一种基于范数和相关的能量效率最大化选择算法,以确定大规模MU-MIMO系统中总功率约束下的发射射频链配置。
{"title":"Energy efficient norm-and-correlation-based antenna selection algorithm in spatially correlated massive multi-user MIMO systems","authors":"Tzu-Hao Tai, Hsin-Jung Chen, W. Chung, Ta-Sung Lee","doi":"10.1109/SiPS.2017.8109989","DOIUrl":"https://doi.org/10.1109/SiPS.2017.8109989","url":null,"abstract":"Massive multiuser multi-input multi-output (MU-MIMO) system adopting large amount of antennas is a promising technique to improve spectral efficiency and energy efficiency for next generation wireless communication systems. In practice, the channels composed of pairs of transmit and receive antennas are often correlated and computational complexity is one of the critical concern in implementation issue. Hence, antenna selection techniques can be adopted to improve system performance. In this paper, we propose a norm-and-correlation-based selection algorithm for energy efficiency maximization to decide transmit RF chain configuration under the total power constraint in massive MU-MIMO systems.","PeriodicalId":251688,"journal":{"name":"2017 IEEE International Workshop on Signal Processing Systems (SiPS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125949148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2017 IEEE International Workshop on Signal Processing Systems (SiPS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1