首页 > 最新文献

IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.最新文献

英文 中文
Efficient design of symbol detector for MIMO-OFDM based wireless LANs 基于MIMO-OFDM的无线局域网符号检测器的高效设计
Pub Date : 2007-05-14 DOI: 10.1109/ICTMICC.2007.4448690
Seungpyo Noh, Yunho Jung, Jaeseok Kim
In this paper, efficient hardware architecture for MIMO-OFDM symbol detector with two transmit and two receive antennas is proposed. The proposed symbol detector supports two MIMO-OFDM modes of SFBC-OFDM and SDM-OFDM. It can be implemented with shared-architecture, since the detection algorithms of two MIMO-OFDM modes are similar. Therefore, by eliminating duplicated function blocks, reduced-complexity implementation can be possible. It was designed in a hardware description language and synthesized to gate-level circuits using 0.18 /spl mu/m CMOS standard cell library. The total logic gate count for the symbol detector is 164K. By the efficient hardware architecture, the proposed symbol detector results in the reduction of the logic gates by 34% and the power consumption by 38%.
本文提出了一种具有双收发天线的MIMO-OFDM符号检测器的高效硬件结构。所提出的符号检测器支持SFBC-OFDM和SDM-OFDM两种MIMO-OFDM模式。由于两种MIMO-OFDM模式的检测算法相似,可以采用共享架构实现。因此,通过消除重复的功能块,可以实现低复杂度的实现。采用硬件描述语言设计,采用0.18 /spl μ m CMOS标准单元库合成门级电路。符号检测器的总逻辑门计数为164K。通过高效的硬件结构,所提出的符号检测器使逻辑门减少了34%,功耗减少了38%。
{"title":"Efficient design of symbol detector for MIMO-OFDM based wireless LANs","authors":"Seungpyo Noh, Yunho Jung, Jaeseok Kim","doi":"10.1109/ICTMICC.2007.4448690","DOIUrl":"https://doi.org/10.1109/ICTMICC.2007.4448690","url":null,"abstract":"In this paper, efficient hardware architecture for MIMO-OFDM symbol detector with two transmit and two receive antennas is proposed. The proposed symbol detector supports two MIMO-OFDM modes of SFBC-OFDM and SDM-OFDM. It can be implemented with shared-architecture, since the detection algorithms of two MIMO-OFDM modes are similar. Therefore, by eliminating duplicated function blocks, reduced-complexity implementation can be possible. It was designed in a hardware description language and synthesized to gate-level circuits using 0.18 /spl mu/m CMOS standard cell library. The total logic gate count for the symbol detector is 164K. By the efficient hardware architecture, the proposed symbol detector results in the reduction of the logic gates by 34% and the power consumption by 38%.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123830386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive energy model and energy-quality evaluation of wireless transceiver front-ends 无线收发器前端综合能量模型及能量质量评价
Pub Date : 2005-12-01 DOI: 10.1109/SIPS.2005.1579876
Ye Li, B. Bakkaloglu, C. Chakrabarti
As CMOS technology scales down, digital supply voltage and digital power consumption goes down. However due to dynamic range limitations, power supply and power consumption of the RF front-ends and analog sections do not scale in the same fashion. In fact, in scaled systems, the RF section of a wireless transceiver consumes more energy than the digital part. For better understanding of the design trade offs, we first develop an accurate and comprehensive energy model for the analog front-end of wireless transceivers. Next, we evaluate a single user point-to-point wireless data communication system and a multi-user CDMA based system with respect to RF front end energy consumption and communication quality. We demonstrate the effect of occupied signal bandwidth, peak-to-average ratio (PAR), symbol rate, constellation size, and pulse-shaping roll-off factor on single user system, and the effect of number of users and multiple access interference (MAI) on CDMA based multi-user system. For a given quality specification, we show how the energy consumption can be reduced by adjusting one or more of these parameters.
随着CMOS技术的缩小,数字电源电压和数字功耗下降。然而,由于动态范围的限制,射频前端和模拟部分的电源和功耗不能以相同的方式缩放。事实上,在缩放系统中,无线收发器的射频部分比数字部分消耗更多的能量。为了更好地理解设计权衡,我们首先为无线收发器的模拟前端开发了一个准确而全面的能量模型。接下来,我们评估了单用户点对点无线数据通信系统和基于多用户CDMA的系统的射频前端能耗和通信质量。我们论证了占用的信号带宽、峰均比(PAR)、符号速率、星座大小和脉冲整形滚落因子对单用户系统的影响,以及用户数和多址干扰(MAI)对基于CDMA的多用户系统的影响。对于给定的质量规范,我们将展示如何通过调整这些参数中的一个或多个来降低能耗。
{"title":"A comprehensive energy model and energy-quality evaluation of wireless transceiver front-ends","authors":"Ye Li, B. Bakkaloglu, C. Chakrabarti","doi":"10.1109/SIPS.2005.1579876","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579876","url":null,"abstract":"As CMOS technology scales down, digital supply voltage and digital power consumption goes down. However due to dynamic range limitations, power supply and power consumption of the RF front-ends and analog sections do not scale in the same fashion. In fact, in scaled systems, the RF section of a wireless transceiver consumes more energy than the digital part. For better understanding of the design trade offs, we first develop an accurate and comprehensive energy model for the analog front-end of wireless transceivers. Next, we evaluate a single user point-to-point wireless data communication system and a multi-user CDMA based system with respect to RF front end energy consumption and communication quality. We demonstrate the effect of occupied signal bandwidth, peak-to-average ratio (PAR), symbol rate, constellation size, and pulse-shaping roll-off factor on single user system, and the effect of number of users and multiple access interference (MAI) on CDMA based multi-user system. For a given quality specification, we show how the energy consumption can be reduced by adjusting one or more of these parameters.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128380714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Low-bandwidth dynamic aspect ratio region-based motion estimation 基于低带宽动态宽高比区域的运动估计
Pub Date : 2005-12-01 DOI: 10.1109/SIPS.2005.1579900
A. Beric, B. van der Waal, R. Sethuraman, G. de Haan
In the domain of motion estimation based applications, in order to keep the bandwidth requirements low, the usage of multiple levels of memory hierarchy is a necessity. We analyze and optimize the two-level memory hierarchy system for motion estimation where the first level (L0 scratchpad) holds the search area of the estimator and the second level (L1 scratchpad) holds the region wherein the estimation is performed. In our system, the L1 scratchpad is reconfigurable and the aspect ratio of the region dynamically changes per video field. The aspect ratio is changed such that physically available region (L1) memory is maximally utilized. We extend this idea to the extreme case where the aspect ratio of the region changes from horizontal stripe to vertical column. This idea keeps the bandwidth requirements towards the off-chip image memory minimal, one access per pixel, regardless of the number of motion estimation scans. Further, switching the aspect ratio of the region to extreme values enables fast convergence of the motion estimator. To demonstrate our idea, experiments were performed on the test set of video sequences using the state-of-the-art de-interlacing as the application. The results are encouraging regarding both, objective quality metric as well as visual perception.
在基于运动估计的应用领域中,为了保持较低的带宽需求,必须使用多层内存层次结构。我们分析并优化了用于运动估计的两级内存层次系统,其中第一级(L0 scratchpad)保存估计器的搜索区域,第二级(L1 scratchpad)保存执行估计的区域。在我们的系统中,L1刮板是可重构的,并且区域的长宽比随视频字段动态变化。更改长宽比,以便最大限度地利用物理可用区域(L1)内存。我们将这个想法扩展到极端情况,即区域的纵横比从水平条纹变为垂直列。这个想法使对片外图像存储器的带宽需求最小化,每个像素一次访问,而不管运动估计扫描的数量。此外,将区域的宽高比切换到极值可以使运动估计器快速收敛。为了证明我们的想法,在视频序列的测试集上进行了实验,使用最先进的去隔行处理作为应用。结果是令人鼓舞的,无论是客观质量指标,以及视觉感知。
{"title":"Low-bandwidth dynamic aspect ratio region-based motion estimation","authors":"A. Beric, B. van der Waal, R. Sethuraman, G. de Haan","doi":"10.1109/SIPS.2005.1579900","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579900","url":null,"abstract":"In the domain of motion estimation based applications, in order to keep the bandwidth requirements low, the usage of multiple levels of memory hierarchy is a necessity. We analyze and optimize the two-level memory hierarchy system for motion estimation where the first level (L0 scratchpad) holds the search area of the estimator and the second level (L1 scratchpad) holds the region wherein the estimation is performed. In our system, the L1 scratchpad is reconfigurable and the aspect ratio of the region dynamically changes per video field. The aspect ratio is changed such that physically available region (L1) memory is maximally utilized. We extend this idea to the extreme case where the aspect ratio of the region changes from horizontal stripe to vertical column. This idea keeps the bandwidth requirements towards the off-chip image memory minimal, one access per pixel, regardless of the number of motion estimation scans. Further, switching the aspect ratio of the region to extreme values enables fast convergence of the motion estimator. To demonstrate our idea, experiments were performed on the test set of video sequences using the state-of-the-art de-interlacing as the application. The results are encouraging regarding both, objective quality metric as well as visual perception.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133734076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An AS-DSP for forward error correction applications 用于前向纠错应用的AS-DSP
Pub Date : 2005-12-01 DOI: 10.1109/SIPS.2005.1579938
T. Hsiao, Chien-Ching Lin, Hsie-Chia Chang
An application specific digital signal processor for channel coding is presented. The vector operations can improve both the performance of memory accesses and program code density. The special function units and datapaths for channel decoding accelerate the decoding speed and facilitate algorithm implementation. The processor had been fabricated in a 0.18 /spl mu/m CMOS 1P6M technology. The chip size is 7.73 mm/sup 2/ including 18k bits embedded memory, and the power consumption is 141 mW while decoding Reed-Solomon code and convolutional code. In contrast with general purpose processor designs, the results show this chip has at least 50% improvement in code density and 66% data rate enhancement.
提出了一种专用于信道编码的数字信号处理器。向量操作可以提高内存访问性能和程序代码密度。信道译码的特殊功能单元和数据路径加快了译码速度,便于算法实现。该处理器采用0.18 /spl μ m CMOS 1P6M工艺制造。芯片尺寸为7.73 mm/sup 2/,包括18k位嵌入式存储器,解码里德-所罗门码和卷积码时功耗为141 mW。结果表明,与通用处理器设计相比,该芯片的代码密度至少提高了50%,数据速率提高了66%。
{"title":"An AS-DSP for forward error correction applications","authors":"T. Hsiao, Chien-Ching Lin, Hsie-Chia Chang","doi":"10.1109/SIPS.2005.1579938","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579938","url":null,"abstract":"An application specific digital signal processor for channel coding is presented. The vector operations can improve both the performance of memory accesses and program code density. The special function units and datapaths for channel decoding accelerate the decoding speed and facilitate algorithm implementation. The processor had been fabricated in a 0.18 /spl mu/m CMOS 1P6M technology. The chip size is 7.73 mm/sup 2/ including 18k bits embedded memory, and the power consumption is 141 mW while decoding Reed-Solomon code and convolutional code. In contrast with general purpose processor designs, the results show this chip has at least 50% improvement in code density and 66% data rate enhancement.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130929364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dynamic normalization technique for decoding LDPC codes LDPC码解码的动态归一化技术
Pub Date : 2005-12-01 DOI: 10.1109/SIPS.2005.1579968
Yen-Chin Liao, Chien-Ching Lin, Chih-Wei Liu, Hsie-Chia Chang
In this paper, a dynamic normalization technique is proposed to approximate the nonlinear operation in decoding LDPC codes. The criterion in determining the normalization factor is also presented with theoretical analysis. The proposed method improves the approximation accuracy as well as the error performance of min-sum algorithm. Furthermore, the hardware implementation benefits from a simplified normalization scheme, leading to reductions in complexity and implementation loss.
本文提出了一种动态归一化技术来近似LDPC码译码过程中的非线性操作。给出了确定归一化因子的判据,并进行了理论分析。该方法提高了最小和算法的逼近精度和误差性能。此外,硬件实现受益于简化的规范化方案,从而降低了复杂性和实现损失。
{"title":"A dynamic normalization technique for decoding LDPC codes","authors":"Yen-Chin Liao, Chien-Ching Lin, Chih-Wei Liu, Hsie-Chia Chang","doi":"10.1109/SIPS.2005.1579968","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579968","url":null,"abstract":"In this paper, a dynamic normalization technique is proposed to approximate the nonlinear operation in decoding LDPC codes. The criterion in determining the normalization factor is also presented with theoretical analysis. The proposed method improves the approximation accuracy as well as the error performance of min-sum algorithm. Furthermore, the hardware implementation benefits from a simplified normalization scheme, leading to reductions in complexity and implementation loss.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122022899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Scalable transcoding for video transmission over space-time OFDM systems 时空OFDM系统视频传输的可扩展转码
Pub Date : 2005-12-01 DOI: 10.1109/SIPS.2005.1579929
Tuanjie Qian, Jun Sun, Rong Xie, Pengcheng Su, Jia Wang, Xiaokang Yang
A new scheme combining a scalable transcoder with space time block codes (STBC) for an orthogonal frequency division multiplexing (OFDM) system is proposed for robust video transmission in dispersive fading channels. The target application for such a scalable transcoder is to provide successful access to the pre-encoded high quality video from mobile wireless terminals. In the scalable transcoder, besides outputting the fine granular scalability (FGS) bitstream, both size of video frames and the bit rate are reduced. And an array processing algorithm of layer interference suppression is used at the receiver which makes the system structure provide different levels of protection to different layers. Furthermore, by considering the important level of scalable bitstream, the different bitstreams can be given different level protection by the system structure and channel coding. With the proposed system, the concurrent large diversity gain characteristic of STBC and alleviating the frequency-selective fading effect of OFDM can be achieved. The simulation results suggest that the proposed schemes integrating scalable transcoding can provide a basic quality of video transmission and outperform the convention single layer transcoding transmitted under the random and bursty error channel conditions.
针对正交频分复用(OFDM)系统中分散衰落信道的鲁棒视频传输问题,提出了一种将可扩展转码器与空时分组码(STBC)相结合的方案。这种可扩展转码器的目标应用是提供对来自移动无线终端的预编码高质量视频的成功访问。在可扩展转码器中,除了输出细粒度可扩展性(FGS)比特流外,视频帧的大小和比特率都得到了降低。在接收端采用了层干扰抑制的阵列处理算法,使得系统结构对不同的层提供了不同级别的保护。此外,考虑到可扩展比特流的重要级别,系统结构和信道编码可以对不同的比特流给予不同级别的保护。该系统既能同时实现STBC的大分集增益特性,又能减轻OFDM的频率选择性衰落效应。仿真结果表明,所提出的集成可扩展转码的方案可以提供基本的视频传输质量,并且优于传统的单层转码在随机和突发错误信道条件下的传输。
{"title":"Scalable transcoding for video transmission over space-time OFDM systems","authors":"Tuanjie Qian, Jun Sun, Rong Xie, Pengcheng Su, Jia Wang, Xiaokang Yang","doi":"10.1109/SIPS.2005.1579929","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579929","url":null,"abstract":"A new scheme combining a scalable transcoder with space time block codes (STBC) for an orthogonal frequency division multiplexing (OFDM) system is proposed for robust video transmission in dispersive fading channels. The target application for such a scalable transcoder is to provide successful access to the pre-encoded high quality video from mobile wireless terminals. In the scalable transcoder, besides outputting the fine granular scalability (FGS) bitstream, both size of video frames and the bit rate are reduced. And an array processing algorithm of layer interference suppression is used at the receiver which makes the system structure provide different levels of protection to different layers. Furthermore, by considering the important level of scalable bitstream, the different bitstreams can be given different level protection by the system structure and channel coding. With the proposed system, the concurrent large diversity gain characteristic of STBC and alleviating the frequency-selective fading effect of OFDM can be achieved. The simulation results suggest that the proposed schemes integrating scalable transcoding can provide a basic quality of video transmission and outperform the convention single layer transcoding transmitted under the random and bursty error channel conditions.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"430 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116002933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Data wordlength optimization for FPGA synthesis 用于FPGA合成的数据字长优化
Pub Date : 2005-11-02 DOI: 10.1109/SIPS.2005.1579941
N. Hervé, D. Ménard, O. Sentieys
Field programmable gate arrays (FPGAs) are now considered as a real alternative for digital signal processing (DSP) applications. But, new methodologies are still needed to automatically map a DSP application into an FPGA with respect to design constraints such as area, power consumption, execution time and time-to-market. Moreover DSP applications are frequently specified using floating-point arithmetic whereas fixed-point arithmetic should be used on FPGA. In this paper, a high-level synthesis methodology under constraints is presented. The originality is to consider a computation accuracy constraint. The methodology is based on a fixed-point operator library which characterizes the operators cost according to their wordlength. An error noise propagation model is used to compute an analytical expression of the accuracy in function of the signals wordlength. To obtain an efficient hardware implementation, the data wordlength optimization process is coupled with the high-level synthesis. In addition, the accuracy evaluation is done through an analytical method, which drastically reduces the optimization time.
现场可编程门阵列(fpga)现在被认为是数字信号处理(DSP)应用的真正替代方案。但是,考虑到诸如面积、功耗、执行时间和上市时间等设计限制,仍然需要新的方法将DSP应用自动映射到FPGA中。此外,DSP应用通常使用浮点运算,而FPGA应该使用定点运算。本文提出了一种约束条件下的高级综合方法。其独创性在于考虑了计算精度约束。该方法基于一个定点算子库,该库根据算子的字长来表征算子的代价。利用误差噪声传播模型计算了精度与信号字长函数的解析表达式。为了获得高效的硬件实现,将数据字长优化过程与高级合成相结合。此外,通过解析法进行精度评估,大大缩短了优化时间。
{"title":"Data wordlength optimization for FPGA synthesis","authors":"N. Hervé, D. Ménard, O. Sentieys","doi":"10.1109/SIPS.2005.1579941","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579941","url":null,"abstract":"Field programmable gate arrays (FPGAs) are now considered as a real alternative for digital signal processing (DSP) applications. But, new methodologies are still needed to automatically map a DSP application into an FPGA with respect to design constraints such as area, power consumption, execution time and time-to-market. Moreover DSP applications are frequently specified using floating-point arithmetic whereas fixed-point arithmetic should be used on FPGA. In this paper, a high-level synthesis methodology under constraints is presented. The originality is to consider a computation accuracy constraint. The methodology is based on a fixed-point operator library which characterizes the operators cost according to their wordlength. An error noise propagation model is used to compute an analytical expression of the accuracy in function of the signals wordlength. To obtain an efficient hardware implementation, the data wordlength optimization process is coupled with the high-level synthesis. In addition, the accuracy evaluation is done through an analytical method, which drastically reduces the optimization time.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126553720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Trellis state aggregation for soft decoding of variable length codes 可变长度码软译码的网格状态聚合
Pub Date : 2005-11-02 DOI: 10.1109/SIPS.2005.1579937
H. Jégou, S. Malinowski, C. Guillemot
This paper describes a new set of state models for soft decoding of variable length codes. A single parameter T allows to trade complexity against estimation accuracy. The extrema choices for this parameter lead respectively to construct the well-known bit-level and bit/symbol trellises. For a proper choice of the parameter T, the results obtained by running a BCJR or Viterbi estimation algorithm on the proposed state models are close to those obtained with the optimum state model. The complexity is however significantly reduced. It can be further decreased by projecting the state model on two state models of reduced size, and by combining their decoding results. This combination is shown to be optimal for the Viterbi algorithm.
本文提出了一套新的变长码软译码的状态模型。单个参数T允许在复杂性和估计精度之间进行权衡。该参数的极值选择分别导致构建众所周知的位级和位/符号格架。在适当选择参数T的情况下,对所提出的状态模型运行BCJR或Viterbi估计算法得到的结果与最优状态模型得到的结果接近。然而,复杂性大大降低了。通过将状态模型投影到减小尺寸的两个状态模型上,并结合它们的解码结果,可以进一步减小这种误差。这种组合对于Viterbi算法来说是最优的。
{"title":"Trellis state aggregation for soft decoding of variable length codes","authors":"H. Jégou, S. Malinowski, C. Guillemot","doi":"10.1109/SIPS.2005.1579937","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579937","url":null,"abstract":"This paper describes a new set of state models for soft decoding of variable length codes. A single parameter T allows to trade complexity against estimation accuracy. The extrema choices for this parameter lead respectively to construct the well-known bit-level and bit/symbol trellises. For a proper choice of the parameter T, the results obtained by running a BCJR or Viterbi estimation algorithm on the proposed state models are close to those obtained with the optimum state model. The complexity is however significantly reduced. It can be further decreased by projecting the state model on two state models of reduced size, and by combining their decoding results. This combination is shown to be optimal for the Viterbi algorithm.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114893218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A system-on-chip vector multiprocessor for transmission line modelling acceleration 用于传输线建模加速的片上系统矢量多处理器
Pub Date : 2005-11-02 DOI: 10.1109/SIPS.2005.1579931
V. Chouliaras, J. Flint, Yibin Li, J. Núñez-Yáñez
We discuss a configurable, system-on-chip vector multiprocessor for accelerating the transmission line modeling (TLM) algorithm with an architecture capable of exploiting the two primary forms of parallelism in the code, thread and data level parallelism. Theoretical results demonstrate an order of magnitude reduction in the dynamic instruction count for a scalar-processor/vector-coprocessor configuration at a vector length of sixteen 32-bit single-precision elements. Furthermore, a multi-vector SoC architecture consisting of ten such vector accelerators provides a near-linear theoretical performance benefit of the order of 88% in three out of four benchmark configurations which is orthogonal to the benefit realized by vectorization alone. We discuss in detail this potent architecture and present implementation data for the 2-way multi-processor VLSI macrocell.
我们讨论了一种可配置的片上系统矢量多处理器,用于加速传输线建模(TLM)算法,其架构能够利用代码、线程和数据级并行的两种主要并行形式。理论结果表明,在矢量长度为16个32位单精度元件的情况下,标量处理器/矢量协处理器配置的动态指令计数减少了一个数量级。此外,由十个这样的矢量加速器组成的多矢量SoC架构在四种基准配置中的三种中提供了88%的近线性理论性能优势,这与单独矢量化实现的优势是正交的。我们详细讨论了这种强大的架构,并给出了双向多处理器VLSI宏单元的实现数据。
{"title":"A system-on-chip vector multiprocessor for transmission line modelling acceleration","authors":"V. Chouliaras, J. Flint, Yibin Li, J. Núñez-Yáñez","doi":"10.1109/SIPS.2005.1579931","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579931","url":null,"abstract":"We discuss a configurable, system-on-chip vector multiprocessor for accelerating the transmission line modeling (TLM) algorithm with an architecture capable of exploiting the two primary forms of parallelism in the code, thread and data level parallelism. Theoretical results demonstrate an order of magnitude reduction in the dynamic instruction count for a scalar-processor/vector-coprocessor configuration at a vector length of sixteen 32-bit single-precision elements. Furthermore, a multi-vector SoC architecture consisting of ten such vector accelerators provides a near-linear theoretical performance benefit of the order of 88% in three out of four benchmark configurations which is orthogonal to the benefit realized by vectorization alone. We discuss in detail this potent architecture and present implementation data for the 2-way multi-processor VLSI macrocell.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117231518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Robust digital image-in-video watermarking for the emerging H.264/AVC standard 新兴的H.264/AVC标准的鲁棒数字图像视频水印
Pub Date : 2005-11-02 DOI: 10.1109/SIPS.2005.1579947
Jing Zhang, A. Ho
A novel grayscale watermark pre-processing and a robust video watermarking algorithm for the emerging video coding standard H.264/AVC are proposed in this paper for the copyright protection application. This algorithm can insert grayscale watermark patterns such as detailed trademarks or logos into the low bit-rate H.264/AVC videos in the compressed domain with good robustness and high capacity. The marked video sequences maintain good visual quality and the same overall consuming bit-rate. The proposed algorithm can robustly survive transcoding process and common signal processing, such as bit-rate reduction, Gaussian filtering and contrast enhancement.
针对新出现的视频编码标准H.264/AVC,提出了一种新的灰度水印预处理方法和鲁棒视频水印算法。该算法可以在压缩域中的低比特率H.264/AVC视频中插入详细的商标或徽标等灰度水印图案,具有良好的鲁棒性和高容量。标记的视频序列保持良好的视觉质量和相同的总体消费比特率。该算法能够在转码过程和常见的信号处理(如比特率降低、高斯滤波和对比度增强)中稳健地生存。
{"title":"Robust digital image-in-video watermarking for the emerging H.264/AVC standard","authors":"Jing Zhang, A. Ho","doi":"10.1109/SIPS.2005.1579947","DOIUrl":"https://doi.org/10.1109/SIPS.2005.1579947","url":null,"abstract":"A novel grayscale watermark pre-processing and a robust video watermarking algorithm for the emerging video coding standard H.264/AVC are proposed in this paper for the copyright protection application. This algorithm can insert grayscale watermark patterns such as detailed trademarks or logos into the low bit-rate H.264/AVC videos in the compressed domain with good robustness and high capacity. The marked video sequences maintain good visual quality and the same overall consuming bit-rate. The proposed algorithm can robustly survive transcoding process and common signal processing, such as bit-rate reduction, Gaussian filtering and contrast enhancement.","PeriodicalId":436123,"journal":{"name":"IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124341885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1