首页 > 最新文献

VLSI Signal Processing, IX最新文献

英文 中文
Estimation of average energy consumption of ripple-carry adder based on average length carry chains 基于平均进位链长度的纹波进位加法器平均能耗估算
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558326
K. Parhi, Janardhan H. SatyanarayanaDepartment
We show theoretically that the average energy consumption of a ripple-carry adder is O(W), and the upper bound on the average energy consumption is O(Wlog/sub 2/W), where W is the word-length of the operands. Our theoretical analysis is based on a simple state transition diagram (STD) model of a full adder cell and the observations that the average length of a carry propagation chain is v=2, and the average length of the maximum carry chain is v/spl les/log/sub 2/W. To verify our theoretical conclusions, we use the HEAT CAD tool to estimate the average power consumed by the ripple-carry adder for word-lengths 4/spl les/W/spl les/64. The experimental results show that, for W/spl ges/16, the error in our theoretical estimations is around 15%.
我们从理论上证明了纹波进位加法器的平均能量消耗为O(W),平均能量消耗的上界为O(Wlog/sub 2/W),其中W为操作数的字长。我们的理论分析基于一个完整加法器单元的简单状态转移图(STD)模型,并观察到进位传播链的平均长度为v=2,最大进位传播链的平均长度为v/spl les/log/sub 2/W。为了验证我们的理论结论,我们使用HEAT CAD工具来估计字长为4/spl les/W/spl les/64的纹波进位加法器的平均功耗。实验结果表明,对于W/spl /16,我们的理论估计误差在15%左右。
{"title":"Estimation of average energy consumption of ripple-carry adder based on average length carry chains","authors":"K. Parhi, Janardhan H. SatyanarayanaDepartment","doi":"10.1109/VLSISP.1996.558326","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558326","url":null,"abstract":"We show theoretically that the average energy consumption of a ripple-carry adder is O(W), and the upper bound on the average energy consumption is O(Wlog/sub 2/W), where W is the word-length of the operands. Our theoretical analysis is based on a simple state transition diagram (STD) model of a full adder cell and the observations that the average length of a carry propagation chain is v=2, and the average length of the maximum carry chain is v/spl les/log/sub 2/W. To verify our theoretical conclusions, we use the HEAT CAD tool to estimate the average power consumed by the ripple-carry adder for word-lengths 4/spl les/W/spl les/64. The experimental results show that, for W/spl ges/16, the error in our theoretical estimations is around 15%.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126218274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Low power storage exploration for H.263 video decoder H.263视频解码器的低功耗存储探索
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558310
L. Nachtergaele, Francky Catthoory, Bhanu Kapoorz, S. Janssens, D. Moolenaar
We describe a power exploration methodology for data-dominated applications using a H.263 video decoding demonstrator application. The starting point for our exploration is a C specification of the video decoder, available in the public domain from Telenor Research. We have transformed the data transfer scheme in the specification and have optimized the distributed memory organization. This results in a memory architecture with significantly reduced power consumption. For the worst-case mode using predicted and bi-directional (PB) frames, memory power consumption is reduced by a factor of 9. To achieve these results, we make use of our formalized high-level memory management methodology, partly supported in our ATOMIUM environment.
我们使用H.263视频解码演示应用程序描述了数据主导应用程序的功率勘探方法。我们探索的起点是视频解码器的C规范,可从Telenor Research的公共领域获得。我们对规范中的数据传输方案进行了转换,并对分布式内存组织进行了优化。这将产生功耗显著降低的内存架构。对于使用预测和双向(PB)帧的最坏情况模式,内存功耗降低了9倍。为了实现这些结果,我们使用了我们的形式化高级内存管理方法,ATOMIUM环境部分支持这种方法。
{"title":"Low power storage exploration for H.263 video decoder","authors":"L. Nachtergaele, Francky Catthoory, Bhanu Kapoorz, S. Janssens, D. Moolenaar","doi":"10.1109/VLSISP.1996.558310","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558310","url":null,"abstract":"We describe a power exploration methodology for data-dominated applications using a H.263 video decoding demonstrator application. The starting point for our exploration is a C specification of the video decoder, available in the public domain from Telenor Research. We have transformed the data transfer scheme in the specification and have optimized the distributed memory organization. This results in a memory architecture with significantly reduced power consumption. For the worst-case mode using predicted and bi-directional (PB) frames, memory power consumption is reduced by a factor of 9. To achieve these results, we make use of our formalized high-level memory management methodology, partly supported in our ATOMIUM environment.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122691090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Divide-and-conquer techniques for global throughput optimization 用于全局吞吐量优化的分而治之技术
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558313
L. Guerra, M. Potkonjak, J. Rabaey
The paper proposes a divide-and-conquer approach for global throughput optimization designed to coordinate existing techniques and enable their more effective use. The "divide" approach consists of logical partitioning of the computation into subparts. The techniques for partitioning the computation, and the corresponding scheme for classifying the subparts is presented. The subparts are optimized or "conquered" through coordinated application of existing optimization techniques. Optimization techniques that are effective for each class have been characterized in terms of their expected effect on throughput. The approach is not limited to a specific class of computations and gives higher, or at least equal, improvement than previously reported techniques on all examples.
本文提出了一种分而治之的全球吞吐量优化方法,旨在协调现有技术并使其更有效地使用。“划分”方法包括将计算逻辑划分为子部分。给出了计算的划分技术和相应的子部件分类方案。通过协调应用现有优化技术,对子部件进行优化或“征服”。对每个类别都有效的优化技术已经根据其对吞吐量的预期影响进行了表征。该方法不局限于特定的计算类别,并且在所有示例上都比以前报道的技术提供更高的改进,或者至少是相同的改进。
{"title":"Divide-and-conquer techniques for global throughput optimization","authors":"L. Guerra, M. Potkonjak, J. Rabaey","doi":"10.1109/VLSISP.1996.558313","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558313","url":null,"abstract":"The paper proposes a divide-and-conquer approach for global throughput optimization designed to coordinate existing techniques and enable their more effective use. The \"divide\" approach consists of logical partitioning of the computation into subparts. The techniques for partitioning the computation, and the corresponding scheme for classifying the subparts is presented. The subparts are optimized or \"conquered\" through coordinated application of existing optimization techniques. Optimization techniques that are effective for each class have been characterized in terms of their expected effect on throughput. The approach is not limited to a specific class of computations and gives higher, or at least equal, improvement than previously reported techniques on all examples.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132618607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Adaptive CDMA receiver implementation for multipath and multiuser environments 多路径和多用户环境下的自适应CDMA接收机实现
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558320
L. Lucke, L. Nelson, H. Oie
We present serial and parallel architectures for an LMS adaptive filter implementation of the minimum-mean-square-error adaptive CDMA receiver. These architectures use fixed-point numbers to represent the variables and 2-bit representation of the input signal to reduce the complexity of the arithmetic operations. We simulate the bit error rate of these architectures to study their performance in near-far and multipath environments. The simulations are used to determine the optimal wordlengths. Simulation results show that the performance of this reduced-complexity digital adaptive filter is compatible with that of the analog one using sufficient numbers of bits and it is much better than that of the conventional matched filter.
我们提出了一个LMS自适应滤波器实现的最小均方误差自适应CDMA接收机的串行和并行架构。这些体系结构使用定点数来表示变量和输入信号的2位表示来降低算术运算的复杂性。我们模拟了这些架构的误码率,以研究它们在近距离和多路径环境下的性能。模拟用于确定最佳字长。仿真结果表明,这种降低复杂度的数字自适应滤波器在使用足够的比特数的情况下,其性能与模拟滤波器相兼容,大大优于传统的匹配滤波器。
{"title":"Adaptive CDMA receiver implementation for multipath and multiuser environments","authors":"L. Lucke, L. Nelson, H. Oie","doi":"10.1109/VLSISP.1996.558320","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558320","url":null,"abstract":"We present serial and parallel architectures for an LMS adaptive filter implementation of the minimum-mean-square-error adaptive CDMA receiver. These architectures use fixed-point numbers to represent the variables and 2-bit representation of the input signal to reduce the complexity of the arithmetic operations. We simulate the bit error rate of these architectures to study their performance in near-far and multipath environments. The simulations are used to determine the optimal wordlengths. Simulation results show that the performance of this reduced-complexity digital adaptive filter is compatible with that of the analog one using sufficient numbers of bits and it is much better than that of the conventional matched filter.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130993047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Maximizing the fault-tolerance of application specific programmable signal processors 最大化应用特定的可编程信号处理器的容错性
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558361
Kyosun Kim, R. Karri, M. Potkonjak
As witnessed by their rapid market growth, application specific programmable processors (ASSP) provide an attractive alternative to both fully programmable and fully custom hardware platforms. ASPP are data paths which provide efficient implementation for any of k functional specifications assuming that only one will be executed at any given time. We combine the flexibility provided by multiple functionalities with judicious operation-to-application allocation to maximize the permanent fault-tolerance of such ASPP designs. The approach and the synthesis algorithms are demonstrated on a number of signal processing applications.
随着市场的快速增长,专用可编程处理器(ASSP)为完全可编程和完全自定义硬件平台提供了一个有吸引力的替代方案。ASPP是为k个功能规范中的任何一个提供有效实现的数据路径,假设在任何给定时间只执行一个功能规范。我们将多个功能提供的灵活性与明智的操作到应用程序分配相结合,以最大限度地提高此类ASPP设计的永久容错性。该方法和合成算法在一些信号处理应用中得到了验证。
{"title":"Maximizing the fault-tolerance of application specific programmable signal processors","authors":"Kyosun Kim, R. Karri, M. Potkonjak","doi":"10.1109/VLSISP.1996.558361","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558361","url":null,"abstract":"As witnessed by their rapid market growth, application specific programmable processors (ASSP) provide an attractive alternative to both fully programmable and fully custom hardware platforms. ASPP are data paths which provide efficient implementation for any of k functional specifications assuming that only one will be executed at any given time. We combine the flexibility provided by multiple functionalities with judicious operation-to-application allocation to maximize the permanent fault-tolerance of such ASPP designs. The approach and the synthesis algorithms are demonstrated on a number of signal processing applications.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124877074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real-time MPEG-2 software decoding with a dual-issue RISC processor 实时MPEG-2软件解码与双问题RISC处理器
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558309
E. Holmann, A. Yamada, T. Yoshida, S. Uramoto
A single chip system for real-time MPEG-2 decoding can be created by integrating a dual-issue RISC processor with a small dedicated hardware for the variable length decoding (VLD) and block loading processes; a 32 KB instruction RAM; and a 16 KB data RAM. The VLD hardware performs the Huffman decoding on the input data. The block loader performs the half-sample prediction for motion compensation and acts as a direct memory access controller for the RISC processor. The dual-issue RISC processor, running at 250 MHz, is enhanced with a set of key sub-word and multimedia instructions for a sustained peak performance of 1000 MOPS. With this setup for MPEG-2 decoding applications, bi-directionally predicted non-intra blocks are decoded in less than 800 cycles, leading to a single chip, real-time MPEG-2 decoding system.
通过将双问题RISC处理器与用于可变长度解码(VLD)和块加载过程的小型专用硬件集成在一起,可以创建实时MPEG-2解码的单芯片系统;32 KB指令RAM;以及16 KB数据RAM。VLD硬件对输入数据执行霍夫曼解码。块加载器执行运动补偿的半样本预测,并充当RISC处理器的直接存储器访问控制器。双发行RISC处理器,运行在250兆赫,增强了一组关键子字和多媒体指令,以维持1000 MOPS的峰值性能。通过这种MPEG-2解码应用的设置,双向预测的非片内块在不到800个周期内解码,从而实现单芯片实时MPEG-2解码系统。
{"title":"Real-time MPEG-2 software decoding with a dual-issue RISC processor","authors":"E. Holmann, A. Yamada, T. Yoshida, S. Uramoto","doi":"10.1109/VLSISP.1996.558309","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558309","url":null,"abstract":"A single chip system for real-time MPEG-2 decoding can be created by integrating a dual-issue RISC processor with a small dedicated hardware for the variable length decoding (VLD) and block loading processes; a 32 KB instruction RAM; and a 16 KB data RAM. The VLD hardware performs the Huffman decoding on the input data. The block loader performs the half-sample prediction for motion compensation and acts as a direct memory access controller for the RISC processor. The dual-issue RISC processor, running at 250 MHz, is enhanced with a set of key sub-word and multimedia instructions for a sustained peak performance of 1000 MOPS. With this setup for MPEG-2 decoding applications, bi-directionally predicted non-intra blocks are decoded in less than 800 cycles, leading to a single chip, real-time MPEG-2 decoding system.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114441882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Parallel and pipelined architecture designs for distributed arithmetic-based recursive digital filters 基于分布式算法的递归数字滤波器的并行和流水线结构设计
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558279
Y. Hwang, C.-L. Su
This paper presents a distributed arithmetic based design scheme for recursive DSP systems requiring high speed computing. The proposed scheme features a bit-serial word-parallel approach and is found more efficient than the conventional bit-parallel word-serial scheme. We apply this scheme to design an ARMA filter and yield an initiation interval as small as the delay of processing only one output bit. We also incorporate the look-ahead transform and the block processing techniques in the proposed DA scheme for further speed improvement. Finally, we propose a signed digit DA scheme to solve the performance degradation problem due to the effect of data word length truncation in fixed point number computing systems.
针对需要高速计算的递归DSP系统,提出一种基于分布式算法的设计方案。该方案采用位串行字并行方法,比传统的位并行字串行方案更有效。我们应用该方案设计了一个ARMA滤波器,其起始间隔小到只处理一个输出位的延迟。为了进一步提高数据挖掘的速度,我们还在数据挖掘方案中引入了前瞻性变换和块处理技术。最后,针对定点数计算系统中由于数据字长截断而导致的性能下降问题,提出了一种符号数字数据挖掘方案。
{"title":"Parallel and pipelined architecture designs for distributed arithmetic-based recursive digital filters","authors":"Y. Hwang, C.-L. Su","doi":"10.1109/VLSISP.1996.558279","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558279","url":null,"abstract":"This paper presents a distributed arithmetic based design scheme for recursive DSP systems requiring high speed computing. The proposed scheme features a bit-serial word-parallel approach and is found more efficient than the conventional bit-parallel word-serial scheme. We apply this scheme to design an ARMA filter and yield an initiation interval as small as the delay of processing only one output bit. We also incorporate the look-ahead transform and the block processing techniques in the proposed DA scheme for further speed improvement. Finally, we propose a signed digit DA scheme to solve the performance degradation problem due to the effect of data word length truncation in fixed point number computing systems.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132877552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Ultra-low-power domain-specific multimedia processors 超低功耗专用多媒体处理器
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558379
A. Abnous, J. Rabaey
Programmability is an important requirement for portable computing and communication devices that must be flexible enough to accommodate a variety of multimedia services and communication capabilities. However, compared to dedicated, application-specific solutions, programmable devices often incur significant performance and power penalties. We present a hybrid architecture template that can be used to implement ultra-low-power programmable processors for signal processing applications.
可编程性是便携式计算和通信设备的一个重要要求,这些设备必须足够灵活,以适应各种多媒体服务和通信功能。然而,与专用的、特定于应用程序的解决方案相比,可编程设备通常会导致显著的性能和功耗损失。我们提出了一种混合架构模板,可用于实现超低功耗可编程处理器的信号处理应用。
{"title":"Ultra-low-power domain-specific multimedia processors","authors":"A. Abnous, J. Rabaey","doi":"10.1109/VLSISP.1996.558379","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558379","url":null,"abstract":"Programmability is an important requirement for portable computing and communication devices that must be flexible enough to accommodate a variety of multimedia services and communication capabilities. However, compared to dedicated, application-specific solutions, programmable devices often incur significant performance and power penalties. We present a hybrid architecture template that can be used to implement ultra-low-power programmable processors for signal processing applications.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134015490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 142
A 250 Msample/sec programmable cascaded integrator-comb decimation filter 250msample /sec可编程级联积分器梳状抽取滤波器
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558351
A. Kwentus, O. Lee, A. Willson
The implementation of a 250 Msample/sec programmable six-stage cascaded integrator-comb (CIC) decimation filter is described. The prototype IC is implemented using 0.8-/spl mu/m CMOS and contains 39,890 transistors in a core area of 8.5 mm/sup 2/. It accommodates programmable power-of-two decimation factors from 2 to 1024 with 16-bit input and output data.
描述了一种250 m采样/秒可编程六级级联积分梳(CIC)抽取滤波器的实现。原型IC采用0.8-/spl μ m CMOS实现,在8.5 mm/sup /的核心面积中包含39,890个晶体管。它容纳可编程的2次幂抽取因子,从2到1024,16位输入和输出数据。
{"title":"A 250 Msample/sec programmable cascaded integrator-comb decimation filter","authors":"A. Kwentus, O. Lee, A. Willson","doi":"10.1109/VLSISP.1996.558351","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558351","url":null,"abstract":"The implementation of a 250 Msample/sec programmable six-stage cascaded integrator-comb (CIC) decimation filter is described. The prototype IC is implemented using 0.8-/spl mu/m CMOS and contains 39,890 transistors in a core area of 8.5 mm/sup 2/. It accommodates programmable power-of-two decimation factors from 2 to 1024 with 16-bit input and output data.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130609130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Novel structures for serial multiplication over the finite field GF(2/sup m/) 有限域GF(2/sup m/)上串行乘法的新结构
Pub Date : 1996-10-30 DOI: 10.1109/VLSISP.1996.558302
M.C. Mekhallalati, A. Ashur
Two novel uni-directional systolic structures for serial multiplication over the finite field GF(2/sup m/) are presented. The architecture of the new structures posses features of regularity, modularity, and uni-directional data flow. One of the new structures is a serial-parallel structure, whereas the other structure is a fully serial one. Both structures consist of (m/2) novel cells. Due to the novel cells architectures of the new structures, the initial delay (i.e. the number of cycles required to obtain the first output) and the latency (i.e. the number of cycles required to complete the multiplication process) are decreased by 25% and 17% respectively. Also, the number of latches of the new structures are reduced by more than 20% when compared to existing uni-directional serial-parallel structures.
在有限域GF(2/sup m/)上提出了两种新的单向收缩结构。新结构的体系结构具有规律性、模块化和单向数据流的特点。其中一种结构是串并联结构,另一种结构是全串联结构。这两种结构都由(m/2)个新细胞组成。由于新结构的新颖单元结构,初始延迟(即获得第一个输出所需的循环次数)和延迟(即完成乘法过程所需的循环次数)分别降低了25%和17%。此外,与现有的单向串并联结构相比,新结构的锁存器数量减少了20%以上。
{"title":"Novel structures for serial multiplication over the finite field GF(2/sup m/)","authors":"M.C. Mekhallalati, A. Ashur","doi":"10.1109/VLSISP.1996.558302","DOIUrl":"https://doi.org/10.1109/VLSISP.1996.558302","url":null,"abstract":"Two novel uni-directional systolic structures for serial multiplication over the finite field GF(2/sup m/) are presented. The architecture of the new structures posses features of regularity, modularity, and uni-directional data flow. One of the new structures is a serial-parallel structure, whereas the other structure is a fully serial one. Both structures consist of (m/2) novel cells. Due to the novel cells architectures of the new structures, the initial delay (i.e. the number of cycles required to obtain the first output) and the latency (i.e. the number of cycles required to complete the multiplication process) are decreased by 25% and 17% respectively. Also, the number of latches of the new structures are reduced by more than 20% when compared to existing uni-directional serial-parallel structures.","PeriodicalId":290885,"journal":{"name":"VLSI Signal Processing, IX","volume":"163 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128771932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
VLSI Signal Processing, IX
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1