首页 > 最新文献

2013 8th IEEE Design and Test Symposium最新文献

英文 中文
Fault tolerance on multicore processors using deterministic multithreading 使用确定性多线程的多核处理器容错
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727107
Hamid Mushtaq, Z. Al-Ars, K. Bertels
This paper describes a software based fault tolerance approach for multithreaded programs running on multicore processors. Redundant multithreaded processes are used to detect soft errors and recover from them. Our scheme makes sure that the execution of the redundant processes is identical even in the presence of non-determinism due to shared memory accesses. This is done by making sure that the redundant processes acquire the locks for accessing the shared memory in the same order. Instead of using record/replay technique to do that, our scheme is based on deterministic multithreading, meaning that for the same input, a multithreaded program always have the same lock interleaving. Unlike record/replay systems, this eliminates the requirement for communication between the redundant processes. Moreover, our scheme is implemented totally in software, requiring no special hardware, making it very portable. Furthermore, our scheme is totally implemented at user-level, requiring no modification of the kernel. For selected benchmarks, our scheme adds an average overhead of 49% for 4 threads.
针对多核处理器上运行的多线程程序,提出了一种基于软件的容错方法。冗余多线程进程用于检测软错误并从中恢复。我们的方案确保冗余进程的执行是相同的,即使存在由于共享内存访问而导致的不确定性。这是通过确保冗余进程按照相同的顺序获得访问共享内存的锁来实现的。我们的方案不是使用记录/重放技术来做到这一点,而是基于确定性多线程,这意味着对于相同的输入,多线程程序总是具有相同的锁交错。与记录/重放系统不同,这消除了冗余进程之间的通信需求。此外,我们的方案完全在软件中实现,不需要特殊的硬件,使其非常便携。此外,我们的方案完全在用户级实现,不需要修改内核。对于选定的基准测试,我们的方案为4个线程增加了49%的平均开销。
{"title":"Fault tolerance on multicore processors using deterministic multithreading","authors":"Hamid Mushtaq, Z. Al-Ars, K. Bertels","doi":"10.1109/IDT.2013.6727107","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727107","url":null,"abstract":"This paper describes a software based fault tolerance approach for multithreaded programs running on multicore processors. Redundant multithreaded processes are used to detect soft errors and recover from them. Our scheme makes sure that the execution of the redundant processes is identical even in the presence of non-determinism due to shared memory accesses. This is done by making sure that the redundant processes acquire the locks for accessing the shared memory in the same order. Instead of using record/replay technique to do that, our scheme is based on deterministic multithreading, meaning that for the same input, a multithreaded program always have the same lock interleaving. Unlike record/replay systems, this eliminates the requirement for communication between the redundant processes. Moreover, our scheme is implemented totally in software, requiring no special hardware, making it very portable. Furthermore, our scheme is totally implemented at user-level, requiring no modification of the kernel. For selected benchmarks, our scheme adds an average overhead of 49% for 4 threads.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120949346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
On the impact of fault list partitioning in parallel implementations for dynamic test compaction considering multicore systems 考虑多核系统的动态测试压缩并行实现中故障列表划分的影响
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727082
Stelios N. Neophytou, Stavros Hadjitheophanous, M. Michael
Modern multicore systems have multiplied the processing power of computing systems, increasing the potential of solving difficult EDA problems. At the same time, careful decomposition of the problem should be made in order to explore the parallelism without compromising the quality of the result with respect to the existing non-parallel solutions. Test set compaction is one of the major EDA problems that is NP-hard and a crucial component of any ATPG methodology. This paper presents a study on the effect of fault list partitioning on a dynamic test set compaction algorithm that has shown to give very good results when considering the entire fault list. The serial algorithm is executed in different subsets of the considered fault list and the obtained results are evaluated in terms of the compaction achieved as well as the execution time. The experimental results demonstrate that the partitioning technique used highly affects the compaction quality while the execution time is significantly reduced.
现代多核系统已经成倍地提高了计算系统的处理能力,增加了解决EDA难题的潜力。同时,应仔细分解问题,以便在不影响现有非并行解的结果质量的情况下探索并行性。测试集压缩是NP-hard的主要EDA问题之一,也是任何ATPG方法的关键组成部分。本文研究了故障列表划分对动态测试集压缩算法的影响,该算法在考虑整个故障列表时得到了很好的结果。串行算法在考虑的故障列表的不同子集中执行,并根据所实现的压缩和执行时间对所获得的结果进行评估。实验结果表明,采用分区技术对压缩质量有很大的影响,同时大大减少了执行时间。
{"title":"On the impact of fault list partitioning in parallel implementations for dynamic test compaction considering multicore systems","authors":"Stelios N. Neophytou, Stavros Hadjitheophanous, M. Michael","doi":"10.1109/IDT.2013.6727082","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727082","url":null,"abstract":"Modern multicore systems have multiplied the processing power of computing systems, increasing the potential of solving difficult EDA problems. At the same time, careful decomposition of the problem should be made in order to explore the parallelism without compromising the quality of the result with respect to the existing non-parallel solutions. Test set compaction is one of the major EDA problems that is NP-hard and a crucial component of any ATPG methodology. This paper presents a study on the effect of fault list partitioning on a dynamic test set compaction algorithm that has shown to give very good results when considering the entire fault list. The serial algorithm is executed in different subsets of the considered fault list and the obtained results are evaluated in terms of the compaction achieved as well as the execution time. The experimental results demonstrate that the partitioning technique used highly affects the compaction quality while the execution time is significantly reduced.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128721776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Memory controller architectures: A comparative study 存储器控制器体系结构的比较研究
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727083
K. Khalifa, H. Fawzy, Sameh El-Ashry, K. Salah
This paper clarifies the differences between six memory architectures, which are Flex-OneNAND, Open NAND Flash Memory (ONFI 3.1), Embedded Multi-Media Card (eMMC v.5.0), Hybrid Memory Cube (HMC v.1.0) WideIO, and Universal Flash Storage (UFS). The paper shows the impact of such discriminating differences on choosing the most suitable architecture for certain application. The comparison is done in terms of most important features to microelectronics industry point of view. The comparison shows that the highest speed is given by HMC v.1.0 which reaches 15GBps supported with power management per link. On the other hand, Flex-OneNAND provides single flash chip with ultra-high density of NAND and simplified interface of NOR with the simplest architecture at very attractive price points. WideIO offers more bandwidth at lower power. Regarding the lowest power consumption, eMMC is sparkling. UFS combines the speed of SSD with the slim form factor and low power of eMMC. ONFI supports increased performance through parallelism using multiple logic units and interleaved addressing. This comparison is very powerful for designers to decide which memory controller is suitable for their applications and satisfies their requirements.
本文阐述了Flex-OneNAND、Open NAND Flash memory (ONFI 3.1)、Embedded multimedia Card (eMMC v.5.0)、Hybrid memory Cube (HMC v.1.0) WideIO和Universal Flash Storage (UFS)六种存储架构之间的差异。本文展示了这种区别性差异对选择最适合特定应用的体系结构的影响。比较是根据微电子工业的最重要的特点来进行的。比较表明,HMC v.1.0提供了最高的速度,支持每链路电源管理,达到15GBps。另一方面,Flex-OneNAND提供具有超高NAND密度和简化NOR接口的单一闪存芯片,结构最简单,价格极具吸引力。WideIO以更低的功耗提供更多的带宽。在最低功耗方面,eMMC是闪亮的。UFS结合了SSD的速度与eMMC的纤薄外形和低功耗。ONFI通过使用多个逻辑单元和交错寻址的并行性来提高性能。这种比较对设计人员决定哪种内存控制器适合他们的应用程序并满足他们的要求非常有用。
{"title":"Memory controller architectures: A comparative study","authors":"K. Khalifa, H. Fawzy, Sameh El-Ashry, K. Salah","doi":"10.1109/IDT.2013.6727083","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727083","url":null,"abstract":"This paper clarifies the differences between six memory architectures, which are Flex-OneNAND, Open NAND Flash Memory (ONFI 3.1), Embedded Multi-Media Card (eMMC v.5.0), Hybrid Memory Cube (HMC v.1.0) WideIO, and Universal Flash Storage (UFS). The paper shows the impact of such discriminating differences on choosing the most suitable architecture for certain application. The comparison is done in terms of most important features to microelectronics industry point of view. The comparison shows that the highest speed is given by HMC v.1.0 which reaches 15GBps supported with power management per link. On the other hand, Flex-OneNAND provides single flash chip with ultra-high density of NAND and simplified interface of NOR with the simplest architecture at very attractive price points. WideIO offers more bandwidth at lower power. Regarding the lowest power consumption, eMMC is sparkling. UFS combines the speed of SSD with the slim form factor and low power of eMMC. ONFI supports increased performance through parallelism using multiple logic units and interleaved addressing. This comparison is very powerful for designers to decide which memory controller is suitable for their applications and satisfies their requirements.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126848356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
High radix montgomery modular multiplication on FPGA 基于FPGA的高基数蒙哥马利模乘法
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727148
A. Mohamed, Anane Nadjia
Enhancing Montgomery modular multiplication (MMM) performances in term of speed and area is crucial for public key cryptography applications. This paper presents an efficient hardware-algorithm for a high radix MMM method that exploits the features available in the Virtex-5 Xilinx FPGA. Our main contribution in this paper is to develop hardware algorithms for radix-216 number system in the FPGA to speed up the MMM. It performs an operation of two 1024-bits numbers on 64 iterations. The CS (Carry Save) representation is advantageously used to overcome the carry propagation then the iteration cycle datapath length independent. Specials efforts were made to design, at the LUT level, the compressor 6:2, which is the key feature of our design. The resulting architecture can run with clock period equivalent to the total delay of an embedded 18×18-bits and two LUT6.
在速度和面积方面增强Montgomery模乘法(MMM)性能对于公钥加密应用至关重要。本文利用Virtex-5 Xilinx FPGA的特点,提出了一种高效的高基数MMM方法的硬件算法。本文的主要贡献是在FPGA上开发了基数-216数系统的硬件算法,以加快MMM的速度。它在64次迭代中执行两个1024位数字的操作。CS(进位保存)表示有利于克服进位传播,从而使迭代周期与数据路径长度无关。在LUT级别,我们特别努力设计了6:2的压缩机,这是我们设计的关键特征。由此产生的体系结构可以在时钟周期相当于一个嵌入式18×18-bits和两个LUT6的总延迟的情况下运行。
{"title":"High radix montgomery modular multiplication on FPGA","authors":"A. Mohamed, Anane Nadjia","doi":"10.1109/IDT.2013.6727148","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727148","url":null,"abstract":"Enhancing Montgomery modular multiplication (MMM) performances in term of speed and area is crucial for public key cryptography applications. This paper presents an efficient hardware-algorithm for a high radix MMM method that exploits the features available in the Virtex-5 Xilinx FPGA. Our main contribution in this paper is to develop hardware algorithms for radix-216 number system in the FPGA to speed up the MMM. It performs an operation of two 1024-bits numbers on 64 iterations. The CS (Carry Save) representation is advantageously used to overcome the carry propagation then the iteration cycle datapath length independent. Specials efforts were made to design, at the LUT level, the compressor 6:2, which is the key feature of our design. The resulting architecture can run with clock period equivalent to the total delay of an embedded 18×18-bits and two LUT6.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126618647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The optimum Booth radix for low power integer multipliers 低功率整数乘法器的最佳布斯基数
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727119
H. Saleh, B. Mohammad, E. Swartzlander
This paper investigates the optimum Booth integer multiplier for low power applications. Booth radix-4, radix-8 and radix-16 were compared for area, speed and power using standard-cell ASIC design flow and 28nm CMOS technology. All of the investigated designs were implemented in RTL, fully verified and then synthesized using 28nm standard-cell libraries which have low leakage slow cells, regular leakage average-speed cells and high-leakage fast-speed cells. The area, speed and power were compared to determine the best choice for low power designs. Among the three investigated designs, the Booth radix-4 was the best choice, it had the lowest area, power and fastest execution speed among the 3-choices. It is worthy of note that radix-8 had lower leakage power and overall power among the three designs when implemented using LVT cells. So for power sensitive and high-speed applications radix-8 could be a better choice with overhead of about 18% area and 3% slower.
本文研究了低功耗应用的最佳布斯整数乘法器。采用标准单元ASIC设计流程和28nm CMOS技术,对基数4、基数8和基数16的面积、速度和功耗进行了比较。所有设计都在RTL中实现,充分验证并使用28nm标准细胞库进行合成,该标准细胞库包括低泄漏慢速细胞,常规泄漏平均速度细胞和高泄漏快速细胞。对面积、速度和功率进行比较,以确定低功耗设计的最佳选择。在三个被调查的设计中,Booth基数-4是最佳选择,它在三个选择中具有最小的面积,功耗和最快的执行速度。值得注意的是,当使用LVT单元实现时,radix-8在三种设计中具有较低的泄漏功率和总功率。因此,对于功率敏感和高速应用程序,基数-8可能是更好的选择,开销约为18%的面积,速度慢3%。
{"title":"The optimum Booth radix for low power integer multipliers","authors":"H. Saleh, B. Mohammad, E. Swartzlander","doi":"10.1109/IDT.2013.6727119","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727119","url":null,"abstract":"This paper investigates the optimum Booth integer multiplier for low power applications. Booth radix-4, radix-8 and radix-16 were compared for area, speed and power using standard-cell ASIC design flow and 28nm CMOS technology. All of the investigated designs were implemented in RTL, fully verified and then synthesized using 28nm standard-cell libraries which have low leakage slow cells, regular leakage average-speed cells and high-leakage fast-speed cells. The area, speed and power were compared to determine the best choice for low power designs. Among the three investigated designs, the Booth radix-4 was the best choice, it had the lowest area, power and fastest execution speed among the 3-choices. It is worthy of note that radix-8 had lower leakage power and overall power among the three designs when implemented using LVT cells. So for power sensitive and high-speed applications radix-8 could be a better choice with overhead of about 18% area and 3% slower.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125055519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On the design of a high-performance digital radar system 一种高性能数字雷达系统的设计
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727099
H. Mir, L. Albasha
The design of a wideband digital radar system is presented. The system operates at S-band and uses a unique stretch-processing based architecture. The two fully digital receiver channels enhance the system dynamic range and enable the application of DSP algorithms. Experimental results verify that the system can achieve an in-band dynamic range of 60 dB across 600 MHz of instantaneous bandwidth.
介绍了一种宽带数字雷达系统的设计。该系统工作在s波段,并使用独特的拉伸处理架构。两个全数字接收通道增强了系统的动态范围,使DSP算法的应用成为可能。实验结果表明,该系统在600 MHz瞬时带宽范围内可实现60 dB的带内动态范围。
{"title":"On the design of a high-performance digital radar system","authors":"H. Mir, L. Albasha","doi":"10.1109/IDT.2013.6727099","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727099","url":null,"abstract":"The design of a wideband digital radar system is presented. The system operates at S-band and uses a unique stretch-processing based architecture. The two fully digital receiver channels enhance the system dynamic range and enable the application of DSP algorithms. Experimental results verify that the system can achieve an in-band dynamic range of 60 dB across 600 MHz of instantaneous bandwidth.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128336181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Performance comparison between air-gap based coaxial TSV and conventional circular TSV in 3D-ICs 3d - ic中基于气隙的同轴TSV与传统圆形TSV的性能比较
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727102
K. Salah
In this paper, performance comparison between Air-Gap Based Coaxial TSV and conventional circular TSV are presented. The comparison shows that the air-gap TSVs reduce the overall parasitic capacitance and the overall energy loss compared to the conventional circular TSV or conventional coaxial TSV.
本文对基于气隙的同轴TSV与传统圆形TSV的性能进行了比较。结果表明,与传统的圆形TSV和同轴TSV相比,气隙TSV减小了总寄生电容和总能量损失。
{"title":"Performance comparison between air-gap based coaxial TSV and conventional circular TSV in 3D-ICs","authors":"K. Salah","doi":"10.1109/IDT.2013.6727102","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727102","url":null,"abstract":"In this paper, performance comparison between Air-Gap Based Coaxial TSV and conventional circular TSV are presented. The comparison shows that the air-gap TSVs reduce the overall parasitic capacitance and the overall energy loss compared to the conventional circular TSV or conventional coaxial TSV.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128517087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Grammar-based program generation based on model finding 基于模型发现的基于语法的程序生成
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727084
Mathias Soeken, R. Drechsler
This paper presents an algorithm that generates test programs in order to test programming languages and domain specific languages using formal methods. The novelty of the approach is that it is embedded into a model driven engineering environment and it is described as a model finding problem. The grammar of the language and the respective test programs are represented as meta-models and models, respectively. As a result, model finders are utilized to generate test programs based on user constraints while additionally ensuring embedded constraints of the programmmg languages. An experimental evaluation demonstrates the applicability of the approach.
本文提出了一种生成测试程序的算法,以便使用形式化方法对编程语言和领域特定语言进行测试。该方法的新颖之处在于它被嵌入到模型驱动的工程环境中,并被描述为一个模型查找问题。语言的语法和相应的测试程序分别表示为元模型和模型。因此,模型查找器被用来生成基于用户约束的测试程序,同时额外地确保编程语言的嵌入式约束。实验验证了该方法的适用性。
{"title":"Grammar-based program generation based on model finding","authors":"Mathias Soeken, R. Drechsler","doi":"10.1109/IDT.2013.6727084","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727084","url":null,"abstract":"This paper presents an algorithm that generates test programs in order to test programming languages and domain specific languages using formal methods. The novelty of the approach is that it is embedded into a model driven engineering environment and it is described as a model finding problem. The grammar of the language and the respective test programs are represented as meta-models and models, respectively. As a result, model finders are utilized to generate test programs based on user constraints while additionally ensuring embedded constraints of the programmmg languages. An experimental evaluation demonstrates the applicability of the approach.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"2008 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125592383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Traffic-based virtual channel activation for low-power NoC 基于流量的低功耗NoC虚拟信道激活
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727077
S. Muhammad, M. El-Moursy, Ali A. Ei-Moursy, A. M. Refaat
Low leakage power with maintained high throughput NoC is achieved. Traffic-based Virtual channel Activation algorithm (TVA) is proposed to activate/deactivate virtual channels in a NoC. The proposed algorithm implements Adaptive Virtual Channel technique of switching-OFF idle virtual channels in the NoC according to traffic heaviness. TVA is an efficient and flexible algorithm which provides set of parameters to be tuned and combined to achieve high performance and high power saving. NoC average leakage power has been reduced by 73.5% with negligible less than 1% degradation in throughput.
低泄漏功率与保持高通量NoC实现。提出了基于业务量的虚拟通道激活算法(TVA)来激活/取消激活NoC中的虚拟通道。该算法实现了自适应虚拟信道技术,根据通信量的大小关闭NoC中空闲的虚拟信道。TVA算法是一种高效而灵活的算法,它提供了一组可调和组合的参数,以实现高性能和高功耗。NoC平均泄漏功率降低了73.5%,吞吐量下降不到1%。
{"title":"Traffic-based virtual channel activation for low-power NoC","authors":"S. Muhammad, M. El-Moursy, Ali A. Ei-Moursy, A. M. Refaat","doi":"10.1109/IDT.2013.6727077","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727077","url":null,"abstract":"Low leakage power with maintained high throughput NoC is achieved. Traffic-based Virtual channel Activation algorithm (TVA) is proposed to activate/deactivate virtual channels in a NoC. The proposed algorithm implements Adaptive Virtual Channel technique of switching-OFF idle virtual channels in the NoC according to traffic heaviness. TVA is an efficient and flexible algorithm which provides set of parameters to be tuned and combined to achieve high performance and high power saving. NoC average leakage power has been reduced by 73.5% with negligible less than 1% degradation in throughput.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130868250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Performance of different wavelet families using DWT and DWPT-channel equalization using ZF and MMSE 用DWT和dwpt进行不同小波族的性能分析,用ZF和MMSE进行信道均衡
Pub Date : 2013-12-01 DOI: 10.1109/IDT.2013.6727136
R. Asif, A. Hussaini, R. Abd‐Alhameed, S. R. Jones, J. Noras, E. Elkhazmi, Jonathan Rodriguez
We have studied the performance of multidimensional signaling techniques using wavelets based modulation within an orthogonally multiplexed communication system. The discrete wavelets transform and wavelet packet modulation techniques have been studied using Daubechies 2 and 8, Biothogonal1.5 and 3.1 and reverse Biorthognal 1.5 and 3.1 wavelets in the presence of Rayleigh multipath fading channels with AWGN. Results showed that DWT based systems outperform WPM systems both in terms of BER vs. SNR performance as well as processing. The performances of two different equalizations techniques, namely zero forcing (ZF) and minimum mean square error (MMSE), were also compared using DWT. When the channel is modeled using Rayleigh multipath fading, AWGN and ISI both techniques yield similar performance.
我们研究了在正交复用通信系统中使用基于小波的调制的多维信号技术的性能。在具有AWGN的瑞利多径衰落信道存在的情况下,利用Daubechies 2和8、Biothogonal1.5和3.1以及反向双正交1.5和3.1小波,研究了离散小波变换和小波包调制技术。结果表明,基于DWT的系统在误码率与信噪比性能以及处理方面都优于WPM系统。利用DWT比较了零强迫(ZF)和最小均方误差(MMSE)两种不同均衡技术的性能。当使用瑞利多径衰落信道建模时,AWGN和ISI两种技术都产生相似的性能。
{"title":"Performance of different wavelet families using DWT and DWPT-channel equalization using ZF and MMSE","authors":"R. Asif, A. Hussaini, R. Abd‐Alhameed, S. R. Jones, J. Noras, E. Elkhazmi, Jonathan Rodriguez","doi":"10.1109/IDT.2013.6727136","DOIUrl":"https://doi.org/10.1109/IDT.2013.6727136","url":null,"abstract":"We have studied the performance of multidimensional signaling techniques using wavelets based modulation within an orthogonally multiplexed communication system. The discrete wavelets transform and wavelet packet modulation techniques have been studied using Daubechies 2 and 8, Biothogonal1.5 and 3.1 and reverse Biorthognal 1.5 and 3.1 wavelets in the presence of Rayleigh multipath fading channels with AWGN. Results showed that DWT based systems outperform WPM systems both in terms of BER vs. SNR performance as well as processing. The performances of two different equalizations techniques, namely zero forcing (ZF) and minimum mean square error (MMSE), were also compared using DWT. When the channel is modeled using Rayleigh multipath fading, AWGN and ISI both techniques yield similar performance.","PeriodicalId":446826,"journal":{"name":"2013 8th IEEE Design and Test Symposium","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133281648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2013 8th IEEE Design and Test Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1