首页 > 最新文献

2011 18th IEEE International Conference on Electronics, Circuits, and Systems最新文献

英文 中文
Design of low voltage low power dual-band LNA with forward body biasing technique 基于前向体偏置技术的低电压低功耗双频LNA设计
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122344
A. Dehqan, K. Mafinezhad, E. Kargaran, H. Nabovati
A low voltage, low power dual band Low Noise Amplifier (LNA) is presented in this paper. By employing a forward body bias of the MOSFET and current reuse topology the LNA can be operated at reduced supply voltage and power consumption while maintaining high gain due to its topology. Using 0.18 um CMOS process the LNA is designed at 2.4 GHz and 5.2GHz with 13.1 dB and 14.2 dB voltage gains and 2.9dB and 2.6dB NF respectively with 0.7V supply voltage and 3mW power consumption.
本文介绍了一种低电压、低功耗的双频低噪声放大器。通过采用MOSFET的正向偏置和电流复用拓扑,LNA可以在降低电源电压和功耗的情况下工作,同时由于其拓扑结构而保持高增益。LNA采用0.18 um CMOS工艺设计,工作频率分别为2.4 GHz和5.2GHz,电压增益分别为13.1 dB和14.2 dB, NF增益分别为2.9dB和2.6dB,电源电压为0.7V,功耗为3mW。
{"title":"Design of low voltage low power dual-band LNA with forward body biasing technique","authors":"A. Dehqan, K. Mafinezhad, E. Kargaran, H. Nabovati","doi":"10.1109/ICECS.2011.6122344","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122344","url":null,"abstract":"A low voltage, low power dual band Low Noise Amplifier (LNA) is presented in this paper. By employing a forward body bias of the MOSFET and current reuse topology the LNA can be operated at reduced supply voltage and power consumption while maintaining high gain due to its topology. Using 0.18 um CMOS process the LNA is designed at 2.4 GHz and 5.2GHz with 13.1 dB and 14.2 dB voltage gains and 2.9dB and 2.6dB NF respectively with 0.7V supply voltage and 3mW power consumption.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"49 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131374277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Implementation of a cost efficient SSL based on an Angular beamformer SRP-PHAT 基于Angular波束形成器SRP-PHAT的低成本SSL实现
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122211
H. Zarghi, M. Sharifkhani, I. Gholampour
Among Sound Source Localization (SSL) methods, beamformed microphone arrays using Steering Response Power(SRP) has received significant attention. Yet, its application is stumbled by its computational complexity which cannot meet low-power/low-cost applications requirements. In this paper, Angular beamforming is presented. By applying this beamforming approach to the conventional SRP-PHAT one can find that the complexity of SRP-PHAT will decreased by two orders of magnitudes with a mere three fold reduction in resolution. The proposed method is implemented both on FPGA and 0.18um CMOS technology for a practical case.
在声源定位(SSL)方法中,利用转向响应功率(SRP)的波束形成麦克风阵列受到了广泛的关注。然而,其计算复杂度无法满足低功耗/低成本应用的要求,阻碍了其应用。本文介绍了角波束形成技术。通过将这种波束形成方法应用于传统的SRP-PHAT,可以发现SRP-PHAT的复杂性将降低两个数量级,而分辨率仅降低三倍。该方法同时在FPGA和0.18um CMOS技术上实现。
{"title":"Implementation of a cost efficient SSL based on an Angular beamformer SRP-PHAT","authors":"H. Zarghi, M. Sharifkhani, I. Gholampour","doi":"10.1109/ICECS.2011.6122211","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122211","url":null,"abstract":"Among Sound Source Localization (SSL) methods, beamformed microphone arrays using Steering Response Power(SRP) has received significant attention. Yet, its application is stumbled by its computational complexity which cannot meet low-power/low-cost applications requirements. In this paper, Angular beamforming is presented. By applying this beamforming approach to the conventional SRP-PHAT one can find that the complexity of SRP-PHAT will decreased by two orders of magnitudes with a mere three fold reduction in resolution. The proposed method is implemented both on FPGA and 0.18um CMOS technology for a practical case.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130219744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A digital circuit for extracting singular points from fingerprint images 一种从指纹图像中提取奇异点的数字电路
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122353
Rosario Arjona, I. Baturone
Since singular point extraction plays an important role in many fingerprint recognition systems, a digital circuit to implement such processing is presented herein. A novel algorithm that combines hardware efficiency with precision in the extraction of the points has been developed. The circuit architecture contains three main building blocks to carry out the three main stages of the algorithm: extraction of a partitioned directional image, smoothing, and searching for the patterns associated with singular points. The circuit processes the pixels in a serial way, following a pipeline scheme and executing in parallel several operations. The design flow employed has been supported by CAD tools. It starts with high-level descriptions and ends with the hardware prototyping into a FPGA from Xilinx.
由于奇点提取在许多指纹识别系统中起着重要的作用,本文提出了一种实现这种处理的数字电路。提出了一种结合硬件效率和精度的点提取算法。电路架构包含三个主要的构建块来执行算法的三个主要阶段:提取分割的方向图像,平滑和搜索与奇异点相关的模式。该电路以串行方式处理像素,遵循流水线方案并并行执行若干操作。所采用的设计流程已得到CAD工具的支持。它从高级描述开始,以硬件原型设计到Xilinx的FPGA结束。
{"title":"A digital circuit for extracting singular points from fingerprint images","authors":"Rosario Arjona, I. Baturone","doi":"10.1109/ICECS.2011.6122353","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122353","url":null,"abstract":"Since singular point extraction plays an important role in many fingerprint recognition systems, a digital circuit to implement such processing is presented herein. A novel algorithm that combines hardware efficiency with precision in the extraction of the points has been developed. The circuit architecture contains three main building blocks to carry out the three main stages of the algorithm: extraction of a partitioned directional image, smoothing, and searching for the patterns associated with singular points. The circuit processes the pixels in a serial way, following a pipeline scheme and executing in parallel several operations. The design flow employed has been supported by CAD tools. It starts with high-level descriptions and ends with the hardware prototyping into a FPGA from Xilinx.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128871047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Design of new full adder cell using hybrid-CMOS logic style 采用混合cmos逻辑风格的新型全加法器单元的设计
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122310
Mohammad Javad Zavarei, MohammadReza Baghbanmanesh, E. Kargaran, H. Nabovati, A. Golmakani
In this paper, we propose a novel 1-bit full adder using hybrid-CMOS logic style. The new full adder is based on a novel XOR-XNOR circuit that generates XOR and XNOR full-swing outputs simultaneously and outperforms its best counterpart showing 28% improvement in power-delay product (PDP). Design of proposed full adder is based on improvement in the PDP and it provides full-swing output with good driving capability. Simulations demonstrate that full adder successfully operates in the PDP compared to similar circuits.
在本文中,我们提出了一种新的1位全加法器,采用混合cmos逻辑风格。新的全加法器基于一种新颖的XOR-XNOR电路,可以同时产生XOR和XNOR全摆幅输出,并且在功率延迟产品(PDP)方面表现出28%的改进。本文提出的全加法器是在改进PDP的基础上设计的,它提供了具有良好驱动能力的全摆幅输出。仿真结果表明,与同类电路相比,全加法器在PDP中运行良好。
{"title":"Design of new full adder cell using hybrid-CMOS logic style","authors":"Mohammad Javad Zavarei, MohammadReza Baghbanmanesh, E. Kargaran, H. Nabovati, A. Golmakani","doi":"10.1109/ICECS.2011.6122310","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122310","url":null,"abstract":"In this paper, we propose a novel 1-bit full adder using hybrid-CMOS logic style. The new full adder is based on a novel XOR-XNOR circuit that generates XOR and XNOR full-swing outputs simultaneously and outperforms its best counterpart showing 28% improvement in power-delay product (PDP). Design of proposed full adder is based on improvement in the PDP and it provides full-swing output with good driving capability. Simulations demonstrate that full adder successfully operates in the PDP compared to similar circuits.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128898826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
An efficient multiple precision floating-point multiplier 一个高效的多精度浮点乘法器
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122237
K. Manolopoulos, D. Reisis, V. Chouliaras
The current paper presents a multi-mode floating point multiplier operating efficiently with every precision format specified by the IEEE 754–2008 standard. The design performs one quadruple precision multiplication, or two double precision multiplications in parallel, or four single precision multiplications in parallel. The proposed multiplier is pipelined to achieve execution of one quadruple multiplication in 3 cycles and either two double precision operations in parallel or four single precision operations in parallel in only 2 cycles. The proposed design improves the throughput by a factor of two compared to a double precision multiplier and by four compared to a single precision multiplication. An example implementation on VLSI verifies the design and it achieves a maximum operating frequency of 505 MHz.
本文提出了一种多模浮点乘法器,可以在IEEE 754-2008标准规定的各种精度格式下高效地工作。该设计执行一次四倍精度乘法,或并行执行两次双精度乘法,或并行执行四次单精度乘法。所提出的乘法器被流水线化,以实现在3个周期内执行一次四倍乘法,并在2个周期内并行执行两次双精度运算或并行执行四个单精度运算。与双精度乘法器相比,所提出的设计将吞吐量提高了两倍,与单精度乘法器相比提高了四倍。在VLSI上的实例实现验证了该设计,最大工作频率达到505 MHz。
{"title":"An efficient multiple precision floating-point multiplier","authors":"K. Manolopoulos, D. Reisis, V. Chouliaras","doi":"10.1109/ICECS.2011.6122237","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122237","url":null,"abstract":"The current paper presents a multi-mode floating point multiplier operating efficiently with every precision format specified by the IEEE 754–2008 standard. The design performs one quadruple precision multiplication, or two double precision multiplications in parallel, or four single precision multiplications in parallel. The proposed multiplier is pipelined to achieve execution of one quadruple multiplication in 3 cycles and either two double precision operations in parallel or four single precision operations in parallel in only 2 cycles. The proposed design improves the throughput by a factor of two compared to a double precision multiplier and by four compared to a single precision multiplication. An example implementation on VLSI verifies the design and it achieves a maximum operating frequency of 505 MHz.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121662923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Image processing technique for segmenting microstructural porosity of laser-welded thermoplastics 激光焊接热塑性塑料微结构孔隙度分割的图像处理技术
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122385
K. Leboeuf, Iman Makaremi, R. Muscedere, M. Ahmadi
Plastics are used in a truly vast number of applications, and research is continously carried out to improve every aspect of the plastics industry. A recent study of laser transmission welding [1] required cross-sectional images of the weld's microstructure to be analyzed for the presence of pores, which are tiny bubbles that may form during the weld process. It is believed that the number and size of pores may be indicative of the weld strength [1]. The current state of the art for detecting these pores involves manually drawing a contour around each one; a laborious process given that a typical sample may have hundreds-to-thousands of pores. This paper presents a segmentation system for classifying the pixels of a microstructural image of a thermoplastic laser weld as either belonging to a pore or the background. The algorithm is robust in terms of dealing with noise from flbreglass strands, cloudy pores, and varying exposure time. On average, it is estimated that the proposed algorithm is able to correctly classify pores at a rate of approximately 90% without requiring any user intervention.
塑料的应用范围非常广泛,不断进行研究以改进塑料工业的各个方面。最近一项关于激光透射焊接的研究[1]需要对焊缝微观结构的横截面图像进行分析,以确定是否存在气孔,气孔是焊接过程中可能形成的微小气泡。人们认为气孔的数量和大小可以反映焊缝的强度[1]。用于检测这些孔隙的当前技术状态涉及在每个孔周围手动绘制轮廓;考虑到一个典型的样品可能有成百上千个孔,这是一个费力的过程。本文提出了一种用于热塑性激光焊接显微结构图像像素点分类的分割系统。该算法在处理纤维玻璃线、浑浊孔隙和不同曝光时间的噪声方面具有鲁棒性。平均而言,估计所提出的算法能够在不需要任何用户干预的情况下以大约90%的率正确分类孔隙。
{"title":"Image processing technique for segmenting microstructural porosity of laser-welded thermoplastics","authors":"K. Leboeuf, Iman Makaremi, R. Muscedere, M. Ahmadi","doi":"10.1109/ICECS.2011.6122385","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122385","url":null,"abstract":"Plastics are used in a truly vast number of applications, and research is continously carried out to improve every aspect of the plastics industry. A recent study of laser transmission welding [1] required cross-sectional images of the weld's microstructure to be analyzed for the presence of pores, which are tiny bubbles that may form during the weld process. It is believed that the number and size of pores may be indicative of the weld strength [1]. The current state of the art for detecting these pores involves manually drawing a contour around each one; a laborious process given that a typical sample may have hundreds-to-thousands of pores. This paper presents a segmentation system for classifying the pixels of a microstructural image of a thermoplastic laser weld as either belonging to a pore or the background. The algorithm is robust in terms of dealing with noise from flbreglass strands, cloudy pores, and varying exposure time. On average, it is estimated that the proposed algorithm is able to correctly classify pores at a rate of approximately 90% without requiring any user intervention.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116356399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhancing synchronizability of complex networks by community weakening 利用群落弱化增强复杂网络的同步性
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122306
Jin Fan
Many complex networks show community structure, i.e., fewer connections between different subgraphs than within each subgraph. In this paper, synchronization as an important dynamics of complex networks is studied. Inspired on the growth and preferential attachment, an evolving community network is proposed firstly. By exploiting the inner- and intercommunity connections, we find that weakening the community structure, that is, rewiring the inner-community connections into the inter-community ones, could efficiently improve the synchronizability of complex networks.
许多复杂网络表现出社区结构,即不同子图之间的连接比每个子图内部的连接要少。本文将同步作为复杂网络的一个重要动力学问题进行了研究。基于生长和优先依恋的理论,本文首先提出了一个进化的社区网络。通过对社区内部连接和社区间连接的研究,我们发现弱化社区结构,即将社区内部连接重新连接到社区间连接,可以有效地提高复杂网络的同步性。
{"title":"Enhancing synchronizability of complex networks by community weakening","authors":"Jin Fan","doi":"10.1109/ICECS.2011.6122306","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122306","url":null,"abstract":"Many complex networks show community structure, i.e., fewer connections between different subgraphs than within each subgraph. In this paper, synchronization as an important dynamics of complex networks is studied. Inspired on the growth and preferential attachment, an evolving community network is proposed firstly. By exploiting the inner- and intercommunity connections, we find that weakening the community structure, that is, rewiring the inner-community connections into the inter-community ones, could efficiently improve the synchronizability of complex networks.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125468894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Middleware switch ASIC implementation 中间件开关ASIC实现
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122369
V. Petrovic, G. Schoof, S. Montenegro
The middleware (MW) switch processor as a part of the new spacecraft area network (SCAN) system for internal satellite communication provides the data transfer between different components, sensors and devices. The new data transfer approach provides more reliable, cheaper and faster solution instead of current board computer based systems. The processor is currently in fabrication process in the 250 nm IHP technology. In this paper we are representing description of MW switch architecture, comparison between different architecture approaches and properties of the implemented MW Switch processor.
中间件(MW)交换处理器作为新型航天器区域网络(SCAN)系统的一部分,用于卫星内部通信,提供不同组件、传感器和设备之间的数据传输。新的数据传输方法提供了比目前基于板机的系统更可靠、更便宜和更快的解决方案。该处理器目前正处于250纳米IHP技术的制造过程中。在本文中,我们描述了MW开关的体系结构,比较了不同的体系结构方法和实现的MW开关处理器的特性。
{"title":"Middleware switch ASIC implementation","authors":"V. Petrovic, G. Schoof, S. Montenegro","doi":"10.1109/ICECS.2011.6122369","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122369","url":null,"abstract":"The middleware (MW) switch processor as a part of the new spacecraft area network (SCAN) system for internal satellite communication provides the data transfer between different components, sensors and devices. The new data transfer approach provides more reliable, cheaper and faster solution instead of current board computer based systems. The processor is currently in fabrication process in the 250 nm IHP technology. In this paper we are representing description of MW switch architecture, comparison between different architecture approaches and properties of the implemented MW Switch processor.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127349406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An ultra-fast hybrid simulation framework for ASIP 一种用于ASIP的超快速混合仿真框架
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122373
Ji Qiu, Xiang Gao, Yifei Jiang, Xu Xiao
ISS (Instruction Set Simulator) plays an important role in pre-silicon software development for ASIP. However, the speed of traditional simulation is too slow to effectively support full-scale software development. In this paper, we propose a hybrid simulation framework which further improves the previous simulation methods by aggressively utilizing the host machine resources. The utilization is achieved by categorizing instructions of ASIP application into two types, namely custom and basic instructions, via binary instrumentation. Then in a way of hybrid simulation, only custom instructions are simulated on the ISS and basic instructions are executed fast and natively on the host machine. We implement this framework for an industrial ASIP to validate our approach. Experimental results show that when the implemented ISS, namely GS-Sim, is applied to practical multimedia decoders, an average simulation speed up to 1058.5MIPS can be achieved, which is 34.7 times of the state-of-art dynamic binary translation simulator and is the fastest to the best of our knowledge.
指令集模拟器在ASIP的预硅软件开发中起着重要的作用。然而,传统的仿真速度太慢,无法有效地支持全面的软件开发。在本文中,我们提出了一种混合仿真框架,通过积极利用主机资源,进一步改进了以前的仿真方法。通过二进制检测,将ASIP应用程序的指令分为自定义指令和基本指令两种类型,从而实现了对ASIP应用程序的利用。然后采用混合仿真的方式,在国际空间站上只模拟自定义指令,在主机上快速本地执行基本指令。我们为一个工业ASIP实现这个框架来验证我们的方法。实验结果表明,将实现的ISS即GS-Sim应用于实际的多媒体解码器中,平均模拟速度可达1058.5MIPS,是目前最先进的动态二进制转换模拟器的34.7倍,是目前所知的最快的模拟速度。
{"title":"An ultra-fast hybrid simulation framework for ASIP","authors":"Ji Qiu, Xiang Gao, Yifei Jiang, Xu Xiao","doi":"10.1109/ICECS.2011.6122373","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122373","url":null,"abstract":"ISS (Instruction Set Simulator) plays an important role in pre-silicon software development for ASIP. However, the speed of traditional simulation is too slow to effectively support full-scale software development. In this paper, we propose a hybrid simulation framework which further improves the previous simulation methods by aggressively utilizing the host machine resources. The utilization is achieved by categorizing instructions of ASIP application into two types, namely custom and basic instructions, via binary instrumentation. Then in a way of hybrid simulation, only custom instructions are simulated on the ISS and basic instructions are executed fast and natively on the host machine. We implement this framework for an industrial ASIP to validate our approach. Experimental results show that when the implemented ISS, namely GS-Sim, is applied to practical multimedia decoders, an average simulation speed up to 1058.5MIPS can be achieved, which is 34.7 times of the state-of-art dynamic binary translation simulator and is the fastest to the best of our knowledge.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122210832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A multi-bit error tolerant register file for a high reliable embedded processor 用于高可靠性嵌入式处理器的多比特容错寄存器文件
Pub Date : 2011-12-01 DOI: 10.1109/ICECS.2011.6122330
S. Esmaeeli, M. Hosseini, B. Vahdat, B. Rashidian
The vulnerability of microprocessors to soft errors is increasing due to continuous shrinking in fabrication process. Recent studies show that 1–5% of the SEUs (single event upset) can cause MBUs (multiple bit upsets). The probability of MBU generation due to SEU is increasing because of the reduction in minimum energy required to flip a memory bit in modern technologies. Register file is the most sensitive component in a microprocessor. In this paper, we present an innovative way to protect registers in a 64-bit register file for a RISC processor using extended Hamming (8, 4) code (SEC-DED code) and narrow-width values. A narrow-width value can be represented by half number of bits of the register width. Two additional bits for each data register have been used to store the information for a narrow-width value. Each 64-bit data in register file has its unique 64-bit extended Hamming code that is stored in another register file in a bit-interleaved manner. Two copies of narrow-width values can be stored in one register and each copy has its unique extended Hamming code in other register file. Proposed method has been tested using fault injection simulation with SPEC2000 benchmarks. Error probability of a word that stores generated values for register file in SPEC2000 benchmarks and is protected with proposed method is less than the error probability of the same word that is protected with TMR or various extended Hamming codes. The implementation on a Xilinx Virtex-4 FPGA shows that the area overhead of a register file with 64-bit wide and more than 64-word entry that is protected with proposed method is less than the area overhead of the same register file that is protected with TMR. Error detection and correction is performed in parallel with execute stage to prevent performance degradation. More than 99% of errors in adjacent 32 bits in data or extended Hamming code registers can be corrected with the proposed method. Presented method employs pure combinational logics and can be used for 16-bit and 32-bit register files too.
由于微处理器在制造过程中的不断缩小,其对软误差的脆弱性越来越大。最近的研究表明,1-5%的seu(单事件扰流)会导致MBUs(多比特扰流)。由于现代技术中翻转存储位所需的最小能量降低,由于SEU而产生MBU的可能性正在增加。寄存器文件是微处理器中最敏感的元件。在本文中,我们提出了一种创新的方法来保护64位寄存器文件中的寄存器为RISC处理器使用扩展汉明(8,4)码(SEC-DED码)和窄宽度值。窄宽度值可以用寄存器宽度的一半位数表示。每个数据寄存器的两个附加位用于存储窄宽度值的信息。寄存器文件中的每个64位数据都有其独特的64位扩展汉明码,以位交错的方式存储在另一个寄存器文件中。窄宽度值的两个副本可以存储在一个寄存器中,每个副本在其他寄存器文件中都有其独特的扩展汉明码。用SPEC2000基准测试测试了该方法的故障注入仿真。在SPEC2000基准测试中,存储寄存器文件生成值并使用本文方法保护的单词的错误概率小于使用TMR或各种扩展汉明码保护的相同单词的错误概率。在Xilinx Virtex-4 FPGA上的实现表明,使用所提出的方法保护的64位宽且多于64字条目的寄存器文件的面积开销小于使用TMR保护的相同寄存器文件的面积开销。错误检测和纠正与执行阶段并行执行,以防止性能下降。该方法可对数据或扩展汉明码寄存器中相邻32位的错误进行99%以上的校正。该方法采用纯组合逻辑,可用于16位和32位的寄存器文件。
{"title":"A multi-bit error tolerant register file for a high reliable embedded processor","authors":"S. Esmaeeli, M. Hosseini, B. Vahdat, B. Rashidian","doi":"10.1109/ICECS.2011.6122330","DOIUrl":"https://doi.org/10.1109/ICECS.2011.6122330","url":null,"abstract":"The vulnerability of microprocessors to soft errors is increasing due to continuous shrinking in fabrication process. Recent studies show that 1–5% of the SEUs (single event upset) can cause MBUs (multiple bit upsets). The probability of MBU generation due to SEU is increasing because of the reduction in minimum energy required to flip a memory bit in modern technologies. Register file is the most sensitive component in a microprocessor. In this paper, we present an innovative way to protect registers in a 64-bit register file for a RISC processor using extended Hamming (8, 4) code (SEC-DED code) and narrow-width values. A narrow-width value can be represented by half number of bits of the register width. Two additional bits for each data register have been used to store the information for a narrow-width value. Each 64-bit data in register file has its unique 64-bit extended Hamming code that is stored in another register file in a bit-interleaved manner. Two copies of narrow-width values can be stored in one register and each copy has its unique extended Hamming code in other register file. Proposed method has been tested using fault injection simulation with SPEC2000 benchmarks. Error probability of a word that stores generated values for register file in SPEC2000 benchmarks and is protected with proposed method is less than the error probability of the same word that is protected with TMR or various extended Hamming codes. The implementation on a Xilinx Virtex-4 FPGA shows that the area overhead of a register file with 64-bit wide and more than 64-word entry that is protected with proposed method is less than the area overhead of the same register file that is protected with TMR. Error detection and correction is performed in parallel with execute stage to prevent performance degradation. More than 99% of errors in adjacent 32 bits in data or extended Hamming code registers can be corrected with the proposed method. Presented method employs pure combinational logics and can be used for 16-bit and 32-bit register files too.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131674724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
2011 18th IEEE International Conference on Electronics, Circuits, and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1