首页 > 最新文献

Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)最新文献

英文 中文
New 4-transistor XOR and XNOR designs 新的4晶体管XOR和XNOR设计
H. Bui, A. Al-Sheraidah, Yuke Wang
Exclusive-OR and exclusive-NOR gates are important in digital circuits. This paper proposes a new set of low power 4-transistor XOR/XNOR gates. Simulations have been performed on the circuits along with 10 other XOR gates. The results show that the new XOR/XNOR gates consistently consume less power than any other XOR/XNOR gate known at the time of publication. In fact, it consumes up to more than 3 times less power than the complementary CMOS implementation and can have 34% better propagation delay. The static energy-recovery XOR gate, which is the second least power-consuming, dissipates 10% more power than the new gates.
异或门和异或门在数字电路中非常重要。本文提出了一种新型的低功耗四晶体管的异或或门。对电路和其他10个异或门进行了仿真。结果表明,新的XOR/XNOR门在发布时始终比任何其他已知的XOR/XNOR门消耗更少的功率。事实上,它的功耗比互补的CMOS实现低3倍以上,并且可以有34%的传播延迟。静态能量回收异或门,这是第二低的功耗,耗电量比新的门多10%。
{"title":"New 4-transistor XOR and XNOR designs","authors":"H. Bui, A. Al-Sheraidah, Yuke Wang","doi":"10.1109/APASIC.2000.896899","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896899","url":null,"abstract":"Exclusive-OR and exclusive-NOR gates are important in digital circuits. This paper proposes a new set of low power 4-transistor XOR/XNOR gates. Simulations have been performed on the circuits along with 10 other XOR gates. The results show that the new XOR/XNOR gates consistently consume less power than any other XOR/XNOR gate known at the time of publication. In fact, it consumes up to more than 3 times less power than the complementary CMOS implementation and can have 34% better propagation delay. The static energy-recovery XOR gate, which is the second least power-consuming, dissipates 10% more power than the new gates.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115118532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
A 1.0 Gbps clock and data recovery circuit with two-XOR phase-frequency detector 带有双xor相频检测器的1.0 Gbps时钟和数据恢复电路
Dong-Hee Kim, Jin-Ku Kang
This paper describes a 1.0 Gbps clock and data recovery circuit with a simple PFD structure. The proposed circuit is based on a single loop controlled by a Phase Frequency Detector (PFD) which has two-XOR gates. The VCO composed of four differential buffer stages generates eight differential clocks each spaced by 45/spl deg/. The PFD generates the VCO control signal by comparing two different phase clocks and input data. The phase frequency capture range of PFD is decided by VCO operating range which is 380-720 MHz. The circuit operates on 800 Mbps to 1.2 Gbps data rate under 2.5 V supply using 0.25 /spl mu/m CMOS HSPICE simulation.
本文介绍了一种具有简单PFD结构的1.0 Gbps时钟和数据恢复电路。所提出的电路是基于一个由相位频率检测器(PFD)控制的单环,它有两个异或门。由四个差分缓冲级组成的VCO产生八个差分时钟,每个时钟间隔45/spl度/。PFD通过比较两个不同的相位时钟和输入数据来产生压控振荡器控制信号。PFD的相位频率捕获范围由压控振荡器工作范围380 ~ 720 MHz决定。该电路采用0.25 /spl mu/m CMOS HSPICE模拟,在2.5 V电源下工作于800 Mbps至1.2 Gbps的数据速率。
{"title":"A 1.0 Gbps clock and data recovery circuit with two-XOR phase-frequency detector","authors":"Dong-Hee Kim, Jin-Ku Kang","doi":"10.1109/APASIC.2000.896943","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896943","url":null,"abstract":"This paper describes a 1.0 Gbps clock and data recovery circuit with a simple PFD structure. The proposed circuit is based on a single loop controlled by a Phase Frequency Detector (PFD) which has two-XOR gates. The VCO composed of four differential buffer stages generates eight differential clocks each spaced by 45/spl deg/. The PFD generates the VCO control signal by comparing two different phase clocks and input data. The phase frequency capture range of PFD is decided by VCO operating range which is 380-720 MHz. The circuit operates on 800 Mbps to 1.2 Gbps data rate under 2.5 V supply using 0.25 /spl mu/m CMOS HSPICE simulation.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128597051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A low-complexity frame memory compression algorithm and its implementation for MPEG-2 video decoder 一种用于MPEG-2视频解码器的低复杂度帧存储压缩算法及其实现
T. Lee, Yong Kim, Sungjae Jang, Hoon Yoo, Jechang Jeong
An algorithm and its implementation to compress video frame data to be stored in external memory are proposed, thereby enabling a reduction of memory requirements and bandwidth. To support random access capability, frames are organized as 1/spl times/8 fixed-length segments, which are used as compression units. For both compression efficiency and low complexity, modified Hadamard transform (MHT) and Golomb-Rice (GR) coding are utilized. The compression ratio is fixed at 50% and objective and subjective video quality assessments are performed. Efficient ASIC implementation including Golomb-Rice encoding/decoding is also proposed to reduce the total compression/decompression latency time.
提出了一种将视频帧数据压缩到外部存储器的算法及其实现,从而降低了存储需求和带宽。为了支持随机访问能力,帧被组织为1/spl乘以/8个固定长度的段,这些段用作压缩单元。为了提高压缩效率和降低复杂度,采用了改进的Hadamard变换(MHT)和Golomb-Rice (GR)编码。压缩比固定为50%,并进行客观和主观视频质量评估。还提出了包括Golomb-Rice编码/解码在内的高效ASIC实现,以减少总压缩/解压缩延迟时间。
{"title":"A low-complexity frame memory compression algorithm and its implementation for MPEG-2 video decoder","authors":"T. Lee, Yong Kim, Sungjae Jang, Hoon Yoo, Jechang Jeong","doi":"10.1109/APASIC.2000.896919","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896919","url":null,"abstract":"An algorithm and its implementation to compress video frame data to be stored in external memory are proposed, thereby enabling a reduction of memory requirements and bandwidth. To support random access capability, frames are organized as 1/spl times/8 fixed-length segments, which are used as compression units. For both compression efficiency and low complexity, modified Hadamard transform (MHT) and Golomb-Rice (GR) coding are utilized. The compression ratio is fixed at 50% and objective and subjective video quality assessments are performed. Efficient ASIC implementation including Golomb-Rice encoding/decoding is also proposed to reduce the total compression/decompression latency time.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126098468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CMOS capacitor coupling logic (C/sup 3/L) circuits CMOS电容耦合逻辑(C/sup 3/L)电路
Hong-Yi Huang, Teng-Neng Wang
The multi-valued logic has been drawing considerable attention as a promising candidate for building future integrated circuits. The capacitor coupling technique is one of the effective methods to approach the performance issue. In this paper the capacitor coupling logic (C/sup 3/L) circuit to implement CMOS logic gates is proposed. The multiple inputs NAND, NOR, AOI, and OAI gates can be easily realized using the technique. Furthermore, the carry and sum circuit of a full-adder is designed and verified.
多值逻辑作为构建未来集成电路的一个有前途的候选方案,已经引起了相当大的关注。电容耦合技术是解决性能问题的有效方法之一。本文提出了电容耦合逻辑(C/sup 3/L)电路来实现CMOS逻辑门。使用该技术可以很容易地实现多输入NAND、NOR、AOI和OAI门。设计并验证了全加法器的进位和电路。
{"title":"CMOS capacitor coupling logic (C/sup 3/L) circuits","authors":"Hong-Yi Huang, Teng-Neng Wang","doi":"10.1109/APASIC.2000.896901","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896901","url":null,"abstract":"The multi-valued logic has been drawing considerable attention as a promising candidate for building future integrated circuits. The capacitor coupling technique is one of the effective methods to approach the performance issue. In this paper the capacitor coupling logic (C/sup 3/L) circuit to implement CMOS logic gates is proposed. The multiple inputs NAND, NOR, AOI, and OAI gates can be easily realized using the technique. Furthermore, the carry and sum circuit of a full-adder is designed and verified.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128302547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A carry-free 54 b/spl times/54 b multiplier using equivalent bit conversion algorithm 使用等效位转换算法的无携带54b /spl倍/ 54b乘法器
Yun Kim, B. Song, J. Grosspietsch, S. Gillig
An equivalent bit conversion algorithm (EBCA) is proposed to eliminate the need for final carry propagation in redundant binary (RB) to normal binary (NB) conversion in the RB multiplier. When the EBCA is applied, conventional power-consuming carry-propagating adders are replaced by simple, minimum-sized converters, and complete carry-free multiplication from input to output is achieved. The proposed method significantly reduces the power and the conversion time in the final adder stage of traditional multipliers. The prototype fabricated in 0.35 /spl mu/m CMOS demonstrates that the 54 b/spl times/54 b multiplier consumes only 53.4 mW at 3.3 V for 74 MHz operation.
提出了一种等效位转换算法(EBCA),以消除冗余二进制(RB)到正常二进制(NB)转换过程中最终进位传播的需要。当应用EBCA时,传统的功耗携带传播加法器被简单,最小尺寸的转换器所取代,并且实现了从输入到输出的完全无携带乘法。该方法大大降低了传统乘法器在最后加法器阶段的功率和转换时间。在0.35 /spl mu/m CMOS中制作的原型表明,54 b/spl倍/54 b倍增器在3.3 V下74 MHz工作时仅消耗53.4 mW。
{"title":"A carry-free 54 b/spl times/54 b multiplier using equivalent bit conversion algorithm","authors":"Yun Kim, B. Song, J. Grosspietsch, S. Gillig","doi":"10.1109/APASIC.2000.896911","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896911","url":null,"abstract":"An equivalent bit conversion algorithm (EBCA) is proposed to eliminate the need for final carry propagation in redundant binary (RB) to normal binary (NB) conversion in the RB multiplier. When the EBCA is applied, conventional power-consuming carry-propagating adders are replaced by simple, minimum-sized converters, and complete carry-free multiplication from input to output is achieved. The proposed method significantly reduces the power and the conversion time in the final adder stage of traditional multipliers. The prototype fabricated in 0.35 /spl mu/m CMOS demonstrates that the 54 b/spl times/54 b multiplier consumes only 53.4 mW at 3.3 V for 74 MHz operation.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121420894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A differential type CMOS phase frequency detector 差分型CMOS相位频率检测器
R. Chang, Lung-Chih Kuo
We propose a new differential-type CMOS phase frequency detector for a PLL design. The circuit uses two D-FFs and two delay buffers. Besides, it adopts two reset functions R1 and R2 to avoid the UP and DN being logic-1 simultaneously. Thus, any mismatch current of the charge pump circuit will not affect the performance of the PLL. The detector can greatly reduce the dead-zone phenomenon in the phase characteristic, which is important in low-jitter applications. In order to detect the smallest phase error the new detector employs delay buffers to shift the phase error. The circuit is simulated by HSPICE with the 0.35 /spl mu/m CMOS technology.
我们提出了一种用于锁相环设计的新型差分型CMOS相频检测器。电路使用两个d - ff和两个延迟缓冲器。同时采用R1和R2两种复位功能,避免UP和DN同时为logic-1。因此,电荷泵电路的任何失配电流都不会影响锁相环的性能。该检测器可以大大减少相位特性中的死区现象,这在低抖动应用中具有重要意义。为了检测出最小的相位误差,该检测器采用延迟缓冲器来偏移相位误差。采用0.35 /spl mu/m CMOS技术,利用HSPICE对电路进行了仿真。
{"title":"A differential type CMOS phase frequency detector","authors":"R. Chang, Lung-Chih Kuo","doi":"10.1109/APASIC.2000.896908","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896908","url":null,"abstract":"We propose a new differential-type CMOS phase frequency detector for a PLL design. The circuit uses two D-FFs and two delay buffers. Besides, it adopts two reset functions R1 and R2 to avoid the UP and DN being logic-1 simultaneously. Thus, any mismatch current of the charge pump circuit will not affect the performance of the PLL. The detector can greatly reduce the dead-zone phenomenon in the phase characteristic, which is important in low-jitter applications. In order to detect the smallest phase error the new detector employs delay buffers to shift the phase error. The circuit is simulated by HSPICE with the 0.35 /spl mu/m CMOS technology.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"228 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131419145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Analysis and implementation of interface for heterogeneous system 异构系统接口的分析与实现
Hwisung Jung, M. Lee
We designed asynchronous event logic library with 0.25 /spl mu/m CMOS technology and high-speed asynchronous FIFO operating at 1.6 GHz. Optimized asynchronous standard cell layouts and Verilog models are designed for top-down design methodology. A method for mitigating a design bottleneck when it comes to tolerate clock skew is described. This communication scheme using clock control circuits, which is used for freeing of synchronization failures, is analyzed and implemented. With clock control circuit and FIFO, we implemented high-speed 32 bit-interface chip for heterogeneous system. The size of the core is about 1.1 mm/spl times/1.1 mm.
采用0.25 /spl mu/m的CMOS技术和1.6 GHz的高速异步FIFO技术设计了异步事件逻辑库。优化的异步标准单元布局和Verilog模型是为自上而下的设计方法而设计的。描述了一种减轻设计瓶颈的方法,当它涉及到容忍时钟倾斜时。分析并实现了利用时钟控制电路消除同步故障的通信方案。采用时钟控制电路和FIFO技术,实现了异构系统的高速32位接口芯片。铁芯尺寸约为1.1 mm/spl倍/1.1 mm。
{"title":"Analysis and implementation of interface for heterogeneous system","authors":"Hwisung Jung, M. Lee","doi":"10.1109/APASIC.2000.896930","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896930","url":null,"abstract":"We designed asynchronous event logic library with 0.25 /spl mu/m CMOS technology and high-speed asynchronous FIFO operating at 1.6 GHz. Optimized asynchronous standard cell layouts and Verilog models are designed for top-down design methodology. A method for mitigating a design bottleneck when it comes to tolerate clock skew is described. This communication scheme using clock control circuits, which is used for freeing of synchronization failures, is analyzed and implemented. With clock control circuit and FIFO, we implemented high-speed 32 bit-interface chip for heterogeneous system. The size of the core is about 1.1 mm/spl times/1.1 mm.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130025812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A QPSK/16 QAM receiver chip for LMDS application 一种用于LMDS的QPSK/16 QAM接收芯片
K. Park, D. Shin, Jun S. Lee, M. Sunwoo
This paper presents a QPSK/16 QAM down stream receiver ASIC chip for LMDS (Local Multipoint Distribution Services) applications. The proposed LMDS chip consists of a blind equalizer, a timing recovery block and a carrier recovery block. The blind equalizer uses a DFE (Decision Feedback Equalizer) structure using CMA (Constant Modulus Algorithm). The symbol timing recovery uses the modified parabolic interpolator and the decision-directed carrier recovery is used to remove the carrier frequency offset, phase offset and phase jitter. The implemented LMDS receiver can support four data rates, 10, 20, 30 and 40 Mbps and can accommodate the symbol rate up to 10 Mbaud. This symbol rate is faster than existing QAM receivers.
提出了一种适用于LMDS (Local Multipoint Distribution Services)应用的QPSK/16 QAM下行接收ASIC芯片。该LMDS芯片由盲均衡器、定时恢复块和载波恢复块组成。盲均衡器采用恒模算法的DFE(决策反馈均衡器)结构。符号时序恢复采用改进的抛物线插值器,决策导向载波恢复用于消除载波频率偏移、相位偏移和相位抖动。所实现的LMDS接收器可支持10、20、30和40 Mbps四种数据速率,并可容纳高达10 Mbaud的符号速率。这种符号速率比现有的QAM接收器快。
{"title":"A QPSK/16 QAM receiver chip for LMDS application","authors":"K. Park, D. Shin, Jun S. Lee, M. Sunwoo","doi":"10.1109/APASIC.2000.896945","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896945","url":null,"abstract":"This paper presents a QPSK/16 QAM down stream receiver ASIC chip for LMDS (Local Multipoint Distribution Services) applications. The proposed LMDS chip consists of a blind equalizer, a timing recovery block and a carrier recovery block. The blind equalizer uses a DFE (Decision Feedback Equalizer) structure using CMA (Constant Modulus Algorithm). The symbol timing recovery uses the modified parabolic interpolator and the decision-directed carrier recovery is used to remove the carrier frequency offset, phase offset and phase jitter. The implemented LMDS receiver can support four data rates, 10, 20, 30 and 40 Mbps and can accommodate the symbol rate up to 10 Mbaud. This symbol rate is faster than existing QAM receivers.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130167541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dynamic TLB management structure to support different page sizes 支持不同页面大小的动态TLB管理结构
Jung-Hoon Lee, Jang-Soo Lee, Shin-Dug Kim
Translation look-aside buffers (TLBs) are small caches to speed-up address translation in processors with virtual memory. There are many methods for improving TLB performance, such as increasing the number of entries in TLB and supporting large page or multiple page sizes. The best way is to support multiple page sizes, but any operating system doesn't support multiple page sizes in user mode. Also software must select a proper page-size assignment policy to take advantage of the larger pages. So, we propose a new structure of TLB supporting two pages to obtain the effect of multiple page sizes with high performance and at low cost without operating system support. According to result of comparison and analysis, the proposed method with fewer entries results in similar performance compared with the conventional TLB with many entries. Also in the case of same area size, it is shown that miss ratio of the proposed TLB can be reduced by as much as 90% comparing with conventional fully-associative TLB.
转换暂置缓冲区(tlb)是一种小型缓存,用于在具有虚拟内存的处理器中加速地址转换。有许多方法可以提高TLB性能,例如增加TLB中的条目数量,支持大页面或多页面大小。最好的方法是支持多种页面大小,但并不是所有操作系统都支持用户模式下的多种页面大小。此外,软件必须选择适当的页面大小分配策略,以利用较大的页面。因此,我们提出了一种新的支持两个页面的TLB结构,以在没有操作系统支持的情况下获得高性能和低成本的多页面大小效果。对比分析结果表明,该方法与传统的多条目TLB方法相比,具有较少条目的性能相当。此外,在相同面积的情况下,与传统的全关联TLB相比,所提出的TLB的缺失率可降低高达90%。
{"title":"A dynamic TLB management structure to support different page sizes","authors":"Jung-Hoon Lee, Jang-Soo Lee, Shin-Dug Kim","doi":"10.1109/APASIC.2000.896967","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896967","url":null,"abstract":"Translation look-aside buffers (TLBs) are small caches to speed-up address translation in processors with virtual memory. There are many methods for improving TLB performance, such as increasing the number of entries in TLB and supporting large page or multiple page sizes. The best way is to support multiple page sizes, but any operating system doesn't support multiple page sizes in user mode. Also software must select a proper page-size assignment policy to take advantage of the larger pages. So, we propose a new structure of TLB supporting two pages to obtain the effect of multiple page sizes with high performance and at low cost without operating system support. According to result of comparison and analysis, the proposed method with fewer entries results in similar performance compared with the conventional TLB with many entries. Also in the case of same area size, it is shown that miss ratio of the proposed TLB can be reduced by as much as 90% comparing with conventional fully-associative TLB.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131086444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The design and implementation of CalmlRISC32 floating point unit CalmlRISC32浮点单元的设计与实现
C. Jeong, W. Park, Sang-Woo Kim, T. Han
The CalmRISC32 FPU is RISC style coprocessor for embedded systems. It supports IEEE-754 standard single precision addition/subtraction, floating-point multiplication, floating-point division, format conversion, comparison, rounding, and load/store. It also supports four rounding modes, and precise exception. It can execute and complete instructions out of order if some constraint is resolved-data dependency, resource conflict, and exception prediction. Standard cell base design technique is used to save design time and cost. First prototype is running at about 70 Mhz with worst-case delay in gate level simulation.
CalmRISC32 FPU是嵌入式系统的RISC风格协处理器。它支持IEEE-754标准的单精度加减、浮点乘法、浮点除法、格式转换、比较、舍入和加载/存储。它还支持四种舍入模式和精确异常。如果解决了某些约束(数据依赖、资源冲突和异常预测),它可以乱序执行和完成指令。采用标准单元基设计技术,节省了设计时间和成本。第一个原型在门电平模拟中以最坏情况延迟运行在约70 Mhz。
{"title":"The design and implementation of CalmlRISC32 floating point unit","authors":"C. Jeong, W. Park, Sang-Woo Kim, T. Han","doi":"10.1109/APASIC.2000.896974","DOIUrl":"https://doi.org/10.1109/APASIC.2000.896974","url":null,"abstract":"The CalmRISC32 FPU is RISC style coprocessor for embedded systems. It supports IEEE-754 standard single precision addition/subtraction, floating-point multiplication, floating-point division, format conversion, comparison, rounding, and load/store. It also supports four rounding modes, and precise exception. It can execute and complete instructions out of order if some constraint is resolved-data dependency, resource conflict, and exception prediction. Standard cell base design technique is used to save design time and cost. First prototype is running at about 70 Mhz with worst-case delay in gate level simulation.","PeriodicalId":313978,"journal":{"name":"Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128975400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Proceedings of Second IEEE Asia Pacific Conference on ASICs. AP-ASIC 2000 (Cat. No.00EX434)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1