首页 > 最新文献

2015 28th IEEE International System-on-Chip Conference (SOCC)最新文献

英文 中文
Reconfigurable hardware architecture of the spatial pooler for hierarchical temporal memory 分层时间存储器空间池的可重构硬件结构
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406930
Abdullah M. Zyarah, D. Kudithipudi
Self-learning hardware systems, with high-degree of plasticity, are critical in performing spatio-temporal tasks in next-generation computing systems. To this end, hierarchical temporal memory (HTM) offers time-based online-learning algorithms that store and recall temporal and spatial patterns. One of the key building blocks in HTM is the spatial pooler. In this paper, we propose a reconfigurable and scalable spatial pooler architecture that is ported onto a Xilinx Virtex-IV FPGA fabric. The concept of synthetic synapses is proposed for dynamic interconnections. The spatial pooler architecture is verified for two different datasets, MNIST and EU numberplate font, with ≈ 91% and ≈ 90% accuracy respectively. Moreover, the proposed hardware model offers speed up of 4817X over the software realization. These results indicate that the proposed architecture can serve as a core to build the HTM in hardware and eventually as a standalone self-learning hardware system.
具有高度可塑性的自学习硬件系统在下一代计算系统中执行时空任务至关重要。为此,分层时间记忆(HTM)提供了基于时间的在线学习算法,可以存储和回忆时间和空间模式。HTM中的关键构建块之一是空间池程序。在本文中,我们提出了一种可重构和可扩展的空间池架构,该架构移植到Xilinx Virtex-IV FPGA结构上。针对动态互连,提出了合成突触的概念。在MNIST和EU车牌字体两种不同的数据集上验证了空间池架构,准确率分别为≈91%和≈90%。此外,所提出的硬件模型比软件实现的速度提高了4817X。这些结果表明,所提出的体系结构可以作为在硬件中构建HTM的核心,并最终作为独立的自学习硬件系统。
{"title":"Reconfigurable hardware architecture of the spatial pooler for hierarchical temporal memory","authors":"Abdullah M. Zyarah, D. Kudithipudi","doi":"10.1109/SOCC.2015.7406930","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406930","url":null,"abstract":"Self-learning hardware systems, with high-degree of plasticity, are critical in performing spatio-temporal tasks in next-generation computing systems. To this end, hierarchical temporal memory (HTM) offers time-based online-learning algorithms that store and recall temporal and spatial patterns. One of the key building blocks in HTM is the spatial pooler. In this paper, we propose a reconfigurable and scalable spatial pooler architecture that is ported onto a Xilinx Virtex-IV FPGA fabric. The concept of synthetic synapses is proposed for dynamic interconnections. The spatial pooler architecture is verified for two different datasets, MNIST and EU numberplate font, with ≈ 91% and ≈ 90% accuracy respectively. Moreover, the proposed hardware model offers speed up of 4817X over the software realization. These results indicate that the proposed architecture can serve as a core to build the HTM in hardware and eventually as a standalone self-learning hardware system.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122110379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A process-variation-aware multi-scenario high-level synthesis algorithm for distributed-register architectures 面向分布式寄存器体系结构的进程变化感知多场景高级综合算法
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406898
Koki Igawa, Youhua Shi, M. Yanagisawa, N. Togawa
In order to tackle a process-variation problem, we can define several scenarios, each of which corresponds to a particular LSI behavior, such as a typical-case scenario and a worst-case scenario. By designing a single LSI chip which realizes multiple scenarios simultaneously, we can have a process-variation-tolerant LSI chip. In this paper, we propose a process-variation-aware low-latency and multi-scenario high-level synthesis algorithm targeting new distributed-register architectures, called HDR architectures. We assume two scenarios, a typical-case scenario and a worst-case scenario, and realize them onto a single chip. We first schedule/bind each of the scenarios independently. After that, we commonize the scheduling/binding results for the typical-case and worst-case scenarios and thus generate a commonized area-minimized floorplan result. Experimental results show that our algorithm reduces the latency of the typical-case scenario by up to 50% without increasing the latency of the worst-case scenario, compared with several existing methods.
为了解决工艺变化问题,我们可以定义几个场景,每个场景对应于一个特定的LSI行为,例如典型情况场景和最坏情况场景。通过设计一个同时实现多种场景的LSI芯片,我们可以得到一个耐工艺变化的LSI芯片。在本文中,我们提出了一种针对新的分布式寄存器体系结构(称为HDR体系结构)的进程变化感知的低延迟和多场景高级综合算法。我们假设了两个场景,一个典型的场景和一个最坏的场景,并将它们实现在一个芯片上。我们首先独立地调度/绑定每个场景。之后,我们将典型情况和最坏情况的调度/绑定结果统一起来,从而生成一个通用的面积最小化平面图结果。实验结果表明,与现有的几种方法相比,我们的算法在不增加最坏情况延迟的情况下,将典型情况的延迟减少了50%。
{"title":"A process-variation-aware multi-scenario high-level synthesis algorithm for distributed-register architectures","authors":"Koki Igawa, Youhua Shi, M. Yanagisawa, N. Togawa","doi":"10.1109/SOCC.2015.7406898","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406898","url":null,"abstract":"In order to tackle a process-variation problem, we can define several scenarios, each of which corresponds to a particular LSI behavior, such as a typical-case scenario and a worst-case scenario. By designing a single LSI chip which realizes multiple scenarios simultaneously, we can have a process-variation-tolerant LSI chip. In this paper, we propose a process-variation-aware low-latency and multi-scenario high-level synthesis algorithm targeting new distributed-register architectures, called HDR architectures. We assume two scenarios, a typical-case scenario and a worst-case scenario, and realize them onto a single chip. We first schedule/bind each of the scenarios independently. After that, we commonize the scheduling/binding results for the typical-case and worst-case scenarios and thus generate a commonized area-minimized floorplan result. Experimental results show that our algorithm reduces the latency of the typical-case scenario by up to 50% without increasing the latency of the worst-case scenario, compared with several existing methods.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130649097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
SESSION T2A: Tutorial: Advanced ESD protection design for CMOS circuits and systems T2A:教程:CMOS电路和系统的高级ESD保护设计
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406884
M. Ker
To reduce the weight of electronic products, to integrate more functions into the electronic products, as well as to reduce the power consumption of electronic products, the CMOS technology has been developed into nanometer scale to realize VLSI/SoC for electronic systems. With the transistors in the nano-scale dimension, the gate-oxide thickness of MOSFET is only 10~15Å for operating with sub-1V power supply. Such thinner gate oxide is very easily ruptured by electrostatic discharge (ESD) events, which frequently happen in our environments with the voltage level of hundreds or even thousands volts. The integrated circuits (ICs) are weaker to sustain such ESD stresses during the assembly, testing, package, and the applications. To verify the ESD reliability of IC products for safe applications, there are already some industry ESD test standards developed, such as Human Body Model (HBM) and Charged Device Model (CDM), to verify the ESD robustness of IC products. Besides, in the IEC 61000-4-2 standard, the electronic products are zapped by the ESD gun with ESD voltage of even up to 15kV in the air-discharge mode. How to design the on-chip ESD protection circuits to effectively protect the integrated circuits realized by the nano-scale CMOS devices is a quite difficult challenge to IC industry. The on-chip ESD protection circuit must be included in the beginning phase of chip design. In this Tutorial, a brief introduction on ESD issue and test standards to IC products is presented with some failure analysis pictures from real IC products to demonstrate the impact of ESD on IC products. The basic design concept for on-chip ESD protection circuit will be presented. Some useful ESD protection designs for high-speed I/O and RF circuits will be mentioned. To achieve the whole-chip ESD protection by using the active power-rail ESD clamp circuit will be emphasized. Additional consideration on the active power-rail ESD clamp circuit realized in the nanoscale CMOS processes will be especially addressed. After the component-level ESD protection, the system-level ESD protection design will be brief discussed. ESD protection for CMOS ICs is not only the process issue but also highly dependent to the design issue, which has been an important topic that the IC designers need to know.
为了减轻电子产品的重量,将更多的功能集成到电子产品中,以及降低电子产品的功耗,CMOS技术已经发展到纳米尺度,以实现电子系统的VLSI/SoC。采用纳米尺寸的晶体管,在低于1v的电源下工作时,MOSFET的栅极氧化层厚度仅为10~15Å。这种较薄的栅极氧化物很容易被静电放电(ESD)事件破坏,而静电放电事件在我们的环境中经常发生,电压水平在几百伏甚至几千伏。集成电路(ic)在组装、测试、封装和应用过程中承受这种ESD应力的能力较弱。为了验证IC产品在安全应用中的ESD可靠性,已经制定了一些行业ESD测试标准,如人体模型(HBM)和充电器件模型(CDM),以验证IC产品的ESD稳健性。此外,在IEC 61000-4-2标准中,电子产品在空气放电模式下受到ESD枪的电击,ESD电压甚至高达15kV。如何设计片上ESD保护电路,有效地保护由纳米级CMOS器件实现的集成电路,是集成电路行业面临的一个相当艰巨的挑战。片内ESD保护电路必须包含在芯片设计的开始阶段。在本教程中,简要介绍了ESD问题和IC产品的测试标准,并提供了一些实际IC产品的失效分析图片,以说明ESD对IC产品的影响。本文将介绍片上ESD保护电路的基本设计概念。本文将介绍一些适用于高速I/O和RF电路的ESD保护设计。着重介绍了利用有源电源轨ESD钳位电路实现全芯片ESD保护的方法。本文将特别讨论在纳米级CMOS工艺中实现的有源电源轨ESD钳位电路的附加考虑。元器件级ESD防护之后,将简要讨论系统级ESD防护设计。CMOS集成电路的ESD保护不仅是工艺问题,而且高度依赖于设计问题,这已经成为集成电路设计人员需要了解的重要课题。
{"title":"SESSION T2A: Tutorial: Advanced ESD protection design for CMOS circuits and systems","authors":"M. Ker","doi":"10.1109/SOCC.2015.7406884","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406884","url":null,"abstract":"To reduce the weight of electronic products, to integrate more functions into the electronic products, as well as to reduce the power consumption of electronic products, the CMOS technology has been developed into nanometer scale to realize VLSI/SoC for electronic systems. With the transistors in the nano-scale dimension, the gate-oxide thickness of MOSFET is only 10~15Å for operating with sub-1V power supply. Such thinner gate oxide is very easily ruptured by electrostatic discharge (ESD) events, which frequently happen in our environments with the voltage level of hundreds or even thousands volts. The integrated circuits (ICs) are weaker to sustain such ESD stresses during the assembly, testing, package, and the applications. To verify the ESD reliability of IC products for safe applications, there are already some industry ESD test standards developed, such as Human Body Model (HBM) and Charged Device Model (CDM), to verify the ESD robustness of IC products. Besides, in the IEC 61000-4-2 standard, the electronic products are zapped by the ESD gun with ESD voltage of even up to 15kV in the air-discharge mode. How to design the on-chip ESD protection circuits to effectively protect the integrated circuits realized by the nano-scale CMOS devices is a quite difficult challenge to IC industry. The on-chip ESD protection circuit must be included in the beginning phase of chip design. In this Tutorial, a brief introduction on ESD issue and test standards to IC products is presented with some failure analysis pictures from real IC products to demonstrate the impact of ESD on IC products. The basic design concept for on-chip ESD protection circuit will be presented. Some useful ESD protection designs for high-speed I/O and RF circuits will be mentioned. To achieve the whole-chip ESD protection by using the active power-rail ESD clamp circuit will be emphasized. Additional consideration on the active power-rail ESD clamp circuit realized in the nanoscale CMOS processes will be especially addressed. After the component-level ESD protection, the system-level ESD protection design will be brief discussed. ESD protection for CMOS ICs is not only the process issue but also highly dependent to the design issue, which has been an important topic that the IC designers need to know.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126264275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 12-bit 1.74-mW 20-MS/s DAC with resistor-string and current-steering hybrid architecture 一个12位1.74 mw 20 ms /s DAC,具有电阻串和电流转向混合架构
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406897
Bill Ma, Qinjin Huang, Fengqi Yu
This paper presents a novel segmented hybrid digital-to-analog converter (DAC). It uses a resistor-string as the LSB part for low-power consumption, and uses a current-steering array as the MSB part for high-speed and small size. The LSB and MSB parts are combined by a slew-rate-enhanced class AB output amplifier for high speed. Compared to resistor string DACs, current steering DACs, or resistor-capacitor hybrid DACs, the proposed DAC shows a better tradeoff between power and speed at low power application demanding a sampling clock between 1 MHz and 100 MHz. The prototype is a 12-bit DAC implemented in 0.18-μm CMOS technology with the worst measured DNL/INL of 6.38 LSB / 7.55 LSB. The analogue part power consumption is 1.24 mW and the digital part 0.5mW at 1.35-V power supply at 20-MS/s sampling rate. Its output is single-end buffered voltage with a range of 500 mV. The core area is 0.16 mm2.
提出了一种新型分段混合数模转换器(DAC)。它采用电阻串作为低功耗的LSB部分,采用电流转向阵列作为高速小尺寸的MSB部分。LSB和MSB部分由一个旋转速率增强的AB类输出放大器组成,以实现高速。与电阻串DAC、电流转向DAC或电阻-电容混合DAC相比,所提出的DAC在要求采样时钟在1 MHz和100 MHz之间的低功耗应用中表现出更好的功率和速度权衡。该原型是采用0.18 μm CMOS技术实现的12位DAC,测量的最差DNL/INL为6.38 LSB / 7.55 LSB。模拟部分功耗为1.24 mW,数字部分功耗为0.5mW,电源为1.35 v,采样率为20 ms /s。其输出为单端缓冲电压,电压范围为500mv。核心面积为0.16 mm2。
{"title":"A 12-bit 1.74-mW 20-MS/s DAC with resistor-string and current-steering hybrid architecture","authors":"Bill Ma, Qinjin Huang, Fengqi Yu","doi":"10.1109/SOCC.2015.7406897","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406897","url":null,"abstract":"This paper presents a novel segmented hybrid digital-to-analog converter (DAC). It uses a resistor-string as the LSB part for low-power consumption, and uses a current-steering array as the MSB part for high-speed and small size. The LSB and MSB parts are combined by a slew-rate-enhanced class AB output amplifier for high speed. Compared to resistor string DACs, current steering DACs, or resistor-capacitor hybrid DACs, the proposed DAC shows a better tradeoff between power and speed at low power application demanding a sampling clock between 1 MHz and 100 MHz. The prototype is a 12-bit DAC implemented in 0.18-μm CMOS technology with the worst measured DNL/INL of 6.38 LSB / 7.55 LSB. The analogue part power consumption is 1.24 mW and the digital part 0.5mW at 1.35-V power supply at 20-MS/s sampling rate. Its output is single-end buffered voltage with a range of 500 mV. The core area is 0.16 mm2.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126506207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Novel ECC structure and evaluation method for NAND flash memory NAND快闪记忆体的新型ECC结构与评估方法
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406921
Jiang Xiao-bo, Tang Xue-qing, Huang Wei-pei
The evaluation of error correction code (ECC) for NAND flash memory is increasingly complicated by the increasing bit error rate in memory. The concept of error-free information capacity is proposed to evaluate the performance ECC of NAND flash memory. The new method simultaneously considers the capacity and reliability of NAND flash memory. Low-density parity-check (LDPC) codes with a medium code rate can improve the integrated performance of NAND flash memory in order of magnitudes. Observations provide guides for the development of ECC schemes in NAND flash memory in future. An ECC structure based on adaptive LDPC codes is also presented in this paper. The new structure achieves integrated performance of both capacity and reliability in NAND flash memory.
随着内存误码率的不断提高,NAND闪存的纠错码评估变得越来越复杂。提出了无差错信息容量的概念来评价NAND闪存的ECC性能。该方法同时考虑了NAND闪存的容量和可靠性。具有中等码率的低密度奇偶校验码(LDPC)可以将NAND闪存的集成性能提高数个数量级。观察结果为未来NAND闪存中ECC方案的发展提供了指导。本文还提出了一种基于自适应LDPC码的ECC结构。新结构实现了NAND闪存容量和可靠性的综合性能。
{"title":"Novel ECC structure and evaluation method for NAND flash memory","authors":"Jiang Xiao-bo, Tang Xue-qing, Huang Wei-pei","doi":"10.1109/SOCC.2015.7406921","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406921","url":null,"abstract":"The evaluation of error correction code (ECC) for NAND flash memory is increasingly complicated by the increasing bit error rate in memory. The concept of error-free information capacity is proposed to evaluate the performance ECC of NAND flash memory. The new method simultaneously considers the capacity and reliability of NAND flash memory. Low-density parity-check (LDPC) codes with a medium code rate can improve the integrated performance of NAND flash memory in order of magnitudes. Observations provide guides for the development of ECC schemes in NAND flash memory in future. An ECC structure based on adaptive LDPC codes is also presented in this paper. The new structure achieves integrated performance of both capacity and reliability in NAND flash memory.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126256404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A novel fault-tolerant router architecture for network-on-chip reconfiguration 一种基于片上网络重构的容错路由器结构
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406966
P. Yan, Shixiong Jiang, R. Sridhar
In Network-on-Chip (NoC) architectures, a faulty router can isolate a functional processing element (PE) from other nodes, severely restricting the performance of the system. This paper presents a fault-tolerant router architecture that can avoid PE isolation even if the router fails. In this design, we connect the local port of the router with one of the other four ports through a fault tolerant control unit that works with the fault detection signal. If the router fails, the control unit will turn on and connect the PE with the neighboring router directly, thus protecting the system functioning. A revised XY-Routing algorithm is also presented to achieve the NoC reconfiguration when router fails. Theoretical analysis show that for a 10*10 mesh NoC, the reliability at year 10 is 20 times better than NoC implemented with traditional router. Also our design has greatly improved value for mean time to failure (MTTF). In video and DSP applications, simulation results show better power control with faulty PEs. The new architecture also has other advantages over annealing re-mapping algorithm. Our design has only 12% area overhead, which is significantly better than other approaches to deal with faults.
在片上网络(NoC)架构中,出现故障的路由器可能会将功能处理单元(PE)与其他节点隔离开来,从而严重限制系统的性能。本文提出了一种容错路由器结构,即使路由器发生故障也能避免PE隔离。在本设计中,我们通过容错控制单元将路由器的本地端口与其他四个端口中的一个连接起来,容错控制单元与故障检测信号一起工作。当路由器发生故障时,控制单元开启,PE直接与相邻路由器连接,保护系统正常运行。提出了一种改进的XY-Routing算法,实现了路由器故障时NoC的重新配置。理论分析表明,对于10*10 mesh NoC, 10年的可靠性是传统路由器NoC的20倍。此外,我们的设计大大提高了平均无故障时间(MTTF)的价值。在视频和DSP应用中,仿真结果表明故障pe具有较好的功率控制效果。与退火重映射算法相比,新架构还具有其他优点。我们的设计只有12%的面积开销,这明显优于其他处理故障的方法。
{"title":"A novel fault-tolerant router architecture for network-on-chip reconfiguration","authors":"P. Yan, Shixiong Jiang, R. Sridhar","doi":"10.1109/SOCC.2015.7406966","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406966","url":null,"abstract":"In Network-on-Chip (NoC) architectures, a faulty router can isolate a functional processing element (PE) from other nodes, severely restricting the performance of the system. This paper presents a fault-tolerant router architecture that can avoid PE isolation even if the router fails. In this design, we connect the local port of the router with one of the other four ports through a fault tolerant control unit that works with the fault detection signal. If the router fails, the control unit will turn on and connect the PE with the neighboring router directly, thus protecting the system functioning. A revised XY-Routing algorithm is also presented to achieve the NoC reconfiguration when router fails. Theoretical analysis show that for a 10*10 mesh NoC, the reliability at year 10 is 20 times better than NoC implemented with traditional router. Also our design has greatly improved value for mean time to failure (MTTF). In video and DSP applications, simulation results show better power control with faulty PEs. The new architecture also has other advantages over annealing re-mapping algorithm. Our design has only 12% area overhead, which is significantly better than other approaches to deal with faults.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123836858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
The evolutionary spintronic technologies and their usage in high performance computing 进化自旋电子技术及其在高性能计算中的应用
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406981
Hai Helen Li, Xiuyuan Bi, Zhenyu Sun
This paper gives a comprehensive summary of our study in using the spintronic technologies for the on-chip cache density improvement of high performance computing systems. We will start with the spin-transfer torque random access memory (STT-RAM) at the early of stage of commercialization and then extend it to the emerging racetrack memory that has been successfully demonstrated at device and small array level. In multi-level cell (MLC) STT-RAM cache, the cell design constrains, e.g., the switching current requirement and asymmetry in write operations, severely limit the density benefit. Moreover, the two-step read/write accesses and inflexible data mapping strategy may even result in system performance degradation. This paper will discuss our circuit and architecture combined solution. Advanced spintronic technology, i.e., racetrack memory, enables an extremely high storage density and offers a faster-than-Moores law scaling path. Unorthodox new memory hierarchies are necessary to minimize the impact of pseudo-sequential accesses of racetrack memory.
本文对自旋电子技术在提高高性能计算系统片上缓存密度方面的研究进行了综述。我们将从商业化早期阶段的自旋转移扭矩随机存取存储器(STT-RAM)开始,然后将其扩展到已经在设备和小阵列级别成功演示的新兴赛道存储器。在多层单元(MLC) STT-RAM缓存中,单元设计的限制,例如开关电流要求和写操作中的不对称,严重限制了密度效益。此外,两步读/写访问和不灵活的数据映射策略甚至可能导致系统性能下降。本文将讨论我们的电路和架构相结合的解决方案。先进的自旋电子技术,即赛道存储器,可以实现极高的存储密度,并提供比摩尔定律更快的缩放路径。非正统的新内存层次结构是必要的,以尽量减少伪顺序访问赛道内存的影响。
{"title":"The evolutionary spintronic technologies and their usage in high performance computing","authors":"Hai Helen Li, Xiuyuan Bi, Zhenyu Sun","doi":"10.1109/SOCC.2015.7406981","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406981","url":null,"abstract":"This paper gives a comprehensive summary of our study in using the spintronic technologies for the on-chip cache density improvement of high performance computing systems. We will start with the spin-transfer torque random access memory (STT-RAM) at the early of stage of commercialization and then extend it to the emerging racetrack memory that has been successfully demonstrated at device and small array level. In multi-level cell (MLC) STT-RAM cache, the cell design constrains, e.g., the switching current requirement and asymmetry in write operations, severely limit the density benefit. Moreover, the two-step read/write accesses and inflexible data mapping strategy may even result in system performance degradation. This paper will discuss our circuit and architecture combined solution. Advanced spintronic technology, i.e., racetrack memory, enables an extremely high storage density and offers a faster-than-Moores law scaling path. Unorthodox new memory hierarchies are necessary to minimize the impact of pseudo-sequential accesses of racetrack memory.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134414134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low-latency power-efficient adaptive router design for network-on-chip 面向片上网络的低延迟节能自适应路由器设计
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406965
N. Nasirian, M. Bayoumi
Network-on-chip (NOC) technology has offered an efficient solution for scalability problem. Following the advent of the NOC technology, decreasing the static power consumption has been at the focus of research and development. Since the routers are the most important and main power-consuming modules of NOC, most of the contributions are related to improvement in router micro-architecture design [1], [2]. Power-gating is currently an effective solution in this area but it causes overhead in terms of delays and in some cases it deteriorates the performance. In this paper, a power efficient design for the network-on-chip (NOC) routers using adaptive routing has been proposed. The Proposed scheme is directing the traffic in a power-gated network with respect to the routers status. In this way, we avoid to turn on the routers, that are in sleep state by alternating the paths. Our simulation has shown that we can achieve near 80% reduction in static power consumption compared to non-power-gated design and we improved the average delay by 35% in comparison with conventional power-gated design.
片上网络(NOC)技术为可扩展性问题提供了一种有效的解决方案。随着NOC技术的出现,降低静态功耗一直是研究和开发的重点。由于路由器是NOC中最重要和最主要的功耗模块,因此大部分贡献都与路由器微架构设计的改进有关[1],[2]。功率门控目前是该领域的一种有效解决方案,但它会在延迟方面造成开销,并且在某些情况下会降低性能。本文提出了一种基于自适应路由的片上网络(NOC)路由器的节能设计方案。所提出的方案是根据路由器的状态对电源门控网络中的流量进行定向。这样,通过路径的交替,避免了开启处于休眠状态的路由器。我们的仿真表明,与非功率门控设计相比,我们可以实现近80%的静态功耗降低,并且与传统的功率门控设计相比,我们将平均延迟提高了35%。
{"title":"Low-latency power-efficient adaptive router design for network-on-chip","authors":"N. Nasirian, M. Bayoumi","doi":"10.1109/SOCC.2015.7406965","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406965","url":null,"abstract":"Network-on-chip (NOC) technology has offered an efficient solution for scalability problem. Following the advent of the NOC technology, decreasing the static power consumption has been at the focus of research and development. Since the routers are the most important and main power-consuming modules of NOC, most of the contributions are related to improvement in router micro-architecture design [1], [2]. Power-gating is currently an effective solution in this area but it causes overhead in terms of delays and in some cases it deteriorates the performance. In this paper, a power efficient design for the network-on-chip (NOC) routers using adaptive routing has been proposed. The Proposed scheme is directing the traffic in a power-gated network with respect to the routers status. In this way, we avoid to turn on the routers, that are in sleep state by alternating the paths. Our simulation has shown that we can achieve near 80% reduction in static power consumption compared to non-power-gated design and we improved the average delay by 35% in comparison with conventional power-gated design.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"96 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114653182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Cascoded flipped voltage follower based output-capacitorless low-dropout regulator for SoCs 基于级联编码翻转电压从动器的soc无输出电容低差稳压器
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406985
Guangxiang Li, Jianping Guo, Yanqi Zheng, Mo Huang, Dihu Chen
A novel cascoded flipped voltage follower (CAFVF) based output-capacitorless low-dropout (LDO) regulator is proposed and implemented in 0.18-μm CMOS technology. With a cascode current source (CCS) embedded into the CAFVF structure, the proposed LDO regulator achieves 58.6-dB DC gain in heavy loading condition (100 mA), which is 44-dB for the conventional CAFVF counterpart under identical conditions. The cascode compensation technique is introduced to widen the loop bandwidth and reduce the minimal loading requirement. With a 5-pF compensation capacitor, the minimum load current to keep the proposed LDO regulator stable is reduced to 50 μA. In addition, the unity-gain frequency (UGF) is extended from 1.51 MHz to 2.36 MHz in 100-mA loading condition. Moreover, an accurate stability analysis without ignoring any channel resistance has been presented in this work. Simulation results show that the LDO regulator consumes an ultra-low quiescent current (Iq) of 14 μA for input voltage ranging from 1.2 V to 1.8 V, with a dropout voltage (Vdrop) of 200 mV.
提出了一种新型的基于级联编码翻转电压从动器(CAFVF)的无输出电容低差(LDO)稳压器,并在0.18 μm CMOS工艺上实现。通过将级联码电流源(CCS)嵌入到CAFVF结构中,所提出的LDO稳压器在重负载条件下(100 mA)可实现58.6 db DC增益,而在相同条件下,传统CAFVF对应器件的增益为44 db。引入级联补偿技术,拓宽环路带宽,降低最小负载要求。采用5-pF补偿电容,保持LDO稳压器稳定的最小负载电流降至50 μA。此外,在100 ma负载条件下,单位增益频率(UGF)从1.51 MHz扩展到2.36 MHz。此外,在不忽略任何通道阻力的情况下,本文给出了精确的稳定性分析。仿真结果表明,LDO稳压器在1.2 V ~ 1.8 V的输入电压范围内消耗14 μA的超低静态电流(Iq),电压降为200 mV。
{"title":"Cascoded flipped voltage follower based output-capacitorless low-dropout regulator for SoCs","authors":"Guangxiang Li, Jianping Guo, Yanqi Zheng, Mo Huang, Dihu Chen","doi":"10.1109/SOCC.2015.7406985","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406985","url":null,"abstract":"A novel cascoded flipped voltage follower (CAFVF) based output-capacitorless low-dropout (LDO) regulator is proposed and implemented in 0.18-μm CMOS technology. With a cascode current source (CCS) embedded into the CAFVF structure, the proposed LDO regulator achieves 58.6-dB DC gain in heavy loading condition (100 mA), which is 44-dB for the conventional CAFVF counterpart under identical conditions. The cascode compensation technique is introduced to widen the loop bandwidth and reduce the minimal loading requirement. With a 5-pF compensation capacitor, the minimum load current to keep the proposed LDO regulator stable is reduced to 50 μA. In addition, the unity-gain frequency (UGF) is extended from 1.51 MHz to 2.36 MHz in 100-mA loading condition. Moreover, an accurate stability analysis without ignoring any channel resistance has been presented in this work. Simulation results show that the LDO regulator consumes an ultra-low quiescent current (Iq) of 14 μA for input voltage ranging from 1.2 V to 1.8 V, with a dropout voltage (Vdrop) of 200 mV.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"197 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121080722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
ESD protection design with stacked low-voltage devices for high-voltage pins of battery-monitoring IC 电池监控IC高压引脚堆叠低压器件ESD防护设计
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406987
Chia-Tsen Dai, M. Ker
For high-voltage (HV) application, an on-chip ESD protection solution has been proposed in a 0.25-μm HV BCD process by using low-voltage (LV) p-type devices with the stacked configuration. Experimental results in silicon chip have verified that the proposed design can successfully protect the 60-V pins of a battery-monitoring IC against over 8-kV human-body-mode (HBM) ESD stress.
针对高压应用,提出了一种基于0.25 μ HV BCD工艺的片上ESD保护解决方案,该方案采用了具有堆叠结构的低压p型器件。在硅芯片上的实验结果验证了所提出的设计可以成功地保护电池监测IC的60v引脚免受超过8kv人体模式(HBM) ESD应力的影响。
{"title":"ESD protection design with stacked low-voltage devices for high-voltage pins of battery-monitoring IC","authors":"Chia-Tsen Dai, M. Ker","doi":"10.1109/SOCC.2015.7406987","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406987","url":null,"abstract":"For high-voltage (HV) application, an on-chip ESD protection solution has been proposed in a 0.25-μm HV BCD process by using low-voltage (LV) p-type devices with the stacked configuration. Experimental results in silicon chip have verified that the proposed design can successfully protect the 60-V pins of a battery-monitoring IC against over 8-kV human-body-mode (HBM) ESD stress.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124546233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2015 28th IEEE International System-on-Chip Conference (SOCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1