首页 > 最新文献

2018 IEEE International Solid - State Circuits Conference - (ISSCC)最新文献

英文 中文
A 14.5mm2 8nW −59.7dBm-sensitivity ultrasonic wake-up receiver for power-, area-, and interference-constrained applications 14.5mm2 8nW−59.7 dbm灵敏度超声唤醒接收器,适用于功率、面积和干扰受限的应用
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310380
Angad S. Rekhi, A. Arbabian
The next generation of the Internet of Things is envisioned to include unobtrusive, distributed mm-sized nodes capable of sensing and communicating information about their surroundings. Wake-up receivers (WuRXs) — ultra-low-power receivers that monitor their environment for a wake-up signature — are an important part of this vision, as they can extend the lifetime of a wireless node by keeping it asleep until interrogated. The state of the art in WuRXs, measured in terms of sensitivity and power, has recently been advanced by streamlining the signal path to include fewer power-hungry gain stages and instead obtaining the gain at the chip-antenna interface [1,2]. This has led to excellent power-sensitivity performance, but the accompanying increase in size hampers the applicability of these techniques in size-conscious applications, such as surveillance, asset tracking, and ubiquitous sensing. Moreover, size-reduction methods based on RF antenna miniaturization are fundamentally limited by high antenna Qs at low size-to-wavelength ratios [3]; efficiently matching to these antennas requires, in turn, high-Q passives that are generally unavailable at mm-scale.
下一代物联网被设想为包括不引人注目的、分布式的毫米大小的节点,这些节点能够感知和交流周围环境的信息。唤醒接收器(WuRXs)——超低功耗接收器,监测其环境的唤醒信号——是这一愿景的重要组成部分,因为它们可以通过保持无线节点休眠直到被询问来延长其寿命。在灵敏度和功率方面,最近通过简化信号路径以包含更少的功耗增益级,而不是在芯片天线接口获得增益,从而提高了wurx的技术水平[1,2]。这带来了出色的功率灵敏度性能,但随之而来的尺寸增加阻碍了这些技术在尺寸敏感应用中的适用性,例如监视、资产跟踪和无处不在的传感。此外,基于射频天线小型化的尺寸减小方法从根本上受到低尺寸与波长比[3]下的高天线q的限制;有效地匹配这些天线需要高q无源,而高q无源通常在毫米尺度上是不可用的。
{"title":"A 14.5mm2 8nW −59.7dBm-sensitivity ultrasonic wake-up receiver for power-, area-, and interference-constrained applications","authors":"Angad S. Rekhi, A. Arbabian","doi":"10.1109/ISSCC.2018.8310380","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310380","url":null,"abstract":"The next generation of the Internet of Things is envisioned to include unobtrusive, distributed mm-sized nodes capable of sensing and communicating information about their surroundings. Wake-up receivers (WuRXs) — ultra-low-power receivers that monitor their environment for a wake-up signature — are an important part of this vision, as they can extend the lifetime of a wireless node by keeping it asleep until interrogated. The state of the art in WuRXs, measured in terms of sensitivity and power, has recently been advanced by streamlining the signal path to include fewer power-hungry gain stages and instead obtaining the gain at the chip-antenna interface [1,2]. This has led to excellent power-sensitivity performance, but the accompanying increase in size hampers the applicability of these techniques in size-conscious applications, such as surveillance, asset tracking, and ubiquitous sensing. Moreover, size-reduction methods based on RF antenna miniaturization are fundamentally limited by high antenna Qs at low size-to-wavelength ratios [3]; efficiently matching to these antennas requires, in turn, high-Q passives that are generally unavailable at mm-scale.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"44 1","pages":"454-456"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84586568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
SkyLake-SP: A 14nm 28-Core xeon® processor SkyLake-SP: 14纳米28核xeon®处理器
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310170
S. Tam, H. Muljono, Min Huang, Sitaraman V. Iyer, Kalapi Royneogi, Nagmohan Satti, Rizwan Qureshi, Wei Chen, Tom Wang, Hubert Hsieh, S. Vora, Eddie Wang
SkyLake-SP (Scalable Performance), code name SKX, is the next generation Xeon® server processor fabricated on the Intel® 14nm tri-gate CMOS technology with 11-metal layers [1,2]. The SKX processor family has three core-count configurations. Each SKX core is accompanied by 1MB of dedicated L2 (2nd level cache) and 1.375MB of non-exclusive L3 (3rd level cache). At its maximum configuration of 28 cores, the SKX processor supports 6 DDR4 channels (2666MT/s), 3×20-lanes UPI processor-to-processor links (10.4GT/s) and x48+4 PCIE links (8GT/s). SKX supports per-core power-performance optimization enabled by on-die integrated voltage regulators (FIVR) [3, 4]. A new 2-dimensional synchronous on-die MESH fabric interconnects all the on-die components. Fig. 2.1.1 shows the overall architecture of the SKX processor.
SkyLake-SP (Scalable Performance),代号SKX,是采用Intel®14nm三栅极CMOS技术制造的下一代至强®服务器处理器,具有11金属层[1,2]。SKX处理器家族有三种内核计数配置。每个SKX核心都伴随着1MB的专用L2(第二级缓存)和1.375MB的非排他L3(第三级缓存)。SKX处理器的最大配置为28核,支持6个DDR4通道(2666MT/s), 3×20-lanes UPI处理器到处理器链路(10.4GT/s)和x48+4 PCIE链路(8GT/s)。SKX支持芯片上集成电压调节器(FIVR)实现的每核功率性能优化[3,4]。一种新的二维同步模上MESH结构将所有模上组件互连起来。图2.1.1显示了SKX处理器的整体架构。
{"title":"SkyLake-SP: A 14nm 28-Core xeon® processor","authors":"S. Tam, H. Muljono, Min Huang, Sitaraman V. Iyer, Kalapi Royneogi, Nagmohan Satti, Rizwan Qureshi, Wei Chen, Tom Wang, Hubert Hsieh, S. Vora, Eddie Wang","doi":"10.1109/ISSCC.2018.8310170","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310170","url":null,"abstract":"SkyLake-SP (Scalable Performance), code name SKX, is the next generation Xeon® server processor fabricated on the Intel® 14nm tri-gate CMOS technology with 11-metal layers [1,2]. The SKX processor family has three core-count configurations. Each SKX core is accompanied by 1MB of dedicated L2 (2nd level cache) and 1.375MB of non-exclusive L3 (3rd level cache). At its maximum configuration of 28 cores, the SKX processor supports 6 DDR4 channels (2666MT/s), 3×20-lanes UPI processor-to-processor links (10.4GT/s) and x48+4 PCIE links (8GT/s). SKX supports per-core power-performance optimization enabled by on-die integrated voltage regulators (FIVR) [3, 4]. A new 2-dimensional synchronous on-die MESH fabric interconnects all the on-die components. Fig. 2.1.1 shows the overall architecture of the SKX processor.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"33 1","pages":"34-36"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84626774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
An inverse-class-F CMOS VCO with intrinsic-high-Q 1st- and 2nd-harmonic resonances for 1/f2-to-1/f3 phase-noise suppression achieving 196.2dBc/Hz FOM 一种反f类CMOS压控振荡器,具有高q的一、二谐共振,1/f2- 1/f3相位噪声抑制,实现196.2dBc/Hz FOM
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310340
Chee-Cheow Lim, Jun Yin, Pui-in Mak, H. Ramiah, R. Martins
Second-harmonic common-mode (CM) resonance has been explored for LC oscillators to improve their phase noise (PN) in the past. Its implementation evolves from an explicit design [1] that relies on an extra tail tank, to a recent implicit design [2], where the resonator itself offers a CM impedance peak at 2x the oscillation frequency (FLO): Explicit design (Fig. 23.5.1-upper): a high-Q tail tank (QTAIL) is desirable to raise its impedance |ZTAIL| at 2FLO and to prevent the loss of LTAIL from penalizing the PN in the 1/f2 region [3]. To compare with the theoretical limit (FOMMAX), the FOM in the 1/f2 PN region is plotted against LTAIL at different QTAIL. Closing the gap between FOMMAX and FOM imposes an excessive QTAIL of 20 or beyond, which can hardly be achieved and maintained over a wide tuning range.
为了改善LC振荡器的相位噪声(PN),过去一直在研究二阶谐波共模共振。它的实现从依赖于额外尾槽的显式设计[1]演变为最近的隐式设计[2],其中谐振器本身在振荡频率(FLO)的2倍处提供CM阻抗峰值:显式设计(图23.5.1-上):高q尾槽(QTAIL)需要在2FLO处提高其阻抗|ZTAIL,并防止lttail的损失损害1/f2区域的PN[3]。为了与理论极限(FOMMAX)进行比较,将1/f2 PN区域的FOM与不同QTAIL的LTAIL进行对比。缩小FOMMAX和FOM之间的差距会施加20或更高的过度QTAIL,这很难在宽调谐范围内实现和维持。
{"title":"An inverse-class-F CMOS VCO with intrinsic-high-Q 1st- and 2nd-harmonic resonances for 1/f2-to-1/f3 phase-noise suppression achieving 196.2dBc/Hz FOM","authors":"Chee-Cheow Lim, Jun Yin, Pui-in Mak, H. Ramiah, R. Martins","doi":"10.1109/ISSCC.2018.8310340","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310340","url":null,"abstract":"Second-harmonic common-mode (CM) resonance has been explored for LC oscillators to improve their phase noise (PN) in the past. Its implementation evolves from an explicit design [1] that relies on an extra tail tank, to a recent implicit design [2], where the resonator itself offers a CM impedance peak at 2x the oscillation frequency (F<inf>LO</inf>): Explicit design (Fig. 23.5.1-upper): a high-Q tail tank (Q<inf>TAIL</inf>) is desirable to raise its impedance |Z<inf>TAIL</inf>| at 2FLO and to prevent the loss of L<inf>TAIL</inf> from penalizing the PN in the 1/f<sup>2</sup> region [3]. To compare with the theoretical limit (FOM<inf>MAX</inf>), the FOM in the 1/f<sup>2</sup> PN region is plotted against L<inf>TAIL</inf> at different QTAIL. Closing the gap between FOM<inf>MAX</inf> and FOM imposes an excessive Q<inf>TAIL</inf> of 20 or beyond, which can hardly be achieved and maintained over a wide tuning range.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"8 1","pages":"374-376"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82080958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
A 7.4-to-14GHz PLL with 54fsrms jitter in 16nm FinFET for integrated RF-data-converter SoCs 用于集成rf数据转换器soc的16nm FinFET中具有54fsrms抖动的7.4至14ghz锁相环
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310342
D. Turker, Ade Bekele, P. Upadhyaya, B. Verbruggen, Ying Cao, Shaojun Ma, C. Erdmann, B. Farley, Y. Frans, Ken Chang
Direct-RF data converters [1,2] have seen increased adoption in remote-radio-head TX and RX, due to their unparalleled bandwidth and flexibility. However, since these converters need to directly synthesize and sample multi-GHz radio signals, the sampling clock must exhibit excellent phase-noise performance, to minimize self- and adjacent-channel mixing, and strong suppression of reference and harmonic spurs, to meet stringent out-of-band emissions and minimize aliased energy. Furthermore, a wide range of sampling frequencies is required for the flexibility to cover multiple bands. Due to these stringent requirements, typically, external PLLs are employed, adding to the BOM cost. This work presents techniques for a fully integrated 7.4-to-14GHz PLL in 16nm FinFET that has 54fsrms jitter to satisfy the low noise requirements of RF data converters.
由于其无与伦比的带宽和灵活性,直接射频数据转换器[1,2]在远程无线电头TX和RX中的应用越来越多。然而,由于这些转换器需要直接合成和采样多ghz无线电信号,采样时钟必须具有优异的相位噪声性能,以最大限度地减少自信道和邻接信道混频,以及对参考杂散和谐波杂散的强抑制,以满足严格的带外发射和最小化混叠能量。此外,为了灵活地覆盖多个频带,需要广泛的采样频率范围。由于这些严格的要求,通常采用外部锁相环,增加了BOM成本。这项工作提出了在16nm FinFET中完全集成7.4至14ghz锁相环的技术,该技术具有54fsrms的抖动,以满足RF数据转换器的低噪声要求。
{"title":"A 7.4-to-14GHz PLL with 54fsrms jitter in 16nm FinFET for integrated RF-data-converter SoCs","authors":"D. Turker, Ade Bekele, P. Upadhyaya, B. Verbruggen, Ying Cao, Shaojun Ma, C. Erdmann, B. Farley, Y. Frans, Ken Chang","doi":"10.1109/ISSCC.2018.8310342","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310342","url":null,"abstract":"Direct-RF data converters [1,2] have seen increased adoption in remote-radio-head TX and RX, due to their unparalleled bandwidth and flexibility. However, since these converters need to directly synthesize and sample multi-GHz radio signals, the sampling clock must exhibit excellent phase-noise performance, to minimize self- and adjacent-channel mixing, and strong suppression of reference and harmonic spurs, to meet stringent out-of-band emissions and minimize aliased energy. Furthermore, a wide range of sampling frequencies is required for the flexibility to cover multiple bands. Due to these stringent requirements, typically, external PLLs are employed, adding to the BOM cost. This work presents techniques for a fully integrated 7.4-to-14GHz PLL in 16nm FinFET that has 54fsrms jitter to satisfy the low noise requirements of RF data converters.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"6 1","pages":"378-380"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84120457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Droop mitigation using critical-path sensors and an on-chip distributed power supply estimation engine in the z14™ enterprise processor 在z14™企业处理器中使用关键路径传感器和片上分布式电源估计引擎来降低功耗
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310303
Christos Vezyrtzis, T. Strach, P. Chuang, P. Lobo, R. Rizzolo, Tobias Webel, Pawel Owczarczyk, A. Buyuktosunoglu, Ramon Bertran Monfort, D. Hui, Susan M. Eickhoff, M. Floyd, G. Salem, S. Carey, Stelios G. Tsapepas, P. Restle
Enterprise server processor designs, which operate at extreme high frequencies and power envelopes, depend critically on power supply noise mitigation techniques. With supply voltage scaling, very high current draws, and broad usage of clock gating, advanced solutions are needed for next-generation products to minimize droop mitigation response time, which can be defined as the latency from when a dangerous droop begins until a countermeasure is effective.
在极高频率和功率包络下工作的企业服务器处理器设计,严重依赖于电源噪声缓解技术。随着电源电压的缩放、非常高的电流消耗和时钟门控的广泛使用,下一代产品需要先进的解决方案来最大限度地减少电压下降的响应时间,这可以定义为从危险电压下降开始到对策有效的延迟时间。
{"title":"Droop mitigation using critical-path sensors and an on-chip distributed power supply estimation engine in the z14™ enterprise processor","authors":"Christos Vezyrtzis, T. Strach, P. Chuang, P. Lobo, R. Rizzolo, Tobias Webel, Pawel Owczarczyk, A. Buyuktosunoglu, Ramon Bertran Monfort, D. Hui, Susan M. Eickhoff, M. Floyd, G. Salem, S. Carey, Stelios G. Tsapepas, P. Restle","doi":"10.1109/ISSCC.2018.8310303","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310303","url":null,"abstract":"Enterprise server processor designs, which operate at extreme high frequencies and power envelopes, depend critically on power supply noise mitigation techniques. With supply voltage scaling, very high current draws, and broad usage of clock gating, advanced solutions are needed for next-generation products to minimize droop mitigation response time, which can be defined as the latency from when a dangerous droop begins until a countermeasure is effective.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"59 1","pages":"300-302"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83969957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A 20Gb/s transceiver with framed-pulsewidth modulation in 40nm CMOS 采用40nm CMOS的帧脉宽调制的20Gb/s收发器
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310288
Sejun Jeon, Woohyun Kwon, Taehun Yoon, Jong-Hyeok Yoon, Kyeongha Kwon, Jaehyeok Yang, Hyeon-Min Bae
Expanding signal bandwidths in high-speed links is increasing intersymbol interference (ISI), which necessitates the enhancement of spectral efficiency. Recently, various modulation schemes including pulse amplitude modulation (PAM) [1], pulsewidth modulation (PWM) [2], permutation modulation (PM) [3] and duo-binary signaling [4] have been investigated in high-speed wireline links to increase spectral efficiency. However, multi-level signaling schemes suffer from SNR reduction and tighter linearity requirements when compared to conventional NRZ signaling. In this work, a 20Gb/s serial link transceiver employing a framed-pulsewidth modulation (FPWM) scheme that overcomes the SNR degradation without linearity requirement is presented. The FPWM scheme encodes data at the location and the width of pulses in a frame spanning multiple UIs while maintaining a minimum pulsewidth equal to 1UI. The test-chip achieves a coding gain of 33%, which allows the total throughput of 20Gb/s while keeping the baud rate of 15Gb/s. The equalization core incorporates programmable 3-tap pre-emphasis at the transmitter and a continuous-time linear equalizer (CTLE) at the receiver, to compensate for channel insertion loss of up to 12dB at the baud frequency. The transceiver IC is implemented in 40nm CMOS and consumes 90.6mW from a 0.9V supply.
高速链路中信号带宽的扩大导致码间干扰(ISI)的增加,这就要求提高频谱效率。最近,各种调制方案,包括脉冲幅度调制(PAM)[1]、脉宽调制(PWM)[2]、排列调制(PM)[3]和双二进制信令[4]已经在高速有线链路中进行了研究,以提高频谱效率。然而,与传统的NRZ信号相比,多级信号方案遭受信噪比降低和更严格的线性要求。在这项工作中,提出了一种20Gb/s串行链路收发器,采用帧脉冲宽度调制(FPWM)方案,克服了信噪比下降而不需要线性要求。FPWM方案对跨多个ui的帧中的位置和脉冲宽度的数据进行编码,同时保持最小脉冲宽度等于1UI。测试芯片实现了33%的编码增益,使总吞吐量达到20Gb/s,同时保持15Gb/s的波特率。均衡核心在发送端集成了可编程的3分路预强调,在接收端集成了连续时间线性均衡器(CTLE),以补偿在波特频率下高达12dB的信道插入损耗。收发器IC在40nm CMOS中实现,从0.9V电源消耗90.6mW。
{"title":"A 20Gb/s transceiver with framed-pulsewidth modulation in 40nm CMOS","authors":"Sejun Jeon, Woohyun Kwon, Taehun Yoon, Jong-Hyeok Yoon, Kyeongha Kwon, Jaehyeok Yang, Hyeon-Min Bae","doi":"10.1109/ISSCC.2018.8310288","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310288","url":null,"abstract":"Expanding signal bandwidths in high-speed links is increasing intersymbol interference (ISI), which necessitates the enhancement of spectral efficiency. Recently, various modulation schemes including pulse amplitude modulation (PAM) [1], pulsewidth modulation (PWM) [2], permutation modulation (PM) [3] and duo-binary signaling [4] have been investigated in high-speed wireline links to increase spectral efficiency. However, multi-level signaling schemes suffer from SNR reduction and tighter linearity requirements when compared to conventional NRZ signaling. In this work, a 20Gb/s serial link transceiver employing a framed-pulsewidth modulation (FPWM) scheme that overcomes the SNR degradation without linearity requirement is presented. The FPWM scheme encodes data at the location and the width of pulses in a frame spanning multiple UIs while maintaining a minimum pulsewidth equal to 1UI. The test-chip achieves a coding gain of 33%, which allows the total throughput of 20Gb/s while keeping the baud rate of 15Gb/s. The equalization core incorporates programmable 3-tap pre-emphasis at the transmitter and a continuous-time linear equalizer (CTLE) at the receiver, to compensate for channel insertion loss of up to 12dB at the baud frequency. The transceiver IC is implemented in 40nm CMOS and consumes 90.6mW from a 0.9V supply.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"38 1","pages":"270-272"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82415516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
SC: Hardware approaches to machine learning and inference SC:机器学习和推理的硬件方法
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310415
D. Friedman
Advances in artificial intelligence are already changing how computing systems interact with users and interact with their environments, with further dramatic changes on the horizon. In this context, machine learning and inference operations have become a critically important computational workload, and the importance of this workload will continue to increase. Today, GPU-, CPU-, and FPGA-based engines dominate the compute landscape for learning and for inference, but the exploration of alternative, enhanced, or complementary compute capability in this problem space is an active and growing research area. In this short course, we will provide a framework for understanding some of the computational challenges in machine learning and inference and discuss emerging technical approaches aimed at meeting those challenges.
人工智能的进步已经改变了计算系统与用户以及与环境交互的方式,未来还会发生更大的变化。在这种背景下,机器学习和推理操作已经成为一个至关重要的计算工作负载,并且这个工作负载的重要性将继续增加。今天,基于GPU、CPU和fpga的引擎在学习和推理的计算领域占据主导地位,但在这个问题空间中探索替代、增强或互补的计算能力是一个活跃且不断发展的研究领域。在这个短期课程中,我们将提供一个框架来理解机器学习和推理中的一些计算挑战,并讨论旨在应对这些挑战的新兴技术方法。
{"title":"SC: Hardware approaches to machine learning and inference","authors":"D. Friedman","doi":"10.1109/ISSCC.2018.8310415","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310415","url":null,"abstract":"Advances in artificial intelligence are already changing how computing systems interact with users and interact with their environments, with further dramatic changes on the horizon. In this context, machine learning and inference operations have become a critically important computational workload, and the importance of this workload will continue to increase. Today, GPU-, CPU-, and FPGA-based engines dominate the compute landscape for learning and for inference, but the exploration of alternative, enhanced, or complementary compute capability in this problem space is an active and growing research area. In this short course, we will provide a framework for understanding some of the computational challenges in machine learning and inference and discuss emerging technical approaches aimed at meeting those challenges.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"19 1","pages":"533-534"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80987781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A phase-domain readout circuit for a CMOS-compatible thermal-conductivity-based carbon dioxide sensor 一种用于cmos兼容热导二氧化碳传感器的相域读出电路
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310319
Z. Cai, R. V. Veldhoven, H. Suy, G. D. Graaf, K. Makinwa, M. Pertijs
The measurement of carbon-dioxide (CO2) concentration is very important in home and building automation, e.g. to control ventilation in energy-efficient buildings. This application requires compact, low-cost sensors that can measure CO2 concentration with a resolution of <200 ppm over a 2500ppm range. Conventional optical (NDIR-based) CO2 sensors require components that are CMOS-incompatible, difficult to miniaturize and power-hungry [1]. Due to their CMOS compatibility, thermal-conductivity-based sensors are an attractive alternative [2,3]. They exploit the fact that the thermal conductivity (TC) of CO2 is lower than that of the other constituents of air, so that CO2 concentration can be indirectly measured via the heat loss of a hot wire to ambient. However, this approach requires the detection of very small changes in TC (0.25 ppm per ppm CO2 [3]).
二氧化碳(CO2)浓度的测量在家庭和建筑自动化中非常重要,例如在节能建筑中控制通风。这种应用需要紧凑,低成本的传感器,可以在2500ppm范围内以< 200ppm的分辨率测量二氧化碳浓度。传统的光学(基于ndir的)CO2传感器需要的组件与cmos不兼容,难以小型化并且耗电[1]。由于其CMOS兼容性,基于热导率的传感器是一个有吸引力的替代方案[2,3]。他们利用二氧化碳的热导率(TC)低于空气中其他成分的事实,因此可以通过热丝向环境的热损失间接测量二氧化碳浓度。然而,这种方法需要检测非常小的TC变化(每ppm CO2 0.25 ppm)[3]。
{"title":"A phase-domain readout circuit for a CMOS-compatible thermal-conductivity-based carbon dioxide sensor","authors":"Z. Cai, R. V. Veldhoven, H. Suy, G. D. Graaf, K. Makinwa, M. Pertijs","doi":"10.1109/ISSCC.2018.8310319","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310319","url":null,"abstract":"The measurement of carbon-dioxide (CO2) concentration is very important in home and building automation, e.g. to control ventilation in energy-efficient buildings. This application requires compact, low-cost sensors that can measure CO2 concentration with a resolution of <200 ppm over a 2500ppm range. Conventional optical (NDIR-based) CO2 sensors require components that are CMOS-incompatible, difficult to miniaturize and power-hungry [1]. Due to their CMOS compatibility, thermal-conductivity-based sensors are an attractive alternative [2,3]. They exploit the fact that the thermal conductivity (TC) of CO2 is lower than that of the other constituents of air, so that CO2 concentration can be indirectly measured via the heat loss of a hot wire to ambient. However, this approach requires the detection of very small changes in TC (0.25 ppm per ppm CO2 [3]).","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"59 1","pages":"332-334"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78717573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A recursive-memory brain-state classifier with 32-channel track-and-zoom Δ2 Σ ADCs and Charge-Balanced Programmable Waveform Neurostimulators 递归记忆脑状态分类器与32通道跟踪和缩放Δ2 Σ adc和电荷平衡可编程波形神经刺激器
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310301
Gerard O’Leary, Mohammad Reza Pazhouhandeh, Michael Chang, David M. Groppe, T. Valiante, N. Verma, R. Genov
The advancement of closed-loop neuromodulation for treating neurological disorders demands: (1) analog circuits monitoring the brain activity uninterruptedly even during neurostimulation, (2) energy-efficient high-efficacy processors for responsive, adaptive, personalized neurostimulation, and (3) safe neurostimulation paradigms with rich spatio-temporal stimuli for controlling the brain's complex dynamics. This paper presents an implantable neural interface processor (NURIP) that addresses these requirements — it performs brain state classification for reliable seizure prediction and contingent seizure abortion.
闭环神经调节治疗神经系统疾病的进展需要:(1)模拟电路在神经刺激过程中不间断地监测大脑活动;(2)高效节能的处理器,用于响应性、适应性、个性化的神经刺激;(3)具有丰富时空刺激的安全神经刺激范式,用于控制大脑的复杂动态。本文提出了一种植入式神经接口处理器(NURIP)来解决这些要求——它执行脑状态分类,以可靠地预测癫痫发作和偶然的癫痫流产。
{"title":"A recursive-memory brain-state classifier with 32-channel track-and-zoom Δ2 Σ ADCs and Charge-Balanced Programmable Waveform Neurostimulators","authors":"Gerard O’Leary, Mohammad Reza Pazhouhandeh, Michael Chang, David M. Groppe, T. Valiante, N. Verma, R. Genov","doi":"10.1109/ISSCC.2018.8310301","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310301","url":null,"abstract":"The advancement of closed-loop neuromodulation for treating neurological disorders demands: (1) analog circuits monitoring the brain activity uninterruptedly even during neurostimulation, (2) energy-efficient high-efficacy processors for responsive, adaptive, personalized neurostimulation, and (3) safe neurostimulation paradigms with rich spatio-temporal stimuli for controlling the brain's complex dynamics. This paper presents an implantable neural interface processor (NURIP) that addresses these requirements — it performs brain state classification for reliable seizure prediction and contingent seizure abortion.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"32 1","pages":"296-298"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91254292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
A 16Gb/s/pin 8Gb GDDR6 DRAM with bandwidth extension techniques for high-speed applications 16Gb/s/引脚8Gb GDDR6 DRAM,带宽扩展技术,适用于高速应用
Pub Date : 2018-02-01 DOI: 10.1109/ISSCC.2018.8310258
Kyu-Dong Hwang, Boram Kim, Sangyeon Byeon, Kyu-Young Kim, Daehan Kwon, Hyun-Bae Lee, Geun-Il Lee, Sang-Sic Yoon, Jin-Youp Cha, Soo-young Jang, Seung-Hun Lee, Yongsuk Joo, Gang-Sik Lee, Sung-Soo Xi, Soo-Bin Lim, Kyung-Ho Chu, Joohwan Cho, J. Chun, Jonghoon Oh, Jinkook Kim, S. Lee
Recently the demand for high-bandwidth graphic DRAM, for game consoles and graphic cards, has dramatically increased due to the development of virtual reality, artificial intelligence, deep learning, autonomous driving cars, etc. These applications require greater data transfer speeds than pervious devices, GDDR5 [1] and GDDR5X [2], which are limited to 12Gb/s/pin. This paper introduces an 8Gb GDDR6 operating at up to 16Gb/s/pin. To exceed the prior speed limit various bandwidth extension techniques are proposed. WCK is driven with a dividing scheme to overcome speed limitations and to reduce power consumption. In addition, a dual-band architecture with different types of nibble drivers is proposed in order to cover stability of CML-to-CMOS in all frequency regions; CML nibble is used for high-speed, while CMOS nibble is used for low-speed. A DC-split scheme is implemented for duty-cycle correction and skew compensation. The bandwidth of the high-frequency divider is extended by using a proposed mode-changed flip-flop. The receiver uses a loop-unrolled one-tap decision-feedback equalizer (DFE) designed to eliminate channel inter-symbol interference (ISI). A two-stage pre-amplifier is also used for bandwidth extension. The transmitter uses a 4:1 multiplexer using a half-rate sampler, where a 1UI pulse is unnecessary to minimize the full-rate operation. To secure on-chip signal transmission characteristic, the bandwidth limitation of transistor in a DRAM process is extended by adopting an on-chip feedback EQ filter.
最近,由于虚拟现实、人工智能、深度学习、自动驾驶汽车等的发展,游戏机和显卡对高带宽图形DRAM的需求急剧增加。这些应用需要比以前的设备GDDR5[1]和GDDR5X[2]更高的数据传输速度,这些设备限制在12Gb/s/pin。本文介绍了一种8Gb的GDDR6,其工作速率高达16Gb/s/pin。为了超越先前的速度限制,提出了各种带宽扩展技术。WCK是驱动与分割方案,以克服速度限制,并降低功耗。此外,为了覆盖CML-to-CMOS在所有频率区域的稳定性,提出了具有不同类型蚕食驱动器的双频结构;CML蚕食用于高速,CMOS蚕食用于低速。采用直流分路方案进行占空比校正和偏斜补偿。高频分频器的带宽通过使用一种提出的变模触发器得到扩展。接收机采用循环展开的单抽头决策反馈均衡器(DFE),旨在消除信道符号间干扰(ISI)。两级前置放大器也用于带宽扩展。发射器使用使用半速率采样器的4:1多路复用器,其中不需要1UI脉冲来最小化全速率操作。为了保证片上信号的传输特性,采用片上反馈EQ滤波器扩展了DRAM工艺中晶体管的带宽限制。
{"title":"A 16Gb/s/pin 8Gb GDDR6 DRAM with bandwidth extension techniques for high-speed applications","authors":"Kyu-Dong Hwang, Boram Kim, Sangyeon Byeon, Kyu-Young Kim, Daehan Kwon, Hyun-Bae Lee, Geun-Il Lee, Sang-Sic Yoon, Jin-Youp Cha, Soo-young Jang, Seung-Hun Lee, Yongsuk Joo, Gang-Sik Lee, Sung-Soo Xi, Soo-Bin Lim, Kyung-Ho Chu, Joohwan Cho, J. Chun, Jonghoon Oh, Jinkook Kim, S. Lee","doi":"10.1109/ISSCC.2018.8310258","DOIUrl":"https://doi.org/10.1109/ISSCC.2018.8310258","url":null,"abstract":"Recently the demand for high-bandwidth graphic DRAM, for game consoles and graphic cards, has dramatically increased due to the development of virtual reality, artificial intelligence, deep learning, autonomous driving cars, etc. These applications require greater data transfer speeds than pervious devices, GDDR5 [1] and GDDR5X [2], which are limited to 12Gb/s/pin. This paper introduces an 8Gb GDDR6 operating at up to 16Gb/s/pin. To exceed the prior speed limit various bandwidth extension techniques are proposed. WCK is driven with a dividing scheme to overcome speed limitations and to reduce power consumption. In addition, a dual-band architecture with different types of nibble drivers is proposed in order to cover stability of CML-to-CMOS in all frequency regions; CML nibble is used for high-speed, while CMOS nibble is used for low-speed. A DC-split scheme is implemented for duty-cycle correction and skew compensation. The bandwidth of the high-frequency divider is extended by using a proposed mode-changed flip-flop. The receiver uses a loop-unrolled one-tap decision-feedback equalizer (DFE) designed to eliminate channel inter-symbol interference (ISI). A two-stage pre-amplifier is also used for bandwidth extension. The transmitter uses a 4:1 multiplexer using a half-rate sampler, where a 1UI pulse is unnecessary to minimize the full-rate operation. To secure on-chip signal transmission characteristic, the bandwidth limitation of transistor in a DRAM process is extended by adopting an on-chip feedback EQ filter.","PeriodicalId":6617,"journal":{"name":"2018 IEEE International Solid - State Circuits Conference - (ISSCC)","volume":"240 1","pages":"210-212"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86700642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
期刊
2018 IEEE International Solid - State Circuits Conference - (ISSCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1