2021 IEEE International Solid- State Circuits Conference (ISSCC)最新文献_第10页

29.6 A Distributed Digital LDO with Time-Multiplexing Calibration Loop Achieving 40A/mm2 Current Density and 1mA-to-6.4A Ultra-Wide Load Range in 5nm FinFET CMOS 29.6 5nm FinFET CMOS中具有时间复用校准环路的分布式数字LDO，实现40A/mm2电流密度和1ma至6.4A超宽负载范围

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365964

Dong-Hoon Jung, Tae-Hwang Kong, Jun-Hyeok Yang, SangHo Kim, Kwang-Ho Kim, J. Park, Michael Choi, Jongshin Shin

Although the number of cores is increasing continuously in modern microprocessors for applications such as HPC and AI, the available power is strictly limited by the thermal power budget. To overcome this limitation, recently, each core has been implemented with a dedicated integrated voltage regulator to increase the efficiency of power usage. Distributed digital LDO (DLDO) is a powerful solution for the integrated voltage regulator because it can supply uniform power over the entire core with reduced IR drop and help the thermal management [1– 4]. In the previous distributed DLDOs [1– 3], even though all LDO outputs are connected to drive the power-delivery network, the LDOs operate independently using their own controller, which occupies a large portion of the LDO size. Therefore, the current density in these types of structures is low. In [4], the distributed DLDO uses a dual-loop structure. In this scheme, the high current density can be achieved because the four shared global controllers control the 16 local LDOs (LLDOs) for highly accurate regulation. However, the LLDOs consume large quiescent current since they operate at a switching frequency of several-GHz for a fast transient response. Besides, the load current range is narrow due to the small switching duty-cycle range of the power FETs. Because of these drawbacks, the structure proposed in [4] has limitations in practical applications.

尽管用于高性能计算和人工智能等应用的现代微处理器的核心数量不断增加，但可用功率受到热功率预算的严格限制。为了克服这一限制，最近，每个核心都实现了一个专用的集成电压调节器，以提高电力使用效率。分布式数字LDO (DLDO)是集成稳压器的强大解决方案，因为它可以在整个核心上提供均匀的电源，减少IR下降，并有助于热管理[1 - 4]。在之前的分布式dldo[1 - 3]中，尽管所有的LDO输出都连接起来驱动供电网络，但LDO使用自己的控制器独立运行，这占据了LDO的很大一部分大小。因此，这类结构中的电流密度很低。在[4]中，分布式DLDO采用双环结构。在该方案中，由于四个共享全局控制器控制16个局部ldo (lldo)，因此可以实现高电流密度，从而实现高精度调节。然而，lldo消耗了很大的静态电流，因为它们在几ghz的开关频率下工作，以获得快速的瞬态响应。此外，由于功率场效应管的开关占空比较小，负载电流范围较窄。由于这些缺点，[4]中提出的结构在实际应用中存在局限性。

{"title":"29.6 A Distributed Digital LDO with Time-Multiplexing Calibration Loop Achieving 40A/mm2 Current Density and 1mA-to-6.4A Ultra-Wide Load Range in 5nm FinFET CMOS","authors":"Dong-Hoon Jung, Tae-Hwang Kong, Jun-Hyeok Yang, SangHo Kim, Kwang-Ho Kim, J. Park, Michael Choi, Jongshin Shin","doi":"10.1109/ISSCC42613.2021.9365964","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365964","url":null,"abstract":"Although the number of cores is increasing continuously in modern microprocessors for applications such as HPC and AI, the available power is strictly limited by the thermal power budget. To overcome this limitation, recently, each core has been implemented with a dedicated integrated voltage regulator to increase the efficiency of power usage. Distributed digital LDO (DLDO) is a powerful solution for the integrated voltage regulator because it can supply uniform power over the entire core with reduced IR drop and help the thermal management [1– 4]. In the previous distributed DLDOs [1– 3], even though all LDO outputs are connected to drive the power-delivery network, the LDOs operate independently using their own controller, which occupies a large portion of the LDO size. Therefore, the current density in these types of structures is low. In [4], the distributed DLDO uses a dual-loop structure. In this scheme, the high current density can be achieved because the four shared global controllers control the 16 local LDOs (LLDOs) for highly accurate regulation. However, the LLDOs consume large quiescent current since they operate at a switching frequency of several-GHz for a fast transient response. Besides, the load current range is narrow due to the small switching duty-cycle range of the power FETs. Because of these drawbacks, the structure proposed in [4] has limitations in practical applications.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131850046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

4.4 A 1.3TOPS/W @ 32GOPS Fully Integrated 10-Core SoC for IoT End-Nodes with 1.7μW Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode 4.4 1.3TOPS/W @ 32GOPS完全集成的10核SoC，用于物联网终端节点，具有基于mram的状态保持睡眠模式的1.7μW认知唤醒

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365939

D. Rossi, Francesco Conti, M. Eggimann, Stefan Mach, Alfio Di Mauro, M. Guermandi, Giuseppe Tagliavini, A. Pullini, Igor Loi, Jie Chen, E. Flamand, L. Benini

The Internet-of-Things requires end-nodes with ultra-low-power always-on capability for long battery lifetime, as well as high performance, energy efficiency, and extreme flexibility to deal with complex and fast-evolving near-sensor analytics algorithms (NSAAs). We present Vega, an always-on IoT end-node SoC capable of scaling from a 1.7$mu$ W fully retentive COGNITIVE sleep mode up to 32.2GOPS (@49.4mW) peak performance on NSAAs, including mobile DNN inference, exploiting 1.6MB of state- retentive SRAM, and 4MB of non-volatile MRAM. To meet the performance and flexibility requirements of NSAAs, the SoC features 10 RISC-V cores: one core for SoC and IO management and a 9-core cluster supporting multi-precision SIMD integer and floating- point computation. Two programmable machine-learning (ML) accelerators boost energy efficiency in sleep and active state, respectively.

物联网要求终端节点具有超低功耗、长电池寿命、高性能、能效和极高的灵活性，以应对复杂且快速发展的近传感器分析算法(NSAAs)。我们提出了Vega，一种始终在线的物联网终端节点SoC，能够从1.7$ $ mu$ $ W的完全保留认知睡眠模式扩展到NSAAs上的32.2GOPS (49.4 mw)峰值性能，包括移动DNN推理，利用1.6MB的状态保留SRAM和4MB的非易失性MRAM。为了满足NSAAs的性能和灵活性要求，SoC具有10个RISC-V内核:一个内核用于SoC和IO管理，一个9核集群支持多精度SIMD整数和浮点计算。两个可编程机器学习(ML)加速器分别提高睡眠和活动状态下的能量效率。

引用次数: 24

29.8 115nA@3V ULPMark-CP Score 1205 SCVR-Less Dynamic Voltage-Stacking Scheme for IoT MCU 29.8 115nA@3V ULPMark-CP Score 1205物联网MCU SCVR-Less动态电压叠加方案

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9366025

Xiaomin Li, Yibo Xu, Lizheng Ren, Weiwei Ge, Jianlong Cai, Xinning Liu, Jun Yang

Limited by battery capacity, advanced MCUs for IoT applications require ultra-low power consumption. In a conventional design, most modules except the crystal oscillator (XO32), real-time clock (RTC), and retention memory are turned off to reduce the current in sleep state, but the sleep power still accounts for most of the total power consumption. When the load current is reduced to $sim 100$ nA, the transient current of a switched capacitor voltage regulator (SCVR) remains unchanged $(sim 100$ nA), so that power efficiency is low and the sleep current cannot be reduced further. Voltage stacking has been proposed to address power efficiency [1, 2]. Prior voltage stacking architectures could not realize dynamic switching between flat mode and stack mode, leading to high dynamic power in the normal state. In addition, the SCVR still consumes some power during the sleep state [3]. This paper proposes a dynamic voltage-stacking scheme, which supports two operating modes: a flat mode in the normal state and a stack mode in the sleep state. In the flat mode, the retention memory, RTC, and XO32 are connected in parallel and are powered by the SCVR. In the stack mode, the four instances are connected in series, including the SRAM1 (level1), the SRAM2 (level2), the XO32, and the RTC (level3), and the on-chip SCVR is shut down for power saving.

由于电池容量的限制，用于物联网应用的先进mcu需要超低功耗。在传统设计中，除了晶体振荡器(XO32)、实时时钟(RTC)和保持存储器之外，大多数模块在休眠状态下关闭以减少电流，但休眠功耗仍然占总功耗的大部分。当负载电流降低到$sim 100$ nA时，开关电容稳压器(SCVR)的暂态电流保持$(sim 100$ nA)不变，因此功率效率较低，睡眠电流无法进一步降低。电压堆叠已被提出以解决功率效率问题[1,2]。先前的电压堆叠架构无法实现平面模式和堆叠模式的动态切换，导致在正常状态下动态功率很高。另外，在休眠状态[3]期间，SCVR仍然会消耗一些功率。本文提出了一种动态电压叠加方案，该方案支持两种工作模式:正常状态下的平坦模式和休眠状态下的堆叠模式。在平面模式下，保留内存、RTC和XO32并联连接，由SCVR供电。在堆叠模式下，SRAM1 (level1)、SRAM2 (level2)、XO32和RTC (level3)四个实例依次连接，为了省电，关闭片上SCVR。

{"title":"29.8 115nA@3V ULPMark-CP Score 1205 SCVR-Less Dynamic Voltage-Stacking Scheme for IoT MCU","authors":"Xiaomin Li, Yibo Xu, Lizheng Ren, Weiwei Ge, Jianlong Cai, Xinning Liu, Jun Yang","doi":"10.1109/ISSCC42613.2021.9366025","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9366025","url":null,"abstract":"Limited by battery capacity, advanced MCUs for IoT applications require ultra-low power consumption. In a conventional design, most modules except the crystal oscillator (XO32), real-time clock (RTC), and retention memory are turned off to reduce the current in sleep state, but the sleep power still accounts for most of the total power consumption. When the load current is reduced to $sim 100$ nA, the transient current of a switched capacitor voltage regulator (SCVR) remains unchanged $(sim 100$ nA), so that power efficiency is low and the sleep current cannot be reduced further. Voltage stacking has been proposed to address power efficiency [1, 2]. Prior voltage stacking architectures could not realize dynamic switching between flat mode and stack mode, leading to high dynamic power in the normal state. In addition, the SCVR still consumes some power during the sleep state [3]. This paper proposes a dynamic voltage-stacking scheme, which supports two operating modes: a flat mode in the normal state and a stack mode in the sleep state. In the flat mode, the retention memory, RTC, and XO32 are connected in parallel and are powered by the SCVR. In the stack mode, the four instances are connected in series, including the SRAM1 (level1), the SRAM2 (level2), the XO32, and the RTC (level3), and the on-chip SCVR is shut down for power saving.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128657690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

11.7 A 56Gb/s 50mW NRZ Receiver in 28nm CMOS 11.7在28nm CMOS 56Gb/s 50mW NRZ接收器

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365997

Atharav Atharav, B. Razavi

The power consumption of wireline transceivers has become increasingly critical as higher data rates and a larger numbers of lanes per chip are sought [1] –[6]. While attractive for lossy channels, PAM-4 signaling has mostly dictated ADC-based receivers (RXs) and relatively high power consumption [1], [2]. Non-return-to-zero (NRZ) receivers, on the other hand, can be realized in the analog domain, potentially consuming less power, but they must deal with a greater loss. This paper introduces an NRZ RX that achieves more than a twofold reduction in power while exhibiting BER < 10-12 for a channel loss of 25dB at 28GHz. The proposed design can compete with PAM-4 counterparts and/or serve in 112Gb/s systems that must also support 56Gb/s reception. Figure 11.7.1 shows the RX architecture. The data path consists of a CTLE core, a DFE core, a discrete-time linear equalizer (DTLE) [4], and a DMUX. The receiver performance is greatly improved by a number of feedforward and feedback paths. Also proposed is a half-rate “band-pass” CDR that avoids loading the main data path and the use of quadrature VCOs.

随着更高的数据速率和每个芯片更大数量的通道被寻求[1]-[6]，有线收发器的功耗变得越来越关键。虽然对有损信道很有吸引力，但PAM-4信令主要取决于基于adc的接收器(RXs)和相对较高的功耗[1]，[2]。另一方面，非归零(NRZ)接收器可以在模拟域实现，潜在地消耗更少的功率，但它们必须处理更大的损耗。本文介绍了一种NRZ RX，在28GHz信道损耗为25dB的情况下，其功率降低了两倍以上，而误码率< 10-12。所提出的设计可以与PAM-4竞争，并且/或在必须支持56Gb/s接收的112Gb/s系统中服务。RX架构如图11.7.1所示。数据路径由CTLE核心、DFE核心、离散时间线性均衡器(DTLE)[4]和DMUX组成。通过多种前馈和反馈路径，大大提高了接收机的性能。还提出了一种半速率“带通”CDR，它避免了加载主数据路径和使用正交vco。

引用次数: 5

12.3 Exploring PUF-Controlled PA Spectral Regrowth for Physical-Layer Identification of IoT Nodes 12.3探索puf控制的PA谱再生用于物联网节点的物理层识别

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365941

Qiang Zhou, Yan He, Kaiyuan Yang, T. Chi

It is projected that 75 billion Internet-of-Things (IoT) devices will be deployed for applications such as wearable electronics and smart home by 2025. Securing IoT devices is one of the most significant barriers we need to overcome for large-scale IoT adoption. Conventional wireless security has been implemented solely using upper-layer cryptography [1]. Unfortunately, IoT nodes are often energy-constrained and may not have enough computational resources to implement advanced asymmetric cryptographic algorithms and public-key-infrastructures (PKI) [2]–[3]. To overcome this challenge, there has been growing interest in leveraging the physical impairments of the radios that are bonded to specific TX for secure identification [4] –[6], a.k.a. RF fingerprinting. If Bob (the RX) has sufficient sensitivity, it can identify Alice (the legitimate TX) and the malicious impersonator during demodulation based on their inherent radio signatures, similar to how we distinguish different people based on their unique voice signatures (Fig. 12.3.1). As the device-dependent radio impairments come from process variation, it is challenging for impersonators to forge in practice. In addition, unlike conventional identification approach that device IDs are inserted in preambles and checked only once a while, RF fingerprinting enables continuous identification at any moment during communication, leading to a tighter bond between the data packet and device.

据预测，到2025年，可穿戴电子产品和智能家居等应用领域将部署750亿台物联网(IoT)设备。确保物联网设备的安全是大规模采用物联网需要克服的最重要障碍之一。传统的无线安全仅使用上层加密技术实现[1]。不幸的是，物联网节点通常受到能量限制，可能没有足够的计算资源来实现高级非对称加密算法和公钥基础设施(PKI)[2] -[3]。为了克服这一挑战，人们对利用绑定到特定TX的无线电的物理损伤进行安全识别[4]-[6](即射频指纹识别)越来越感兴趣。如果Bob (RX)具有足够的灵敏度，它可以根据其固有的无线电签名在解调期间识别Alice(合法TX)和恶意模仿者，类似于我们如何根据其独特的语音签名区分不同的人(图12.3.1)。由于设备相关的无线电损伤来自于工艺变化，因此仿冒者在实践中很难伪造。此外，与传统的识别方法不同，设备id插入序言中，只检查一次，射频指纹识别可以在通信过程中的任何时刻进行连续识别，从而使数据包和设备之间的联系更加紧密。

{"title":"12.3 Exploring PUF-Controlled PA Spectral Regrowth for Physical-Layer Identification of IoT Nodes","authors":"Qiang Zhou, Yan He, Kaiyuan Yang, T. Chi","doi":"10.1109/ISSCC42613.2021.9365941","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365941","url":null,"abstract":"It is projected that 75 billion Internet-of-Things (IoT) devices will be deployed for applications such as wearable electronics and smart home by 2025. Securing IoT devices is one of the most significant barriers we need to overcome for large-scale IoT adoption. Conventional wireless security has been implemented solely using upper-layer cryptography [1]. Unfortunately, IoT nodes are often energy-constrained and may not have enough computational resources to implement advanced asymmetric cryptographic algorithms and public-key-infrastructures (PKI) [2]–[3]. To overcome this challenge, there has been growing interest in leveraging the physical impairments of the radios that are bonded to specific TX for secure identification [4] –[6], a.k.a. RF fingerprinting. If Bob (the RX) has sufficient sensitivity, it can identify Alice (the legitimate TX) and the malicious impersonator during demodulation based on their inherent radio signatures, similar to how we distinguish different people based on their unique voice signatures (Fig. 12.3.1). As the device-dependent radio impairments come from process variation, it is challenging for impersonators to forge in practice. In addition, unlike conventional identification approach that device IDs are inserted in preambles and checked only once a while, RF fingerprinting enables continuous identification at any moment during communication, leading to a tighter bond between the data packet and device.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133510129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

High-Performance and Small Form-Factor mm-Wave CMOS Radars for Automotive and Industrial Sensing in 76-to-81GHz and 57-to-64GHz Bands 用于76- 81ghz和57- 64ghz频段的汽车和工业传感的高性能小尺寸毫米波CMOS雷达

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365838

K. Dandu, S. Samala, K. Bhatia, M. Moallem, Karthik Subburaj, Z. Ahmad, Daniel Breen, Sunhwan Jang, Tim Davis, Mayank Singh, Shankar Ram, Vashishth Dudhia, M. Dewilde, Dheeraj Shetty, J. Samuel, Z. Parkar, Cathy Chi, Pilar Loya, Zachary Crawford, John B. Herrington, R. Kulak, Abhinav Daga, Rakesh Raavi, R. Teja, Rajesh Veettil, Daniel Khemraj, Indu Prathapan, P. Narayanan, Naveen Narayanan, Sangamesh Anandwade, Jasbir Singh, V. Srinivasan, Neeraj P. Nayak, K. Ramasubramanian, B. Ginsburg, V. Rentala

Millimeterwave (mm-Wave) radar sensors operating in the 76-to-81 GHz band are a key component of advanced driver-assistance systems (ADAS) for enhanced automotive safety. The recent entry of CMOS solutions in this space has accelerated development of multi-mode Radars that can support long, medium and short-range applications [1–3]. As ADAS applications evolve to support higher levels of autonomy, there is increased demand on radar sensors for improved maximum range, velocity, and angular resolution. Emerging automotive in-cabin occupancy sensing applications are creating opportunities for short-range, high-resolution sensors operating in 60/77GHz bands (depending on the regulatory market). The unlicensed 60GHz band has also enabled industrial sensing opportunities across diverse markets such as robotics, building automation, and healthcare. Several of these broad-market applications require inexpensive and small form factor sensors that can be deployed on low cost PCBs (e.g., FR4) without expertise in mm-Wave design. In this paper, we describe our high-performance 76-to-81GHz FMCW Automotive Radar that supports multi-chip cascading to enable higher angular resolution and a compact 57-to-64 GHz single-chip Radar with integrated antennas on package. All devices are built on a 45nm bulk CMOS technology with 9 metal layers and packaged using flip-chip BGA technology.

工作在76至81 GHz频段的毫米波(mm-Wave)雷达传感器是增强汽车安全性的高级驾驶员辅助系统(ADAS)的关键组成部分。最近进入该领域的CMOS解决方案加速了多模雷达的发展，可以支持长、中、近距离应用[1-3]。随着ADAS应用不断发展，以支持更高级别的自主性，对雷达传感器的需求不断增加，以提高最大范围、速度和角度分辨率。新兴的汽车舱内占用感测应用为60/77GHz频段(取决于监管市场)的短距离高分辨率传感器创造了机会。未经许可的60GHz频段还为机器人、楼宇自动化和医疗保健等不同市场提供了工业传感机会。这些广泛的市场应用中有几个需要廉价和小尺寸的传感器，这些传感器可以部署在低成本的pcb(例如FR4)上，而无需在毫米波设计方面的专业知识。在本文中，我们介绍了我们的高性能76至81ghz FMCW汽车雷达，该雷达支持多芯片级联以实现更高的角度分辨率，以及紧凑型57至64 GHz单芯片雷达，其封装上集成了天线。所有器件均采用45纳米体CMOS技术，采用9层金属层，并采用倒装芯片BGA技术封装。

{"title":"High-Performance and Small Form-Factor mm-Wave CMOS Radars for Automotive and Industrial Sensing in 76-to-81GHz and 57-to-64GHz Bands","authors":"K. Dandu, S. Samala, K. Bhatia, M. Moallem, Karthik Subburaj, Z. Ahmad, Daniel Breen, Sunhwan Jang, Tim Davis, Mayank Singh, Shankar Ram, Vashishth Dudhia, M. Dewilde, Dheeraj Shetty, J. Samuel, Z. Parkar, Cathy Chi, Pilar Loya, Zachary Crawford, John B. Herrington, R. Kulak, Abhinav Daga, Rakesh Raavi, R. Teja, Rajesh Veettil, Daniel Khemraj, Indu Prathapan, P. Narayanan, Naveen Narayanan, Sangamesh Anandwade, Jasbir Singh, V. Srinivasan, Neeraj P. Nayak, K. Ramasubramanian, B. Ginsburg, V. Rentala","doi":"10.1109/ISSCC42613.2021.9365838","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365838","url":null,"abstract":"Millimeterwave (mm-Wave) radar sensors operating in the 76-to-81 GHz band are a key component of advanced driver-assistance systems (ADAS) for enhanced automotive safety. The recent entry of CMOS solutions in this space has accelerated development of multi-mode Radars that can support long, medium and short-range applications [1–3]. As ADAS applications evolve to support higher levels of autonomy, there is increased demand on radar sensors for improved maximum range, velocity, and angular resolution. Emerging automotive in-cabin occupancy sensing applications are creating opportunities for short-range, high-resolution sensors operating in 60/77GHz bands (depending on the regulatory market). The unlicensed 60GHz band has also enabled industrial sensing opportunities across diverse markets such as robotics, building automation, and healthcare. Several of these broad-market applications require inexpensive and small form factor sensors that can be deployed on low cost PCBs (e.g., FR4) without expertise in mm-Wave design. In this paper, we describe our high-performance 76-to-81GHz FMCW Automotive Radar that supports multi-chip cascading to enable higher angular resolution and a compact 57-to-64 GHz single-chip Radar with integrated antennas on package. All devices are built on a 45nm bulk CMOS technology with 9 metal layers and packaged using flip-chip BGA technology.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133371579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

7.4 A 256×128 3D-Stacked (45nm) SPAD FLASH LiDAR with 7-Level Coincidence Detection and Progressive Gating for 100m Range and 10klux Background Light 7.4 256×128 3d堆叠(45nm) SPAD FLASH激光雷达，具有7级重合检测和渐进门控，适用于100m范围和10klux背景光

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9366010

Preethi Padmanabhan, Chao Zhang, M. Cazzaniga, Baris C. Efe, A. Ximenes, Myung-Jae Lee, E. Charbon

3D vision is an increasingly important feature in many applications of consumer, automotive, industrial, and medical imaging. Long range, high depth resolution, high spatial resolution, and high frame rates, are often conflicting requirements and difficult to be simultaneously achieved, especially in extreme ambient light conditions. In order to address range and depth resolution, direct time-of-flight has emerged as a powerful technique to perform light detection and ranging (LiDAR), thanks to advances in low-jitter optical detectors, such as single-photon avalanche diodes (SPADs), and accurate time-to-digital converters (TDCs) [1]–[5]. High spatial resolution can be achieved using scanning, at a cost of system complexity and somewhat lower frame rates [1], [3], while FLASH sensors [2], 4, [5] offer an alternative for both high frame rates and large pixel counts, but at limited ambient light conditions, due to typically long exposure times.

3D视觉在消费、汽车、工业和医疗成像的许多应用中越来越重要。远距离、高深度分辨率、高空间分辨率和高帧率往往是相互冲突的要求，难以同时实现，特别是在极端环境光条件下。为了解决距离和深度分辨率问题，由于低抖动光学探测器(如单光子雪崩二极管(spad))和精确的时间-数字转换器(tdc)[1] -[5]的进步，直接飞行时间已成为执行光探测和测距(LiDAR)的强大技术。高空间分辨率可以通过扫描来实现，但代价是系统复杂性和较低的帧率[1]，[3]，而FLASH传感器[2]，4，[5]提供了高帧率和大像素计数的替代方案，但在有限的环境光条件下，由于典型的长曝光时间。

引用次数: 31

11.2 A 26.5625-to-106.25Gb/s XSR SerDes with 1.55pJ/b Efficiency in 7nm CMOS 11.2在7nm CMOS上具有1.55pJ/b效率的26.5625- 106.25 gb /s XSR SerDes

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365975

Ravi Shivnaraine, Marcus van Ierssel, K. Farzan, D. DiClemente, G. Ng, Nanyan Y. Wang, J. Musayev, Gairik Dutta, M. Shibata, A. Moradi, H. Vahedi, Manavi Farzad, Prabhnoor Kainth, Matt Yu, N. Nguyen, J. Pham, A. McLaren

The increasing connectivity of devices in our daily lives has driven the need for higher bandwidth in network and data centers. Recently, we have seen the development of 112Gb/s SerDes, particularly for long-reach interfaces [1– 3]. In high-density switch ASICs, we see an increasing demand to improve both area efficiency (mm2/lane) and signaling efficiencies (pJ/b) [1– 6]. In a switch ASIC, keeping the SerDes power low translates into broader system power savings since additional power and cost for cooling can be limited or even avoided entirely. One path forward to achieve these important system gains is co-packaged optics (CPO) with an extra-short-reach (XSR) interface. In these applications the switch ASIC and optical engine are no more than 50mm apart which represents a total loss of approximately 10dB at 106.25Gb/s.

在我们的日常生活中，越来越多的设备连接，推动了对网络和数据中心更高带宽的需求。最近，我们已经看到了112Gb/s服务器的发展，特别是对于长距离接口[1 - 3]。在高密度开关asic中，我们看到对提高面积效率(mm2/lane)和信令效率(pJ/b)的需求不断增加[1 - 6]。在开关ASIC中，保持较低的SerDes功率可以转化为更广泛的系统功耗节约，因为可以限制甚至完全避免额外的功耗和冷却成本。实现这些重要系统增益的一个途径是带有超短距离(XSR)接口的共封装光学器件(CPO)。在这些应用中，开关ASIC和光引擎之间的距离不超过50mm，在106.25Gb/s的速度下，总损耗约为10dB。

引用次数: 6

17.5 A 98.2%-Efficiency Reciprocal Direct Charge Recycling Inductor-First DC-DC Converter 17.5效率98.2%的反向直接电荷回收电感-首个DC-DC变换器

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365860

Abdullah Abdulslam, P. Mercier

Inductive DC-DC converters are fundamentally limited by the trade-off between conduction losses and switching losses. Miniaturized converters used in applications such as mobile devices suffer badly from this trade off, as a small inductor has a large DCR, which contributes large I2 RDCR conduction losses, while a small inductance desires high frequency operation, which implies high CGATE V2 f hard charging switching losses from the power MOSFET gate drivers. Interestingly, the rise/fall time of such drivers cannot be too rapid, regardless of switching frequency, due to inductive ringing causing potential voltage stresses [1], [2]. To ease the conduction/switching loss trade-off, it is possible to exploit the requirement for finite rise/fall time by replacing conventionally hard-switching gate drivers with adiabatic charge-recycling (CR) gate drivers. As depicted in Fig. 17.5.1 (top right), CR can, through the help of inductor LR, recycle the charge stored on CGATE to another capacitance, CSTORE (and vice-versa), theoretically with 100% efficiency. This approach was demonstrated in [3], where the charge on the power MOSFET gates are recycled to two auxiliary capacitors through two separate inductors (Fig. 17.5.1, top left). However, besides the overhead of two inductors, recycling with separate storage capacitors introduces indirect losses, while the separated duty-cycled resonate gate drivers makes non-overlap timing control between power MOSFETs difficult. By AC-coupling the power NMOS to the resonant gate driver as in [4] (Fig. 17.5.1, bottom left), it is possible to reduce the number of resonant inductors to 1. However, the non-overlap time cannot be precisely controlled, leading to potentially large overlap losses, and the limited duty-cycle control through driver slope modulation prevents robust regulation across a wide output range.

电感式DC-DC变换器从根本上受限于传导损耗和开关损耗之间的权衡。在移动设备等应用中使用的小型化转换器严重受到这种权衡的影响，因为小型电感具有大的DCR，这有助于大的I2 rdrcr导通损耗，而小型电感需要高频操作，这意味着功率MOSFET栅极驱动器的硬充电开关损耗高CGATE V2。有趣的是，无论开关频率如何，这种驱动器的上升/下降时间都不能太快，因为感应振铃会产生潜在的电压应力[1]，[2]。为了减轻传导/开关损耗的权衡，可以通过用绝热电荷回收(CR)栅极驱动器取代传统的硬开关栅极驱动器来满足有限上升/下降时间的要求。如图17.5.1(右上)所示，CR可以通过电感器LR的帮助，将存储在CGATE上的电荷回收到另一个电容CSTORE上(反之亦然)，理论上效率为100%。这种方法在[3]中得到了证明，其中功率MOSFET栅极上的电荷通过两个独立的电感回收到两个辅助电容器(图17.5.1，左上)。然而，除了两个电感器的开销外，使用单独的存储电容进行回收会引入间接损耗，而分离的占空比谐振栅驱动器使得功率mosfet之间的非重叠定时控制变得困难。通过将功率NMOS与谐振栅极驱动器进行交流耦合，如图[4]所示(图17.5.1，左下)，可以将谐振电感的数量减少到1个。然而，非重叠时间不能精确控制，导致潜在的大重叠损失，并且通过驱动器斜率调制的有限占空比控制阻碍了在宽输出范围内的鲁棒调节。

{"title":"17.5 A 98.2%-Efficiency Reciprocal Direct Charge Recycling Inductor-First DC-DC Converter","authors":"Abdullah Abdulslam, P. Mercier","doi":"10.1109/ISSCC42613.2021.9365860","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365860","url":null,"abstract":"Inductive DC-DC converters are fundamentally limited by the trade-off between conduction losses and switching losses. Miniaturized converters used in applications such as mobile devices suffer badly from this trade off, as a small inductor has a large DCR, which contributes large I2 RDCR conduction losses, while a small inductance desires high frequency operation, which implies high CGATE V2 f hard charging switching losses from the power MOSFET gate drivers. Interestingly, the rise/fall time of such drivers cannot be too rapid, regardless of switching frequency, due to inductive ringing causing potential voltage stresses [1], [2]. To ease the conduction/switching loss trade-off, it is possible to exploit the requirement for finite rise/fall time by replacing conventionally hard-switching gate drivers with adiabatic charge-recycling (CR) gate drivers. As depicted in Fig. 17.5.1 (top right), CR can, through the help of inductor LR, recycle the charge stored on CGATE to another capacitance, CSTORE (and vice-versa), theoretically with 100% efficiency. This approach was demonstrated in [3], where the charge on the power MOSFET gates are recycled to two auxiliary capacitors through two separate inductors (Fig. 17.5.1, top left). However, besides the overhead of two inductors, recycling with separate storage capacitors introduces indirect losses, while the separated duty-cycled resonate gate drivers makes non-overlap timing control between power MOSFETs difficult. By AC-coupling the power NMOS to the resonant gate driver as in [4] (Fig. 17.5.1, bottom left), it is possible to reduce the number of resonant inductors to 1. However, the non-overlap time cannot be precisely controlled, leading to potentially large overlap losses, and the limited duty-cycle control through driver slope modulation prevents robust regulation across a wide output range.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117024137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

36.2 An EM/Power SCA-Resilient AES-256 with Synthesizable Signature Attenuation Using Digital-Friendly Current Source and RO-Bleed-Based Integrated Local Feedback and Global Switched-Mode Control 36.2基于数字友好电流源和基于ro -流血的集成局部反馈和全局切换模式控制的EM/Power sca弹性AES-256可合成签名衰减

2021 IEEE International Solid- State Circuits Conference (ISSCC)

Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365978

A. Ghosh, D. Das, Josef Danial, V. De, Santosh K. Ghosh, Shreyas Sen

Mathematically secure cryptographic algorithms leak side-channel information in the form of correlated power and electromagnetic (EM) signals, leading to physical side-channel analysis (SCA) attacks. Circuit-level countermeasures against power/EM SCA include current equalizer [1], series LDO [2], IVR [3], enhancing protection up to 10M traces. Recently, current domain signature attenuation [4] and randomized NL-LDO cascaded with arithmetic countermeasures [5] achieved $gt1mathrm{B}$ minimum traces to disclosure (MTD) with a single and two countermeasures, respectively. Among these, the highest protection with a single strategy is achieved using signature attenuation [4], [6], which utilized a current source making the supply current mostly constant. While being highly resilient to SCA, [4] required analog-biased cascode current sources and an analog bleed path, making it not easily scalable across different technology generations. Conversely, [2], [5] are synthesizable but a single countermeasure only achieved moderate protection (up to 10M MTD). This work embraces the concept of signature attenuation in the current domain, but makes it fully-synthesizable with digital current sources, control loop and the bleed to increase the MTD from 10M [5] to $250mathrm{M} (25 times $ improvement, Fig. 36.2.1) using a single synthesizable countermeasure. Finally, combining the digital signature attenuation circuit (DSAC) with a second synthesizable generic technique in the form of a time-varying transfer function (TVTF), this work achieves an MTD $gt1.25mathrm{B}$ for both EM and power SCA.

数学上安全的加密算法以相关功率和电磁(EM)信号的形式泄漏侧信道信息，导致物理侧信道分析(SCA)攻击。针对功率/EM SCA的电路级对策包括电流均衡器[1]，系列LDO [2]， IVR[3]，可增强长达10M走线的保护。近年来，电流域特征衰减[4]和随机化NL-LDO级联算术对抗[5]分别实现了$gt1mathrm{B}$最小披露痕迹(MTD)。其中，使用特征衰减[4]，[6]实现了单一策略的最高保护，该策略利用电流源使电源电流基本恒定。虽然[4]对SCA具有很高的弹性，但它需要模拟偏级级码电流源和模拟输出路径，这使得它不容易在不同的技术世代之间进行扩展。相反，[2]和[5]是可合成的，但单个对抗措施只能实现中等保护(高达10M MTD)。这项工作包含了电流域中的特征衰减的概念，但使其与数字电流源、控制回路和出血完全合成，使用单个可合成对策将MTD从10M[5]增加到250 mathm {M}(改进25倍，图36.2.1)。最后，将数字签名衰减电路(DSAC)与时变传递函数(TVTF)形式的第二种可合成通用技术相结合，本工作实现了EM和功率SCA的MTD $gt1.25 maththrm {B}$。

{"title":"36.2 An EM/Power SCA-Resilient AES-256 with Synthesizable Signature Attenuation Using Digital-Friendly Current Source and RO-Bleed-Based Integrated Local Feedback and Global Switched-Mode Control","authors":"A. Ghosh, D. Das, Josef Danial, V. De, Santosh K. Ghosh, Shreyas Sen","doi":"10.1109/ISSCC42613.2021.9365978","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365978","url":null,"abstract":"Mathematically secure cryptographic algorithms leak side-channel information in the form of correlated power and electromagnetic (EM) signals, leading to physical side-channel analysis (SCA) attacks. Circuit-level countermeasures against power/EM SCA include current equalizer [1], series LDO [2], IVR [3], enhancing protection up to 10M traces. Recently, current domain signature attenuation [4] and randomized NL-LDO cascaded with arithmetic countermeasures [5] achieved $gt1mathrm{B}$ minimum traces to disclosure (MTD) with a single and two countermeasures, respectively. Among these, the highest protection with a single strategy is achieved using signature attenuation [4], [6], which utilized a current source making the supply current mostly constant. While being highly resilient to SCA, [4] required analog-biased cascode current sources and an analog bleed path, making it not easily scalable across different technology generations. Conversely, [2], [5] are synthesizable but a single countermeasure only achieved moderate protection (up to 10M MTD). This work embraces the concept of signature attenuation in the current domain, but makes it fully-synthesizable with digital current sources, control loop and the bleed to increase the MTD from 10M [5] to $250mathrm{M} (25 times $ improvement, Fig. 36.2.1) using a single synthesizable countermeasure. Finally, combining the digital signature attenuation circuit (DSAC) with a second synthesizable generic technique in the form of a time-varying transfer function (TVTF), this work achieves an MTD $gt1.25mathrm{B}$ for both EM and power SCA.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"337 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122842533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16