首页 > 最新文献

2021 IEEE International Solid- State Circuits Conference (ISSCC)最新文献

英文 中文
A 400-to-1000nm 24μ W Monolithic PPG Sensor with 0.3A/W Spectral Responsivity for Miniature Wearables 一种400- 1000nm 24μ W单片PPG传感器,光谱响应率为0.3A/W,用于微型可穿戴设备
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9366035
Sung-jin Jung, Jeil Ryu, Wang-hyun Kim, Seunghoon Lee, Jongboo Kim, Hyelim Park, Tae-Min Jang, Haedo Jeong, Juhwa Kim, Jeongho Park, Raeyoung Kim, Jeongho Park, H. Jo, Whee Jin Kim, Jangbeom Yang, Bongjin Sohn, Yuncheol Han, Inchun Lim, Seoungjae Yoo, Changsoon Park, Dae-Geun Jang, Byung-Hoon Ko, J. Lim, Jihon Kim, Kyungho Lee, Jesuk Lee, Yongin Park, Long Yan
Incorporating different wavelength (400 to 1000nm) LEDs, photoplethysmography (PPG) sensors allow wearable devices to monitor various health parameters such as heart rate (HR), oxygen saturation (SpO2), and blood pressure (BP). Nowadays, PPG sensing technology at the wrist is well established. To cope with the large degree of motion turbulence presented at the wrist, PPG sensors use Green (Gr) LEDs together with multiple photodiodes (PD), and they are driven by wide-dynamic-range (DR) current-sensing front-ends [1]. It is attractive to use a near-infra-red (nIR) PPG sensor in a True Wireless Stereo (TWS), as the ear provides the best site to measure heart rhythm (more blood flow, constant distance from the heart, and less motion than at the finger or wrist). However, TWS requires a PPG sensor that is more stringent on size and power consumption (shown in Fig. 28.2.1). A promising solution [2, 3] is integrating an array of PDs with an ADC to dramatically reduce power while also providing monolithic integration. However, the limited DR (<80 dB) and the poor spectral responsivity remain challenging. This work advances [1] by demonstrating a CMOS monolithic PPG sensor, and improves spectral responsivity more than $4 times (0.3mathrm{A} /mathrm{W}$ across 400 to 1000nm) compared to [2]. The sensor is fabricated by back-side illumination (BSI) CMOS technology providing 90dB DR (18dB improvement from [3]) while consuming only $24 mu mathrm{W}$ power and 5.5mm 2 silicon area.
结合不同波长(400至1000nm)的led,光电体积脉搏描记(PPG)传感器允许可穿戴设备监测各种健康参数,如心率(HR)、氧饱和度(SpO2)和血压(BP)。如今,手腕上的PPG传感技术已经很成熟。为了应对手腕处出现的大程度运动湍流,PPG传感器使用Green (Gr) led和多个光电二极管(PD),并由宽动态范围(DR)电流传感前端驱动[1]。在真正的无线立体声(TWS)中使用近红外(nIR) PPG传感器是很有吸引力的,因为耳朵提供了测量心律的最佳位置(更多的血流量,与心脏的距离恒定,比手指或手腕的运动更少)。然而,TWS对PPG传感器的尺寸和功耗要求更为严格(如图28.2.1所示)。一个很有前途的解决方案[2,3]是将一组pd与一个ADC集成在一起,以显著降低功耗,同时提供单片集成。然而,有限的DR (<80 dB)和较差的光谱响应率仍然是一个挑战。这项工作通过展示CMOS单片PPG传感器来推进[1],与[2]相比,光谱响应率提高了4倍以上(在400至1000nm范围内0.3 mathm {a} / mathm {W}$)。该传感器采用背面照明(BSI) CMOS技术制造,提供90dB DR(比[3]提高了18dB),功耗仅为$24 mu mathm {W}$,硅面积为5.5mm。
{"title":"A 400-to-1000nm 24μ W Monolithic PPG Sensor with 0.3A/W Spectral Responsivity for Miniature Wearables","authors":"Sung-jin Jung, Jeil Ryu, Wang-hyun Kim, Seunghoon Lee, Jongboo Kim, Hyelim Park, Tae-Min Jang, Haedo Jeong, Juhwa Kim, Jeongho Park, Raeyoung Kim, Jeongho Park, H. Jo, Whee Jin Kim, Jangbeom Yang, Bongjin Sohn, Yuncheol Han, Inchun Lim, Seoungjae Yoo, Changsoon Park, Dae-Geun Jang, Byung-Hoon Ko, J. Lim, Jihon Kim, Kyungho Lee, Jesuk Lee, Yongin Park, Long Yan","doi":"10.1109/ISSCC42613.2021.9366035","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9366035","url":null,"abstract":"Incorporating different wavelength (400 to 1000nm) LEDs, photoplethysmography (PPG) sensors allow wearable devices to monitor various health parameters such as heart rate (HR), oxygen saturation (SpO2), and blood pressure (BP). Nowadays, PPG sensing technology at the wrist is well established. To cope with the large degree of motion turbulence presented at the wrist, PPG sensors use Green (Gr) LEDs together with multiple photodiodes (PD), and they are driven by wide-dynamic-range (DR) current-sensing front-ends [1]. It is attractive to use a near-infra-red (nIR) PPG sensor in a True Wireless Stereo (TWS), as the ear provides the best site to measure heart rhythm (more blood flow, constant distance from the heart, and less motion than at the finger or wrist). However, TWS requires a PPG sensor that is more stringent on size and power consumption (shown in Fig. 28.2.1). A promising solution [2, 3] is integrating an array of PDs with an ADC to dramatically reduce power while also providing monolithic integration. However, the limited DR (<80 dB) and the poor spectral responsivity remain challenging. This work advances [1] by demonstrating a CMOS monolithic PPG sensor, and improves spectral responsivity more than $4 times (0.3mathrm{A} /mathrm{W}$ across 400 to 1000nm) compared to [2]. The sensor is fabricated by back-side illumination (BSI) CMOS technology providing 90dB DR (18dB improvement from [3]) while consuming only $24 mu mathrm{W}$ power and 5.5mm 2 silicon area.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"248 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124723581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A 24Gb/s/pin 8Gb GDDR6 with a Half-Rate Daisy-Chain-Based Clocking Architecture and IO Circuitry for Low-Noise Operation 24Gb/s/引脚8Gb GDDR6,半速率菊花链时钟架构和低噪声操作IO电路
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365844
Kyunghoon Kim, Joo-Hyung Chae, Jaehyeok Yang, Ji-Hyo Kang, Gang-Sik Lee, Sangyeon Byeon, Youngtaek Kim, Boram Kim, Donghoon Kim, Yeongmuk Cho, Kangmoo Choi, Hye-Lim Park, Junghwan Ji, S. Jeong, Yongsuk Joo, Jaehoon Cha, Mi-Lim Park, Hongdeuk Kim, Sijun Park, K. Kong, Sunho Kim, Sangkwon Lee, J. Chun, Hyung-Seuk Kim, S. Cha
The demand for high-performance graphics systems used for artificial intelligence continues to grow; this trend requires graphics systems to achieve ever higher bandwidths. Enabling GDDR6 DRAM to achieve data rates beyond 18Gb/s/pin [1] requires identifying and solving factors that affect the speed of a memory interface. Prior studies have showed that the memory interface is vulnerable from the signal integrity (SI) and power integrity (PI) perspective, since it is based on a parallel interface using single-ended signaling. Furthermore, circuit schemes to mitigate process, voltage, and temperature (PVT) variations in sub-nanometer DRAM process are required to improve performance. To achieve 24Gb/s/pin on a 1.35V DRAM process, this work proposes a GDDR6 DRAM with a half-rate clocking architecture and optimized I/O.
对用于人工智能的高性能图形系统的需求持续增长;这种趋势要求图形系统实现更高的带宽。要使GDDR6 DRAM实现超过18Gb/s/pin的数据速率[1],需要识别和解决影响内存接口速度的因素。先前的研究表明,从信号完整性(SI)和功率完整性(PI)的角度来看,存储接口是脆弱的,因为它是基于使用单端信令的并行接口。此外,为了提高性能,需要在亚纳米DRAM工艺中减小工艺、电压和温度(PVT)变化的电路方案。为了在1.35V DRAM上实现24Gb/s/pin,本工作提出了一种具有半速率时钟架构和优化I/O的GDDR6 DRAM。
{"title":"A 24Gb/s/pin 8Gb GDDR6 with a Half-Rate Daisy-Chain-Based Clocking Architecture and IO Circuitry for Low-Noise Operation","authors":"Kyunghoon Kim, Joo-Hyung Chae, Jaehyeok Yang, Ji-Hyo Kang, Gang-Sik Lee, Sangyeon Byeon, Youngtaek Kim, Boram Kim, Donghoon Kim, Yeongmuk Cho, Kangmoo Choi, Hye-Lim Park, Junghwan Ji, S. Jeong, Yongsuk Joo, Jaehoon Cha, Mi-Lim Park, Hongdeuk Kim, Sijun Park, K. Kong, Sunho Kim, Sangkwon Lee, J. Chun, Hyung-Seuk Kim, S. Cha","doi":"10.1109/ISSCC42613.2021.9365844","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365844","url":null,"abstract":"The demand for high-performance graphics systems used for artificial intelligence continues to grow; this trend requires graphics systems to achieve ever higher bandwidths. Enabling GDDR6 DRAM to achieve data rates beyond 18Gb/s/pin [1] requires identifying and solving factors that affect the speed of a memory interface. Prior studies have showed that the memory interface is vulnerable from the signal integrity (SI) and power integrity (PI) perspective, since it is based on a parallel interface using single-ended signaling. Furthermore, circuit schemes to mitigate process, voltage, and temperature (PVT) variations in sub-nanometer DRAM process are required to improve performance. To achieve 24Gb/s/pin on a 1.35V DRAM process, this work proposes a GDDR6 DRAM with a half-rate clocking architecture and optimized I/O.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"33 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129454196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
14.5 A 1V W-Band Bidirectional Transceiver Front-End with <1dB T/R Switch Loss, <1°/dB Phase/Gain Resolution and 12.3% TX PAE at 15.1dBm Output Power in 65nm CMOS Technology 14.5 v w波段双向收发前端,T/R开关损耗<1dB,相位/增益分辨率<1°/dB,输出功率15.1dBm, txpae 12.3%,采用65nm CMOS技术
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365944
Wei Zhu, Jiawen Wang, Ruitao Wang, Yan Wang
Millimeter-wave (mm-wave) imaging radars and communication systems operating at W-band obtain an ever-increasing attention due to their high-resolution and high data rate [1] –[6]. However, because of the challenges of lower active device gain and greater passive loss, most W-Band transceivers (TRXs) do not integrate T/R switches and attenuators [1] –[6], which will significantly reduce the system performance and increase the cost. In this paper, we propose a W-band bidirectional phased-array TRX FE in a 65nm CMOS technology to support communication and radar applications in which three critical issues of higher mm-wave-band phased-array TRX FE have been initially dealt with: 1) the large insertion loss (IL) of the T/R switch, 2) the limited resolution and 3) gain/phase variation of the phase shifter (PS) and attenuator. The concept of EM coupling was applied in the W-band circuit design and the couple-based T/R switches, PSs and attenuators were integrated in the TRX FE, achieving <1dB T/R switch IL, > 12.3% peak PAE at 15.1dBm output power and <1°/dB phase/gain resolution with <±2.1dB/±6° gain/phase variation.
工作在w波段的毫米波(mm-wave)成像雷达和通信系统因其高分辨率和高数据速率而受到越来越多的关注[1]-[6]。然而,由于低有源器件增益和大无源损耗的挑战,大多数w波段收发器(trx)没有集成T/R开关和衰减器[1]-[6],这将大大降低系统性能并增加成本。在本文中,我们提出了一种w波段双向相控阵TRX FE,采用65nm CMOS技术,以支持通信和雷达应用,其中高毫米波段相控阵TRX FE的三个关键问题已经初步解决:1)T/R开关的大插入损耗(IL), 2)有限的分辨率和3)移相器(PS)和衰减器的增益/相位变化。在w波段电路设计中应用了EM耦合的概念,并在TRX FE中集成了基于耦合的T/R开关、ps和衰减器,在15.1dBm输出功率下实现了12.3%的峰值PAE,相位/增益分辨率<1°/dB,增益/相位变化<±2.1dB/±6°。
{"title":"14.5 A 1V W-Band Bidirectional Transceiver Front-End with <1dB T/R Switch Loss, <1°/dB Phase/Gain Resolution and 12.3% TX PAE at 15.1dBm Output Power in 65nm CMOS Technology","authors":"Wei Zhu, Jiawen Wang, Ruitao Wang, Yan Wang","doi":"10.1109/ISSCC42613.2021.9365944","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365944","url":null,"abstract":"Millimeter-wave (mm-wave) imaging radars and communication systems operating at W-band obtain an ever-increasing attention due to their high-resolution and high data rate [1] –[6]. However, because of the challenges of lower active device gain and greater passive loss, most W-Band transceivers (TRXs) do not integrate T/R switches and attenuators [1] –[6], which will significantly reduce the system performance and increase the cost. In this paper, we propose a W-band bidirectional phased-array TRX FE in a 65nm CMOS technology to support communication and radar applications in which three critical issues of higher mm-wave-band phased-array TRX FE have been initially dealt with: 1) the large insertion loss (IL) of the T/R switch, 2) the limited resolution and 3) gain/phase variation of the phase shifter (PS) and attenuator. The concept of EM coupling was applied in the W-band circuit design and the couple-based T/R switches, PSs and attenuators were integrated in the TRX FE, achieving <1dB T/R switch IL, > 12.3% peak PAE at 15.1dBm output power and <1°/dB phase/gain resolution with <±2.1dB/±6° gain/phase variation.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129718911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Programmable Neural-Network Inference Accelerator Based on Scalable In-Memory Computing 基于可扩展内存计算的可编程神经网络推理加速器
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365788
Hongyang Jia, Murat Ozatay, Yinqi Tang, Hossein Valavi, Rakshit Pathak, Jinseok Lee, N. Verma
This paper presents a scalable neural-network (NN) inference accelerator in 16nm, based on an array of programmable cores employing mixed-signal In-Memory Computing (IMC), digital Near-Memory Computing (NMC), and localized buffering/control. IMC achieves high energy efficiency and throughput for matrix-vector multiplications (MVMs), which dominate NNs; but, scalability poses numerous challenges, both technologically, going to advanced nodes to maintain gains over digital architectures, and architecturally, for full execution of diverse NNs. Recent demonstrations have explored integrating IMC in programmable processors [1, 2], but have not achieved IMC efficiency and throughput for full executions. The central challenge is drastically different physical design points and associated tradeoffs incurred by IMC compared to digital engines. Namely, IMC substantially increases compute energy efficiency and HW density/parallelism, but retains the overheads of HW virtualization (state and data swapping/buffering/communication across spatial/temporal computation mappings). The demonstrated architecture is co-designed with SW-mapping algorithms (encapsulated in a custom graph compiler), to provide efficiency across a broad range of mapping strategies, to overcome these overheads.
本文提出了一种可扩展的16nm神经网络推理加速器,该加速器基于一系列可编程内核,采用混合信号内存计算(IMC)、数字近内存计算(NMC)和局部缓冲/控制。IMC实现了主导神经网络的矩阵向量乘法(mvm)的高能量效率和吞吐量;但是,可扩展性带来了许多挑战,无论是在技术上,走向先进的节点以保持对数字架构的优势,还是在架构上,为了全面执行各种神经网络。最近的演示已经探索了将IMC集成到可编程处理器中[1,2],但尚未实现完整执行的IMC效率和吞吐量。与数字引擎相比,IMC面临的主要挑战是物理设计点和相关权衡的差异。也就是说,IMC大大提高了计算能源效率和硬件密度/并行性,但保留了硬件虚拟化的开销(跨空间/时间计算映射的状态和数据交换/缓冲/通信)。演示的体系结构是与sw映射算法(封装在自定义图形编译器中)共同设计的,以便在广泛的映射策略中提供效率,以克服这些开销。
{"title":"A Programmable Neural-Network Inference Accelerator Based on Scalable In-Memory Computing","authors":"Hongyang Jia, Murat Ozatay, Yinqi Tang, Hossein Valavi, Rakshit Pathak, Jinseok Lee, N. Verma","doi":"10.1109/ISSCC42613.2021.9365788","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365788","url":null,"abstract":"This paper presents a scalable neural-network (NN) inference accelerator in 16nm, based on an array of programmable cores employing mixed-signal In-Memory Computing (IMC), digital Near-Memory Computing (NMC), and localized buffering/control. IMC achieves high energy efficiency and throughput for matrix-vector multiplications (MVMs), which dominate NNs; but, scalability poses numerous challenges, both technologically, going to advanced nodes to maintain gains over digital architectures, and architecturally, for full execution of diverse NNs. Recent demonstrations have explored integrating IMC in programmable processors [1, 2], but have not achieved IMC efficiency and throughput for full executions. The central challenge is drastically different physical design points and associated tradeoffs incurred by IMC compared to digital engines. Namely, IMC substantially increases compute energy efficiency and HW density/parallelism, but retains the overheads of HW virtualization (state and data swapping/buffering/communication across spatial/temporal computation mappings). The demonstrated architecture is co-designed with SW-mapping algorithms (encapsulated in a custom graph compiler), to provide efficiency across a broad range of mapping strategies, to overcome these overheads.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129357852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
A 5nm 5.7GHz@1.0V and 1.3GHz@0.5V 4kb Standard-Cell- Based Two-Port Register File with a 16T Bitcell with No Half-Selection Issue 一个5nm 5.7GHz@1.0V和1.3GHz@0.5V 4kb基于标准单元的双端口寄存器文件,具有16T位单元,没有半选择问题
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9366000
H. Fujiwara, Y. Nien, Chih-Yu Lin, H. Pan, H. Hsu, Shin-Rung Wu, Yao-Yi Liu, Yen-Huei Chen, H. Liao, Jonathan Chang
Continued scaling of the transistor increases random Vt variation, which limits the minimum operating voltage $(V_{mathrm{MIN}})$. Furthermore, fin formation differences between the SRAM bitcells, the peripheral circuits and the standard logic degrade area efficiency due to the empty spaces at fin-to-fin boundary and the required dummy [1]. Memories with small capacities that use the classical SRAM design suffer from this issue the most. In this paper, we will propose a 5nm digital-based SRAM macro with a 16T cell supporting a bit-write-mask operation. We adopted the standard cell rules for the proposed SRAM layout design. The area of the 16T cell is larger than the foundry’s 6T SRAM cell; however, the total macro area of a small capacity SRAM is smaller since there is no empty space in the macro and due to its simple peripheral circuit. In addition, the proposed SRAM can be directly abutted with the standard cell region. The proposed SRAM can support ultra-wide range voltage operation due to the advantages of a digital-based bitcell design.
晶体管的持续缩放增加了Vt的随机变化,这限制了最小工作电压$(V_{ mathm {MIN}})$。此外,SRAM位单元、外围电路和标准逻辑之间的鳍形差异,由于鳍到鳍边界的空白空间和所需的假体,会降低区域效率[1]。使用经典SRAM设计的小容量存储器最容易受到这个问题的影响。在本文中,我们将提出一个5nm基于数字的SRAM宏,其16T单元支持位写掩码操作。我们采用标准单元规则进行SRAM布局设计。16T单元的面积比代工的6T SRAM单元大;然而,小容量SRAM的总宏面积较小,因为宏中没有空白空间,并且由于其简单的外围电路。此外,所提出的SRAM可以直接与标准单元区域相邻。由于基于数字位单元设计的优势,所提出的SRAM可以支持超宽范围电压工作。
{"title":"A 5nm 5.7GHz@1.0V and 1.3GHz@0.5V 4kb Standard-Cell- Based Two-Port Register File with a 16T Bitcell with No Half-Selection Issue","authors":"H. Fujiwara, Y. Nien, Chih-Yu Lin, H. Pan, H. Hsu, Shin-Rung Wu, Yao-Yi Liu, Yen-Huei Chen, H. Liao, Jonathan Chang","doi":"10.1109/ISSCC42613.2021.9366000","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9366000","url":null,"abstract":"Continued scaling of the transistor increases random Vt variation, which limits the minimum operating voltage $(V_{mathrm{MIN}})$. Furthermore, fin formation differences between the SRAM bitcells, the peripheral circuits and the standard logic degrade area efficiency due to the empty spaces at fin-to-fin boundary and the required dummy [1]. Memories with small capacities that use the classical SRAM design suffer from this issue the most. In this paper, we will propose a 5nm digital-based SRAM macro with a 16T cell supporting a bit-write-mask operation. We adopted the standard cell rules for the proposed SRAM layout design. The area of the 16T cell is larger than the foundry’s 6T SRAM cell; however, the total macro area of a small capacity SRAM is smaller since there is no empty space in the macro and due to its simple peripheral circuit. In addition, the proposed SRAM can be directly abutted with the standard cell region. The proposed SRAM can support ultra-wide range voltage operation due to the advantages of a digital-based bitcell design.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"03 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127396733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
32.5 A 24GHz Self-Calibrated ADPLL-Based FMCW Synthesizer with 0.01% rms Frequency Error Under 3.2GHz Chirp Bandwidth and 320MHz/μs Slope 32.5基于adpll的24GHz自校准FMCW合成器,在3.2GHz啁啾带宽下,斜率为320MHz/μs,频率误差为0.01%
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365949
Zhengkun Shen, Haoyun Jiang, Fan Yang, Yixiao Wang, Zherui Zhang, Junhua Liu, H. Liao
Frequency synthesizers are critical for millimeter-wave (mm-wave) frequency-modulated continuous-wave (FMCW) radars. Large-chirp-bandwidth ($BW_{chirp})$ sawtooth waveforms are required to be synthesized with fast slope and high-frequency linearity for accurate detection of targets or high-quality imaging. Fractional-N phase-locked loops (PLLs) with a two-point-modulation (TPM) scheme are widely used to synthesize fast high-linearity chirps [1] –[3]. However, for a wideband multi-bank digitally controlled oscillator (DCO), the intrinsic $1/ surd Lmathrm{C}$ nonlinearity and the frequency discontinuity from overlaps between adjacent tuning bands introduce a significant gain mismatch between the two modulation paths of the TPM scheme and degrade the chirp linearity [3], [5]. To linearize the DCO tuning curve, a piecewise linear pre-distortion method is commonly used as shown in Fig. 32.5.1 [3]. In this method, the overlaps are mitigated by scaling every band with the same factor SC, which is based on the assumption that ratios of each DCO tuning band to the corresponding coarse frequency step remain the same. In practice, precise matching between these bands cannot be guaranteed. Each tuning band is then linearly fitted with its average gain $g_{i}$, but non-ideal residual frequency errors may still deteriorate the chirp linearity. As a DCO bandwidth increases, the mismatches and residual frequency errors tend to be more severe, making this method unsuitable for wideband FMCW synthesizers.
频率合成器是毫米波(mm波)调频连续波(FMCW)雷达的关键器件。大啁啾带宽($BW_{chirp})$锯齿波需要具有快速斜率和高线性度才能准确检测目标或实现高质量成像。双点调制(TPM)方案的分数n锁相环(pll)被广泛用于合成快速的高线性啁啾[1]-[3]。然而,对于宽带多组数字控制振荡器(DCO),固有的$1/ surd L maththrm {C}$非线性和相邻调谐带之间重叠的频率不连续导致TPM方案的两个调制路径之间的显著增益失配,并降低了啁啾线性度[3],[5]。为了使DCO调谐曲线线性化,通常采用分段线性预失真方法,如图32.5.1[3]所示。在这种方法中,通过用相同的SC因子缩放每个频带来减轻重叠,这是基于每个DCO调谐频带与相应粗频率步长的比率保持不变的假设。在实际应用中,不能保证这些波段之间的精确匹配。然后用其平均增益$g_{i}$线性拟合每个调谐带,但非理想剩余频率误差仍可能使啁啾线性恶化。随着DCO带宽的增加,失配和剩余频率误差趋于严重,使得该方法不适用于宽带FMCW合成器。
{"title":"32.5 A 24GHz Self-Calibrated ADPLL-Based FMCW Synthesizer with 0.01% rms Frequency Error Under 3.2GHz Chirp Bandwidth and 320MHz/μs Slope","authors":"Zhengkun Shen, Haoyun Jiang, Fan Yang, Yixiao Wang, Zherui Zhang, Junhua Liu, H. Liao","doi":"10.1109/ISSCC42613.2021.9365949","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365949","url":null,"abstract":"Frequency synthesizers are critical for millimeter-wave (mm-wave) frequency-modulated continuous-wave (FMCW) radars. Large-chirp-bandwidth ($BW_{chirp})$ sawtooth waveforms are required to be synthesized with fast slope and high-frequency linearity for accurate detection of targets or high-quality imaging. Fractional-N phase-locked loops (PLLs) with a two-point-modulation (TPM) scheme are widely used to synthesize fast high-linearity chirps [1] –[3]. However, for a wideband multi-bank digitally controlled oscillator (DCO), the intrinsic $1/ surd Lmathrm{C}$ nonlinearity and the frequency discontinuity from overlaps between adjacent tuning bands introduce a significant gain mismatch between the two modulation paths of the TPM scheme and degrade the chirp linearity [3], [5]. To linearize the DCO tuning curve, a piecewise linear pre-distortion method is commonly used as shown in Fig. 32.5.1 [3]. In this method, the overlaps are mitigated by scaling every band with the same factor SC, which is based on the assumption that ratios of each DCO tuning band to the corresponding coarse frequency step remain the same. In practice, precise matching between these bands cannot be guaranteed. Each tuning band is then linearly fitted with its average gain $g_{i}$, but non-ideal residual frequency errors may still deteriorate the chirp linearity. As a DCO bandwidth increases, the mismatches and residual frequency errors tend to be more severe, making this method unsuitable for wideband FMCW synthesizers.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127519848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
7.9 1/2.74-inch 32Mpixel-Prototype CMOS Image Sensor with 0.64μ m Unit Pixels Separated by Full-Depth Deep-Trench Isolation 7.9 /2.74英寸32mpixel原型CMOS图像传感器,采用全深度深沟隔离,单位像素为0.64μ m
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365751
Jongeun Park, Sungbong Park, K. Cho, Taehun Lee, Changkyu Lee, Donghyun Kim, Beomsuk Lee, Sungin Kim, H. Ji, Dongmo Im, Haeyong Park, Jinyoung Kim, J. Cha, Taehoon Kim, I. Joe, Soojin Hong, Chongkwang Chang, Jingyun Kim, W. Shim, Taehee Kim, Jamie Lee, D. Park, Euiyeol Kim, Howoo Park, Jaekyu Lee, Yitae Kim, JungChak Ahn, Youngki Chung, ChungSam Jun, Hyunchul Kim, Changrok Moon, Ho-Kyu Kang
For years, there has been a strong drive for sub-micron pixel development, in spite of reaching the visible light diffraction limit, because a smaller pixel pitch of CMOS image sensors (CISs) is inevitably required for ever-miniaturizing camera modules as mobile devices incorporate more cameras, few of which are dedicated to ultra-high-resolution zoomed images [1]. To that end, image sensor vendors have tried to find new ways to avoid reduction in sensitivity and more crosstalk in the sensor through pixel architecture change and/or fabrication process refinement [2] –[4]. For example, a $0.7 mu m$ pixel sensor was demonstrated with acceptable photodiode (PD) full-well capacity (FWC) of $gt 6$,000e- as well as signal-to-noise ratio (SNR) of $sim32$ dB without optical/electrical crosstalk by employing state-of-the-art full-depth deep-trench isolations (FDTIs). [4] However, further scaling requires elaborate fabrication innovation and layout ideas. At the same time, meeting every aspect of pixel performance compared to the previous generation becomes even more difficult, e.g., with respect to dark or illuminated characteristics, fixed-pattern or temporal noises, etc. The latter, in particular, is associated with in-pixel source-follower (SF) amplifiers. Therefore, electrical performance of scaled in-pixel transistors cannot be overlooked. In this paper, a 32-megpixel (MP) CIS with $0.64 mu m$ unit pixels is demonstrated with FDTI design. Innovations in terms of fabrication and design to achieve this performance with scaling are discussed.
多年来,尽管达到了可见光衍射极限,但亚微米像素的发展一直受到强烈的推动,因为随着移动设备包含更多的相机,越来越小型化的相机模块不可避免地需要更小的CMOS图像传感器(CISs)的像素间距,其中很少有专门用于超高分辨率缩放图像[1]。为此,图像传感器供应商试图通过改变像素架构和/或改进制造工艺来寻找新的方法来避免传感器中灵敏度降低和更多串扰[2]-[4]。例如,通过采用最先进的全深度深沟隔离(FDTIs), $0.7 mu m$像素传感器具有可接受的光电二极管(PD)全井容量(FWC) $gt 6$,000e,以及$sim32$ dB的信噪比(SNR),没有光/电串扰。[4]然而,进一步的规模化需要精细的制造创新和布局理念。与此同时,与上一代相比,满足像素性能的各个方面变得更加困难,例如,关于黑暗或照明特性,固定模式或时间噪声等。后者尤其与像素内源跟随器(SF)放大器相关。因此,缩放像素内晶体管的电性能不容忽视。本文采用FDTI设计,演示了一个单位像素为$0.64 mu m$的3200万像素CIS。在制造和设计方面的创新,以实现这种性能与缩放进行了讨论。
{"title":"7.9 1/2.74-inch 32Mpixel-Prototype CMOS Image Sensor with 0.64μ m Unit Pixels Separated by Full-Depth Deep-Trench Isolation","authors":"Jongeun Park, Sungbong Park, K. Cho, Taehun Lee, Changkyu Lee, Donghyun Kim, Beomsuk Lee, Sungin Kim, H. Ji, Dongmo Im, Haeyong Park, Jinyoung Kim, J. Cha, Taehoon Kim, I. Joe, Soojin Hong, Chongkwang Chang, Jingyun Kim, W. Shim, Taehee Kim, Jamie Lee, D. Park, Euiyeol Kim, Howoo Park, Jaekyu Lee, Yitae Kim, JungChak Ahn, Youngki Chung, ChungSam Jun, Hyunchul Kim, Changrok Moon, Ho-Kyu Kang","doi":"10.1109/ISSCC42613.2021.9365751","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365751","url":null,"abstract":"For years, there has been a strong drive for sub-micron pixel development, in spite of reaching the visible light diffraction limit, because a smaller pixel pitch of CMOS image sensors (CISs) is inevitably required for ever-miniaturizing camera modules as mobile devices incorporate more cameras, few of which are dedicated to ultra-high-resolution zoomed images [1]. To that end, image sensor vendors have tried to find new ways to avoid reduction in sensitivity and more crosstalk in the sensor through pixel architecture change and/or fabrication process refinement [2] –[4]. For example, a $0.7 mu m$ pixel sensor was demonstrated with acceptable photodiode (PD) full-well capacity (FWC) of $gt 6$,000e- as well as signal-to-noise ratio (SNR) of $sim32$ dB without optical/electrical crosstalk by employing state-of-the-art full-depth deep-trench isolations (FDTIs). [4] However, further scaling requires elaborate fabrication innovation and layout ideas. At the same time, meeting every aspect of pixel performance compared to the previous generation becomes even more difficult, e.g., with respect to dark or illuminated characteristics, fixed-pattern or temporal noises, etc. The latter, in particular, is associated with in-pixel source-follower (SF) amplifiers. Therefore, electrical performance of scaled in-pixel transistors cannot be overlooked. In this paper, a 32-megpixel (MP) CIS with $0.64 mu m$ unit pixels is demonstrated with FDTI design. Innovations in terms of fabrication and design to achieve this performance with scaling are discussed.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129966254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
13.4 A 1GS/s 6-to-8b 0.5mW/Qubit Cryo-CMOS SAR ADC for Quantum Computing in 40nm CMOS 13.4个用于40nm CMOS量子计算的1GS/s 6- 8b 0.5mW/Qubit Cryo-CMOS SAR ADC
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365927
G. Kiene, A. Catania, Ramon W. J. Overwater, P. Bruschi, E. Charbon, M. Babaie, F. Sebastiano
Quantum computers (QCs) promise significant speedup for relevant computational problems that are intractable by classical computers. QCs process information stored in quantum bits (qubits) that must be typically cooled down to cryogenic temperatures. Since state-of-the-art QCs employ only a few qubits, those qubits can be driven and read out by room-temperature electronics connected to the cryogenic qubits by only a few wires. However, practical QCs will require more than thousands of qubits, making this approach impractical due to system complexity and reliability concerns. Although frequency multiplexing would reduce the interconnects to room temperature by fitting many qubit channels in the same physical interconnect, an excessive number of interconnects would still be required. An alternative, more scalable solution is a cryogenic electronic interface operating very close to the quantum processor to keep the whole control loop at cryogenic temperature, hence avoiding any high-speed interconnect to room temperature. This system must comprise drivers, readout circuits (LNAs, ADCs), and a digital controller to steer the quantum-algorithm execution [1]. While cryogenic CMOS (cryo-CMOS) wideband drivers and LNAs supporting qubit frequency multiplexing have been shown before [1] –[3], no wideband cryo-CMOS ADC has been demonstrated yet.
量子计算机(qc)有望显著加快经典计算机难以解决的相关计算问题。qc处理存储在量子比特(量子位)中的信息,这些信息通常必须冷却到低温。由于最先进的量子计算机只使用几个量子位,这些量子位可以通过连接到低温量子位的几根电线的室温电子设备来驱动和读取。然而,实际的qc将需要超过数千个量子位,由于系统复杂性和可靠性问题,这种方法不切实际。虽然频率复用可以通过在同一物理互连中安装许多量子比特通道来将互连降低到室温,但仍然需要过多的互连。另一种更具可扩展性的解决方案是一个低温电子接口,它非常靠近量子处理器,使整个控制回路保持在低温下,从而避免任何高速互连到室温。该系统必须包括驱动器、读出电路(lna、adc)和一个数字控制器来引导量子算法的执行[1]。虽然低温CMOS (cryo-CMOS)宽带驱动器和支持量子比特频率复用的LNAs之前已经被展示过[1]-[3],但还没有宽带cryo-CMOS ADC被展示过。
{"title":"13.4 A 1GS/s 6-to-8b 0.5mW/Qubit Cryo-CMOS SAR ADC for Quantum Computing in 40nm CMOS","authors":"G. Kiene, A. Catania, Ramon W. J. Overwater, P. Bruschi, E. Charbon, M. Babaie, F. Sebastiano","doi":"10.1109/ISSCC42613.2021.9365927","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365927","url":null,"abstract":"Quantum computers (QCs) promise significant speedup for relevant computational problems that are intractable by classical computers. QCs process information stored in quantum bits (qubits) that must be typically cooled down to cryogenic temperatures. Since state-of-the-art QCs employ only a few qubits, those qubits can be driven and read out by room-temperature electronics connected to the cryogenic qubits by only a few wires. However, practical QCs will require more than thousands of qubits, making this approach impractical due to system complexity and reliability concerns. Although frequency multiplexing would reduce the interconnects to room temperature by fitting many qubit channels in the same physical interconnect, an excessive number of interconnects would still be required. An alternative, more scalable solution is a cryogenic electronic interface operating very close to the quantum processor to keep the whole control loop at cryogenic temperature, hence avoiding any high-speed interconnect to room temperature. This system must comprise drivers, readout circuits (LNAs, ADCs), and a digital controller to steer the quantum-algorithm execution [1]. While cryogenic CMOS (cryo-CMOS) wideband drivers and LNAs supporting qubit frequency multiplexing have been shown before [1] –[3], no wideband cryo-CMOS ADC has been demonstrated yet.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130997458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
11.6 A 100Gb/s-8.3dBm-Sensitivity PAM-4 Optical Receiver with Integrated TIA, FFE and Direct-Feedback DFE in 28nm CMOS 11.6 100Gb/s-8.3 dbm灵敏度PAM-4光接收机,集成TIA、FFE和直接反馈DFE,采用28nm CMOS
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9365802
Hao Li, J. Sharma, Chun-Ming Hsu, G. Balamurugan, J. Jaussi
Several 400G Ethernet standards (e.g. 400G-DR4/FR4) have been developed to address the rapid increase in interconnect BW demand created by data-centric computing [1]. Low-cost100Gb/s PAM-4 optical transceivers are critical to spur their adoption in high volume by data centers. While low-cost integrated silicon-photonic 100Gb/s PAM-4 transmitters have been demonstrated recently, the electronics in current receiver solutions is more disaggregated. They typically employ a standalone BiCMOS TIA 1C followed by a 100G PAM-4 (ADC+DSP)-based SerDes 1C (designed to equalize high-loss electrical channels), which results in higher power dissipation and package cost. To address these drawbacks, we present a 100Gb/s PAM-4 optical RX with a single-chip Solution integrating all 0f the RX electronics in a bulk CMOS process. While standalone l00Gb/s PAM-4 CMOS linear TIAs have been shown in prior work [2], [3], their integration with subsequent SerDes has not yet been demonstrated.
已经开发了几种400G以太网标准(例如400G- dr4 /FR4),以解决以数据为中心的计算所产生的互连BW需求的快速增长[1]。低成本的100gb /s PAM-4光收发器对于促进数据中心大量采用它们至关重要。虽然最近已经展示了低成本集成硅光子100Gb/s PAM-4发射机,但目前接收器解决方案中的电子器件更加分散。它们通常采用独立的BiCMOS TIA 1C,然后是基于100G PAM-4 (ADC+DSP)的SerDes 1C(旨在均衡高损耗电通道),这导致更高的功耗和封装成本。为了解决这些缺点,我们提出了一种100Gb/s PAM-4光学RX,其单芯片解决方案将所有RX电子器件集成在批量CMOS工艺中。虽然独立的l00Gb/s PAM-4 CMOS线性TIAs已经在先前的工作中得到展示[2],[3],但它们与后续SerDes的集成尚未得到证明。
{"title":"11.6 A 100Gb/s-8.3dBm-Sensitivity PAM-4 Optical Receiver with Integrated TIA, FFE and Direct-Feedback DFE in 28nm CMOS","authors":"Hao Li, J. Sharma, Chun-Ming Hsu, G. Balamurugan, J. Jaussi","doi":"10.1109/ISSCC42613.2021.9365802","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9365802","url":null,"abstract":"Several 400G Ethernet standards (e.g. 400G-DR4/FR4) have been developed to address the rapid increase in interconnect BW demand created by data-centric computing [1]. Low-cost100Gb/s PAM-4 optical transceivers are critical to spur their adoption in high volume by data centers. While low-cost integrated silicon-photonic 100Gb/s PAM-4 transmitters have been demonstrated recently, the electronics in current receiver solutions is more disaggregated. They typically employ a standalone BiCMOS TIA 1C followed by a 100G PAM-4 (ADC+DSP)-based SerDes 1C (designed to equalize high-loss electrical channels), which results in higher power dissipation and package cost. To address these drawbacks, we present a 100Gb/s PAM-4 optical RX with a single-chip Solution integrating all 0f the RX electronics in a bulk CMOS process. While standalone l00Gb/s PAM-4 CMOS linear TIAs have been shown in prior work [2], [3], their integration with subsequent SerDes has not yet been demonstrated.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130436903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
4.7 A 91mW 90fps Super-Resolution Processor for Full HD Images 4.7一个91mW 90fps的全高清图像超分辨率处理器
Pub Date : 2021-02-13 DOI: 10.1109/ISSCC42613.2021.9366026
Hsueh-Yen Shen, Yu-Chi Lee, Tzu-Wei Tong, Chia-Hsiang Yang
Super resolution is the process of reconstructing a high-resolution (HR) image from a low-resolution (LR) one. Super-resolution technology enables high-resolution video streaming, image zoom-in, and far object recognition. Fig. 4.7.1 shows such an application scenario. The details of the videos/images can be reconstructed and projected to a higher-resolution screen, thereby providing a better visual experience. A hardware accelerator is needed to speed up the super-resolution process to support real-time high-resolution video streaming. Conventionally, dictionary-based approaches, such as ANR/GR [1] and A+ [2], convert the LR image into the HR one from learned mapping functions. Neural network (NN)-based algorithms generate better-quality super-resolution images by extracting features from training [3]. However, the complexity of the dictionary-based and the NN-based algorithms is excessively high, making them unsuitable for high-speed applications [4]. A rapid and accurate image super resolution (RAISR) algorithm [4] is proposed to achieve comparable quality with a much faster processing speed when compared to the previous solutions. It employs pre-learned filters to enhance the image quality based on bicubic interpolation. A pre-learned filter (also known as kernel) is selected by a hash function to address the structure-related details.
超分辨率是指从低分辨率图像重建高分辨率图像的过程。超分辨率技术可实现高分辨率视频流、图像放大和远距离物体识别。该应用场景如图4.7.1所示。视频/图像的细节可以重建并投射到更高分辨率的屏幕上,从而提供更好的视觉体验。为了支持实时高分辨率视频流,需要一个硬件加速器来加速超分辨率过程。通常,基于字典的方法,如ANR/GR[1]和A+[2],将LR图像从学习到的映射函数转换为HR图像。基于神经网络(NN)的算法通过从训练[3]中提取特征来生成质量更好的超分辨率图像。然而,基于字典和基于神经网络的算法的复杂性过高,不适合高速应用[4]。提出了一种快速准确的图像超分辨率(RAISR)算法[4],与以往的解决方案相比,以更快的处理速度获得相当的质量。它采用基于双三次插值的预学习滤波器来提高图像质量。通过散列函数选择预学习过滤器(也称为内核)来处理与结构相关的细节。
{"title":"4.7 A 91mW 90fps Super-Resolution Processor for Full HD Images","authors":"Hsueh-Yen Shen, Yu-Chi Lee, Tzu-Wei Tong, Chia-Hsiang Yang","doi":"10.1109/ISSCC42613.2021.9366026","DOIUrl":"https://doi.org/10.1109/ISSCC42613.2021.9366026","url":null,"abstract":"Super resolution is the process of reconstructing a high-resolution (HR) image from a low-resolution (LR) one. Super-resolution technology enables high-resolution video streaming, image zoom-in, and far object recognition. Fig. 4.7.1 shows such an application scenario. The details of the videos/images can be reconstructed and projected to a higher-resolution screen, thereby providing a better visual experience. A hardware accelerator is needed to speed up the super-resolution process to support real-time high-resolution video streaming. Conventionally, dictionary-based approaches, such as ANR/GR [1] and A+ [2], convert the LR image into the HR one from learned mapping functions. Neural network (NN)-based algorithms generate better-quality super-resolution images by extracting features from training [3]. However, the complexity of the dictionary-based and the NN-based algorithms is excessively high, making them unsuitable for high-speed applications [4]. A rapid and accurate image super resolution (RAISR) algorithm [4] is proposed to achieve comparable quality with a much faster processing speed when compared to the previous solutions. It employs pre-learned filters to enhance the image quality based on bicubic interpolation. A pre-learned filter (also known as kernel) is selected by a hash function to address the structure-related details.","PeriodicalId":371093,"journal":{"name":"2021 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127840027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2021 IEEE International Solid- State Circuits Conference (ISSCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1