首页 > 最新文献

IEEE Solid-State Circuits Letters最新文献

英文 中文
A RISC-V SoC With Reconfigurable Custom Instructions on a Synthesized eFPGA Fabric in 22nm FinFET 在22nm FinFET合成eFPGA结构上具有可重构自定义指令的RISC-V SoC
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-07 DOI: 10.1109/LSSC.2025.3618942
Prashanth Mohan;Siddharth Das;Ken Mai
This letter presents a flexible and energy-efficient RISC-V system-on-chip (SoC) in 22nm FinFET technology, achieving state-of-the-art performance by tightly integrating the CPU with a synthesized embedded FPGA (embedded field programmable gate array (eFPGA)), enabling the implementation of reconfigurable custom instructions. The tight integration of the eFPGA with SoC scratchpad memory facilitates parallel high-bandwidth (16 GB/s) access to scratchpad SRAMs and enables rapid eFPGA reconfiguration ( $2~mu $ s) to switch between custom instruction sets. The eFPGA fabric itself is optimized for compute-intensive tasks, featuring fused logic tiles that amortize routing overheads to achieve a compute density of 22.3 GOPS/mm2. The measurement results from the fabricated chip demonstrate a peak energy efficiency of 748 GOPS/W (INT8) while improving throughput and energy by 1-2 orders of magnitude compared to the CPU for accelerated applications.
本文介绍了一种灵活节能的RISC-V片上系统(SoC),采用22nm FinFET技术,通过将CPU与合成嵌入式FPGA(嵌入式现场可编程门阵列(eFPGA))紧密集成,实现了最先进的性能,实现了可重构定制指令的实现。eFPGA与SoC刮刮板存储器的紧密集成促进了对刮刮板sram的并行高带宽(16 GB/s)访问,并实现了快速eFPGA重新配置($2~mu $ s)以在自定义指令集之间切换。eFPGA结构本身针对计算密集型任务进行了优化,具有融合的逻辑块,可分摊路由开销,实现22.3 GOPS/mm2的计算密度。制造芯片的测量结果表明,峰值能量效率为748 GOPS/W (INT8),同时与加速应用的CPU相比,吞吐量和能量提高了1-2个数量级。
{"title":"A RISC-V SoC With Reconfigurable Custom Instructions on a Synthesized eFPGA Fabric in 22nm FinFET","authors":"Prashanth Mohan;Siddharth Das;Ken Mai","doi":"10.1109/LSSC.2025.3618942","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3618942","url":null,"abstract":"This letter presents a flexible and energy-efficient RISC-V system-on-chip (SoC) in 22nm FinFET technology, achieving state-of-the-art performance by tightly integrating the CPU with a synthesized embedded FPGA (embedded field programmable gate array (eFPGA)), enabling the implementation of reconfigurable custom instructions. The tight integration of the eFPGA with SoC scratchpad memory facilitates parallel high-bandwidth (16 GB/s) access to scratchpad SRAMs and enables rapid eFPGA reconfiguration (<inline-formula> <tex-math>$2~mu $ </tex-math></inline-formula>s) to switch between custom instruction sets. The eFPGA fabric itself is optimized for compute-intensive tasks, featuring fused logic tiles that amortize routing overheads to achieve a compute density of 22.3 GOPS/mm2. The measurement results from the fabricated chip demonstrate a peak energy efficiency of 748 GOPS/W (INT8) while improving throughput and energy by 1-2 orders of magnitude compared to the CPU for accelerated applications.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"321-324"},"PeriodicalIF":2.0,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Benchmark of Cryo-CMOS Dynamic Comparators in a 40 nm Bulk CMOS Technology Cryo-CMOS动态比较器在40纳米体CMOS技术中的基准测试
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-06 DOI: 10.1109/LSSC.2025.3618161
Bram Veraverbeke;Filip Tavernier
All cryo-CMOS quantum-classical control interfaces require an analog-to-digital converter (ADC) bridging the analog qubits and the digital control logic. Dynamic comparators play a crucial role in the precision, speed, and power consumption of these ADCs. Yet, their performance is severely impacted by the cryogenic environment. Therefore, this letter benchmarks, for the first time in the literature, four comparator topologies at room temperature (RT) and 6K. Their noise, delay, and energy consumption are characterized, allowing the identification of the best topology for a given application. This analysis shows that the strongARM (SA) comparator is the most energy efficient, closely followed by the double tail comparator. However, the reduced voltage headroom at 6K almost doubles the SA’s delay compared to RT and leaves it susceptible to common-mode and supply voltage variations. In contrast, the recently proposed triple tail comparator with capacitive over-neutralization limits the delay increase to only 13ps by separating the preamplifier and the latch. Furthermore, its boosted preamplification gain makes it notably more resilient to voltage variations, ensuring a highly robust cryogenic operation favorable for scaled technologies.
所有cryo-CMOS量子经典控制接口都需要模数转换器(ADC)桥接模拟量子位和数字控制逻辑。动态比较器在这些adc的精度、速度和功耗方面起着至关重要的作用。然而,低温环境严重影响了它们的性能。因此,本文在文献中首次对室温(RT)和6K下的四种比较器拓扑进行了基准测试。对它们的噪声、延迟和能耗进行了表征,从而可以确定给定应用程序的最佳拓扑。分析表明,强arm (SA)比较器是最节能的,其次是双尾比较器。然而,与RT相比,6K时降低的电压净空几乎使SA的延迟增加了一倍,并且使其容易受到共模和电源电压变化的影响。相比之下,最近提出的带有电容过中和的三尾比较器通过分离前置放大器和锁存器,将延迟增加限制在仅13ps。此外,其增强的预放大增益使其对电压变化的适应能力更强,确保了高度稳健的低温操作,有利于规模化技术。
{"title":"A Benchmark of Cryo-CMOS Dynamic Comparators in a 40 nm Bulk CMOS Technology","authors":"Bram Veraverbeke;Filip Tavernier","doi":"10.1109/LSSC.2025.3618161","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3618161","url":null,"abstract":"All cryo-CMOS quantum-classical control interfaces require an analog-to-digital converter (ADC) bridging the analog qubits and the digital control logic. Dynamic comparators play a crucial role in the precision, speed, and power consumption of these ADCs. Yet, their performance is severely impacted by the cryogenic environment. Therefore, this letter benchmarks, for the first time in the literature, four comparator topologies at room temperature (RT) and 6K. Their noise, delay, and energy consumption are characterized, allowing the identification of the best topology for a given application. This analysis shows that the strongARM (SA) comparator is the most energy efficient, closely followed by the double tail comparator. However, the reduced voltage headroom at 6K almost doubles the SA’s delay compared to RT and leaves it susceptible to common-mode and supply voltage variations. In contrast, the recently proposed triple tail comparator with capacitive over-neutralization limits the delay increase to only 13ps by separating the preamplifier and the latch. Furthermore, its boosted preamplification gain makes it notably more resilient to voltage variations, ensuring a highly robust cryogenic operation favorable for scaled technologies.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"317-320"},"PeriodicalIF":2.0,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Entropy Analog-Based Strong Physical Unclonable Function With Area-to-Entropy-ratio of 166 F2/bit 基于高熵类比的强物理不可克隆函数,面积熵比为166f2 /bit
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-30 DOI: 10.1109/LSSC.2025.3616263
Alessandro Catania;Sebastiano Strangio;Maksym Paliy;Christian Sbrana;Michele Bertozzi;Giuseppe Iannaccone
In this letter, we present a high-entropy strong physically unclonable function (PUF) utilizing weak-inversion current mirrors implemented in a standard 65-nm CMOS technology. Each response bit of the proposed PUF relies on the threshold voltage differences of minimum-sized transistors arranged in a $32times 32$ matrix. The analog operating principle enables encoding at least three effective bits per transistor pair, significantly improving entropy density. Leveraging a bit-masking technique, the design achieves remarkable robustness, attaining a bit error rate (BER) as low as 0.22% even under substantial supply voltage and temperature variations, with less than 10% discarded bits. The presented architecture exhibits a record area-to-entropy ratio of $166~rm {F^{2}}$ /bit, confirming its suitability for highly secure, compact applications in hardware security.
在这封信中,我们提出了一个高熵强物理不可克隆函数(PUF),利用在标准65纳米CMOS技术中实现的弱反转电流镜。所提出的PUF的每个响应位依赖于排列在32 × 32矩阵中的最小尺寸晶体管的阈值电压差。模拟操作原理使每个晶体管对至少编码三个有效位,显著提高熵密度。利用位掩蔽技术,该设计实现了出色的鲁棒性,即使在电源电压和温度变化很大的情况下,误码率(BER)也低至0.22%,丢弃的比特少于10%。所提出的架构显示了$166~rm {F^{2}}$ /bit的创纪录的面积熵比,证实了它适合于高度安全,紧凑的硬件安全应用。
{"title":"High-Entropy Analog-Based Strong Physical Unclonable Function With Area-to-Entropy-ratio of 166 F2/bit","authors":"Alessandro Catania;Sebastiano Strangio;Maksym Paliy;Christian Sbrana;Michele Bertozzi;Giuseppe Iannaccone","doi":"10.1109/LSSC.2025.3616263","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3616263","url":null,"abstract":"In this letter, we present a high-entropy strong physically unclonable function (PUF) utilizing weak-inversion current mirrors implemented in a standard 65-nm CMOS technology. Each response bit of the proposed PUF relies on the threshold voltage differences of minimum-sized transistors arranged in a <inline-formula> <tex-math>$32times 32$ </tex-math></inline-formula> matrix. The analog operating principle enables encoding at least three effective bits per transistor pair, significantly improving entropy density. Leveraging a bit-masking technique, the design achieves remarkable robustness, attaining a bit error rate (BER) as low as 0.22% even under substantial supply voltage and temperature variations, with less than 10% discarded bits. The presented architecture exhibits a record area-to-entropy ratio of <inline-formula> <tex-math>$166~rm {F^{2}}$ </tex-math></inline-formula>/bit, confirming its suitability for highly secure, compact applications in hardware security.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"309-312"},"PeriodicalIF":2.0,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145255900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-Junction Monolithically Integrated Monitoring Photodiode With a Two-Stage 18 GHz 18 pA/√Hz TIA in 22-nm FDSOI 采用两级18ghz 18pa /√Hz TIA的22nm FDSOI双结单片集成监测光电二极管
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-29 DOI: 10.1109/LSSC.2025.3615656
Jelle H. T. Bakker;Nimit Jain;Paul Klatser;Mark S. Oude Alink;Bram Nauta
We present a monolithically integrated (MI) dualjunction monitoring photodiode (PD) and transimpedance amplifier (TIA). The photocurrent originates from the deep Nwell (DNW)/P-type substrate (PSUB) $({lt }5~ mathrm {GHz})$ and the P-Well $(mathrm {PW}) / mathrm {DNW}({gt }1~ mathrm {GHz})$ junctions. The presented combination of bulk PD and 22 nm fully-depleted silicon-on-insulator (FDSOI) TIA (18-GHz bandwidth (BW), $17.8 ~mathrm {pA} / sqrt {mathrm {Hz}}$ noise level) advances the state-of-the-art in MI high-speed optical monitoring and reduces the inherent tradeoff in MI solutions regarding PD (responsivity & BW) and RF circuitry $(f_{t})$ performance.
我们提出了一种单片集成(MI)双结监测光电二极管(PD)和跨阻放大器(TIA)。光电流来源于深Nwell (DNW)/ p型衬底(PSUB) $({lt}5~ mathrm {GHz})$和P-Well $(mathrm {PW}) / mathrm {DNW}({gt}1~ mathrm {GHz})$结。所提出的体PD和22 nm完全耗尽绝缘体上硅(FDSOI) TIA (18 ghz带宽(BW), 17.8 ~ mathm {pA} / sqrt { mathm {Hz}}$噪声级)的组合,推进了MI高速光学监测的最新技术,并减少了MI解决方案中PD(响应性和BW)和RF电路(f_{t})$性能的固有权衡。
{"title":"Dual-Junction Monolithically Integrated Monitoring Photodiode With a Two-Stage 18 GHz 18 pA/√Hz TIA in 22-nm FDSOI","authors":"Jelle H. T. Bakker;Nimit Jain;Paul Klatser;Mark S. Oude Alink;Bram Nauta","doi":"10.1109/LSSC.2025.3615656","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3615656","url":null,"abstract":"We present a monolithically integrated (MI) dualjunction monitoring photodiode (PD) and transimpedance amplifier (TIA). The photocurrent originates from the deep Nwell (DNW)/P-type substrate (PSUB) <inline-formula> <tex-math>$({lt }5~ mathrm {GHz})$ </tex-math></inline-formula> and the P-Well <inline-formula> <tex-math>$(mathrm {PW}) / mathrm {DNW}({gt }1~ mathrm {GHz})$ </tex-math></inline-formula> junctions. The presented combination of bulk PD and 22 nm fully-depleted silicon-on-insulator (FDSOI) TIA (18-GHz bandwidth (BW), <inline-formula> <tex-math>$17.8 ~mathrm {pA} / sqrt {mathrm {Hz}}$ </tex-math></inline-formula> noise level) advances the state-of-the-art in MI high-speed optical monitoring and reduces the inherent tradeoff in MI solutions regarding PD (responsivity & BW) and RF circuitry <inline-formula> <tex-math>$(f_{t})$ </tex-math></inline-formula> performance.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"313-316"},"PeriodicalIF":2.0,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An 8.5 MHz 42 ppm/°C Relaxation Oscillator With Charge-Pump Delay Cancellation and Digital Chopping Demodulation 8.5 MHz 42 ppm/°C弛豫振荡器与电荷泵延迟抵消和数字斩波解调
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-29 DOI: 10.1109/LSSC.2025.3615833
Yongjia Li;Jianlin Xia;Feng Cheng;Yifan Cao;Jin Wu;Encheng Zhu;Xiaofeng Sun;Dejin Wang;Long Zhang;Zhongyuan Fang;Weifeng Sun
This letter presents an RC oscillator featuring a mixed-signal compensation loop that simultaneously mitigates comparator offset, loop delay, switch on-resistance, and temperature dependency. The oscillator employs an auxiliary comparator, a charge pump, and a differential difference amplifier (DDA)-based main comparator to suppress ramping voltage overshoots caused by device and loop imperfections. Moreover, the auxiliary comparator employs chopper and digital demodulation techniques to suppress its own offset, further enhancing the accuracy of the calibration loop. By generating both ramping and reference voltages, the proportional-to-absolute-temperature (PTAT) core ensures that the temperature coefficient (TC) of the oscillation frequency primarily depends on passive RC components. Fabricated in a $0.18~mu $ m BCD process, the design achieves an average frequency TC of 42 ppm/°C from −20 °C to 125 °C across ten samples. Benefiting from the proposed architecture, the oscillator operates at 8.45 MHz with a fast startup calibration time of $50~mu $ s.
这封信提出了一个RC振荡器具有混合信号补偿回路,同时减轻比较器偏移,环路延迟,开关导通电阻和温度依赖性。该振荡器采用一个辅助比较器、一个电荷泵和一个基于差分放大器(DDA)的主比较器来抑制由器件和回路缺陷引起的斜坡电压过调。此外,辅助比较器采用斩波和数字解调技术来抑制其自身的偏移,进一步提高了校准环路的精度。通过产生斜坡电压和参考电压,绝对温度比例(PTAT)核心确保振荡频率的温度系数(TC)主要取决于无源RC元件。在$0.18~mu $ m的BCD工艺中制造,该设计在10个样品中实现了从- 20°C到125°C的平均频率TC为42 ppm/°C。得益于所提出的架构,振荡器工作频率为8.45 MHz,启动校准时间为$50~mu $ s。
{"title":"An 8.5 MHz 42 ppm/°C Relaxation Oscillator With Charge-Pump Delay Cancellation and Digital Chopping Demodulation","authors":"Yongjia Li;Jianlin Xia;Feng Cheng;Yifan Cao;Jin Wu;Encheng Zhu;Xiaofeng Sun;Dejin Wang;Long Zhang;Zhongyuan Fang;Weifeng Sun","doi":"10.1109/LSSC.2025.3615833","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3615833","url":null,"abstract":"This letter presents an RC oscillator featuring a mixed-signal compensation loop that simultaneously mitigates comparator offset, loop delay, switch on-resistance, and temperature dependency. The oscillator employs an auxiliary comparator, a charge pump, and a differential difference amplifier (DDA)-based main comparator to suppress ramping voltage overshoots caused by device and loop imperfections. Moreover, the auxiliary comparator employs chopper and digital demodulation techniques to suppress its own offset, further enhancing the accuracy of the calibration loop. By generating both ramping and reference voltages, the proportional-to-absolute-temperature (PTAT) core ensures that the temperature coefficient (TC) of the oscillation frequency primarily depends on passive RC components. Fabricated in a <inline-formula> <tex-math>$0.18~mu $ </tex-math></inline-formula>m BCD process, the design achieves an average frequency TC of 42 ppm/°C from −20 °C to 125 °C across ten samples. Benefiting from the proposed architecture, the oscillator operates at 8.45 MHz with a fast startup calibration time of <inline-formula> <tex-math>$50~mu $ </tex-math></inline-formula>s.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"333-336"},"PeriodicalIF":2.0,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Compact Current-Reusing 6-mW 66–92 GHz Frequency Quadrupler With 5% Peak Power Added Efficiency and >36 dBc Harmonic Rejection in 22-nm FDSOI CMOS 一个紧凑的电流复用6mw 66 - 92ghz频率四倍器,峰值功率增加效率5%,谐波抑制> 36dbc,采用22nm FDSOI CMOS
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-24 DOI: 10.1109/LSSC.2025.3614381
Shankkar Balasubramanian;Kristof Vaesen;Piet Wambacq;Carsten Wulff
This letter presents a frequency quadrupler with 32% fractional bandwidth (66–92 GHz) and 5% peak power-added efficiency (PAE), capable of operating with an input power of 0 dBm. The quadrupler consisting of two cascaded frequency doublers uses a multiport driven push-push complementary architecture for the first stage to generate differential signals for the second doubler with high fundamental harmonic rejection. The second doubler based on the nMOS-based push-push architecture uses gain enhancement to achieve a maximum conversion gain of –4 dB for the quadrupler. The quadrupler with an output saturation power (P ${}_{text {sat}}$ ) of –2.6 dBm achieves first- to third-harmonic rejections of more than 36 dBc across the 3-dB bandwidth. The compact quadrupler has a core area of 0.09 mm2, while consuming a DC power of 6.2 mW from a 0.8 V supply with an input power of 0 dBm at 20 GHz.
这封信介绍了一种频率四倍器,具有32%的分数带宽(66-92 GHz)和5%的峰值功率附加效率(PAE),能够在0 dBm的输入功率下工作。由两个级联倍频器组成的四倍频器采用多端口驱动的推-推互补结构作为第一级,为具有高基频抑制的第二倍频器产生差分信号。第二个倍频器基于基于nmos的推推式架构,使用增益增强来实现四倍频器的最大转换增益-4 dB。输出饱和功率(P ${}_{text {sat}}$)为- 2.6 dBm的四倍器在3db带宽上实现了超过36 dBc的一到三次谐波抑制。紧凑型四倍器的核心面积为0.09 mm2,在20 GHz时,从0.8 V电源和0 dBm输入功率消耗6.2 mW的直流功率。
{"title":"A Compact Current-Reusing 6-mW 66–92 GHz Frequency Quadrupler With 5% Peak Power Added Efficiency and >36 dBc Harmonic Rejection in 22-nm FDSOI CMOS","authors":"Shankkar Balasubramanian;Kristof Vaesen;Piet Wambacq;Carsten Wulff","doi":"10.1109/LSSC.2025.3614381","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3614381","url":null,"abstract":"This letter presents a frequency quadrupler with 32% fractional bandwidth (66–92 GHz) and 5% peak power-added efficiency (PAE), capable of operating with an input power of 0 dBm. The quadrupler consisting of two cascaded frequency doublers uses a multiport driven push-push complementary architecture for the first stage to generate differential signals for the second doubler with high fundamental harmonic rejection. The second doubler based on the nMOS-based push-push architecture uses gain enhancement to achieve a maximum conversion gain of –4 dB for the quadrupler. The quadrupler with an output saturation power (P<inline-formula> <tex-math>${}_{text {sat}}$ </tex-math></inline-formula>) of –2.6 dBm achieves first- to third-harmonic rejections of more than 36 dBc across the 3-dB bandwidth. The compact quadrupler has a core area of 0.09 mm2, while consuming a DC power of 6.2 mW from a 0.8 V supply with an input power of 0 dBm at 20 GHz.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"301-304"},"PeriodicalIF":2.0,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11178244","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145255858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Battery-Free BLE Backscatter Communication Chip for Wearable Systems 一种用于可穿戴系统的无电池BLE反向散射通信芯片
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-22 DOI: 10.1109/LSSC.2025.3612423
Yongling Zhang;Ji Xiong;Junzai Chen;Xiaoyu Li;Jinrui Zuo;Yan Wang;Xiaoyi Wang;Miao Meng
This letter presents a backscatter chip that features bidirectional communication with commodity bluetooth low-energy (BLE) transceivers. For uplink, the chip reflects a reverse-whitened BLE tone into single-sideband (SSB) GFSK-modulated BLE packets via a proposed replica VCO-based GFSK modulator and an inductor-free SSB reflector. For downlink, the BLE packets are frequency down-converted passively by utilizing a reverse-whitened BLE tone followed by a proposed self-calibrated GFSK demodulator to recover the downlink data at ultralow power with improved robustness. A dual-linearly polarized microstrip patch antenna (DPMPA) is integrated to enable concurrent RF energy harvesting and communication in a wearable form factor. Implemented in 65-nm CMOS, the chip consumes $1.4~mu $ W for downlink and $15.8~mu $ W for uplink. Wireless tests demonstrated a 50 cm downlink and >3 m uplink ranges at 20 dBm EIRP.
这封信介绍了一种反向散射芯片,具有与商品蓝牙低功耗(BLE)收发器双向通信的特点。对于上行链路,该芯片通过提出的基于复制vco的GFSK调制器和无电感的SSB反射器将反向白化的BLE音调反射到单边带(SSB) GFSK调制的BLE数据包中。对于下行链路,通过使用反向白化BLE音调,然后使用拟议的自校准GFSK解调器,被动地进行频率降转换,以超低功耗恢复下行链路数据,并提高鲁棒性。集成了双线极化微带贴片天线(DPMPA),以实现可穿戴形式的并发射频能量收集和通信。该芯片采用65nm CMOS实现,下行功耗为1.4~mu $ W,上行功耗为15.8~mu $ W。无线测试表明,在20 dBm EIRP下,下行链路为50 cm,上行链路为bbb30 m。
{"title":"A Battery-Free BLE Backscatter Communication Chip for Wearable Systems","authors":"Yongling Zhang;Ji Xiong;Junzai Chen;Xiaoyu Li;Jinrui Zuo;Yan Wang;Xiaoyi Wang;Miao Meng","doi":"10.1109/LSSC.2025.3612423","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3612423","url":null,"abstract":"This letter presents a backscatter chip that features bidirectional communication with commodity bluetooth low-energy (BLE) transceivers. For uplink, the chip reflects a reverse-whitened BLE tone into single-sideband (SSB) GFSK-modulated BLE packets via a proposed replica VCO-based GFSK modulator and an inductor-free SSB reflector. For downlink, the BLE packets are frequency down-converted passively by utilizing a reverse-whitened BLE tone followed by a proposed self-calibrated GFSK demodulator to recover the downlink data at ultralow power with improved robustness. A dual-linearly polarized microstrip patch antenna (DPMPA) is integrated to enable concurrent RF energy harvesting and communication in a wearable form factor. Implemented in 65-nm CMOS, the chip consumes <inline-formula> <tex-math>$1.4~mu $ </tex-math></inline-formula>W for downlink and <inline-formula> <tex-math>$15.8~mu $ </tex-math></inline-formula>W for uplink. Wireless tests demonstrated a 50 cm downlink and >3 m uplink ranges at 20 dBm EIRP.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"297-300"},"PeriodicalIF":2.0,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145223688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Opal: A 16-nm Coarse-Grained Reconfigurable Array SoC for Full Sparse Machine Learning Applications Opal:用于全稀疏机器学习应用的16nm粗粒度可重构阵列SoC
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-22 DOI: 10.1109/LSSC.2025.3613245
Po-Han Chen;Bo Wun Cheng;Michael Oduoza;Zhouhua Xie;Rupert Lu;Sai Gautham Ravipati;Kalhan Koul;Alex Carsello;Yuchen Mei;Mark Horowitz;Priyanka Raina
Sparsity has recently attracted increased attention in the machine learning (ML) community due to its potential to improve performance and energy efficiency by eliminating ineffectual computations. As ML models evolve rapidly, reconfigurable architectures, such as coarse-grained reconfigurable arrays (CGRAs), are being explored to adapt to and accelerate emerging models. Previous CGRA designs have supported unstructured sparsity and reported promising speedups and energy savings for compute-intensive kernels. However, these approaches still face performance bottlenecks when accelerating entire sparse ML networks. In this letter, we identify the primary sources of inefficiency in prior CGRA-based approaches and present Opal, a CGRA SoC with three key contributions: 1) flexible dataflow architecture supporting Gustavson’s dataflow for sparse matrix multiplication; 2) high-throughput sparse hardware primitives; and 3) enhanced processing elements to support mapping all ML operations on the CGRA. As a result, Opal achieves a 66% to 79% reduction in runtime and energy consumption across our evaluated sparse graph neural network benchmarks compared to prior CGRA solutions which only target kernel acceleration.
稀疏性最近在机器学习(ML)社区引起了越来越多的关注,因为它有可能通过消除无效的计算来提高性能和能源效率。随着机器学习模型的快速发展,人们正在探索诸如粗粒度可重构阵列(CGRAs)之类的可重构架构,以适应和加速新兴模型。以前的CGRA设计支持非结构化稀疏性,并报告了对计算密集型内核有希望的加速和节能。然而,这些方法在加速整个稀疏ML网络时仍然面临性能瓶颈。在这封信中,我们确定了先前基于CGRA的方法效率低下的主要来源,并提出了Opal,一种具有三个关键贡献的CGRA SoC: 1)支持稀疏矩阵乘法的Gustavson数据流的灵活数据流架构;2)高吞吐量稀疏硬件原语;3)增强的处理元素,以支持在CGRA上映射所有ML操作。因此,在我们评估的稀疏图神经网络基准测试中,与之前只针对内核加速的CGRA解决方案相比,Opal在运行时间和能耗方面减少了66%到79%。
{"title":"Opal: A 16-nm Coarse-Grained Reconfigurable Array SoC for Full Sparse Machine Learning Applications","authors":"Po-Han Chen;Bo Wun Cheng;Michael Oduoza;Zhouhua Xie;Rupert Lu;Sai Gautham Ravipati;Kalhan Koul;Alex Carsello;Yuchen Mei;Mark Horowitz;Priyanka Raina","doi":"10.1109/LSSC.2025.3613245","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3613245","url":null,"abstract":"Sparsity has recently attracted increased attention in the machine learning (ML) community due to its potential to improve performance and energy efficiency by eliminating ineffectual computations. As ML models evolve rapidly, reconfigurable architectures, such as coarse-grained reconfigurable arrays (CGRAs), are being explored to adapt to and accelerate emerging models. Previous CGRA designs have supported unstructured sparsity and reported promising speedups and energy savings for compute-intensive kernels. However, these approaches still face performance bottlenecks when accelerating entire sparse ML networks. In this letter, we identify the primary sources of inefficiency in prior CGRA-based approaches and present Opal, a CGRA SoC with three key contributions: 1) flexible dataflow architecture supporting Gustavson’s dataflow for sparse matrix multiplication; 2) high-throughput sparse hardware primitives; and 3) enhanced processing elements to support mapping all ML operations on the CGRA. As a result, Opal achieves a 66% to 79% reduction in runtime and energy consumption across our evaluated sparse graph neural network benchmarks compared to prior CGRA solutions which only target kernel acceleration.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"293-296"},"PeriodicalIF":2.0,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145223703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fully Dynamic Capacitive Touch Sensor With Tri-Level Energy Recycling and Compressive Sensing Technique 基于三能级能量回收和压缩传感技术的全动态电容式触摸传感器
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-19 DOI: 10.1109/LSSC.2025.3612093
Xiangdong Feng;Zhiyu Wang;Haoyang Li;Jiaqing Li;Wei-Chin Lin;Xin Hu;Zhong Tang;Yuyan Liu;Qinwen Fan;Yuxuan Luo;Bo Zhao
Capacitive touch screens have become the dominant user interface over the past decade. Achieving high framerates with low power consumption remains a critical design goal for touch systems. The conventional charge-recycling technique reduces driving power by 64%, but it relies on off-chip capacitors. To address this issue, we propose a tri-level energy recycling scheme, in which energy released during the 2-to-1 transition is recycled to power the 0-to-1 transition on the complementary channel. This approach achieves a 25% power reduction using on-chip transmission gates. Additionally, a compressive sensing method is introduced to selectively process touched RX channels while bypassing the others, reducing the number of fine ADCs by a factor of four compared to conventional two-step sensing. The proposed techniques are implemented in a 65-nm CMOS process and integrated into a $32times 20$ channel prototype occupying 2.4 mm2. Measurement results show that the chip consumes only 2.6 mW at a framerate of 1513 Hz. The signal-to-noise ratio (SNR) reaches 49.7 dB for finger touch and 28.7 dB for a 1-mm $Phi $ stylus, resulting in an energy efficiency of 10.66 pJ/step.
在过去的十年里,电容式触摸屏已经成为了主流的用户界面。以低功耗实现高帧率仍然是触摸系统的关键设计目标。传统的充电回收技术可以减少64%的驱动功率,但它依赖于片外电容器。为了解决这个问题,我们提出了一个三级能量回收方案,其中在2到1转换过程中释放的能量被回收,为互补通道上的0到1转换提供动力。这种方法使用片上传输门实现了25%的功耗降低。此外,还引入了一种压缩感知方法,可以选择性地处理触摸RX通道,同时绕过其他通道,与传统的两步感知相比,将精细adc的数量减少了四倍。所提出的技术在65纳米CMOS工艺中实现,并集成到占地2.4 mm2的32 × 20美元通道原型中。测量结果表明,该芯片在1513 Hz的帧率下功耗仅为2.6 mW。手指触摸的信噪比(SNR)达到49.7 dB, 1 mm $Phi $触控笔的信噪比达到28.7 dB,能效为10.66 pJ/step。
{"title":"A Fully Dynamic Capacitive Touch Sensor With Tri-Level Energy Recycling and Compressive Sensing Technique","authors":"Xiangdong Feng;Zhiyu Wang;Haoyang Li;Jiaqing Li;Wei-Chin Lin;Xin Hu;Zhong Tang;Yuyan Liu;Qinwen Fan;Yuxuan Luo;Bo Zhao","doi":"10.1109/LSSC.2025.3612093","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3612093","url":null,"abstract":"Capacitive touch screens have become the dominant user interface over the past decade. Achieving high framerates with low power consumption remains a critical design goal for touch systems. The conventional charge-recycling technique reduces driving power by 64%, but it relies on off-chip capacitors. To address this issue, we propose a tri-level energy recycling scheme, in which energy released during the 2-to-1 transition is recycled to power the 0-to-1 transition on the complementary channel. This approach achieves a 25% power reduction using on-chip transmission gates. Additionally, a compressive sensing method is introduced to selectively process touched RX channels while bypassing the others, reducing the number of fine ADCs by a factor of four compared to conventional two-step sensing. The proposed techniques are implemented in a 65-nm CMOS process and integrated into a <inline-formula> <tex-math>$32times 20$ </tex-math></inline-formula> channel prototype occupying 2.4 mm2. Measurement results show that the chip consumes only 2.6 mW at a framerate of 1513 Hz. The signal-to-noise ratio (SNR) reaches 49.7 dB for finger touch and 28.7 dB for a 1-mm <inline-formula> <tex-math>$Phi $ </tex-math></inline-formula> stylus, resulting in an energy efficiency of 10.66 pJ/step.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"337-340"},"PeriodicalIF":2.0,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145405290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 40.68-MHz, 200-ns-Settling Active Rectifier for mm-Sized Implants 一个40.68 mhz, 200-ns沉降有源整流器毫米大小的植入物
IF 2 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-18 DOI: 10.1109/LSSC.2025.3611484
Ronald Wijermars;Yi-Han Ou-Yang;Sijun Du;Dante G. Muratore
This letter describes a fast-settling active rectifier for a 40.68 MHz wireless power transfer receiver for implantable applications. Fast-settling and low power are achieved through a novel direct voltage-domain compensation technique. The rectifier maintains high efficiency during load and link variations required for downlink communication. The system was fabricated in 40nm CMOS and achieves a voltage conversion ratio of 93.9% and a simulated power conversion efficiency of 90.1% in a 0.19 mm2 area, resulting in a 118 mW/mm2 power density while integrating the resonance and filter capacitors. The worst-case settling of the ON- and OFF-delay compensation in the active rectifier is 200 ns, which is the fastest reported to date.
这封信描述了用于可植入应用的40.68 MHz无线电力传输接收器的快速沉降有源整流器。通过一种新颖的直接电压域补偿技术,实现了快速沉降和低功耗。整流器在下行通信所需的负载和链路变化期间保持高效率。该系统采用40nm CMOS工艺,在0.19 mm2面积内实现了93.9%的电压转换比和90.1%的模拟功率转换效率,在集成谐振和滤波电容的情况下,功率密度为118 mW/mm2。有源整流器ON- off延时补偿的最坏情况沉降速度为200ns,是迄今为止报道的最快的。
{"title":"A 40.68-MHz, 200-ns-Settling Active Rectifier for mm-Sized Implants","authors":"Ronald Wijermars;Yi-Han Ou-Yang;Sijun Du;Dante G. Muratore","doi":"10.1109/LSSC.2025.3611484","DOIUrl":"https://doi.org/10.1109/LSSC.2025.3611484","url":null,"abstract":"This letter describes a fast-settling active rectifier for a 40.68 MHz wireless power transfer receiver for implantable applications. Fast-settling and low power are achieved through a novel direct voltage-domain compensation technique. The rectifier maintains high efficiency during load and link variations required for downlink communication. The system was fabricated in 40nm CMOS and achieves a voltage conversion ratio of 93.9% and a simulated power conversion efficiency of 90.1% in a 0.19 mm2 area, resulting in a 118 mW/mm2 power density while integrating the resonance and filter capacitors. The worst-case settling of the ON- and OFF-delay compensation in the active rectifier is 200 ns, which is the fastest reported to date.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"305-308"},"PeriodicalIF":2.0,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145255928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Solid-State Circuits Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1