首页 > 最新文献

Integration-The Vlsi Journal最新文献

英文 中文
A local positive feedback loop-reused technique for enhancing performance of folded cascode amplifier 提高折叠式级联放大器性能的局部正反馈回路重复使用技术
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-12 DOI: 10.1016/j.vlsi.2024.102277

A current-reused folded cascode operational transconductance amplifier (OTA) using a local positive feedback (LPFB) technique has been proposed in previous literature, which does not achieve maximum unity gain-bandwidth (GBW). Besides, the stability of LPFB in the LPFB-OTA is limited by local common mode feedback (LCMFB) resistors. Based on the analysis, a local positive feedback loop-reused (LPFBR) technique is proposed to improve the performance of conventional LPFB OTA. For a fair comparison, both conventional and proposed OTAs working at saturation region are designed and simulated in SMIC 0.18μm process. The simulated results demonstrate that the proposed LPFBR-OTA has almost 10.5 times the bandwidth and maintains stability compared to that of the conventional LPFB-OTA under the condition that LCMFB resistors are increased by a factor of 10.

以前的文献提出了一种采用局部正反馈(LPFB)技术的电流复用折叠级联运算跨导放大器(OTA),但这种放大器无法实现最大统一增益带宽(GBW)。此外,LPFB-OTA 中 LPFB 的稳定性受到本地共模反馈(LCMFB)电阻的限制。在分析的基础上,提出了一种本地正反馈回路重复使用(LPFBR)技术,以改善传统 LPFB OTA 的性能。为了进行公平比较,在中芯国际 0.18μm 工艺中设计并仿真了工作在饱和区的传统和拟议 OTA。仿真结果表明,在 LCMFB 电阻增加 10 倍的条件下,拟议的 LPFBR-OTA 的带宽几乎是传统 LPFB-OTA 的 10.5 倍,并且保持稳定。
{"title":"A local positive feedback loop-reused technique for enhancing performance of folded cascode amplifier","authors":"","doi":"10.1016/j.vlsi.2024.102277","DOIUrl":"10.1016/j.vlsi.2024.102277","url":null,"abstract":"<div><p>A current-reused folded cascode operational transconductance amplifier (OTA) using a local positive feedback (LPFB) technique has been proposed in previous literature, which does not achieve maximum unity gain-bandwidth (GBW). Besides, the stability of LPFB in the LPFB-OTA is limited by local common mode feedback (LCMFB) resistors. Based on the analysis, a local positive feedback loop-reused (LPFBR) technique is proposed to improve the performance of conventional LPFB OTA. For a fair comparison, both conventional and proposed OTAs working at saturation region are designed and simulated in SMIC <span><math><mrow><mn>0</mn><mo>.</mo><mn>18</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span> process. The simulated results demonstrate that the proposed LPFBR-OTA has almost 10.5 times the bandwidth and maintains stability compared to that of the conventional LPFB-OTA under the condition that LCMFB resistors are increased by a factor of 10.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability 在 RISC-V 处理器微体系结构中集成纠错和检测技术以提高可靠性
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-12 DOI: 10.1016/j.vlsi.2024.102282

An essential consideration in processor design is ensuring reliability, particularly in demanding environments such as outer space and nuclear plants. To mitigate the effects of errors and enable error recovery, processors need to incorporate fault tolerance techniques. One common type of error is SEU (Single Event Upset), which affects various microelectronic devices including microprocessors, microcontrollers, and semiconductor memory devices. While error mitigation techniques have been developed for processors based on architectures like ARM (Advanced RISC Machine) and MIPS (Million Instructions Per Second), there is a gap in research for open-source ISAs (Instruction Set Architecture) like RISC-V, which this paper aims to address. This paper focuses on designing a fault-tolerant microarchitecture for a RISC-V processor that can correct one-bit errors, detect up to two-bit errors, and integrate lockstep and pipeline rollback features at a lower LUTs (Look Up Tables) consumption by re-using the same hardware pipeline for error mitigation and recovery through instruction mimicking. By incorporating these features, the proposed approach enhances the system’s fault tolerance by detecting and correcting errors caused by transient events and achieves a lower effective die size upon realization compared to contemporary works. The proposed microarchitecture design was simulated and synthesized using the Vivado Design Suite 2023.1 and implemented on a Zynq 7000 SoC ZC702 Evaluation Kit.

处理器设计的一个基本考虑因素是确保可靠性,尤其是在外层空间和核电厂等苛刻的环境中。为了减轻错误的影响并实现错误恢复,处理器需要采用容错技术。一种常见的错误类型是 SEU(单次事件猝发),它会影响各种微电子器件,包括微处理器、微控制器和半导体存储器件。虽然针对基于 ARM(高级 RISC 机器)和 MIPS(每秒百万条指令)架构的处理器开发了错误缓解技术,但针对 RISC-V 等开源 ISA(指令集架构)的研究还存在空白,本文旨在解决这一问题。本文的重点是为 RISC-V 处理器设计一种容错微体系结构,该体系结构可以纠正一位错误、检测多达两位的错误,并通过模仿指令重新使用相同的硬件流水线进行错误缓解和恢复,从而在较低的 LUT(查找表)消耗下集成锁步和流水线回滚功能。通过集成这些功能,所提出的方法可以检测和纠正瞬态事件引起的错误,从而增强系统的容错能力,与同类产品相比,实现了更小的有效芯片尺寸。我们使用 Vivado Design Suite 2023.1 对提出的微体系结构设计进行了仿真和综合,并在 Zynq 7000 SoC ZC702 评估套件上实现了该设计。
{"title":"Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability","authors":"","doi":"10.1016/j.vlsi.2024.102282","DOIUrl":"10.1016/j.vlsi.2024.102282","url":null,"abstract":"<div><p>An essential consideration in processor design is ensuring reliability, particularly in demanding environments such as outer space and nuclear plants. To mitigate the effects of errors and enable error recovery, processors need to incorporate fault tolerance techniques. One common type of error is SEU (Single Event Upset), which affects various microelectronic devices including microprocessors, microcontrollers, and semiconductor memory devices. While error mitigation techniques have been developed for processors based on architectures like ARM (Advanced RISC Machine) and MIPS (Million Instructions Per Second), there is a gap in research for open-source ISAs (Instruction Set Architecture) like RISC-V, which this paper aims to address. This paper focuses on designing a fault-tolerant microarchitecture for a RISC-V processor that can correct one-bit errors, detect up to two-bit errors, and integrate lockstep and pipeline rollback features at a lower LUTs (Look Up Tables) consumption by re-using the same hardware pipeline for error mitigation and recovery through instruction mimicking. By incorporating these features, the proposed approach enhances the system’s fault tolerance by detecting and correcting errors caused by transient events and achieves a lower effective die size upon realization compared to contemporary works. The proposed microarchitecture design was simulated and synthesized using the Vivado Design Suite 2023.1 and implemented on a Zynq 7000 SoC ZC702 Evaluation Kit.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general and accurate pattern search method for various scenarios 适用于各种情况的通用精确模式搜索法
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-11 DOI: 10.1016/j.vlsi.2024.102281

As chip designs grow in complexity, the optical proximity correction (OPC) process becomes increasingly time-consuming. As a result, pattern search technology is becoming a foundation stone of many tasks for manufacturing, such as lithography simulation, hotspot detection, mask optimization, and so on. The most difficult challenge of pattern search is to locate clips by specific patterns within the layout accurately and efficiently. In this paper, we present a generalized pattern search method capable in diverse scenarios, including patterns with hollow shapes, shifting edges, and multi-layer situations. Experimental results show that our method outperforms commercial tools in pattern locating accuracy and handling search problems involving complex patterns which is not directly support by commercial tools yet.

随着芯片设计日趋复杂,光学接近校正(OPC)过程变得越来越耗时。因此,图案搜索技术正成为光刻模拟、热点检测、光罩优化等许多制造任务的基石。图案搜索最困难的挑战是如何准确、高效地在版图中通过特定图案定位剪辑。在本文中,我们提出了一种通用的图案搜索方法,该方法可适用于多种场景,包括具有空心形状、边缘移动和多层情况的图案。实验结果表明,我们的方法在图案定位精度和处理复杂图案搜索问题方面优于商业工具,而商业工具尚不能直接支持复杂图案的搜索。
{"title":"A general and accurate pattern search method for various scenarios","authors":"","doi":"10.1016/j.vlsi.2024.102281","DOIUrl":"10.1016/j.vlsi.2024.102281","url":null,"abstract":"<div><p>As chip designs grow in complexity, the optical proximity correction (OPC) process becomes increasingly time-consuming. As a result, pattern search technology is becoming a foundation stone of many tasks for manufacturing, such as lithography simulation, hotspot detection, mask optimization, and so on. The most difficult challenge of pattern search is to locate clips by specific patterns within the layout accurately and efficiently. In this paper, we present a generalized pattern search method capable in diverse scenarios, including patterns with hollow shapes, shifting edges, and multi-layer situations. Experimental results show that our method outperforms commercial tools in pattern locating accuracy and handling search problems involving complex patterns which is not directly support by commercial tools yet.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142171740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synchronous control of memristive hindmarsh-rose neuron models with extreme multistability 具有极端多稳定性的记忆性后马什-蔷薇神经元模型的同步控制
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-10 DOI: 10.1016/j.vlsi.2024.102280

In this paper, the circuit simulation is achieved by the established Hindmarsh-Rose (HR) neuron model and the system is applied in projection synchronization. The chaotic behaviors of the neural network model are analyzed using bifurcation diagrams, Lyapunov exponents spectra, phase diagrams and time series diagrams. The dynamics analysis of the neuron model shows a variety of firing behaviors and extreme multistability behavior. The model is then simulated through circuit multisim to demonstrate the possibility in a physical sense. Finally, synchronization is induced to the memristive neural system through projection control, and the experimental results show that the model embodies a good synchronization effect in the process of projection synchronization, which helps to improve the security of signal transmission and the confidentiality of the system, and lays the foundation for the secure communication afterwards.

本文通过已建立的 Hindmarsh-Rose (HR) 神经元模型实现了电路仿真,并将该系统应用于投影同步。本文利用分岔图、Lyapunov 指数谱、相位图和时间序列图分析了神经网络模型的混沌行为。神经元模型的动力学分析显示了多种发射行为和极端多稳定性行为。然后通过电路 Multisim 对模型进行仿真,从物理意义上证明了这种可能性。最后,通过投影控制对记忆神经系统进行同步诱导,实验结果表明,该模型在投影同步过程中体现了良好的同步效果,有助于提高信号传输的安全性和系统的保密性,为之后的安全通信奠定了基础。
{"title":"Synchronous control of memristive hindmarsh-rose neuron models with extreme multistability","authors":"","doi":"10.1016/j.vlsi.2024.102280","DOIUrl":"10.1016/j.vlsi.2024.102280","url":null,"abstract":"<div><p>In this paper, the circuit simulation is achieved by the established Hindmarsh-Rose (HR) neuron model and the system is applied in projection synchronization. The chaotic behaviors of the neural network model are analyzed using bifurcation diagrams, Lyapunov exponents spectra, phase diagrams and time series diagrams. The dynamics analysis of the neuron model shows a variety of firing behaviors and extreme multistability behavior. The model is then simulated through circuit multisim to demonstrate the possibility in a physical sense. Finally, synchronization is induced to the memristive neural system through projection control, and the experimental results show that the model embodies a good synchronization effect in the process of projection synchronization, which helps to improve the security of signal transmission and the confidentiality of the system, and lays the foundation for the secure communication afterwards.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A wide-output buck DC-DC power management IC 宽输出降压型 DC-DC 电源管理集成电路
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-06 DOI: 10.1016/j.vlsi.2024.102278

-This article designs and develops a wide-input voltage, high-efficiency, small-size, and peak current-mode control step-down DC-DC converter. The Cadence Spectre simulation tool is used for system simulation to verify the performance of the chip. The overall research content of the article includes the function of the output under heavy load, light load and the stability of the output under transient load changes. The specific content of the research is buck synchronous step-down DC-DC converter chip with pulse modulated. It is provided with an input-voltage range of 6 V–80 V and maximum output-voltage range 72 V. The chip possesses wide operating temperature range of −20 °C to 130 °C. The 92 % high-efficiency can be achieved by using a PWM/PFM hybrid modulation method. When achieving transient load jump, the output voltage change shall not exceed 150 mV. The maximum load current of the chip is 1 A. Furthermore, the chip is packaged and samples can be obtained, and the output light load/heavy load, and other functions are tested through the circuit board. In addition, the chip achieved tape out by using 0.18 μm CMOS process with size of 2027 μm × 2020 μm. The converter features current mode control to simplify external compensation and optimize transient response through a wide range of inductors and output capacitors. It can be adopted user-programmable soft-start time to prevent inrush current during startup. It also includes thermal shutdown protection to provide safe and smooth operation in operating conditions.

-本文设计并开发了一种宽输入电压、高效率、小尺寸和峰值电流模式控制的降压型 DC-DC 转换器。采用 Cadence Spectre 仿真工具进行系统仿真,以验证芯片的性能。文章的总体研究内容包括重负载、轻负载下的输出功能以及瞬态负载变化下的输出稳定性。具体研究内容为带脉冲调制的降压同步降压型 DC-DC 转换器芯片。该芯片的输入电压范围为 6 V-80 V,最大输出电压范围为 72 V。通过使用 PWM/PFM 混合调制方法,可实现 92% 的高效率。在实现瞬态负载跳变时,输出电压变化不超过 150 mV。芯片的最大负载电流为 1 A。此外,该芯片经过封装,可以获得样品,并通过电路板测试输出轻载/重载等功能。此外,该芯片采用 0.18 μm CMOS 工艺制成,尺寸为 2027 μm × 2020 μm。该转换器采用电流模式控制,可简化外部补偿,并通过各种电感器和输出电容器优化瞬态响应。它可采用用户可编程软启动时间,以防止启动期间出现浪涌电流。它还包括热关断保护,可在工作条件下提供安全平稳的运行。
{"title":"A wide-output buck DC-DC power management IC","authors":"","doi":"10.1016/j.vlsi.2024.102278","DOIUrl":"10.1016/j.vlsi.2024.102278","url":null,"abstract":"<div><p>-This article designs and develops a wide-input voltage, high-efficiency, small-size, and peak current-mode control step-down DC-DC converter. The Cadence Spectre simulation tool is used for system simulation to verify the performance of the chip. The overall research content of the article includes the function of the output under heavy load, light load and the stability of the output under transient load changes. The specific content of the research is buck synchronous step-down DC-DC converter chip with pulse modulated. It is provided with an input-voltage range of 6 V–80 V and maximum output-voltage range 72 V. The chip possesses wide operating temperature range of −20 °C to 130 °C. The 92 % high-efficiency can be achieved by using a PWM/PFM hybrid modulation method. When achieving transient load jump, the output voltage change shall not exceed 150 mV. The maximum load current of the chip is 1 A. Furthermore, the chip is packaged and samples can be obtained, and the output light load/heavy load, and other functions are tested through the circuit board. In addition, the chip achieved tape out by using 0.18 μm CMOS process with size of 2027 μm <span><math><mrow><mo>×</mo></mrow></math></span> 2020 μm. The converter features current mode control to simplify external compensation and optimize transient response through a wide range of inductors and output capacitors. It can be adopted user-programmable soft-start time to prevent inrush current during startup. It also includes thermal shutdown protection to provide safe and smooth operation in operating conditions.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LA-ring based non-linear components: Application to image security 基于 LA 环的非线性组件:图像安全应用
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-05 DOI: 10.1016/j.vlsi.2024.102279

The prevalent utilization of symmetric block ciphers in contemporary information security systems reinforces the need for immediate action to increase their effectiveness. This task is considered crucial in the synthesis of high-quality cryptographic primitives, particularly S-boxes. In accordance with this requirement, the current article demonstrates a strategy for generating an 8 × 8 S-box drawing over an LA-ring of order 1024, which is considered a substantial class of non-associative rings. For the purpose of investigating LA-ring and their practical uses, it is essential to have illustrative examples. However, obtaining such examples using current methods is tedious and yields limited results. Therefore, this research explores an intriguing opportunity for an extensive exploration of LA-ring, far exceeding the limitations previously established and offering a valuable analytical approach for creating examples of higher-order LA-ring by drawing upon lower orders. The manuscript also performs a variety of standard evaluation tests based on five core indicators, which highlight their potential as parallels to the existing frameworks. By using the crafted S-box, an image encryption approach is launched that aims to enhance security measures. It is therefore seen that the recommended S-box has shown a high potential for causing confusion during the substitution phase, and a 3D chaotic map is implemented for the pixel permutation in order to create diffusion into the colour image. Certainly, the discovery is leading to a foundational framework among academics and is expected to serve as the basis for numerous implementations in the future.

对称块密码在当代信息安全系统中的广泛使用,使我们更有必要立即采取行动提高其有效性。这项任务被认为是合成高质量密码基元,特别是 S-box 的关键。根据这一要求,本文展示了在阶数为 1024 的 LA 环上生成 8 × 8 S-box 绘图的策略,LA 环被认为是非关联环的一个重要类别。为了研究 LA 环及其实际用途,必须有能说明问题的例子。然而,使用现有方法获取此类示例非常繁琐,且结果有限。因此,本研究为广泛探索 LA 环探索了一个令人感兴趣的机会,远远超出了以前建立的限制,并为通过借鉴低阶 LA 环创建高阶 LA 环示例提供了一种有价值的分析方法。该手稿还基于五项核心指标进行了各种标准评估测试,凸显了其与现有框架并行的潜力。通过使用精心制作的 S-box,推出了一种旨在加强安全措施的图像加密方法。由此可见,所推荐的 S-box 在替换阶段极有可能造成混淆,而为了在彩色图像中形成扩散,像素置换采用了三维混沌图。当然,这一发现正在学术界形成一个基础框架,并有望成为未来众多实施方案的基础。
{"title":"LA-ring based non-linear components: Application to image security","authors":"","doi":"10.1016/j.vlsi.2024.102279","DOIUrl":"10.1016/j.vlsi.2024.102279","url":null,"abstract":"<div><p>The prevalent utilization of symmetric block ciphers in contemporary information security systems reinforces the need for immediate action to increase their effectiveness. This task is considered crucial in the synthesis of high-quality cryptographic primitives, particularly S-boxes. In accordance with this requirement, the current article demonstrates a strategy for generating an 8 × 8 S-box drawing over an LA-ring of order 1024, which is considered a substantial class of non-associative rings. For the purpose of investigating LA-ring and their practical uses, it is essential to have illustrative examples. However, obtaining such examples using current methods is tedious and yields limited results. Therefore, this research explores an intriguing opportunity for an extensive exploration of LA-ring, far exceeding the limitations previously established and offering a valuable analytical approach for creating examples of higher-order LA-ring by drawing upon lower orders. The manuscript also performs a variety of standard evaluation tests based on five core indicators, which highlight their potential as parallels to the existing frameworks. By using the crafted S-box, an image encryption approach is launched that aims to enhance security measures. It is therefore seen that the recommended S-box has shown a high potential for causing confusion during the substitution phase, and a 3D chaotic map is implemented for the pixel permutation in order to create diffusion into the colour image. Certainly, the discovery is leading to a foundational framework among academics and is expected to serve as the basis for numerous implementations in the future.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 3D-stack DRAM-based PNM architecture design 基于 3D 堆栈 DRAM 的 PNM 架构设计
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-02 DOI: 10.1016/j.vlsi.2024.102266

The article examines methods for integrating 3D-stacked DRAM with AI logic chips, in order to overcome the memory bandwidth challenges faced in the AI inference of transformer models. The findings indicate that this approach can yield a 9x to 3x reduction in power consumption while maintaining similar performance levels, or alternatively, an 8x improvement in performance with comparable power consumption.

文章研究了将三维堆叠 DRAM 与人工智能逻辑芯片集成的方法,以克服变压器模型的人工智能推理所面临的内存带宽挑战。研究结果表明,这种方法可以在保持类似性能水平的同时将功耗降低 9 到 3 倍,或者在功耗相当的情况下将性能提高 8 倍。
{"title":"A 3D-stack DRAM-based PNM architecture design","authors":"","doi":"10.1016/j.vlsi.2024.102266","DOIUrl":"10.1016/j.vlsi.2024.102266","url":null,"abstract":"<div><p>The article examines methods for integrating 3D-stacked DRAM with AI logic chips, in order to overcome the memory bandwidth challenges faced in the AI inference of transformer models. The findings indicate that this approach can yield a 9x to 3x reduction in power consumption while maintaining similar performance levels, or alternatively, an 8x improvement in performance with comparable power consumption.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated electrical silicon interconnects for short-range high-speed millimeter-wave and terahertz communications 用于短程高速毫米波和太赫兹通信的集成电子硅互连器件
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-08-31 DOI: 10.1016/j.vlsi.2024.102267

—Millimeter-wave and terahertz interconnects implemented in advanced complementary metal oxide semiconductor (CMOS) technologies have emerged as promising solutions to fix the issues encountered by baseband interconnects and optical interconnects across specific communication ranges. Over the last decade, significant attempts to advance millimeter-wave and terahertz electronics and platforms have been made. Notably, there have been ground-breaking advancements in active components, including modulation techniques, low-noise receivers, efficient and high-output-power signal generators, and high-frequency clock synthesizers. Nevertheless, since energy efficiency is of paramount importance for interconnect applications, it is necessary to prioritize efficiency enhancements over improvements in signal power, signal integrity and noise related performance. Strategies to improve system output power and phase noise as well as strategies to reduce channel loss and channel electromagnetic crosstalk should leverage alternative approaches, such as architectural optimizations and array configurations, rather than prioritizing energy efficiency. As such, the progression of passive channel technology is equally vital. While reducing channel insertion loss is essential for extending communication reach, channel dispersion and crosstalk limitations at the interface level present critical challenges to achieving optimal bandwidth over distances of up to a few meters. This underscores the need for a balanced focus on both active and passive component innovations to fully harness the potential of millimeter-wave and terahertz interconnects in overcoming the limitations of current CMOS technologies.

-采用先进的互补金属氧化物半导体(CMOS)技术实现的毫米波和太赫兹互连已成为解决基带互连和光互连在特定通信范围内遇到的问题的有前途的解决方案。在过去十年中,人们为推动毫米波和太赫兹电子技术和平台的发展做出了重大尝试。值得注意的是,有源元件取得了突破性进展,包括调制技术、低噪声接收器、高效和高输出功率信号发生器以及高频时钟合成器。然而,由于能效对互连应用至关重要,因此有必要优先提高能效,而不是改善信号功率、信号完整性和噪声相关性能。改善系统输出功率和相位噪声的策略,以及降低通道损耗和通道电磁串扰的策略,都应采用其他方法,如架构优化和阵列配置,而不是优先考虑能效。因此,无源信道技术的发展同样至关重要。虽然降低信道插入损耗对扩大通信覆盖范围至关重要,但接口层面的信道色散和串扰限制对实现最远几米的最佳带宽提出了严峻挑战。这突出表明,要充分利用毫米波和太赫兹互连的潜力,克服当前 CMOS 技术的局限性,就必须均衡地关注有源和无源元件的创新。
{"title":"Integrated electrical silicon interconnects for short-range high-speed millimeter-wave and terahertz communications","authors":"","doi":"10.1016/j.vlsi.2024.102267","DOIUrl":"10.1016/j.vlsi.2024.102267","url":null,"abstract":"<div><p>—Millimeter-wave and terahertz interconnects implemented in advanced complementary metal oxide semiconductor (CMOS) technologies have emerged as promising solutions to fix the issues encountered by baseband interconnects and optical interconnects across specific communication ranges. Over the last decade, significant attempts to advance millimeter-wave and terahertz electronics and platforms have been made. Notably, there have been ground-breaking advancements in active components, including modulation techniques, low-noise receivers, efficient and high-output-power signal generators, and high-frequency clock synthesizers. Nevertheless, since energy efficiency is of paramount importance for interconnect applications, it is necessary to prioritize efficiency enhancements over improvements in signal power, signal integrity and noise related performance. Strategies to improve system output power and phase noise as well as strategies to reduce channel loss and channel electromagnetic crosstalk should leverage alternative approaches, such as architectural optimizations and array configurations, rather than prioritizing energy efficiency. As such, the progression of passive channel technology is equally vital. While reducing channel insertion loss is essential for extending communication reach, channel dispersion and crosstalk limitations at the interface level present critical challenges to achieving optimal bandwidth over distances of up to a few meters. This underscores the need for a balanced focus on both active and passive component innovations to fully harness the potential of millimeter-wave and terahertz interconnects in overcoming the limitations of current CMOS technologies.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142228524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An analytical placement algorithm with looking-ahead routing topology optimization 具有前瞻性路由拓扑优化功能的分析放置算法
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-08-31 DOI: 10.1016/j.vlsi.2024.102264

Placement is a critical step in the modern VLSI design flow, as it dramatically determines the performance of circuit designs. Most placement algorithms estimate the design performance with a half-perimeter wirelength (HPWL) and target it as their optimization objective. The wirelength model used by these algorithms limits their ability to optimize the internal routing topology, which can lead to discrepancies between estimates and the actual routing wirelength. This paper proposes an analytical placement algorithm to optimize the internal routing topology. We first introduce a differential wirelength model in the global placement stage based on an ideal routing topology RSMT. Through screening and tracing various segments, this model can generate meaningful gradients for interior points during gradient computation. Then, after global placement, we propose a cell refinement algorithm and further optimize the routing wirelength with swift density control. Experiments on ICCAD2015 benchmarks show that our algorithm can achieve a 3% improvement in routing wirelength, 0.8% in HPWL, and 23.8% in TNS compared with the state-of-the-art analytical placer. On industrial benchmarks, our algorithm can also achieve a 10.6% improvement in routing wirelength, 27.3% in WNS, and 34.4% in TNS.

贴装是现代超大规模集成电路设计流程中的关键步骤,因为它极大地决定了电路设计的性能。大多数布局算法使用半周线长(HPWL)估算设计性能,并将其作为优化目标。这些算法使用的线长模型限制了其优化内部路由拓扑的能力,从而导致估计值与实际路由线长之间的差异。本文提出了一种优化内部路由拓扑的分析性布局算法。我们首先基于理想路由拓扑 RSMT,在全局布局阶段引入了一个差分线长模型。通过筛选和跟踪各种线段,该模型可在梯度计算过程中为内部点生成有意义的梯度。然后,在全局布局之后,我们提出了一种单元细化算法,并通过快速密度控制进一步优化路由线长。在 ICCAD2015 基准上进行的实验表明,与最先进的分析放置器相比,我们的算法可实现 3% 的路由线长改进、0.8% 的 HPWL 改进和 23.8% 的 TNS 改进。在工业基准上,我们的算法还能将路由线长提高 10.6%,将 WNS 提高 27.3%,将 TNS 提高 34.4%。
{"title":"An analytical placement algorithm with looking-ahead routing topology optimization","authors":"","doi":"10.1016/j.vlsi.2024.102264","DOIUrl":"10.1016/j.vlsi.2024.102264","url":null,"abstract":"<div><p>Placement is a critical step in the modern VLSI design flow, as it dramatically determines the performance of circuit designs. Most placement algorithms estimate the design performance with a half-perimeter wirelength (HPWL) and target it as their optimization objective. The wirelength model used by these algorithms limits their ability to optimize the internal routing topology, which can lead to discrepancies between estimates and the actual routing wirelength. This paper proposes an analytical placement algorithm to optimize the internal routing topology. We first introduce a differential wirelength model in the global placement stage based on an ideal routing topology RSMT. Through screening and tracing various segments, this model can generate meaningful gradients for interior points during gradient computation. Then, after global placement, we propose a cell refinement algorithm and further optimize the routing wirelength with swift density control. Experiments on ICCAD2015 benchmarks show that our algorithm can achieve a 3% improvement in routing wirelength, 0.8% in HPWL, and 23.8% in TNS compared with the state-of-the-art analytical placer. On industrial benchmarks, our algorithm can also achieve a 10.6% improvement in routing wirelength, 27.3% in WNS, and 34.4% in TNS.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142150576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A three-stage single-miller CMOS OTA with no lower load capacitor limit 无负载电容下限的三级单填充 CMOS OTA
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-08-30 DOI: 10.1016/j.vlsi.2024.102269

This work proposes a Single Miller Capacitor (SMC) compensated three-stage Operational Transconductance Amplifier (OTA) for a wide range of load capacitors with a zero minimum load capacitor. The proposed three-stage OTA does not require a minimum load capacitor for OTA to be stable. The proposed work uses two different feed-forward transconductors to enhance the small-signal and large-signal performances of the OTA. This OTA achieves more than 70° phase margin and more than 10dB gain margin with a load capacitor range of 0 to 500pF and consumes less quiescent current. The proposed OTA uses a smaller SMC of 2pF to drive a wide range of load capacitors. Furthermore, it saves the active area of the chip. The proposed OTA is simulated in a cadence virtuoso tool using UMC 90nm CMOS technology with BSIM4 MOSFETs.

这项研究提出了一种单米勒电容器(SMC)补偿式三级运算跨导放大器(OTA),适用于多种负载电容器,最小负载电容器为零。拟议的三级 OTA 不需要最小负载电容就能实现稳定的 OTA。建议的工作使用两个不同的前馈跨导来增强 OTA 的小信号和大信号性能。该 OTA 在 0 至 500pF 的负载电容范围内实现了 70° 以上的相位裕度和 10dB 以上的增益裕度,并消耗较少的静态电流。拟议的 OTA 使用 2pF 的较小 SMC,可驱动各种负载电容器。此外,它还节省了芯片的有效面积。我们在 cadence virtuoso 工具中使用联电 90nm CMOS 技术和 BSIM4 MOSFET 对拟议的 OTA 进行了仿真。
{"title":"A three-stage single-miller CMOS OTA with no lower load capacitor limit","authors":"","doi":"10.1016/j.vlsi.2024.102269","DOIUrl":"10.1016/j.vlsi.2024.102269","url":null,"abstract":"<div><p>This work proposes a Single Miller Capacitor (SMC) compensated three-stage Operational Transconductance Amplifier (OTA) for a wide range of load capacitors with a zero minimum load capacitor. The proposed three-stage OTA does not require a minimum load capacitor for OTA to be stable. The proposed work uses two different feed-forward transconductors to enhance the small-signal and large-signal performances of the OTA. This OTA achieves more than <span><math><mrow><mn>70</mn></mrow></math></span>° phase margin and more than <span><math><mrow><mn>10</mn><mspace></mspace><mi>dB</mi></mrow></math></span> gain margin with a load capacitor range of 0 to <span><math><mrow><mn>500</mn><mspace></mspace><mi>pF</mi></mrow></math></span> and consumes less quiescent current. The proposed OTA uses a smaller SMC of <span><math><mrow><mn>2</mn><mspace></mspace><mi>pF</mi></mrow></math></span> to drive a wide range of load capacitors. Furthermore, it saves the active area of the chip. The proposed OTA is simulated in a cadence virtuoso tool using UMC <span><math><mrow><mn>90</mn><mspace></mspace><mi>nm</mi></mrow></math></span> CMOS technology with BSIM4 MOSFETs.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Integration-The Vlsi Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1