首页 > 最新文献

Integration-The Vlsi Journal最新文献

英文 中文
Optimal design of mixed dielectric coaxial-annular TSV using GWO algorithm based on artificial neural network 使用基于人工神经网络的 GWO 算法优化设计混合介质同轴环形 TSV
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-05-08 DOI: 10.1016/j.vlsi.2024.102205
Liwen Zhang, He Yang, Chen Yang, Jincan Zhang, Jinchan Wang

The single-objective and single-parameter optimization method is commonly used in the structure optimization of TSV to improve the transmission characteristics, for which a structure design scheme that simultaneously satisfies multiple target requirements is difficult to obtain. Moreover, the method cannot simultaneously optimize different design parameters. Aiming at the above problems, a global optimization method based on the grey wolf optimization (GWO) algorithm and artificial neural network (ANN) model is proposed. With the presented mixed dielectric coaxial-annular TSV model, firstly six key design parameters A-F are selected as optimization variables by the control variable method. The L25(56) orthogonal experiment is designed for Taguchi analysis and analysis of variance (ANOVA). Then, three prediction models, ANN, support vector machine (SVM), and extreme learning machine (ELM), are developed with the extended orthogonal data as the training sets. It is found that the ANN model performed best. To search for the global optimal solution, the genetic algorithm (GA) and GWO algorithm, combined with the ANN model are applied, respectively. The results show that the GWO algorithm is more successful in solving the problem of falling into the local optimum than GA, and the convergence speed is faster and more stable. After GWO-ANN optimization, the performance of each S-parameter index is greatly improved, S11 reduces by 14.05 dB, S21 increases by 0.33 dB, and S31 reduces by 12.50 dB at 30 GHz.

单目标、单参数优化方法通常用于 TSV 的结构优化以改善传输特性,而同时满足多个目标要求的结构设计方案很难获得。此外,该方法无法同时优化不同的设计参数。针对上述问题,本文提出了一种基于灰狼优化(GWO)算法和人工神经网络(ANN)模型的全局优化方法。利用所提出的混合介质同轴环形 TSV 模型,首先通过控制变量法选取 A-F 六个关键设计参数作为优化变量。设计 L25(56) 正交实验,进行田口分析和方差分析(ANOVA)。然后,以扩展的正交数据为训练集,建立了三个预测模型,即 ANN、支持向量机(SVM)和极端学习机(ELM)。结果发现,ANN 模型表现最佳。为了寻找全局最优解,分别应用了遗传算法(GA)和 GWO 算法,并将其与 ANN 模型相结合。结果表明,GWO 算法比 GA 更好地解决了陷入局部最优的问题,而且收敛速度更快、更稳定。经过 GWO-ANN 优化后,各 S 参数指标的性能都得到了极大改善,在 30 GHz 频率下,S11 降低了 14.05 dB,S21 增加了 0.33 dB,S31 降低了 12.50 dB。
{"title":"Optimal design of mixed dielectric coaxial-annular TSV using GWO algorithm based on artificial neural network","authors":"Liwen Zhang,&nbsp;He Yang,&nbsp;Chen Yang,&nbsp;Jincan Zhang,&nbsp;Jinchan Wang","doi":"10.1016/j.vlsi.2024.102205","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102205","url":null,"abstract":"<div><p>The single-objective and single-parameter optimization method is commonly used in the structure optimization of TSV to improve the transmission characteristics, for which a structure design scheme that simultaneously satisfies multiple target requirements is difficult to obtain. Moreover, the method cannot simultaneously optimize different design parameters. Aiming at the above problems, a global optimization method based on the grey wolf optimization (GWO) algorithm and artificial neural network (ANN) model is proposed. With the presented mixed dielectric coaxial-annular TSV model, firstly six key design parameters A-F are selected as optimization variables by the control variable method. The L<sub>25</sub>(5<sup>6</sup>) orthogonal experiment is designed for Taguchi analysis and analysis of variance (ANOVA). Then, three prediction models, ANN, support vector machine (SVM), and extreme learning machine (ELM), are developed with the extended orthogonal data as the training sets. It is found that the ANN model performed best. To search for the global optimal solution, the genetic algorithm (GA) and GWO algorithm, combined with the ANN model are applied, respectively. The results show that the GWO algorithm is more successful in solving the problem of falling into the local optimum than GA, and the convergence speed is faster and more stable. After GWO-ANN optimization, the performance of each <em>S</em>-parameter index is greatly improved, <em>S</em><sub>11</sub> reduces by 14.05 dB, <em>S</em><sub>21</sub> increases by 0.33 dB, and <em>S</em><sub>31</sub> reduces by 12.50 dB at 30 GHz.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102205"},"PeriodicalIF":1.9,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140948440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of CMOS fully differential multipath two-stage OTA with boosted slew rate and power efficiency 设计可提高压摆率和能效的 CMOS 全差分多路径两级 OTA
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-05-07 DOI: 10.1016/j.vlsi.2024.102204
Zahra Hashemi, Mostafa Yargholi

A CMOS fully differential multipath two-stage operational transconductance amplifier (OTA) with boosted slew rate and power efficiency is proposed in this paper. The new OTA consists of two gain stages. The basic structure of the proposed OTA is the recycling folded cascode (RFC) structure. By using the multipath technique in the first stage of the proposed OTA, it leads to an increase in gain and a decrease in power consumption. In addition, a high-speed current mirror is applied to increase the phase margin. The second stage with a class-AB amplifier is used to increase the transconductance and slew rate of the output. Moreover, the power efficiency of the proposed OTA is boosted compared to the recycling double-folded cascode (RDFC) OTA. This makes the proposed OTA more appropriate for applications that require low power consumption, such as neural amplifiers. Design and simulation of the proposed OTA is done in 0.18 μm standard CMOS technology with a 1 V supply voltage. Post-layout simulation results of the proposed OTA demonstrate that the OTA dissipates 180 nW of power, while showing a 136.7 dB voltage gain, and 127.1 kHz unity gain frequency for a capacitive load of 30 pF. Thus, compared to the RDFC OTA, the proposed OTA provides a 250 % increase in slew rate and a 20 % increase in PSRR and CMRR, while power consumption is reduced by 10 %. The proposed OTA is robust against process, voltage, and temperature (PVT) variations. The recommended OTA achieves a good figure of merit (FOM) over the previous OTAs.

本文提出了一种具有更高转速和能效的 CMOS 全差分多路两级运算转导放大器(OTA)。新型 OTA 由两个增益级组成。拟议 OTA 的基本结构是循环折叠级联(RFC)结构。通过在第一级 OTA 中使用多路径技术,可提高增益并降低功耗。此外,还采用了高速电流镜来增加相位裕量。带有 AB 类放大器的第二级用于提高输出的跨导和压摆率。此外,与循环双折叠级联(RDFC)OTA 相比,拟议 OTA 的功率效率有所提高。这使得拟议的 OTA 更适合需要低功耗的应用,如神经放大器。拟议 OTA 的设计和仿真采用 0.18 μm 标准 CMOS 技术,电源电压为 1 V。拟议 OTA 的布局后仿真结果表明,该 OTA 的耗散功率为 180 nW,电压增益为 136.7 dB,在 30 pF 的电容负载条件下,单位增益频率为 127.1 kHz。因此,与 RDFC OTA 相比,拟议 OTA 的压摆率提高了 250%,PSRR 和 CMRR 提高了 20%,同时功耗降低了 10%。建议的 OTA 对工艺、电压和温度(PVT)变化具有鲁棒性。与之前的 OTA 相比,推荐的 OTA 实现了良好的性能指标(FOM)。
{"title":"Design of CMOS fully differential multipath two-stage OTA with boosted slew rate and power efficiency","authors":"Zahra Hashemi,&nbsp;Mostafa Yargholi","doi":"10.1016/j.vlsi.2024.102204","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102204","url":null,"abstract":"<div><p>A CMOS fully differential multipath two-stage operational transconductance amplifier (OTA) with boosted slew rate and power efficiency is proposed in this paper. The new OTA consists of two gain stages. The basic structure of the proposed OTA is the recycling folded cascode (RFC) structure. By using the multipath technique in the first stage of the proposed OTA, it leads to an increase in gain and a decrease in power consumption. In addition, a high-speed current mirror is applied to increase the phase margin. The second stage with a class-AB amplifier is used to increase the transconductance and slew rate of the output. Moreover, the power efficiency of the proposed OTA is boosted compared to the recycling double-folded cascode (RDFC) OTA. This makes the proposed OTA more appropriate for applications that require low power consumption, such as neural amplifiers. Design and simulation of the proposed OTA is done in 0.18 μm standard CMOS technology with a 1 V supply voltage. Post-layout simulation results of the proposed OTA demonstrate that the OTA dissipates 180 nW of power, while showing a 136.7 dB voltage gain, and 127.1 kHz unity gain frequency for a capacitive load of 30 pF. Thus, compared to the RDFC OTA, the proposed OTA provides a 250 % increase in slew rate and a 20 % increase in PSRR and CMRR, while power consumption is reduced by 10 %. The proposed OTA is robust against process, voltage, and temperature (PVT) variations. The recommended OTA achieves a good figure of merit (FOM) over the previous OTAs.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102204"},"PeriodicalIF":1.9,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140918517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neuro-inspired hardware solutions for high-performance computing: A TiO2-based nano-synaptic device approach with backpropagation 高性能计算的神经启发硬件解决方案:基于 TiO2 的纳米突触设备与反向传播方法
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-05-07 DOI: 10.1016/j.vlsi.2024.102206
Yildiran Yilmaz , Fatih Gül

Computer-based machine learning algorithms that produce impressive performance results are computationally demanding and thus subject to high energy consumption during training and testing. Therefore, compact neuro-inspired devices are required to achieve efficiency in hardware resource consumption for the smooth implementation of neural network applications that require low energy and area. In this paper, learning characteristics and performances of the nanoscale titanium dioxide (TiO2) based synaptic device have been analyzed by implementing it in the hardware-based neural network for digit classification. Our model is experimentally validated by using 32-nm CMOS technology and the results demonstrate that the model provides high computational ability with better accuracy and efficiency in resource consumption with low energy and less area. The proposed model exhibits 20% energy gain and 16.82% accuracy improvement and 18% less total latency compared to the state-of-the-art Ag:Si synaptic device-based neural network. Furthermore, when compared to the software-based (i.e., computer-based) implementation of neural networks, our TiO2-based model not only achieved an impressive accuracy rate of 90.01% on the MNIST dataset but also did so with reduced energy consumption. Consequently, our model, characterized by a low hardware implementation cost, emerges as a promising neuro-inspired hardware solution for various neural network applications. The proposed model has further demonstrated outstanding performance in experiments involving both the MNIST and Fisher’s Iris datasets. On the latter dataset, the model exhibited notable precision (94.5%), recall (91.5%), and an impressive F1-score (92.9%), accompanied by a commendable accuracy rate of 93.04%.

基于计算机的机器学习算法能产生令人印象深刻的性能结果,但对计算要求很高,因此在训练和测试过程中能耗很高。因此,需要紧凑型神经启发器件来实现硬件资源消耗的高效率,以便顺利实施需要低能耗和低面积的神经网络应用。本文通过在基于硬件的数字分类神经网络中实施基于纳米级二氧化钛(TiO2)的突触装置,分析了该装置的学习特性和性能。我们的模型采用 32 纳米 CMOS 技术进行了实验验证,结果表明该模型具有较高的计算能力、较好的准确性和资源消耗效率,而且能耗低、占地面积小。与最先进的基于 Ag:Si 突触器件的神经网络相比,所提出的模型实现了 20% 的能量增益和 16.82% 的准确率提高,总延迟时间减少了 18%。此外,与基于软件(即基于计算机)实现的神经网络相比,我们基于 TiO2 的模型不仅在 MNIST 数据集上实现了 90.01% 的惊人准确率,而且还降低了能耗。因此,我们的模型具有硬件实施成本低的特点,是各种神经网络应用中一种前景广阔的神经启发硬件解决方案。在涉及 MNIST 和 Fisher's Iris 数据集的实验中,所提出的模型进一步展示了出色的性能。在费舍尔虹膜数据集上,该模型表现出显著的精确度(94.5%)、召回率(91.5%)和令人印象深刻的 F1 分数(92.9%),以及 93.04% 的准确率。
{"title":"Neuro-inspired hardware solutions for high-performance computing: A TiO2-based nano-synaptic device approach with backpropagation","authors":"Yildiran Yilmaz ,&nbsp;Fatih Gül","doi":"10.1016/j.vlsi.2024.102206","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102206","url":null,"abstract":"<div><p>Computer-based machine learning algorithms that produce impressive performance results are computationally demanding and thus subject to high energy consumption during training and testing. Therefore, compact neuro-inspired devices are required to achieve efficiency in hardware resource consumption for the smooth implementation of neural network applications that require low energy and area. In this paper, learning characteristics and performances of the nanoscale titanium dioxide (<span><math><msub><mrow><mi>TiO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>) based synaptic device have been analyzed by implementing it in the hardware-based neural network for digit classification. Our model is experimentally validated by using 32-nm CMOS technology and the results demonstrate that the model provides high computational ability with better accuracy and efficiency in resource consumption with low energy and less area. The proposed model exhibits 20% energy gain and 16.82% accuracy improvement and 18% less total latency compared to the state-of-the-art <span><math><mi>Ag</mi></math></span>:<span><math><mi>Si</mi></math></span> synaptic device-based neural network. Furthermore, when compared to the software-based (i.e., computer-based) implementation of neural networks, our <span><math><msub><mrow><mi>TiO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-based model not only achieved an impressive accuracy rate of 90.01% on the MNIST dataset but also did so with reduced energy consumption. Consequently, our model, characterized by a low hardware implementation cost, emerges as a promising neuro-inspired hardware solution for various neural network applications. The proposed model has further demonstrated outstanding performance in experiments involving both the MNIST and Fisher’s Iris datasets. On the latter dataset, the model exhibited notable precision (94.5%), recall (91.5%), and an impressive F1-score (92.9%), accompanied by a commendable accuracy rate of 93.04%.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102206"},"PeriodicalIF":1.9,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140918518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring BTI aging effects on spatial power density and temperature profiles of VLSI chips 探索 BTI 老化对超大规模集成电路芯片空间功率密度和温度曲线的影响
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-30 DOI: 10.1016/j.vlsi.2024.102202
Sachin Sachdeva, Jincong Lu, Hussam Amrouch , Sheldon X.-D. Tan

The Long-term reliability of a chip, encompassing factors like bias temperature instability (BTI), plays a substantial role in the chip's operational efficiency and overall lifespan. Most studies primarily center around performance-related aspects like delay and timing impacts, and fewer studies are performed on reliability impacts on the spatial power density and thermal profiles of the chips. In this study, we propose to investigate the BTI impacts on the spatial power density and temperature profiles of VLSI chips for the first time. We assessed the BTI aging impact on the on-chip spatial power density and temperature for two widely used circuit functional blocks (dual port RAM, Discrete Cosine Transform (DCT) block) at T = 130C and T = 25C to account for the worst-case BTI degradation, using degradation-aware cell libraries for a 10-year aging scenario. Furthermore, we showcased the essential role of BTI aging-aware timing analysis in evaluating the impact of BTI aging on total power, on-chip spatial power density, and thermal maps. Neglecting this aspect can result in a substantial underestimation of the results related to the parameters mentioned above. We developed a power map generation method from the circuit layout and power analysis from EDA tools. We demonstrate that both circuits’ maximum power density reduction is approximately 12 % and 20 %, respectively. Furthermore, to analyze the BTI impact on spatial temperature, we built the heat transfer model using a multiphysics tool to imitate a real chip (Intel i7-8650U) and performed thermal simulations to evaluate the spatial thermal map. The resulting maximum temperature reduction for both these circuits is approximately 10 % and 12 %, respectively, which is quite significant.

Our analysis has further unveiled that, in the context of a specific circuit, the position of maximum power density and the occurrence of a hot spot remains consistent over time, unaffected by aging. However, it's important to note that these positions can vary between different circuits, primarily influenced by the workload the circuit is currently handling. Furthermore, our findings demonstrate that the effects of Bias Temperature Instability (BTI) aging are significantly more pronounced when the circuit operates at higher temperatures (T = 130C) compared to lower operating temperatures (T = 25C).

芯片的长期可靠性包括偏置温度不稳定性(BTI)等因素,对芯片的运行效率和整体寿命起着重要作用。大多数研究主要围绕延迟和时序影响等与性能相关的方面,而关于可靠性对芯片空间功率密度和热曲线影响的研究较少。在本研究中,我们首次提出研究 BTI 对 VLSI 芯片空间功率密度和温度曲线的影响。我们评估了两个广泛使用的电路功能块(双端口 RAM、离散余弦变换 (DCT) 块)在 T = 130 oC 和 T = 25 oC 下的 BTI 老化对芯片空间功率密度和温度的影响,以考虑最坏情况下的 BTI 退化,并使用了 10 年老化情况下的退化感知单元库。此外,我们还展示了 BTI 老化感知时序分析在评估 BTI 老化对总功率、片上空间功率密度和热图的影响方面的重要作用。忽视这一方面会导致与上述参数相关的结果被大大低估。我们从电路布局和 EDA 工具的功率分析中开发了一种功率图生成方法。我们证明,这两种电路的最大功率密度降幅分别约为 12% 和 20%。此外,为了分析 BTI 对空间温度的影响,我们使用多物理场工具模仿真实芯片(英特尔 i7-8650U)建立了热传导模型,并进行了热仿真,以评估空间热图。我们的分析进一步揭示出,在特定电路中,最大功率密度的位置和热点的出现随着时间的推移保持一致,不受老化的影响。不过,值得注意的是,这些位置在不同的电路中会有所不同,主要受电路当前处理的工作量影响。此外,我们的研究结果表明,与较低的工作温度(T = 25◦C)相比,电路在较高温度(T = 130◦C)下工作时,偏置温度不稳定性(BTI)老化的影响要明显得多。
{"title":"Exploring BTI aging effects on spatial power density and temperature profiles of VLSI chips","authors":"Sachin Sachdeva,&nbsp;Jincong Lu,&nbsp;Hussam Amrouch ,&nbsp;Sheldon X.-D. Tan","doi":"10.1016/j.vlsi.2024.102202","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102202","url":null,"abstract":"<div><p>The Long-term reliability of a chip, encompassing factors like bias temperature instability (BTI), plays a substantial role in the chip's operational efficiency and overall lifespan. Most studies primarily center around performance-related aspects like delay and timing impacts, and fewer studies are performed on reliability impacts on the spatial power density and thermal profiles of the chips. In this study, we propose to investigate the BTI impacts on the spatial power density and temperature profiles of VLSI chips for the first time. We assessed the BTI aging impact on the on-chip spatial power density and temperature for two widely used circuit functional blocks (dual port RAM, Discrete Cosine Transform (DCT) block) at T = 130<sup><em>◦</em></sup>C and T = 25<sup><em>◦</em></sup>C to account for the worst-case BTI degradation, using degradation-aware cell libraries for a 10-year aging scenario. Furthermore, we showcased the essential role of BTI aging-aware timing analysis in evaluating the impact of BTI aging on total power, on-chip spatial power density, and thermal maps. Neglecting this aspect can result in a substantial underestimation of the results related to the parameters mentioned above. We developed a power map generation method from the circuit layout and power analysis from EDA tools. We demonstrate that both circuits’ maximum power density reduction is approximately 12 % and 20 %, respectively. Furthermore, to analyze the BTI impact on spatial temperature, we built the heat transfer model using a multiphysics tool to imitate a real chip (Intel i7-8650U) and performed thermal simulations to evaluate the spatial thermal map. The resulting maximum temperature reduction for both these circuits is approximately 10 % and 12 %, respectively, which is quite significant.</p><p>Our analysis has further unveiled that, in the context of a specific circuit, the position of maximum power density and the occurrence of a hot spot remains consistent over time, unaffected by aging. However, it's important to note that these positions can vary between different circuits, primarily influenced by the workload the circuit is currently handling. Furthermore, our findings demonstrate that the effects of Bias Temperature Instability (BTI) aging are significantly more pronounced when the circuit operates at higher temperatures (T = 130<sup><em>◦</em></sup>C) compared to lower operating temperatures (T = 25<sup><em>◦</em></sup>C).</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102202"},"PeriodicalIF":1.9,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141097293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A rail-to-rail high speed comparator with LVDS output in 0.18-μm SiGe BiCMOS Technology 采用 0.18μm SiGe BiCMOS 技术制造的具有 LVDS 输出的轨至轨高速比较器
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-27 DOI: 10.1016/j.vlsi.2024.102198
Qiyan Sun , Ruiyong Tu , Jin Xie , Yihong Gong , Sini Wu , Jinghu Li , Zhicong Luo

Achieving low propagation delay in comparators under low input overdrive voltage is challenging. To overcome this difficulty, this paper presents a novel rail-to-rail high-speed comparator. By clamping the output node of the current summation circuit relative to a fixed level VC, the overdrive recovery time under large signal is successfully reduced. Moreover,by adopting a cascaded approach with multiple stages of high bandwidth and low gain,not only is the comparator’s gain enhanced,but it also acquires higher bandwidth. Ultimately, the comparator’s output is transmitted at high speed through an LVDS interface. This design is implemented using 0.18μm SiGe BiCMOS technology. Simulation results show that the comparator has a static power consumption of 26.4 mW, and for 5 mV input overdrive, the average propagation delay is about 1.09 ns.

在低输入过驱动电压下实现比较器的低传播延迟具有挑战性。为了克服这一困难,本文提出了一种新型轨至轨高速比较器。通过将电流求和电路的输出节点相对于固定电平 VC 进行箝位,成功缩短了大信号下的过驱动恢复时间。此外,通过采用多级高带宽、低增益的级联方法,不仅提高了比较器的增益,还获得了更高的带宽。比较器的输出最终通过 LVDS 接口高速传输。该设计采用 0.18μm SiGe BiCMOS 技术实现。仿真结果表明,该比较器的静态功耗为 26.4 mW,在 5 mV 输入过驱动条件下,平均传播延迟约为 1.09 ns。
{"title":"A rail-to-rail high speed comparator with LVDS output in 0.18-μm SiGe BiCMOS Technology","authors":"Qiyan Sun ,&nbsp;Ruiyong Tu ,&nbsp;Jin Xie ,&nbsp;Yihong Gong ,&nbsp;Sini Wu ,&nbsp;Jinghu Li ,&nbsp;Zhicong Luo","doi":"10.1016/j.vlsi.2024.102198","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102198","url":null,"abstract":"<div><p>Achieving low propagation delay in comparators under low input overdrive voltage is challenging. To overcome this difficulty, this paper presents a novel rail-to-rail high-speed comparator. By clamping the output node of the current summation circuit relative to a fixed level <span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>C</mi></mrow></msub></math></span>, the overdrive recovery time under large signal is successfully reduced. Moreover,by adopting a cascaded approach with multiple stages of high bandwidth and low gain,not only is the comparator’s gain enhanced,but it also acquires higher bandwidth. Ultimately, the comparator’s output is transmitted at high speed through an LVDS interface. This design is implemented using <span><math><mrow><mn>0</mn><mo>.</mo><mn>18</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span> SiGe BiCMOS technology. Simulation results show that the comparator has a static power consumption of 26.4 mW, and for 5 mV input overdrive, the average propagation delay is about 1.09 ns.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102198"},"PeriodicalIF":1.9,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140816071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A non-degenerate n-dimensional integer domain chaotic map model with application to PRNG 应用于 PRNG 的非退化 n 维整数域混沌图模型
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-25 DOI: 10.1016/j.vlsi.2024.102200
Mengdi Zhao, Hongjun Liu

To address the limitations of existing chaotic maps, we proposed a non-degenerate n-dimensional (n ≥ 2) integer domain chaotic map (nD-IDCM) model that can construct any non-degenerate n-dimensional integer domain chaotic maps. Moreover, we analyzed its chaotic behavior through Lyapunov exponent, and found that the nD-IDCM generates chaotic sequences in the integer domain, which effectively resolves the issue of finite precision effect when implementing existing chaotic maps on computers or digital devices. To verify the effectiveness of nD-IDCM, we presented two instances to demonstrate how the positive Lyapunov exponents can be regulated by manipulating the parameter matrix. Subsequently, we have scrutinized their dynamical behavior using Kolmogorov entropy, sample entropy, correlation dimension and randomness testing via TestU01. Finally, to assess the feasibility of nD-IDCM, we devised a keyed pseudo random number generator (PRNG) based on a 3D-IDCM that can ensure superior randomness and unpredictability. Experimental results indicated that integer domain chaotic maps constructed using nD-IDCM have desirable Lyapunov exponents and exhibit ergodicity within a sufficient larger chaotic range.

针对现有混沌图的局限性,我们提出了一种非退化n维(n≥2)整数域混沌图(nD-IDCM)模型,它可以构造任意非退化n维整数域混沌图。此外,我们还通过Lyapunov指数分析了它的混沌行为,发现nD-IDCM能在整数域产生混沌序列,从而有效解决了在计算机或数字设备上实现现有混沌图时有限精度效应的问题。为了验证 nD-IDCM 的有效性,我们提出了两个实例,演示如何通过操纵参数矩阵来调节正 Lyapunov 指数。随后,我们通过 TestU01 使用科尔莫哥洛夫熵、样本熵、相关维度和随机性测试仔细研究了它们的动力学行为。最后,为了评估 nD-IDCM 的可行性,我们设计了一种基于 3D-IDCM 的密钥伪随机数生成器(PRNG),它能确保卓越的随机性和不可预测性。实验结果表明,利用 nD-IDCM 构建的整数域混沌图具有理想的 Lyapunov 指数,并在足够大的混沌范围内表现出遍历性。
{"title":"A non-degenerate n-dimensional integer domain chaotic map model with application to PRNG","authors":"Mengdi Zhao,&nbsp;Hongjun Liu","doi":"10.1016/j.vlsi.2024.102200","DOIUrl":"10.1016/j.vlsi.2024.102200","url":null,"abstract":"<div><p>To address the limitations of existing chaotic maps, we proposed a non-degenerate <em>n</em>-dimensional (<em>n</em> ≥ 2) integer domain chaotic map (<em>n</em>D-IDCM) model that can construct any non-degenerate <em>n</em>-dimensional integer domain chaotic maps. Moreover, we analyzed its chaotic behavior through Lyapunov exponent, and found that the <em>n</em>D-IDCM generates chaotic sequences in the integer domain, which effectively resolves the issue of finite precision effect when implementing existing chaotic maps on computers or digital devices. To verify the effectiveness of <em>n</em>D-IDCM, we presented two instances to demonstrate how the positive Lyapunov exponents can be regulated by manipulating the parameter matrix. Subsequently, we have scrutinized their dynamical behavior using Kolmogorov entropy, sample entropy, correlation dimension and randomness testing via TestU01. Finally, to assess the feasibility of <em>n</em>D-IDCM, we devised a keyed pseudo random number generator (PRNG) based on a 3D-IDCM that can ensure superior randomness and unpredictability. Experimental results indicated that integer domain chaotic maps constructed using <em>n</em>D-IDCM have desirable Lyapunov exponents and exhibit ergodicity within a sufficient larger chaotic range.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102200"},"PeriodicalIF":1.9,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140775056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of novel low cost triple-node-upset self-recoverable hardened latch 设计新型低成本三节点嵌入式可自动恢复加固闩锁
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-24 DOI: 10.1016/j.vlsi.2024.102199
Hui Xu , Shuo Zhu , Ruijun Ma , Zhengfeng Huang , Huaguo Liang , Haojie Sun , Chaoming Liu

CMOS devices are increasingly affected by triple-node-upset as transistor characteristics reduce, particularly in radiation environments. For the shortcomings of the existing radiation hardened designs, including high overhead and high delay, this paper proposes a novel low cost triple-node-upset self-recoverable latch. Simulation results show that compared with the existing triple-node-upset hardened designs, the proposed latch has reduced power consumption, delay, and power-delay product by 34.57 %, 6.42 %, and 34.98 %, respectively.

随着晶体管特性的降低,尤其是在辐射环境中,CMOS 器件受三重节点重置的影响越来越大。针对现有辐射加固设计的高开销和高延迟等缺点,本文提出了一种新型低成本三节点重置自恢复锁存器。仿真结果表明,与现有的三节点上集加固设计相比,所提出的锁存器在功耗、延迟和功率-延迟乘积方面分别降低了 34.57%、6.42% 和 34.98%。
{"title":"Design of novel low cost triple-node-upset self-recoverable hardened latch","authors":"Hui Xu ,&nbsp;Shuo Zhu ,&nbsp;Ruijun Ma ,&nbsp;Zhengfeng Huang ,&nbsp;Huaguo Liang ,&nbsp;Haojie Sun ,&nbsp;Chaoming Liu","doi":"10.1016/j.vlsi.2024.102199","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102199","url":null,"abstract":"<div><p>CMOS devices are increasingly affected by triple-node-upset as transistor characteristics reduce, particularly in radiation environments. For the shortcomings of the existing radiation hardened designs, including high overhead and high delay, this paper proposes a novel low cost triple-node-upset self-recoverable latch. Simulation results show that compared with the existing triple-node-upset hardened designs, the proposed latch has reduced power consumption, delay, and power-delay product by 34.57 %, 6.42 %, and 34.98 %, respectively.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102199"},"PeriodicalIF":1.9,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140650477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble learning model for effective thermal simulation of multi-core CPUs 多核 CPU 有效热模拟的集合学习模型
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-24 DOI: 10.1016/j.vlsi.2024.102201
Lin Jiang , Anthony Dowling, Yu Liu, Ming-C. Cheng

An ensemble data-learning approach based on proper orthogonal decomposition (POD) and Galerkin projection (EnPOD-GP) is proposed for thermal simulations of multi-core CPUs to improve training efficiency and the model accuracy for a previously developed global POD-GP method (GPOD-GP). GPOD-GP generates one set of basis functions (or POD modes) to account for thermal behavior in response to variations in dynamic power maps (PMs) in the entire chip, which is computationally intensive to cover possible variations of all power sources. EnPOD-GP however acquires multiple sets of POD modes to significantly improve training efficiency and effectiveness, and its simulation accuracy is independent of any dynamic PM. Compared to finite element simulation, both GPOD-GP and EnPOD-GP offer a computational speedup over 3 orders of magnitude. For a processor with a small number of cores, GPOD-GP provides a more efficient approach. When high accuracy is desired and/or a processor with more cores is involved, EnPOD-GP is more preferable in terms of training effort and simulation accuracy and efficiency. Additionally, the error resulting from EnPOD-GP can be precisely predicted for any random spatiotemporal power excitation.

针对多核 CPU 的热仿真,提出了一种基于适当正交分解(POD)和 Galerkin 投影(EnPOD-GP)的集合数据学习方法,以提高先前开发的全局 POD-GP 方法(GPOD-GP)的训练效率和模型精度。GPOD-GP 生成一组基函数(或 POD 模式)来解释热行为,以响应整个芯片中动态功率图 (PM) 的变化,这需要大量计算才能涵盖所有功率源的可能变化。然而,EnPOD-GP 可获取多组 POD 模式,从而显著提高训练效率和效果,而且其仿真精度与任何动态 PM 无关。与有限元模拟相比,GPOD-GP 和 EnPOD-GP 的计算速度提高了 3 个数量级。对于内核数量较少的处理器,GPOD-GP 提供了一种更高效的方法。当需要高精度和/或更多内核的处理器时,EnPOD-GP 在训练工作量、仿真精度和效率方面更为可取。此外,对于任何随机时空功率激励,EnPOD-GP 产生的误差都可以精确预测。
{"title":"Ensemble learning model for effective thermal simulation of multi-core CPUs","authors":"Lin Jiang ,&nbsp;Anthony Dowling,&nbsp;Yu Liu,&nbsp;Ming-C. Cheng","doi":"10.1016/j.vlsi.2024.102201","DOIUrl":"10.1016/j.vlsi.2024.102201","url":null,"abstract":"<div><p>An ensemble data-learning approach based on proper orthogonal decomposition (POD) and Galerkin projection (EnPOD-GP) is proposed for thermal simulations of multi-core CPUs to improve training efficiency and the model accuracy for a previously developed global POD-GP method (GPOD-GP). GPOD-GP generates one set of basis functions (or POD modes) to account for thermal behavior in response to variations in dynamic power maps (PMs) in the entire chip, which is computationally intensive to cover possible variations of all power sources. EnPOD-GP however acquires multiple sets of POD modes to significantly improve training efficiency and effectiveness, and its simulation accuracy is independent of any dynamic PM. Compared to finite element simulation, both GPOD-GP and EnPOD-GP offer a computational speedup over 3 orders of magnitude. For a processor with a small number of cores, GPOD-GP provides a more efficient approach. When high accuracy is desired and/or a processor with more cores is involved, EnPOD-GP is more preferable in terms of training effort and simulation accuracy and efficiency. Additionally, the error resulting from EnPOD-GP can be precisely predicted for any random spatiotemporal power excitation.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102201"},"PeriodicalIF":1.9,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0167926024000658/pdfft?md5=1bfea626d6bed7a5cf9433aa649eaf0a&pid=1-s2.0-S0167926024000658-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140783197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-design based FPGA implementation of an efficient new speech hyperchaotic cryptosystem in the transform domain 基于协同设计的 FPGA 实现变换域高效新型语音超混沌密码系统
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-23 DOI: 10.1016/j.vlsi.2024.102197
Mohamed Salah Azzaz, Redouane Kaibou, Bachir Madani

In this paper a new encryption system has been designed and implemented for real-time speech transmission to reduce bandwidth requirements, increase security and minimize residual intelligibility. To guarantee robustness and lightweight computation, the developed cryptosystem has been carried out in the wavelet transform domain based on a hyperchaotic model to generate mask and permutation keys. The cryptographic system has been designed using a hardware-software (HW/SW) co-design approach by developing several IP-cores in a relatively short development time. The performances and security evaluation of the system have been validated through simulation results followed by an experimental validation through the implementation of an encrypted speech signal transmission between two low cost Nexys-4 DDR FPGA platforms, operating in real-time for both wired and wireless communications. Compared to similar works, high performances have been obtained in terms of bandwidth efficiency due to the use of DWT, limited area of FPGA resources, low power consumption and high security level with a large keyspace that is sufficient to resist against brute force attacks. The designed system can be a very useful solution for many real-time secure integrated voice communication systems, multiple communication purposes, military, professional or personal high level of conversations security.

本文为实时语音传输设计并实施了一种新的加密系统,以降低带宽要求、提高安全性并最大限度地减少残余可懂度。为保证稳健性和轻量级计算,所开发的加密系统在小波变换域中基于超混沌模型生成掩码和置换密钥。加密系统的设计采用了硬件/软件(HW/SW)协同设计方法,在相对较短的开发时间内开发了多个 IP 核。系统的性能和安全性评估已通过仿真结果得到验证,随后通过在两个低成本 Nexys-4 DDR FPGA 平台之间实现加密语音信号传输进行了实验验证,实时运行于有线和无线通信。与同类研究相比,由于使用了 DWT,该系统在带宽效率、有限的 FPGA 资源面积、低功耗和高安全级别等方面都取得了很高的性能。所设计的系统对于许多实时安全综合语音通信系统、多种通信用途、军事、专业或个人高级对话安全都是非常有用的解决方案。
{"title":"Co-design based FPGA implementation of an efficient new speech hyperchaotic cryptosystem in the transform domain","authors":"Mohamed Salah Azzaz,&nbsp;Redouane Kaibou,&nbsp;Bachir Madani","doi":"10.1016/j.vlsi.2024.102197","DOIUrl":"https://doi.org/10.1016/j.vlsi.2024.102197","url":null,"abstract":"<div><p>In this paper a new encryption system has been designed and implemented for real-time speech transmission to reduce bandwidth requirements, increase security and minimize residual intelligibility. To guarantee robustness and lightweight computation, the developed cryptosystem has been carried out in the wavelet transform domain based on a hyperchaotic model to generate mask and permutation keys. The cryptographic system has been designed using a hardware-software (HW/SW) co-design approach by developing several IP-cores in a relatively short development time. The performances and security evaluation of the system have been validated through simulation results followed by an experimental validation through the implementation of an encrypted speech signal transmission between two low cost Nexys-4 DDR FPGA platforms, operating in real-time for both wired and wireless communications. Compared to similar works, high performances have been obtained in terms of bandwidth efficiency due to the use of DWT, limited area of FPGA resources, low power consumption and high security level with a large keyspace that is sufficient to resist against brute force attacks. The designed system can be a very useful solution for many real-time secure integrated voice communication systems, multiple communication purposes, military, professional or personal high level of conversations security.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102197"},"PeriodicalIF":1.9,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140650478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing code allocation for hybrid on-chip memory in IoT systems 优化物联网系统中混合片上存储器的代码分配
IF 1.9 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-04-20 DOI: 10.1016/j.vlsi.2024.102195
Zhe Sun , Zimeng Zhou , Fang-Wei Fu

With the increasing application of IoT devices, the memory subsystem, as the performance and energy bottleneck of IoT systems, has received a lot of attention. One of the keys is on-chip memory which can bridge the performance gap between the CPU and main memory. While many off-the-shelf embedded processors utilize the hybrid on-chip memory architecture containing scratchpad memories (SPMs) and caches, most existing literature ignores the collaboration between caches and SPMs. This paper proposes static SPM allocation strategies for the architecture mentioned above in IoT systems, which try to minimize the overall instruction memory subsystem latency and/or energy consumption. We capture the intra- and inter-task cache conflict misses via a fine-grained temporal cache behavior model. Based on this cache conflict information, we propose an integer linear programming (ILP) algorithm to generate an optimal static function level SPM allocation for system performance. Furthermore, to improve the scalability of the proposed allocation scheme for an enormous task set, we offer the interference factor to calculate the interference impact quantitatively. Then, based on the interference factor, we present two approximate knapsack based heuristic algorithms to provide near optimal static allocation schemes at both function- and basic block-level granularities, which favors fast design space exploration. The experiment results demonstrate that the proposed solution achieves a 30.85% improvement in memory performance, and up to 31.39% reduction in energy consumption, compared to the existing SPM allocation scheme at the function level. In addition, the proposed basic block level allocation algorithm shows better performance than our function level allocation algorithm and other basic block level allocation algorithm.

随着物联网设备的应用日益广泛,存储器子系统作为物联网系统的性能和能耗瓶颈受到了广泛关注。片上存储器是关键之一,它可以弥补 CPU 和主存储器之间的性能差距。虽然许多现成的嵌入式处理器都采用了包含刮板存储器(SPM)和高速缓存的混合片上存储器架构,但大多数现有文献都忽略了高速缓存和 SPM 之间的协作。本文针对物联网系统中的上述架构提出了静态 SPM 分配策略,以尽量减少整体指令存储器子系统的延迟和/或能耗。我们通过细粒度的时间缓存行为模型捕捉任务内和任务间的缓存冲突缺失。基于这些缓存冲突信息,我们提出了一种整数线性规划(ILP)算法,以生成最优的静态函数级 SPM 分配,从而提高系统性能。此外,为了提高所提分配方案在处理庞大任务集时的可扩展性,我们提供了干扰系数来定量计算干扰影响。然后,基于干扰系数,我们提出了两种基于knapsack的近似启发式算法,在函数级和基本块级粒度上提供接近最优的静态分配方案,这有利于快速探索设计空间。实验结果表明,与现有的函数级 SPM 分配方案相比,所提出的解决方案可将内存性能提高 30.85%,能耗降低 31.39%。此外,与我们的函数级分配算法和其他基本块级分配算法相比,所提出的基本块级分配算法显示出更好的性能。
{"title":"Optimizing code allocation for hybrid on-chip memory in IoT systems","authors":"Zhe Sun ,&nbsp;Zimeng Zhou ,&nbsp;Fang-Wei Fu","doi":"10.1016/j.vlsi.2024.102195","DOIUrl":"10.1016/j.vlsi.2024.102195","url":null,"abstract":"<div><p>With the increasing application of IoT devices, the memory subsystem, as the performance and energy bottleneck of IoT systems, has received a lot of attention. One of the keys is on-chip memory which can bridge the performance gap between the CPU and main memory. While many off-the-shelf embedded processors utilize the hybrid on-chip memory architecture containing scratchpad memories (SPMs) and caches, most existing literature ignores the collaboration between caches and SPMs. This paper proposes static SPM allocation strategies for the architecture mentioned above in IoT systems, which try to minimize the overall instruction memory subsystem latency and/or energy consumption. We capture the intra- and inter-task cache conflict misses via a fine-grained temporal cache behavior model. Based on this cache conflict information, we propose an integer linear programming (ILP) algorithm to generate an optimal static function level SPM allocation for system performance. Furthermore, to improve the scalability of the proposed allocation scheme for an enormous task set, we offer the interference factor to calculate the interference impact quantitatively. Then, based on the interference factor, we present two approximate knapsack based heuristic algorithms to provide near optimal static allocation schemes at both function- and basic block-level granularities, which favors fast design space exploration. The experiment results demonstrate that the proposed solution achieves a 30.85% improvement in memory performance, and up to 31.39% reduction in energy consumption, compared to the existing SPM allocation scheme at the function level. In addition, the proposed basic block level allocation algorithm shows better performance than our function level allocation algorithm and other basic block level allocation algorithm.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"97 ","pages":"Article 102195"},"PeriodicalIF":1.9,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140794438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Integration-The Vlsi Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1