首页 > 最新文献

ETRI Journal最新文献

英文 中文
AONet: Attention network with optional activation for unsupervised video anomaly detection AONet:可选择激活的注意力网络,用于无监督视频异常检测
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-28 DOI: 10.4218/etrij.2024-0115
Akhrorjon Akhmadjon Ugli Rakhmonov, Barathi Subramanian, Bahar Amirian Varnousefaderani, Jeonghong Kim

Anomaly detection in video surveillance is crucial but challenging due to the rarity of irregular events and ambiguity of defining anomalies. We propose a method called AONet that utilizes a spatiotemporal module to extract spatiotemporal features efficiently, as well as a residual autoencoder equipped with an attention network for effective future frame prediction in video anomaly detection. AONet utilizes a novel activation function called OptAF that combines the strengths of the ReLU, leaky ReLU, and sigmoid functions. Furthermore, the proposed method employs a combination of robust loss functions to address various aspects of prediction errors and enhance training effectiveness. The performance of the proposed method is evaluated on three widely used benchmark datasets. The results indicate that the proposed method outperforms existing state-of-the-art methods and demonstrates comparable performance, achieving area under the curve values of 97.0%, 86.9%, and 73.8% on the UCSD Ped2, CUHK Avenue, and ShanghaiTech Campus datasets, respectively. Additionally, the high speed of the proposed method enables its application to real-time tasks.

视频监控中的异常检测至关重要,但由于非正常事件的罕见性和异常定义的模糊性,异常检测具有挑战性。我们提出了一种名为 AONet 的方法,它利用时空模块有效提取时空特征,并利用配备注意力网络的残差自动编码器在视频异常检测中有效预测未来帧。AONet 采用了一种名为 OptAF 的新型激活函数,它结合了 ReLU、leaky ReLU 和 sigmoid 函数的优点。此外,所提出的方法还采用了鲁棒损失函数的组合,以解决预测误差的各个方面并提高训练效果。我们在三个广泛使用的基准数据集上评估了所提方法的性能。结果表明,所提出的方法优于现有的最先进方法,在 UCSD Ped2、CUHK Avenue 和 ShanghaiTech Campus 数据集上的曲线下面积值分别达到 97.0%、86.9% 和 73.8%,性能相当。此外,所提方法的高速性使其能够应用于实时任务。
{"title":"AONet: Attention network with optional activation for unsupervised video anomaly detection","authors":"Akhrorjon Akhmadjon Ugli Rakhmonov,&nbsp;Barathi Subramanian,&nbsp;Bahar Amirian Varnousefaderani,&nbsp;Jeonghong Kim","doi":"10.4218/etrij.2024-0115","DOIUrl":"https://doi.org/10.4218/etrij.2024-0115","url":null,"abstract":"<p>Anomaly detection in video surveillance is crucial but challenging due to the rarity of irregular events and ambiguity of defining anomalies. We propose a method called AONet that utilizes a spatiotemporal module to extract spatiotemporal features efficiently, as well as a residual autoencoder equipped with an attention network for effective future frame prediction in video anomaly detection. AONet utilizes a novel activation function called OptAF that combines the strengths of the ReLU, leaky ReLU, and sigmoid functions. Furthermore, the proposed method employs a combination of robust loss functions to address various aspects of prediction errors and enhance training effectiveness. The performance of the proposed method is evaluated on three widely used benchmark datasets. The results indicate that the proposed method outperforms existing state-of-the-art methods and demonstrates comparable performance, achieving area under the curve values of 97.0%, 86.9%, and 73.8% on the UCSD Ped2, CUHK Avenue, and ShanghaiTech Campus datasets, respectively. Additionally, the high speed of the proposed method enables its application to real-time tasks.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"890-903"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0115","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators NEST-C:用于带有人工智能加速器的异构计算系统的深度学习编译器框架
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-28 DOI: 10.4218/etrij.2024-0139
Jeman Park, Misun Yu, Jinse Kwon, Junmo Park, Jemin Lee, Yongin Kwon

Deep learning (DL) has significantly advanced artificial intelligence (AI); however, frameworks such as PyTorch, ONNX, and TensorFlow are optimized for general-purpose GPUs, leading to inefficiencies on specialized accelerators such as neural processing units (NPUs) and processing-in-memory (PIM) devices. These accelerators are designed to optimize both throughput and energy efficiency but they require more tailored optimizations. To address these limitations, we propose the NEST compiler (NEST-C), a novel DL framework that improves the deployment and performance of models across various AI accelerators. NEST-C leverages profiling-based quantization, dynamic graph partitioning, and multi-level intermediate representation (IR) integration for efficient execution on diverse hardware platforms. Our results show that NEST-C significantly enhances computational efficiency and adaptability across various AI accelerators, achieving higher throughput, lower latency, improved resource utilization, and greater model portability. These benefits contribute to more efficient DL model deployment in modern AI applications.

深度学习(DL)极大地推动了人工智能(AI)的发展;然而,PyTorch、ONNX 和 TensorFlow 等框架是针对通用 GPU 优化的,导致神经处理单元(NPU)和内存处理(PIM)设备等专用加速器的效率低下。这些加速器旨在优化吞吐量和能效,但它们需要更有针对性的优化。为了解决这些局限性,我们提出了 NEST 编译器(NEST-C),这是一个新颖的 DL 框架,可改善模型在各种人工智能加速器上的部署和性能。NEST-C 利用基于剖析的量化、动态图分割和多级中间表示(IR)集成,在不同的硬件平台上高效执行。我们的研究结果表明,NEST-C 显著提高了各种人工智能加速器的计算效率和适应性,实现了更高的吞吐量、更低的延迟、更高的资源利用率和更强的模型可移植性。这些优势有助于在现代人工智能应用中更高效地部署 DL 模型。
{"title":"NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators","authors":"Jeman Park,&nbsp;Misun Yu,&nbsp;Jinse Kwon,&nbsp;Junmo Park,&nbsp;Jemin Lee,&nbsp;Yongin Kwon","doi":"10.4218/etrij.2024-0139","DOIUrl":"https://doi.org/10.4218/etrij.2024-0139","url":null,"abstract":"<p>Deep learning (DL) has significantly advanced artificial intelligence (AI); however, frameworks such as PyTorch, ONNX, and TensorFlow are optimized for general-purpose GPUs, leading to inefficiencies on specialized accelerators such as neural processing units (NPUs) and processing-in-memory (PIM) devices. These accelerators are designed to optimize both throughput and energy efficiency but they require more tailored optimizations. To address these limitations, we propose the NEST compiler (NEST-C), a novel DL framework that improves the deployment and performance of models across various AI accelerators. NEST-C leverages profiling-based quantization, dynamic graph partitioning, and multi-level intermediate representation (IR) integration for efficient execution on diverse hardware platforms. Our results show that NEST-C significantly enhances computational efficiency and adaptability across various AI accelerators, achieving higher throughput, lower latency, improved resource utilization, and greater model portability. These benefits contribute to more efficient DL model deployment in modern AI applications.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"851-864"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0139","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Free-space quantum key distribution transmitter system using WDM filter for channel integration 利用波分复用滤波器实现信道集成的自由空间量子密钥分发发射机系统
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-28 DOI: 10.4218/etrij.2024-0142
Minchul Kim, Kyongchun Lim, Joong-Seon Choe, Byung-Seok Choi, Kap-Joong Kim, Ju Hee Baek, Chun Ju Youn

In this study, we report a transmitter system for free-space quantum key distribution (QKD) using the BB84 protocol, which does not require an internal alignment process, by using a wavelength-division multiplexing (WDM) filter and polarization-encoding module. With a custom-made WDM filter, the signals required for QKD can be integrated by simply connecting fibers, thus avoiding the laborious internal alignment required for free-space QKD systems using conventional bulk-optic setups. The WDM filter is designed to multiplex the single-mode signals from 785-nm quantum and 1550-nm synchronization channels for spatial-mode matching while maintaining the polarization relations. The measured insertion loss and isolation are 1.8 dB and 32.6 dB for 785 nm and 0.7 dB and 28.3 dB for 1550 nm, respectively. We also evaluate the QKD performance of the proposed system. The sifted key rate and quantum bit error rate are 1.6 Mbps and 0.62%, respectively, at an operating speed of 100 MHz, rendering our system comparable to conventional systems using bulk-optic devices for channel integration.

在这项研究中,我们报告了一种使用 BB84 协议的自由空间量子密钥分发(QKD)发射机系统,该系统使用波分复用(WDM)滤波器和偏振编码模块,无需内部对准过程。有了定制的波分复用滤波器,只需连接光纤就能集成 QKD 所需的信号,从而避免了使用传统散装光学装置的自由空间 QKD 系统所需的费力的内部对准过程。波分复用滤波器旨在复用来自 785 纳米量子和 1550 纳米同步通道的单模信号,以实现空间模式匹配,同时保持偏振关系。测得的插入损耗和隔离度分别为:785 nm 1.8 dB 和 32.6 dB,1550 nm 0.7 dB 和 28.3 dB。我们还评估了拟议系统的 QKD 性能。在 100 MHz 的工作速度下,筛分密钥率和量子比特错误率分别为 1.6 Mbps 和 0.62%,使我们的系统可与使用散装光学器件进行信道集成的传统系统相媲美。
{"title":"Free-space quantum key distribution transmitter system using WDM filter for channel integration","authors":"Minchul Kim,&nbsp;Kyongchun Lim,&nbsp;Joong-Seon Choe,&nbsp;Byung-Seok Choi,&nbsp;Kap-Joong Kim,&nbsp;Ju Hee Baek,&nbsp;Chun Ju Youn","doi":"10.4218/etrij.2024-0142","DOIUrl":"https://doi.org/10.4218/etrij.2024-0142","url":null,"abstract":"<p>In this study, we report a transmitter system for free-space quantum key distribution (QKD) using the BB84 protocol, which does not require an internal alignment process, by using a wavelength-division multiplexing (WDM) filter and polarization-encoding module. With a custom-made WDM filter, the signals required for QKD can be integrated by simply connecting fibers, thus avoiding the laborious internal alignment required for free-space QKD systems using conventional bulk-optic setups. The WDM filter is designed to multiplex the single-mode signals from 785-nm quantum and 1550-nm synchronization channels for spatial-mode matching while maintaining the polarization relations. The measured insertion loss and isolation are 1.8 dB and 32.6 dB for 785 nm and 0.7 dB and 28.3 dB for 1550 nm, respectively. We also evaluate the QKD performance of the proposed system. The sifted key rate and quantum bit error rate are 1.6 Mbps and 0.62%, respectively, at an operating speed of 100 MHz, rendering our system comparable to conventional systems using bulk-optic devices for channel integration.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"806-816"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0142","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fabrication of low-loss symmetrical rib waveguides based on x-cut lithium niobate on insulator for integrated quantum photonics 基于 x 切割铌酸锂绝缘体的低损耗对称肋波导的制造,用于集成量子光子学
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-17 DOI: 10.4218/etrij.2024-0137
Hong-Seok Kim, Guhwan Kim, Tetiana Slusar, Jinwoo Kim, Jiho Park, Jaegyu Park, Hyeon Hwang, Woojin Noh, Hansuek Lee, Min-Kyo Seo, Kiwon Moon, Jung Jin Ju

Lithium niobate on insulator (LNOI) is a promising material platform for applications in integrated quantum photonics. A low optical loss is crucial for preserving fragile quantum states. Therefore, in this study, we have fabricated LNOI rib waveguides with a low optical propagation loss of 0.16 dB/cm by optimizing the etching conditions for various parameters. The symmetry and smoothness of the waveguides on x-cut LNOI are improved by employing a shallow etching process. The proposed method is expected to facilitate the development of on-chip quantum photonic devices based on LNOI.

绝缘体铌酸锂(LNOI)是一种应用于集成量子光子学的前景广阔的材料平台。低光损耗对于保存脆弱的量子态至关重要。因此,在本研究中,我们通过优化蚀刻条件的各种参数,制作出了光传播损耗低至 0.16 dB/cm 的 LNOI 肋波导。通过采用浅层蚀刻工艺,提高了 x 切割 LNOI 上波导的对称性和光滑度。所提出的方法有望促进基于 LNOI 的片上量子光子器件的开发。
{"title":"Fabrication of low-loss symmetrical rib waveguides based on x-cut lithium niobate on insulator for integrated quantum photonics","authors":"Hong-Seok Kim,&nbsp;Guhwan Kim,&nbsp;Tetiana Slusar,&nbsp;Jinwoo Kim,&nbsp;Jiho Park,&nbsp;Jaegyu Park,&nbsp;Hyeon Hwang,&nbsp;Woojin Noh,&nbsp;Hansuek Lee,&nbsp;Min-Kyo Seo,&nbsp;Kiwon Moon,&nbsp;Jung Jin Ju","doi":"10.4218/etrij.2024-0137","DOIUrl":"https://doi.org/10.4218/etrij.2024-0137","url":null,"abstract":"<p>Lithium niobate on insulator (LNOI) is a promising material platform for applications in integrated quantum photonics. A low optical loss is crucial for preserving fragile quantum states. Therefore, in this study, we have fabricated LNOI rib waveguides with a low optical propagation loss of 0.16 dB/cm by optimizing the etching conditions for various parameters. The symmetry and smoothness of the waveguides on \u0000<span></span><math>\u0000 <mi>x</mi></math>-cut LNOI are improved by employing a shallow etching process. The proposed method is expected to facilitate the development of on-chip quantum photonic devices based on LNOI.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"783-792"},"PeriodicalIF":1.3,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0137","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal execution of logical Hadamard with low-space overhead in rotated surface code 在旋转曲面代码中以低空间开销优化逻辑哈达玛的执行
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-15 DOI: 10.4218/etrij.2024-0129
Sang-Min Lee, Ki-Sung Jin, Soo-Cheol Oh, Jin-Ho On, Gyu-Il Cha

Fault-tolerant quantum computation requires error-correcting codes that enable reliable universal quantum operations. This study introduces a novel approach that executes the logical Hadamard with low-space requirements while preserving the original definition of logical operators within the framework of the rotated surface codes. Our method leverages a boundary deformation method to rotate the logical qubit transformed by transversal Hadamard. Following this, the original encoding of the logical qubit is reinstated through logical flip-and-shift operations. The estimated space–time cost for a logical Hadamard operation with a code distance d is 5d2 + 3d2. The efficiency enhancement of the proposed method is approximately four times greater than those of previous approaches, regardless of the code distance. Unlike the traditional method, implementing a logical Hadamard requires only two patches instead of seven. Furthermore, the proposed method ensures the parallelism of quantum circuits by preventing interferences between adjacent logical data qubits.

容错量子计算需要能实现可靠的通用量子运算的纠错码。本研究介绍了一种新方法,它能以低空间要求执行逻辑哈达玛,同时在旋转曲面代码框架内保留逻辑算子的原始定义。我们的方法利用边界变形法旋转由横向哈达玛转换的逻辑量子比特。之后,通过逻辑翻转和移位操作恢复逻辑量子位的原始编码。代码距离为 d 的逻辑哈达玛操作的时空成本估计为 5d2 + 3d2。与之前的方法相比,无论代码距离如何,拟议方法的效率提高了约四倍。与传统方法不同的是,实现逻辑哈达玛运算只需要两个补丁,而不是七个。此外,提出的方法通过防止相邻逻辑数据量子比特之间的干扰,确保了量子电路的并行性。
{"title":"Optimal execution of logical Hadamard with low-space overhead in rotated surface code","authors":"Sang-Min Lee,&nbsp;Ki-Sung Jin,&nbsp;Soo-Cheol Oh,&nbsp;Jin-Ho On,&nbsp;Gyu-Il Cha","doi":"10.4218/etrij.2024-0129","DOIUrl":"https://doi.org/10.4218/etrij.2024-0129","url":null,"abstract":"<p>Fault-tolerant quantum computation requires error-correcting codes that enable reliable universal quantum operations. This study introduces a novel approach that executes the logical Hadamard with low-space requirements while preserving the original definition of logical operators within the framework of the rotated surface codes. Our method leverages a boundary deformation method to rotate the logical qubit transformed by transversal Hadamard. Following this, the original encoding of the logical qubit is reinstated through logical flip-and-shift operations. The estimated space–time cost for a logical Hadamard operation with a code distance d is 5<i>d</i><sup>2</sup> + 3<i>d</i><sup>2</sup>. The efficiency enhancement of the proposed method is approximately four times greater than those of previous approaches, regardless of the code distance. Unlike the traditional method, implementing a logical Hadamard requires only two patches instead of seven. Furthermore, the proposed method ensures the parallelism of quantum circuits by preventing interferences between adjacent logical data qubits.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"759-773"},"PeriodicalIF":1.3,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0129","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asynchronous interface circuit for nonlinear connectivity in multicore spiking neural networks 多核尖峰神经网络非线性连接的异步接口电路
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-12 DOI: 10.4218/etrij.2024-0135
Sung-Eun Kim, Kwang-Il Oh, Taewook Kang, Sukho Lee, Hyuk Kim, Mi-Jeong Park, Jae-Jin Lee

To expand the scale of spiking neural networks (SNNs), an interface circuit that supports multiple SNN cores is essential. This circuit should be designed using an asynchronous approach to leverage characteristics of SNNs similar to those of the human brain. However, the absence of a global clock presents timing issues during implementation. Hence, we propose an intermediate latching template to establish asynchronous nonlinear connectivity with multipipeline processing between multiple SNN cores. We design arbitration and distribution blocks in the interface circuit based on the proposed template and fabricate an interface circuit that supports four SNN cores using a full-custom approach in a 28-nm CMOS (complementary metal–oxide–semiconductor) FDSOI (fully depleted silicon on insulator) process. The proposed template can enhance throughput in the interface circuit by up to 53% compared with the conventional asynchronous template. The interface circuit transmits spikes while consuming 1.7 and 3.7 pJ of power, supporting 606 and 59 Mevent/s in intrachip and interchip communications, respectively.

要扩大尖峰神经网络(SNN)的规模,必须有一个支持多个 SNN 内核的接口电路。这种电路的设计应采用异步方法,以充分利用尖峰神经网络与人脑类似的特性。然而,由于没有全局时钟,在实现过程中会出现时序问题。因此,我们提出了一种中间锁存模板,用于在多个 SNN 内核之间建立异步非线性连接和多线处理。我们根据提出的模板设计了接口电路中的仲裁和分配块,并在 28 纳米 CMOS(互补金属氧化物半导体)FDSOI(绝缘体上全耗尽硅)工艺中采用全定制方法制造了支持四个 SNN 内核的接口电路。与传统的异步模板相比,所提出的模板可将接口电路的吞吐量提高 53%。接口电路在消耗 1.7 和 3.7 pJ 功率的情况下传输尖峰,在芯片内和芯片间通信中分别支持 606 和 59 Mevent/s。
{"title":"Asynchronous interface circuit for nonlinear connectivity in multicore spiking neural networks","authors":"Sung-Eun Kim,&nbsp;Kwang-Il Oh,&nbsp;Taewook Kang,&nbsp;Sukho Lee,&nbsp;Hyuk Kim,&nbsp;Mi-Jeong Park,&nbsp;Jae-Jin Lee","doi":"10.4218/etrij.2024-0135","DOIUrl":"https://doi.org/10.4218/etrij.2024-0135","url":null,"abstract":"<p>To expand the scale of spiking neural networks (SNNs), an interface circuit that supports multiple SNN cores is essential. This circuit should be designed using an asynchronous approach to leverage characteristics of SNNs similar to those of the human brain. However, the absence of a global clock presents timing issues during implementation. Hence, we propose an intermediate latching template to establish asynchronous nonlinear connectivity with multipipeline processing between multiple SNN cores. We design arbitration and distribution blocks in the interface circuit based on the proposed template and fabricate an interface circuit that supports four SNN cores using a full-custom approach in a 28-nm CMOS (complementary metal–oxide–semiconductor) FDSOI (fully depleted silicon on insulator) process. The proposed template can enhance throughput in the interface circuit by up to 53% compared with the conventional asynchronous template. The interface circuit transmits spikes while consuming 1.7 and 3.7 pJ of power, supporting 606 and 59 Mevent/s in intrachip and interchip communications, respectively.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"878-889"},"PeriodicalIF":1.3,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0135","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
XEM: Tensor accelerator for AB21 supercomputing artificial intelligence processor XEM:用于 AB21 超级计算人工智能处理器的张量加速器
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-12 DOI: 10.4218/etrij.2024-0141
Won Jeon, Mi Young Lee, Joo Hyun Lee, Chun-Gi Lyuh

As computing systems become increasingly larger, high-performance computing (HPC) is gaining importance. In particular, as hyperscale artificial intelligence (AI) applications, such as large language models emerge, HPC has become important even in the field of AI. Important operations in hyperscale AI and HPC are mainly linear algebraic operations based on tensors. An AB21 supercomputing AI processor has been proposed to accelerate such applications. This study proposes a XEM accelerator to accelerate linear algebraic operations in an AB21 processor effectively. The XEM accelerator has outer product-based parallel floating-point units that can efficiently process tensor operations. We provide hardware details of the XEM architecture and introduce new instructions for controlling the XEM accelerator. Additionally, hardware characteristic analyses based on chip fabrication and simulator-based functional verification are conducted. In the future, the performance and functionalities of the XEM accelerator will be verified using an AB21 processor.

随着计算系统变得越来越大,高性能计算(HPC)的重要性也与日俱增。特别是随着超大规模人工智能(AI)应用(如大型语言模型)的出现,高性能计算甚至在人工智能领域也变得非常重要。超大规模人工智能和 HPC 中的重要运算主要是基于张量的线性代数运算。为加速此类应用,有人提出了一种 AB21 超级计算人工智能处理器。本研究提出了一种 XEM 加速器,以有效加速 AB21 处理器中的线性代数运算。XEM 加速器具有基于外积的并行浮点运算单元,可高效处理张量运算。我们提供了 XEM 架构的硬件细节,并介绍了用于控制 XEM 加速器的新指令。此外,我们还进行了基于芯片制造的硬件特性分析和基于模拟器的功能验证。未来,我们将使用 AB21 处理器验证 XEM 加速器的性能和功能。
{"title":"XEM: Tensor accelerator for AB21 supercomputing artificial intelligence processor","authors":"Won Jeon,&nbsp;Mi Young Lee,&nbsp;Joo Hyun Lee,&nbsp;Chun-Gi Lyuh","doi":"10.4218/etrij.2024-0141","DOIUrl":"https://doi.org/10.4218/etrij.2024-0141","url":null,"abstract":"<p>As computing systems become increasingly larger, high-performance computing (HPC) is gaining importance. In particular, as hyperscale artificial intelligence (AI) applications, such as large language models emerge, HPC has become important even in the field of AI. Important operations in hyperscale AI and HPC are mainly linear algebraic operations based on tensors. An AB21 supercomputing AI processor has been proposed to accelerate such applications. This study proposes a XEM accelerator to accelerate linear algebraic operations in an AB21 processor effectively. The XEM accelerator has outer product-based parallel floating-point units that can efficiently process tensor operations. We provide hardware details of the XEM architecture and introduce new instructions for controlling the XEM accelerator. Additionally, hardware characteristic analyses based on chip fabrication and simulator-based functional verification are conducted. In the future, the performance and functionalities of the XEM accelerator will be verified using an AB21 processor.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"839-850"},"PeriodicalIF":1.3,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0141","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantum electrodynamical formulation of photochemical acid generation and its implications on optical lithography 光化学酸生成的量子电动力学公式及其对光学光刻技术的影响
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-12 DOI: 10.4218/etrij.2024-0127
Seungjin Lee

The photochemical acid generation is refined from the first principles of quantum electrodynamics. First, we briefly review the formulation of the quantum theory of light based on the quantum electrodynamics framework to establish the probability of acid generation at a given spacetime point. The quantum mechanical acid generation is then combined with the deprotection mechanism to obtain a probabilistic description of the deprotection density directly related to feature formation in a photoresist. A statistical analysis of the random deprotection density is presented to reveal the leading characteristics of stochastic feature formation.

光化学酸生成是从量子电动力学的第一原理提炼出来的。首先,我们简要回顾了基于量子电动力学框架的光量子理论的表述,以确定在给定时空点酸生成的概率。然后将量子力学酸生成与去保护机制相结合,得到与光刻胶中特征形成直接相关的去保护密度的概率描述。通过对随机去保护密度的统计分析,揭示了随机特征形成的主要特征。
{"title":"Quantum electrodynamical formulation of photochemical acid generation and its implications on optical lithography","authors":"Seungjin Lee","doi":"10.4218/etrij.2024-0127","DOIUrl":"https://doi.org/10.4218/etrij.2024-0127","url":null,"abstract":"<p>The photochemical acid generation is refined from the first principles of quantum electrodynamics. First, we briefly review the formulation of the quantum theory of light based on the quantum electrodynamics framework to establish the probability of acid generation at a given spacetime point. The quantum mechanical acid generation is then combined with the deprotection mechanism to obtain a probabilistic description of the deprotection density directly related to feature formation in a photoresist. A statistical analysis of the random deprotection density is presented to reveal the leading characteristics of stochastic feature formation.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"774-782"},"PeriodicalIF":1.3,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0127","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance improvement of microLED-to-microLED visible light communication using reverse bias 利用反向偏置改进微led到微led可见光通信的性能
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-12 DOI: 10.4218/etrij.2023-0484
Bo-Guen Kim, Sung-Man Kim

LED-to-LED visible light communication (VLC), which uses LEDs not only for the transmitter but also for the receiver, is being studied as an efficient optical wireless communication technology that uses LED lighting infrastructure. In this paper, we investigate microLED-to-LED VLC, which uses microLEDs as both the transmitter and receiver. In particular, we conducted a study to improve the performance of microLED-to-microLED VLC. For this, we measured the performance depending on the transmitter and receiver LED color combination. In addition, the effects of zero bias and reverse bias at the receiver LED were investigated. We also investigated the improvement in the reverse bias when applying a transimpedance amplifier to the receiver LED. Finally, we experimentally demonstrated a data rate of 360 kbps in the microLED-to-microLED VLC.

LED 对 LED 可见光通信(VLC)不仅将 LED 用作发射器,还将其用作接收器,作为一种利用 LED 照明基础设施的高效光无线通信技术,目前正在对其进行研究。在本文中,我们研究了同时使用微型 LED 作为发射器和接收器的微型 LED 对 LED 可见光通信(VLC)。我们特别研究了如何提高 microLED 对 microLED VLC 的性能。为此,我们测量了发射器和接收器 LED 颜色组合的性能。此外,我们还研究了接收器 LED 的零偏压和反向偏压的影响。我们还研究了在接收器 LED 上应用跨阻放大器对反向偏置的改善。最后,我们通过实验演示了微型 LED 到微型 LED VLC 的 360 kbps 数据传输速率。
{"title":"Performance improvement of microLED-to-microLED visible light communication using reverse bias","authors":"Bo-Guen Kim,&nbsp;Sung-Man Kim","doi":"10.4218/etrij.2023-0484","DOIUrl":"https://doi.org/10.4218/etrij.2023-0484","url":null,"abstract":"<p>LED-to-LED visible light communication (VLC), which uses LEDs not only for the transmitter but also for the receiver, is being studied as an efficient optical wireless communication technology that uses LED lighting infrastructure. In this paper, we investigate microLED-to-LED VLC, which uses microLEDs as both the transmitter and receiver. In particular, we conducted a study to improve the performance of microLED-to-microLED VLC. For this, we measured the performance depending on the transmitter and receiver LED color combination. In addition, the effects of zero bias and reverse bias at the receiver LED were investigated. We also investigated the improvement in the reverse bias when applying a transimpedance amplifier to the receiver LED. Finally, we experimentally demonstrated a data rate of 360 kbps in the microLED-to-microLED VLC.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"270-277"},"PeriodicalIF":1.3,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0484","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixed-mode SNN crossbar array with embedded dummy switch and mid-node pre-charge scheme 采用嵌入式假开关和中节点预充电方案的混合模式 SNN 横杆阵列
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-12 DOI: 10.4218/etrij.2024-0120
Kwang-Il Oh, Hyuk Kim, Taewook Kang, Sung-Eun Kim, Jae-Jin Lee, Byung-Do Yang

This paper presents a membrane computation error-minimized mixed-mode spiking neural network (SNN) crossbar array. Our approach involves implementing an embedded dummy switch scheme and a mid-node pre-charge scheme to construct a high-precision current-mode synapse. We effectively suppressed charge sharing between membrane capacitors and the parasitic capacitance of synapses that results in membrane computation error. A 400 × 20 SNN crossbar prototype chip is fabricated via a 28-nm FDSOI CMOS process, and 20 MNIST patterns with their sizes reduced to 20 × 20 pixels are successfully recognized under 411 μW of power consumed. Moreover, the peak-to-peak deviation of the normalized output spike count measured from the 21 fabricated SNN prototype chips is within 16.5% from the ideal value, including sample-wise random variations.

本文介绍了一种膜计算误差最小化混合模式尖峰神经网络(SNN)横杆阵列。我们的方法包括实施嵌入式假开关方案和中节点预充电方案,以构建高精度电流模式突触。我们有效地抑制了膜电容之间的电荷共享以及导致膜计算误差的突触寄生电容。我们采用 28 纳米 FDSOI CMOS 工艺制造了 400 × 20 SNN 横条原型芯片,并成功识别了 20 个尺寸缩小为 20 × 20 像素的 MNIST 图案,功耗仅为 411 μW。此外,从 21 个已制造的 SNN 原型芯片测得的归一化输出尖峰计数的峰峰值偏差与理想值的偏差在 16.5% 以内,其中包括样本随机变量。
{"title":"Mixed-mode SNN crossbar array with embedded dummy switch and mid-node pre-charge scheme","authors":"Kwang-Il Oh,&nbsp;Hyuk Kim,&nbsp;Taewook Kang,&nbsp;Sung-Eun Kim,&nbsp;Jae-Jin Lee,&nbsp;Byung-Do Yang","doi":"10.4218/etrij.2024-0120","DOIUrl":"https://doi.org/10.4218/etrij.2024-0120","url":null,"abstract":"<p>This paper presents a membrane computation error-minimized mixed-mode spiking neural network (SNN) crossbar array. Our approach involves implementing an embedded dummy switch scheme and a mid-node pre-charge scheme to construct a high-precision current-mode synapse. We effectively suppressed charge sharing between membrane capacitors and the parasitic capacitance of synapses that results in membrane computation error. A 400 × 20 SNN crossbar prototype chip is fabricated via a 28-nm FDSOI CMOS process, and 20 MNIST patterns with their sizes reduced to 20 × 20 pixels are successfully recognized under 411 μW of power consumed. Moreover, the peak-to-peak deviation of the normalized output spike count measured from the 21 fabricated SNN prototype chips is within 16.5% from the ideal value, including sample-wise random variations.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"865-877"},"PeriodicalIF":1.3,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0120","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ETRI Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1