首页 > 最新文献

ETRI Journal最新文献

英文 中文
An end-to-end joint learning scheme of image compression and quality enhancement with improved entropy minimization 基于改进熵最小化的图像压缩和质量增强的端到端联合学习方案
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-05-27 DOI: 10.4218/etrij.2023-0275
Jooyoung Lee, Seunghyun Cho, Munchurl Kim

Recently, learned image compression methods based on entropy minimization have achieved superior results compared with conventional image codecs such as BPG and JPEG2000. However, they leverage single Gaussian models, which have a limited ability to approximate various irregular distributions of transformed latent representations, resulting in suboptimal coding efficiency. Furthermore, existing methods focus on constructing effective entropy models, rather than utilizing modern architectural techniques. In this paper, we propose a novel joint learning scheme called JointIQ-Net that incorporates image compression and quality enhancement technologies with improved entropy minimization based on a newly adopted Gaussian mixture model. We also exploit global context to estimate the distributions of latent representations precisely. The results of extensive experiments demonstrate that JointIQ-Net achieves remarkable performance improvements in terms of coding efficiency compared with existing learned image compression methods and conventional codecs. To the best of our knowledge, ours is the first learned image compression method that outperforms VVC intra-coding in terms of both PSNR and MS-SSIM.

近年来,与传统的BPG和JPEG2000等图像编解码器相比,基于熵最小化的学习图像压缩方法取得了更好的效果。然而,它们利用单一高斯模型,该模型具有有限的能力来近似变换后的潜在表示的各种不规则分布,导致次优编码效率。此外,现有方法侧重于构建有效的熵模型,而不是利用现代体系结构技术。在本文中,我们提出了一种新的联合学习方案,称为JointIQ-Net,它结合了图像压缩和质量增强技术以及基于新采用的高斯混合模型改进的熵最小化。我们还利用全局上下文来精确估计潜在表征的分布。大量的实验结果表明,与现有的学习图像压缩方法和传统编解码器相比,JointIQ-Net在编码效率方面取得了显著的提高。据我们所知,我们的方法是第一个在PSNR和MS-SSIM方面都优于VVC内编码的学习图像压缩方法。
{"title":"An end-to-end joint learning scheme of image compression and quality enhancement with improved entropy minimization","authors":"Jooyoung Lee,&nbsp;Seunghyun Cho,&nbsp;Munchurl Kim","doi":"10.4218/etrij.2023-0275","DOIUrl":"https://doi.org/10.4218/etrij.2023-0275","url":null,"abstract":"<p>Recently, learned image compression methods based on entropy minimization have achieved superior results compared with conventional image codecs such as BPG and JPEG2000. However, they leverage single Gaussian models, which have a limited ability to approximate various irregular distributions of transformed latent representations, resulting in suboptimal coding efficiency. Furthermore, existing methods focus on constructing effective entropy models, rather than utilizing modern architectural techniques. In this paper, we propose a novel joint learning scheme called JointIQ-Net that incorporates image compression and quality enhancement technologies with improved entropy minimization based on a newly adopted Gaussian mixture model. We also exploit global context to estimate the distributions of latent representations precisely. The results of extensive experiments demonstrate that JointIQ-Net achieves remarkable performance improvements in terms of coding efficiency compared with existing learned image compression methods and conventional codecs. To the best of our knowledge, ours is the first learned image compression method that outperforms VVC intra-coding in terms of both PSNR and MS-SSIM.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"935-949"},"PeriodicalIF":1.3,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0275","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142862260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved VoWiFi cell capacity using A-MPDU for frame aggregation in sixth-generation WLAN standard 在第六代无线局域网标准中使用 A-MPDU 进行帧聚合,提高 VoWiFi 小区容量
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-05-20 DOI: 10.4218/etrij.2023-0333
Ayes Chinmay, Hemanta Kumar Pati

The rapid progress in wireless communication technology and proliferation of multimedia applications, including voice over WiFi (VoWiFi), demand exploration of innovative approaches to enhance the network performance and quality of service. We propose a technique for enhancing the cell capacity of wireless local area network (WLAN) access point that provides VoWiFi service in the sixth-generation WLAN standard. The proposed technique uses the aggregate media access control protocol data unit (A-MPDU) for frame aggregation with constant bit rate (CBR) traffic in the WiFi 6 standard (i.e., IEEE 802.11ax). On the other hand, the retransmission of voice packets substantially deteriorates the VoWiFi cell capacity. We compare the results obtained from the use of WiFi 6 with currently existing WLAN standards, such as IEEE 802.11b/g/n/ac. This comparison focuses on distributed coordination function interframe spacing (DIFS) and arbitration interframe spacing (AIFS) using CBR traffic. Using our technique, we can increase the VoWiFi cell capacity for CBR traffic by 24.25% and 25.20% when using DIFS and AIFS, respectively, while considering the A-MPDU frame aggregation technique.

随着无线通信技术的飞速发展和包括 WiFi 语音(VoWiFi)在内的多媒体应用的激增,需要探索创新方法来提高网络性能和服务质量。我们提出了一种在第六代无线局域网标准中增强提供 VoWiFi 服务的无线局域网(WLAN)接入点小区容量的技术。所提出的技术在 WiFi 6 标准(即 IEEE 802.11ax)中使用聚合媒体访问控制协议数据单元(A-MPDU)对恒定比特率(CBR)流量进行帧聚合。另一方面,语音数据包的重传大大降低了 VoWiFi 小区的容量。我们比较了使用 WiFi 6 和现有无线局域网标准(如 IEEE 802.11b/g/n/ac)所获得的结果。比较的重点是使用 CBR 流量的分布式协调功能帧间距(DIFS)和仲裁帧间距(AIFS)。使用我们的技术,在考虑到 A-MPDU 帧聚合技术的情况下,使用 DIFS 和 AIFS 时可将 CBR 流量的 VoWiFi 小区容量分别提高 24.25% 和 25.20%。
{"title":"Improved VoWiFi cell capacity using A-MPDU for frame aggregation in sixth-generation WLAN standard","authors":"Ayes Chinmay,&nbsp;Hemanta Kumar Pati","doi":"10.4218/etrij.2023-0333","DOIUrl":"10.4218/etrij.2023-0333","url":null,"abstract":"<p>The rapid progress in wireless communication technology and proliferation of multimedia applications, including voice over WiFi (VoWiFi), demand exploration of innovative approaches to enhance the network performance and quality of service. We propose a technique for enhancing the cell capacity of wireless local area network (WLAN) access point that provides VoWiFi service in the sixth-generation WLAN standard. The proposed technique uses the aggregate media access control protocol data unit (A-MPDU) for frame aggregation with constant bit rate (CBR) traffic in the WiFi 6 standard (i.e., IEEE 802.11ax). On the other hand, the retransmission of voice packets substantially deteriorates the VoWiFi cell capacity. We compare the results obtained from the use of WiFi 6 with currently existing WLAN standards, such as IEEE 802.11b/g/n/ac. This comparison focuses on distributed coordination function interframe spacing (DIFS) and arbitration interframe spacing (AIFS) using CBR traffic. Using our technique, we can increase the VoWiFi cell capacity for CBR traffic by 24.25% and 25.20% when using DIFS and AIFS, respectively, while considering the A-MPDU frame aggregation technique.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"965-976"},"PeriodicalIF":1.3,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0333","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141119983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-complexity patch projection method for efficient and lightweight point-cloud compression 用于高效、轻量级点云压缩的低复杂度补丁投影法
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-05-15 DOI: 10.4218/etrij.2023-0242
Sungryeul Rhyu, Junsik Kim, Gwang Hoon Park, Kyuheon Kim

The point cloud provides viewers with intuitive geometric understanding but requires a huge amount of data. Moving Picture Experts Group (MPEG) has developed video-based point-cloud compression in the range of 300–700. As the compression rate increases, the complexity increases to the extent that it takes 101.36 s to compress one frame in an experimental environment using a personal computer. To realize real-time point-cloud compression processing, the direct patch projection (DPP) method proposed herein simplifies the complex patch segmentation process by classifying and projecting points according to their geometric positions. The DPP method decreases the complexity of the patch segmentation from 25.75 s to 0.10 s per frame, and the entire process becomes 8.76 times faster than the conventional one. Consequently, this proposed DPP method yields similar peak signal-to-noise ratio (PSNR) outcomes to those of the conventional method at reduced times (4.7–5.5 times) at the cost of bitrate overhead. The objective and subjective results show that the proposed DPP method can be considered when low-complexity requirements are required in lightweight device environments.

点云为观众提供了直观的几何理解,但需要大量数据。移动图像专家组(MPEG)开发了基于视频的点云压缩技术,压缩率范围为 300-700。随着压缩率的提高,复杂性也随之增加,在实验环境中使用个人电脑压缩一帧图像需要 101.36 秒。为了实现实时点云压缩处理,本文提出的直接补丁投影(DPP)方法通过根据点的几何位置对点进行分类和投影,简化了复杂的补丁分割过程。DPP 方法将斑块分割的复杂度从每帧 25.75 秒降至 0.10 秒,整个过程比传统方法快 8.76 倍。因此,所提出的 DPP 方法以比特率开销为代价,在缩短时间(4.7-5.5 倍)的情况下获得了与传统方法相似的峰值信噪比(PSNR)结果。客观和主观结果表明,在轻量级设备环境中需要低复杂度要求时,可以考虑采用所提出的 DPP 方法。
{"title":"Low-complexity patch projection method for efficient and lightweight point-cloud compression","authors":"Sungryeul Rhyu,&nbsp;Junsik Kim,&nbsp;Gwang Hoon Park,&nbsp;Kyuheon Kim","doi":"10.4218/etrij.2023-0242","DOIUrl":"10.4218/etrij.2023-0242","url":null,"abstract":"<p>The point cloud provides viewers with intuitive geometric understanding but requires a huge amount of data. Moving Picture Experts Group (MPEG) has developed video-based point-cloud compression in the range of 300–700. As the compression rate increases, the complexity increases to the extent that it takes 101.36 s to compress one frame in an experimental environment using a personal computer. To realize real-time point-cloud compression processing, the direct patch projection (DPP) method proposed herein simplifies the complex patch segmentation process by classifying and projecting points according to their geometric positions. The DPP method decreases the complexity of the patch segmentation from 25.75 s to 0.10 s per frame, and the entire process becomes 8.76 times faster than the conventional one. Consequently, this proposed DPP method yields similar peak signal-to-noise ratio (PSNR) outcomes to those of the conventional method at reduced times (4.7–5.5 times) at the cost of bitrate overhead. The objective and subjective results show that the proposed DPP method can be considered when low-complexity requirements are required in lightweight device environments.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 4","pages":"683-696"},"PeriodicalIF":1.3,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0242","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140974659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of high-efficiency rectenna for microwave power wireless transmission systems 设计用于微波功率无线传输系统的高效整流器天线
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-05-14 DOI: 10.4218/etrij.2023-0290
Jia-Xiang Chen, Zhong-Hua Ma, Chen Li, Meng-Nan Wang, Hai-Tao Xing, Wei-Qian Liang, Yan-Feng Jiang

Radio power transmission is studied herein. A rectenna with high-efficiency microwave power transmission is proposed. Improvement on the traditional Yagi antenna structure is implemented by using a rectenna. In this design, the antenna is fed by a coplanar waveguide, and the impedance bandwidth is widened by adding a set of radiation dipoles. A set of parasitic directors is used to enhance the directionality of the antenna. A rectifier circuit based on a power-recovery network is designed using an open branch-line network with harmonic rejection characteristics as a low-pass filter. The rectifier network topology is adopted with a diode in parallel connection to increase the output voltage of the circuit. The reflected power generated by the impedance mismatch can be recycled by the power-recovery network. Thus, the rectifying efficiency can be improved. Finally, experimental data show that the radiofrequency–direct current power conversion efficiency of the rectenna can be as high as 77.5% at the operating frequency 2.45 GHz and the input power 12 dBm. The proposed rectenna can be potentially used for the power supply of various wireless sensor nodes with high efficiency.

本文对无线电功率传输进行了研究。本文提出了一种具有高效微波功率传输功能的整流天线。利用整流天线对传统的八木天线结构进行了改进。在该设计中,天线由共面波导馈电,并通过增加一组辐射偶极子来拓宽阻抗带宽。一组寄生导向器用于增强天线的方向性。利用具有谐波抑制特性的开放式支线网络作为低通滤波器,设计了基于功率恢复网络的整流电路。整流网络拓扑结构采用二极管并联,以提高电路的输出电压。阻抗失配产生的反射功率可由功率回收网络回收利用。因此,整流效率得以提高。最后,实验数据显示,在工作频率为 2.45 GHz、输入功率为 12 dBm 时,整流天线的射频-直流电功率转换效率高达 77.5%。所提出的整流天线可用于为各种无线传感器节点高效供电。
{"title":"Design of high-efficiency rectenna for microwave power wireless transmission systems","authors":"Jia-Xiang Chen,&nbsp;Zhong-Hua Ma,&nbsp;Chen Li,&nbsp;Meng-Nan Wang,&nbsp;Hai-Tao Xing,&nbsp;Wei-Qian Liang,&nbsp;Yan-Feng Jiang","doi":"10.4218/etrij.2023-0290","DOIUrl":"10.4218/etrij.2023-0290","url":null,"abstract":"<p>Radio power transmission is studied herein. A rectenna with high-efficiency microwave power transmission is proposed. Improvement on the traditional Yagi antenna structure is implemented by using a rectenna. In this design, the antenna is fed by a coplanar waveguide, and the impedance bandwidth is widened by adding a set of radiation dipoles. A set of parasitic directors is used to enhance the directionality of the antenna. A rectifier circuit based on a power-recovery network is designed using an open branch-line network with harmonic rejection characteristics as a low-pass filter. The rectifier network topology is adopted with a diode in parallel connection to increase the output voltage of the circuit. The reflected power generated by the impedance mismatch can be recycled by the power-recovery network. Thus, the rectifying efficiency can be improved. Finally, experimental data show that the radiofrequency–direct current power conversion efficiency of the rectenna can be as high as 77.5% at the operating frequency 2.45 GHz and the input power 12 dBm. The proposed rectenna can be potentially used for the power supply of various wireless sensor nodes with high efficiency.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"1103-1112"},"PeriodicalIF":1.3,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0290","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140980107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient dual layer data aggregation scheme in clustered wireless sensor networks 集群无线传感器网络中的高效双层数据聚合方案
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-05-06 DOI: 10.4218/etrij.2023-0214
Fenting Yang, Zhen Xu, Lei Yang

In wireless sensor network (WSN) monitoring systems, redundant data from sluggish environmental changes and overlapping sensing ranges can increase the volume of data sent by nodes, degrade the efficiency of information collection, and lead to the death of sensor nodes. To reduce the energy consumption of sensor nodes and prolong the life of WSNs, this study proposes a dual layer intracluster data fusion scheme based on ring buffer. To reduce redundant data and temporary anomalous data while guaranteeing the temporal coherence of data, the source nodes employ a binarized similarity function and sliding quartile detection based on the ring buffer. Based on the improved support degree function of weighted Pearson distance, the cluster head node performs a weighted fusion on the data received from the source nodes. Experimental results reveal that the scheme proposed in this study has clear advantages in three aspects: the number of remaining nodes, residual energy, and the number of packets transmitted. The data fusion of the proposed scheme is confined to the data fusion of the same attribute environment parameters.

在无线传感器网络(WSN)监测系统中,由于环境变化缓慢和传感范围重叠而产生的冗余数据会增加节点发送的数据量,降低信息收集效率,并导致传感器节点死亡。为了降低传感器节点的能耗,延长 WSN 的寿命,本研究提出了一种基于环形缓冲区的双层簇内数据融合方案。为了减少冗余数据和临时异常数据,同时保证数据的时间一致性,源节点采用了基于环形缓冲区的二值化相似度函数和滑动四分位检测。簇首节点根据改进的加权皮尔逊距离支持度函数,对从源节点接收到的数据进行加权融合。实验结果表明,本研究提出的方案在剩余节点数、剩余能量和数据包传输数三个方面具有明显优势。所提方案的数据融合仅限于相同属性环境参数的数据融合。
{"title":"An efficient dual layer data aggregation scheme in clustered wireless sensor networks","authors":"Fenting Yang,&nbsp;Zhen Xu,&nbsp;Lei Yang","doi":"10.4218/etrij.2023-0214","DOIUrl":"10.4218/etrij.2023-0214","url":null,"abstract":"<p>In wireless sensor network (WSN) monitoring systems, redundant data from sluggish environmental changes and overlapping sensing ranges can increase the volume of data sent by nodes, degrade the efficiency of information collection, and lead to the death of sensor nodes. To reduce the energy consumption of sensor nodes and prolong the life of WSNs, this study proposes a dual layer intracluster data fusion scheme based on ring buffer. To reduce redundant data and temporary anomalous data while guaranteeing the temporal coherence of data, the source nodes employ a binarized similarity function and sliding quartile detection based on the ring buffer. Based on the improved support degree function of weighted Pearson distance, the cluster head node performs a weighted fusion on the data received from the source nodes. Experimental results reveal that the scheme proposed in this study has clear advantages in three aspects: the number of remaining nodes, residual energy, and the number of packets transmitted. The data fusion of the proposed scheme is confined to the data fusion of the same attribute environment parameters.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 4","pages":"604-618"},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0214","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141006889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Writer verification using feature selection based on genetic algorithm: A case study on handwritten Bangla dataset 利用基于遗传算法的特征选择进行作家验证:手写孟加拉语数据集案例研究
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-28 DOI: 10.4218/etrij.2023-0188
Jaya Paul, Kalpita Dutta, Anasua Sarkar, Kaushik Roy, Nibaran Das

Author verification is challenging because of the diversity in writing styles. We propose an enhanced handwriting verification method that combines handcrafted and automatically extracted features. The method uses a genetic algorithm to reduce the dimensionality of the feature set. We consider offline Bangla handwriting content and evaluate the proposed method using handcrafted features with a simple logistic regression, radial basis function network, and sequential minimal optimization as well as automatically extracted features using a convolutional neural network. The handcrafted features outperform the automatically extracted ones, achieving an average verification accuracy of 94.54% for 100 writers. The handcrafted features include Radon transform, histogram of oriented gradients, local phase quantization, and local binary patterns from interwriter and intrawriter content. The genetic algorithm reduces the feature dimensionality and selects salient features using a support vector machine. The top five experimental results are obtained from the optimal feature set selected using a consensus strategy. Comparisons with other methods and features confirm the satisfactory results.

由于书写风格的多样性,作者验证具有挑战性。我们提出了一种结合手工制作和自动提取特征的增强型手写验证方法。该方法使用遗传算法来降低特征集的维度。我们考虑了离线孟加拉语手写内容,并使用简单逻辑回归、径向基函数网络和顺序最小优化等手工特征以及卷积神经网络自动提取的特征对所提出的方法进行了评估。手工创建的特征优于自动提取的特征,在 100 位作家中实现了 94.54% 的平均验证准确率。手工特征包括拉顿变换、定向梯度直方图、局部相位量化以及来自作家间和作家内内容的局部二进制模式。遗传算法降低了特征维度,并使用支持向量机选择突出特征。实验结果的前五名来自使用共识策略选出的最佳特征集。与其他方法和特征的比较证实了结果令人满意。
{"title":"Writer verification using feature selection based on genetic algorithm: A case study on handwritten Bangla dataset","authors":"Jaya Paul,&nbsp;Kalpita Dutta,&nbsp;Anasua Sarkar,&nbsp;Kaushik Roy,&nbsp;Nibaran Das","doi":"10.4218/etrij.2023-0188","DOIUrl":"https://doi.org/10.4218/etrij.2023-0188","url":null,"abstract":"<p>Author verification is challenging because of the diversity in writing styles. We propose an enhanced handwriting verification method that combines handcrafted and automatically extracted features. The method uses a genetic algorithm to reduce the dimensionality of the feature set. We consider offline Bangla handwriting content and evaluate the proposed method using handcrafted features with a simple logistic regression, radial basis function network, and sequential minimal optimization as well as automatically extracted features using a convolutional neural network. The handcrafted features outperform the automatically extracted ones, achieving an average verification accuracy of 94.54% for 100 writers. The handcrafted features include Radon transform, histogram of oriented gradients, local phase quantization, and local binary patterns from interwriter and intrawriter content. The genetic algorithm reduces the feature dimensionality and selects salient features using a support vector machine. The top five experimental results are obtained from the optimal feature set selected using a consensus strategy. Comparisons with other methods and features confirm the satisfactory results.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 4","pages":"648-659"},"PeriodicalIF":1.3,"publicationDate":"2024-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0188","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141980503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved contrastive learning model via identification of false-negatives in self-supervised learning 通过识别自我监督学习中的假阴性来改进对比学习模型
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-16 DOI: 10.4218/etrij.2023-0285
Joonsun Auh, Changsik Cho, Seon-tae Kim

Self-supervised learning is a method that learns the data representation through unlabeled data. It is efficient because it learns from large-scale unlabeled data and through continuous research, performance comparable to supervised learning has been reached. Contrastive learning, a type of self-supervised learning algorithm, utilizes data similarity to perform instance-level learning within an embedding space. However, it suffers from the problem of false-negatives, which are the misclassification of data class during training the data representation. They result in loss of information and deteriorate the performance of the model. This study employed cosine similarity and temperature simultaneously to identify false-negatives and mitigate their impact to improve the performance of the contrastive learning model. The proposed method exhibited a performance improvement of up to 2.7% compared with the existing algorithm on the CIFAR-100 dataset. Improved performance on other datasets such as CIFAR-10 and ImageNet was also observed.

自监督学习是一种通过无标记数据来学习数据表示的方法。它的高效之处在于能从大规模无标记数据中学习,而且通过不断的研究,其性能已可与监督学习相媲美。对比学习是一种自监督学习算法,它利用数据相似性在嵌入空间内进行实例级学习。然而,它也存在假阴性的问题,即在训练数据表示时对数据类别的错误分类。它们会导致信息丢失并降低模型的性能。本研究同时使用余弦相似度和温度来识别假阴性并减轻其影响,从而提高对比学习模型的性能。在 CIFAR-100 数据集上,与现有算法相比,拟议方法的性能提高了 2.7%。在 CIFAR-10 和 ImageNet 等其他数据集上的性能也有所提高。
{"title":"Improved contrastive learning model via identification of false-negatives in self-supervised learning","authors":"Joonsun Auh,&nbsp;Changsik Cho,&nbsp;Seon-tae Kim","doi":"10.4218/etrij.2023-0285","DOIUrl":"10.4218/etrij.2023-0285","url":null,"abstract":"<p>Self-supervised learning is a method that learns the data representation through unlabeled data. It is efficient because it learns from large-scale unlabeled data and through continuous research, performance comparable to supervised learning has been reached. Contrastive learning, a type of self-supervised learning algorithm, utilizes data similarity to perform instance-level learning within an embedding space. However, it suffers from the problem of false-negatives, which are the misclassification of data class during training the data representation. They result in loss of information and deteriorate the performance of the model. This study employed cosine similarity and temperature simultaneously to identify false-negatives and mitigate their impact to improve the performance of the contrastive learning model. The proposed method exhibited a performance improvement of up to 2.7% compared with the existing algorithm on the CIFAR-100 dataset. Improved performance on other datasets such as CIFAR-10 and ImageNet was also observed.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"1020-1029"},"PeriodicalIF":1.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0285","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140697047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative autoencoder to prevent overregularization of variational autoencoder 防止变分自动编码器过度规则化的生成自动编码器
IF 1.4 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-12 DOI: 10.4218/etrij.2023-0375
YoungMin Ko, SunWoo Ko, YoungSoo Kim
In machine learning, data scarcity is a common problem, and generative models have the potential to solve it. The variational autoencoder is a generative model that performs variational inference to estimate a low-dimensional posterior distribution given high-dimensional data. Specifically, it optimizes the evidence lower bound from regularization and reconstruction terms, but the two terms are imbalanced in general. If the reconstruction error is not sufficiently small to belong to the population, the generative model performance cannot be guaranteed. We propose a generative autoencoder (GAE) that uses an autoencoder to first minimize the reconstruction error and then estimate the distribution using latent vectors mapped onto a lower dimension through the encoder. We compare the Fréchet inception distances scores of the proposed GAE and nine other variational autoencoders on the MNIST, Fashion MNIST, CIFAR10, and SVHN datasets. The proposed GAE consistently outperforms the other methods on the MNIST (44.30), Fashion MNIST (196.34), and SVHN (77.53) datasets.
在机器学习中,数据稀缺是一个常见问题,而生成模型有可能解决这一问题。变分自动编码器是一种生成模型,它通过变分推理来估计给定高维数据的低维后验分布。具体来说,它优化正则化和重构项的证据下限,但这两个项一般是不平衡的。如果重构误差不够小,不属于群体,生成模型的性能就无法保证。我们提出了一种生成式自动编码器(GAE),它使用自动编码器首先使重构误差最小化,然后使用通过编码器映射到较低维度上的潜向量来估计分布。我们在 MNIST、Fashion MNIST、CIFAR10 和 SVHN 数据集上比较了所提出的 GAE 和其他九种变异自动编码器的弗雷谢特截距得分。在 MNIST(44.30)、Fashion MNIST(196.34)和 SVHN(77.53)数据集上,所提出的 GAE 始终优于其他方法。
{"title":"Generative autoencoder to prevent overregularization of variational autoencoder","authors":"YoungMin Ko, SunWoo Ko, YoungSoo Kim","doi":"10.4218/etrij.2023-0375","DOIUrl":"https://doi.org/10.4218/etrij.2023-0375","url":null,"abstract":"In machine learning, data scarcity is a common problem, and generative models have the potential to solve it. The variational autoencoder is a generative model that performs variational inference to estimate a low-dimensional posterior distribution given high-dimensional data. Specifically, it optimizes the evidence lower bound from regularization and reconstruction terms, but the two terms are imbalanced in general. If the reconstruction error is not sufficiently small to belong to the population, the generative model performance cannot be guaranteed. We propose a generative autoencoder (GAE) that uses an autoencoder to first minimize the reconstruction error and then estimate the distribution using latent vectors mapped onto a lower dimension through the encoder. We compare the Fréchet inception distances scores of the proposed GAE and nine other variational autoencoders on the MNIST, Fashion MNIST, CIFAR10, and SVHN datasets. The proposed GAE consistently outperforms the other methods on the MNIST (44.30), Fashion MNIST (196.34), and SVHN (77.53) datasets.","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"33 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140596643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Background music monitoring framework and dataset for TV broadcast audio 电视广播音频背景音乐监测框架和数据集
IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-12 DOI: 10.4218/etrij.2023-0249
Hyemi Kim, Junghyun Kim, Jihyun Park, Seongwoo Kim, Chanjin Park, Wonyoung Yoo

Music identification is widely regarded as a solved problem for music searching in quiet environments, but its performance tends to degrade in TV broadcast audio owing to the presence of dialogue or sound effects. In addition, constructing an accurate dataset for measuring the performance of background music monitoring in TV broadcast audio is challenging. We propose a framework for monitoring background music by automatic identification and introduce a background music cue sheet. The framework comprises three main components: music identification, music–speech separation, and music detection. In addition, we introduce the Cue-K-Drama dataset, which includes reference songs, audio tracks from 60 episodes of five Korean TV drama series, and corresponding cue sheets that provide the start and end timestamps of background music. Experimental results on the constructed and existing datasets demonstrate that the proposed framework, which incorporates music identification with music–speech separation and music detection, effectively enhances TV broadcast audio monitoring.

音乐识别被广泛认为是解决安静环境下音乐搜索的一个难题,但在电视广播音频中,由于对话或音效的存在,音乐识别的性能往往会下降。此外,构建一个准确的数据集来衡量电视广播音频中背景音乐监测的性能也很有挑战性。我们提出了一个通过自动识别监控背景音乐的框架,并引入了背景音乐提示表。该框架由三个主要部分组成:音乐识别、音乐语音分离和音乐检测。此外,我们还引入了 Cue-K-Drama 数据集,其中包括参考歌曲、五部韩国电视剧 60 集的音轨以及提供背景音乐开始和结束时间戳的相应提示表。在所构建的数据集和现有数据集上的实验结果表明,所提出的框架将音乐识别与音乐-语音分离和音乐检测结合在一起,有效地增强了电视广播音频监测功能。
{"title":"Background music monitoring framework and dataset for TV broadcast audio","authors":"Hyemi Kim,&nbsp;Junghyun Kim,&nbsp;Jihyun Park,&nbsp;Seongwoo Kim,&nbsp;Chanjin Park,&nbsp;Wonyoung Yoo","doi":"10.4218/etrij.2023-0249","DOIUrl":"10.4218/etrij.2023-0249","url":null,"abstract":"<p>Music identification is widely regarded as a solved problem for music searching in quiet environments, but its performance tends to degrade in TV broadcast audio owing to the presence of dialogue or sound effects. In addition, constructing an accurate dataset for measuring the performance of background music monitoring in TV broadcast audio is challenging. We propose a framework for monitoring background music by automatic identification and introduce a background music cue sheet. The framework comprises three main components: music identification, music–speech separation, and music detection. In addition, we introduce the Cue-K-Drama dataset, which includes reference songs, audio tracks from 60 episodes of five Korean TV drama series, and corresponding cue sheets that provide the start and end timestamps of background music. Experimental results on the constructed and existing datasets demonstrate that the proposed framework, which incorporates music identification with music–speech separation and music detection, effectively enhances TV broadcast audio monitoring.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 4","pages":"697-707"},"PeriodicalIF":1.3,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0249","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140596641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A neural network framework based on ConvNeXt for side-channel hardware Trojan detection 基于 ConvNeXt 的侧信道硬件木马检测神经网络框架
IF 1.4 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-08 DOI: 10.4218/etrij.2023-0448
Yuchan Gao, Jing Su, Jia Li, Shenglong Wang, Chao Li
Researchers in the field of hardware security have been dedicated to the study of hardware Trojan detection. Among the various approaches, side-channel detection methods are widely used because of their high detection accuracy and fewer constraints. However, most side-channel detection methods cannot make full use of side-channel information. In this paper, we propose a framework that utilizes the continuous wavelet transform to convert time-series information and employs an improved ConvNeXt network to detect hardware Trojans. This detection framework first converts one-dimensional time-series information into a two-dimensional time–frequency map using the continuous wavelet transform to leverage frequency information in electromagnetic side-channel signals. Then, the two-dimensional time–frequency map is fed into the improved ConvNeXt network, which increases the weight of the informative parts in the two-dimensional time–frequency map and enhances detection efficiency. The results indicate that the method proposed in this paper significantly improves the accuracy of hardware Trojan detection.
硬件安全领域的研究人员一直致力于硬件木马检测的研究。在各种方法中,侧信道检测方法因其检测精度高、限制少而被广泛使用。然而,大多数侧信道检测方法无法充分利用侧信道信息。本文提出了一种利用连续小波变换转换时间序列信息的框架,并采用改进的 ConvNeXt 网络来检测硬件木马。该检测框架首先利用连续小波变换将一维时间序列信息转换为二维时频图,以充分利用电磁侧信道信号中的频率信息。然后,将二维时频图输入改进的 ConvNeXt 网络,增加二维时频图中信息部分的权重,提高检测效率。结果表明,本文提出的方法显著提高了硬件木马检测的准确性。
{"title":"A neural network framework based on ConvNeXt for side-channel hardware Trojan detection","authors":"Yuchan Gao, Jing Su, Jia Li, Shenglong Wang, Chao Li","doi":"10.4218/etrij.2023-0448","DOIUrl":"https://doi.org/10.4218/etrij.2023-0448","url":null,"abstract":"Researchers in the field of hardware security have been dedicated to the study of hardware Trojan detection. Among the various approaches, side-channel detection methods are widely used because of their high detection accuracy and fewer constraints. However, most side-channel detection methods cannot make full use of side-channel information. In this paper, we propose a framework that utilizes the continuous wavelet transform to convert time-series information and employs an improved ConvNeXt network to detect hardware Trojans. This detection framework first converts one-dimensional time-series information into a two-dimensional time–frequency map using the continuous wavelet transform to leverage frequency information in electromagnetic side-channel signals. Then, the two-dimensional time–frequency map is fed into the improved ConvNeXt network, which increases the weight of the informative parts in the two-dimensional time–frequency map and enhances detection efficiency. The results indicate that the method proposed in this paper significantly improves the accuracy of hardware Trojan detection.","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"69 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140596752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ETRI Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1