首页 > 最新文献

ICT Express最新文献

英文 中文
Artificial intelligence in industrial heat exchanger fouling prediction: A 20-year systematic review of AI, ML, and DL approaches 工业热交换器结垢预测中的人工智能:人工智能、机器学习和深度学习方法的20年系统回顾
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-12-11 DOI: 10.1016/j.icte.2025.12.003
Abdul Wahid Soomro , Miss Laiha Mat Kiah , Rafidah Md Noor , Salim Newaz Kazi , Kaleemullah Shaikh , Wajahat Ahmed Khan , Ihsan Ali
Fouling in heat exchangers (HXs) affects various industries by lowering efficiency and increasing costs. Traditional fouling-prediction models often do not reflect important mechanistic information and thus become very complex and less reliable. The applications of artificial intelligence (AI), machine learning (ML), and deep learning (DL) open new frontiers, as these techniques can model complex correlations and work with large volumes of data. This review synthesizes 51 articles published between 2005 and June 2025, outlining key trends, persistent research limitations, and emerging directions. Models such as artificial neural networks (ANNs)/deep neural networks (DNNs) and Gaussian process regression (GPR) deliver the optimal results in terms of accurate prediction.
换热器结垢降低了效率,增加了成本,影响了各行各业。传统的污垢预测模型往往不能反映重要的机理信息,因而变得非常复杂和不可靠。人工智能(AI)、机器学习(ML)和深度学习(DL)的应用开辟了新的领域,因为这些技术可以模拟复杂的相关性并处理大量数据。本综述综合了2005年至2025年6月间发表的51篇文章,概述了主要趋势、持续的研究局限性和新兴方向。人工神经网络(ann)/深度神经网络(dnn)和高斯过程回归(GPR)等模型在准确预测方面提供了最佳结果。
{"title":"Artificial intelligence in industrial heat exchanger fouling prediction: A 20-year systematic review of AI, ML, and DL approaches","authors":"Abdul Wahid Soomro ,&nbsp;Miss Laiha Mat Kiah ,&nbsp;Rafidah Md Noor ,&nbsp;Salim Newaz Kazi ,&nbsp;Kaleemullah Shaikh ,&nbsp;Wajahat Ahmed Khan ,&nbsp;Ihsan Ali","doi":"10.1016/j.icte.2025.12.003","DOIUrl":"10.1016/j.icte.2025.12.003","url":null,"abstract":"<div><div>Fouling in heat exchangers (HXs) affects various industries by lowering efficiency and increasing costs. Traditional fouling-prediction models often do not reflect important mechanistic information and thus become very complex and less reliable. The applications of artificial intelligence (AI), machine learning (ML), and deep learning (DL) open new frontiers, as these techniques can model complex correlations and work with large volumes of data. This review synthesizes 51 articles published between 2005 and June 2025, outlining key trends, persistent research limitations, and emerging directions. Models such as artificial neural networks (ANNs)/deep neural networks (DNNs) and Gaussian process regression (GPR) deliver the optimal results in terms of accurate prediction.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 92-110"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Partial encryption-based Shamir secret sharing for low-latency and secure networks 面向低延迟和安全网络的基于部分加密的Shamir秘密共享
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-12-17 DOI: 10.1016/j.icte.2025.12.006
Sun-Jin Lee, So-Eun Jeon, Il-Gu Lee
Advances in next-generation wireless network technology is boosting the demand for ultra-low latency and reliable data-transmission technologies. Particularly, industries are adopting robust end-to-end encryption to enhance security. However, the delays inherent in conventional encryption methods make them unsuitable for meeting these demands. Therefore, this paper presents the Shamir secret sharing–partial encryption (SSS–PE) method, which encrypts only the minimal secret fragments within an SSS environment wherein data are distributed and transmitted. Experimental results indicate that SSS–PE improves secrecy throughput by 1.86 times and reduces latency by 1.38 times compared to the Advanced Encryption Standard.
下一代无线网络技术的进步推动了对超低延迟和可靠数据传输技术的需求。特别是,行业正在采用强大的端到端加密来增强安全性。然而,传统加密方法固有的延迟使得它们不适合满足这些要求。因此,本文提出了Shamir秘密共享-部分加密(SSS - pe)方法,该方法在数据分布和传输的SSS环境中只对最小的秘密片段进行加密。实验结果表明,与高级加密标准相比,ss - pe的保密吞吐量提高了1.86倍,延迟降低了1.38倍。
{"title":"Partial encryption-based Shamir secret sharing for low-latency and secure networks","authors":"Sun-Jin Lee,&nbsp;So-Eun Jeon,&nbsp;Il-Gu Lee","doi":"10.1016/j.icte.2025.12.006","DOIUrl":"10.1016/j.icte.2025.12.006","url":null,"abstract":"<div><div>Advances in next-generation wireless network technology is boosting the demand for ultra-low latency and reliable data-transmission technologies. Particularly, industries are adopting robust end-to-end encryption to enhance security. However, the delays inherent in conventional encryption methods make them unsuitable for meeting these demands. Therefore, this paper presents the Shamir secret sharing–partial encryption (SSS–PE) method, which encrypts only the minimal secret fragments within an SSS environment wherein data are distributed and transmitted. Experimental results indicate that SSS–PE improves secrecy throughput by 1.86 times and reduces latency by 1.38 times compared to the Advanced Encryption Standard.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 168-174"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selective channel inversion protocol for over-the-air computation 无线计算的选择信道反转协议
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-11-28 DOI: 10.1016/j.icte.2025.11.016
Hyunwoo Jung , Jung-Bin Kim
This paper introduces a novel selective channel inversion (SCI) protocol for over-the-air computation (AirComp) networks. The proposed SCI protocol reduces the overhead associated with channel state information (CSI) by broadcasting a predefined channel threshold. Furthermore, it improves mean squared error (MSE) performance by permitting transmission only from sensor nodes with favorable channel conditions. For an arbitrary channel distribution, an exact closed-form expression for the MSE is derived, from which an asymptotic expression for a large number of sensor nodes is obtained. Numerical results demonstrate that the asymptotic expression closely matches the exact result, even for a relatively small number of sensor nodes. An optimization problem is formulated to determine the optimal channel threshold under Nakagami-m fading channels, and it is analytically proven that the problem is convex. An algorithm for dynamically determining the threshold under arbitrary channel distributions is also presented. Numerical results demonstrate that SCI AirComp outperforms CI AirComp in scenarios with low signal-to-noise ratio (SNR), high fading severity, and a large number of sensor nodes.
2018 The Korean Institute of Communications and Information Sciences. Publishing Services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
本文介绍了一种用于空中计算(AirComp)网络的选择性信道反演(SCI)协议。提出的SCI协议通过广播一个预定义的通道阈值来减少与通道状态信息(CSI)相关的开销。此外,它只允许从具有有利信道条件的传感器节点传输,从而提高了均方误差(MSE)性能。对于任意信道分布,导出了MSE的精确封闭表达式,并由此得到了大量传感器节点的渐近表达式。数值结果表明,即使对于相对较少的传感器节点,该渐近表达式也与精确结果非常接近。提出了在Nakagami-m衰落信道下确定最优信道阈值的优化问题,并解析证明了该问题的凸性。提出了在任意信道分布下动态确定阈值的算法。数值结果表明,SCI AirComp在低信噪比(SNR)、高衰落严重程度和大量传感器节点场景下优于CI AirComp。2018韩国通信与信息科学研究所。这是一篇基于CC by-nc-nd许可(http://creativecommons.org/licenses/by-nc-nd/4.0/)的开放获取文章。
{"title":"Selective channel inversion protocol for over-the-air computation","authors":"Hyunwoo Jung ,&nbsp;Jung-Bin Kim","doi":"10.1016/j.icte.2025.11.016","DOIUrl":"10.1016/j.icte.2025.11.016","url":null,"abstract":"<div><div>This paper introduces a novel selective channel inversion (SCI) protocol for over-the-air computation (AirComp) networks. The proposed SCI protocol reduces the overhead associated with channel state information (CSI) by broadcasting a predefined channel threshold. Furthermore, it improves mean squared error (MSE) performance by permitting transmission only from sensor nodes with favorable channel conditions. For an arbitrary channel distribution, an exact closed-form expression for the MSE is derived, from which an asymptotic expression for a large number of sensor nodes is obtained. Numerical results demonstrate that the asymptotic expression closely matches the exact result, even for a relatively small number of sensor nodes. An optimization problem is formulated to determine the optimal channel threshold under Nakagami-<span><math><mi>m</mi></math></span> fading channels, and it is analytically proven that the problem is convex. An algorithm for dynamically determining the threshold under arbitrary channel distributions is also presented. Numerical results demonstrate that SCI AirComp outperforms CI AirComp in scenarios with low signal-to-noise ratio (SNR), high fading severity, and a large number of sensor nodes.</div><div>2018 The Korean Institute of Communications and Information Sciences. Publishing Services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (<span><span>http://creativecommons.org/licenses/by-nc-nd/4.0/</span><svg><path></path></svg></span>).</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 6-12"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid-YOLO: Lightweight Mamba-Transformer Hybrid with multi-scale fusion for real-world traffic detection Hybrid- yolo:轻量级Mamba-Transformer混合动力车,具有多尺度融合,用于现实世界的流量检测
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-09-05 DOI: 10.1016/j.icte.2025.09.002
Hongqing Wang , Jun Kit Chaw , Marizuana Mat Daud , Liantao Shi , Nannan Huang , Tin Tin Ting , Liuzhen Pu
Vehicle detection in complex traffic scenes remains challenging due to frequent occlusions, lighting variations, and extreme weather. We present Hybrid-YOLO, a real-time detection framework that unifies Mamba-based state space modeling, Transformer-driven global attention, and multi-scale feature fusion to achieve high accuracy at low computational cost. At its core, Hybrid-YOLO introduces a Dynamic Residual Stem (DR Stem) for adaptive feature calibration, a Hexa-Scan Selective Block (HSSBlock) for six-directional structural perception, and a Selective State Space Model (SSM) for efficient long-range dependency modeling. A Cross-Stage Scales Feature Extraction (CSSFE) module enriches spatial semantics for small-object detection, while a Sparse-Queries Cascade Self-Attention (SCS) module focuses computation on informative regions, enhancing robustness to clutter and background noise. Extensive experiments on KITTI, BDD100K, and IITM-HeTra show that Hybrid-YOLO achieves 90.11 [email protected] at 66.3 FPS, surpassing state-of-the-art methods in both accuracy and efficiency, and offering a promising solution for real-world intelligent transportation systems.
由于频繁的闭塞、光照变化和极端天气,复杂交通场景中的车辆检测仍然具有挑战性。我们提出了Hybrid-YOLO,这是一种实时检测框架,它将基于mamba的状态空间建模、变压器驱动的全局关注和多尺度特征融合结合在一起,以低计算成本实现高精度。Hybrid-YOLO的核心是引入了一个用于自适应特征校准的动态残差干(DR Stem),一个用于六方向结构感知的Hexa-Scan选择性块(HSSBlock),以及一个用于高效远程依赖建模的选择性状态空间模型(SSM)。跨阶段尺度特征提取(CSSFE)模块丰富了小目标检测的空间语义,而稀疏查询级联自注意(SCS)模块将计算重点放在信息区域上,增强了对杂波和背景噪声的鲁棒性。在KITTI, BDD100K和IITM-HeTra上进行的大量实验表明,Hybrid-YOLO在66.3 FPS下达到90.11 [email protected],在准确性和效率方面都超过了最先进的方法,为现实世界的智能交通系统提供了一个有前途的解决方案。
{"title":"Hybrid-YOLO: Lightweight Mamba-Transformer Hybrid with multi-scale fusion for real-world traffic detection","authors":"Hongqing Wang ,&nbsp;Jun Kit Chaw ,&nbsp;Marizuana Mat Daud ,&nbsp;Liantao Shi ,&nbsp;Nannan Huang ,&nbsp;Tin Tin Ting ,&nbsp;Liuzhen Pu","doi":"10.1016/j.icte.2025.09.002","DOIUrl":"10.1016/j.icte.2025.09.002","url":null,"abstract":"<div><div>Vehicle detection in complex traffic scenes remains challenging due to frequent occlusions, lighting variations, and extreme weather. We present Hybrid-YOLO, a real-time detection framework that unifies Mamba-based state space modeling, Transformer-driven global attention, and multi-scale feature fusion to achieve high accuracy at low computational cost. At its core, Hybrid-YOLO introduces a Dynamic Residual Stem (DR Stem) for adaptive feature calibration, a Hexa-Scan Selective Block (HSSBlock) for six-directional structural perception, and a Selective State Space Model (SSM) for efficient long-range dependency modeling. A Cross-Stage Scales Feature Extraction (CSSFE) module enriches spatial semantics for small-object detection, while a Sparse-Queries Cascade Self-Attention (SCS) module focuses computation on informative regions, enhancing robustness to clutter and background noise. Extensive experiments on KITTI, BDD100K, and IITM-HeTra show that Hybrid-YOLO achieves 90.11 [email protected] at 66.3 FPS, surpassing state-of-the-art methods in both accuracy and efficiency, and offering a promising solution for real-world intelligent transportation systems.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 214-222"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward storage-aware learning with compressed data an empirical exploratory study on JPEG 基于压缩数据的存储感知学习在JPEG上的实证探索性研究
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-12-18 DOI: 10.1016/j.icte.2025.12.005
Kichang Lee , JeongGil Ko , Songkuk Kim , JaeYeon Park
On-device machine learning is fundamentally constrained by limited storage, especially in continuous data collection scenarios where sensor or vision streams accumulate rapidly. This paper empirically investigates storage-aware learning, characterizing the trade-off between data quantity and data quality under lossy compression. Using the CIFAR-10 dataset as a controlled benchmark, we systematically vary both the amount and the fidelity of training data to understand their joint impact on model performance. Our results reveal that (1) neither maximizing quantity nor quality alone yields optimal accuracy, emphasizing that the optimal trade-off between them depends nonlinearly on the available storage budget, and (2) data samples exhibit differential sensitivity to compression, motivating a sample-wise adaptive compression policy. These findings challenge uniform data-retention strategies such as naive data dropping or fixed-rate compression, and establish a foundation for adaptive, storage-efficient learning systems on resource-limited devices. This work opens new directions toward generalizable, storage-aware on-device intelligence.
设备上的机器学习从根本上受到有限存储的限制,特别是在传感器或视觉流快速积累的连续数据收集场景中。本文对存储感知学习进行了实证研究,描述了有损压缩下数据数量和数据质量之间的权衡关系。使用CIFAR-10数据集作为受控基准,我们系统地改变了训练数据的数量和保真度,以了解它们对模型性能的共同影响。我们的研究结果表明:(1)单独最大化数量和质量都不能产生最佳精度,强调两者之间的最佳权衡非线性地依赖于可用的存储预算;(2)数据样本对压缩表现出不同的敏感性,从而激发了样本自适应压缩策略。这些发现挑战了统一的数据保留策略,如原始数据丢弃或固定速率压缩,并为资源有限设备上的自适应、存储高效学习系统奠定了基础。这项工作为可推广的、存储感知的设备智能开辟了新的方向。
{"title":"Toward storage-aware learning with compressed data an empirical exploratory study on JPEG","authors":"Kichang Lee ,&nbsp;JeongGil Ko ,&nbsp;Songkuk Kim ,&nbsp;JaeYeon Park","doi":"10.1016/j.icte.2025.12.005","DOIUrl":"10.1016/j.icte.2025.12.005","url":null,"abstract":"<div><div>On-device machine learning is fundamentally constrained by limited storage, especially in continuous data collection scenarios where sensor or vision streams accumulate rapidly. This paper empirically investigates storage-aware learning, characterizing the trade-off between data quantity and data quality under lossy compression. Using the CIFAR-10 dataset as a controlled benchmark, we systematically vary both the amount and the fidelity of training data to understand their joint impact on model performance. Our results reveal that (1) neither maximizing quantity nor quality alone yields optimal accuracy, emphasizing that the optimal trade-off between them depends nonlinearly on the available storage budget, and (2) data samples exhibit differential sensitivity to compression, motivating a sample-wise adaptive compression policy. These findings challenge uniform data-retention strategies such as naive data dropping or fixed-rate compression, and establish a foundation for adaptive, storage-efficient learning systems on resource-limited devices. This work opens new directions toward generalizable, storage-aware on-device intelligence.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 50-54"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Macroeconomic context-aware graph topology learning for stock price forecasting using graph neural network 基于图神经网络的宏观经济上下文感知图拓扑学习股票价格预测
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-12-03 DOI: 10.1016/j.icte.2025.12.002
Amna Sarwar , Fizza Bukhari , Asma Sattar , Maryam Bukhari , Zahoor ur Rehman , Sungwoo Park , Seungmin Rho
Stock price forecasting is a crucial challenge in FinTech industries, with implications that extend to algorithmic trading. In recent studies, Graph Neural Networks (GNNs) are employed for prediction; however, they are still limited to involving macroeconomic contexts. Hence, in this study, a novel GNN-based method for stock price forecasting is designed, with a graph building structure influenced by macroeconomic variables, namely inflation, interest rate, and GDP growth regimes. Our model captures the relationships between stocks on the basis of regime-specific, macro-driven static graphs along with an LSTM model. The proposed approach outperforms existing methods and provides a new viewpoint on stock forecasting.
股价预测是金融科技行业的一个关键挑战,其影响延伸到算法交易。在最近的研究中,图神经网络(GNNs)被用于预测;然而,它们仍然局限于涉及宏观经济背景。因此,本研究设计了一种新的基于gnn的股票价格预测方法,该方法采用受宏观经济变量(即通货膨胀、利率和GDP增长机制)影响的图构建结构。我们的模型基于特定制度、宏观驱动的静态图以及LSTM模型来捕获股票之间的关系。该方法优于现有方法,为股票预测提供了新的视角。
{"title":"Macroeconomic context-aware graph topology learning for stock price forecasting using graph neural network","authors":"Amna Sarwar ,&nbsp;Fizza Bukhari ,&nbsp;Asma Sattar ,&nbsp;Maryam Bukhari ,&nbsp;Zahoor ur Rehman ,&nbsp;Sungwoo Park ,&nbsp;Seungmin Rho","doi":"10.1016/j.icte.2025.12.002","DOIUrl":"10.1016/j.icte.2025.12.002","url":null,"abstract":"<div><div>Stock price forecasting is a crucial challenge in FinTech industries, with implications that extend to algorithmic trading. In recent studies, Graph Neural Networks (GNNs) are employed for prediction; however, they are still limited to involving macroeconomic contexts. Hence, in this study, a novel GNN-based method for stock price forecasting is designed, with a graph building structure influenced by macroeconomic variables, namely inflation, interest rate, and GDP growth regimes. Our model captures the relationships between stocks on the basis of regime-specific, macro-driven static graphs along with an LSTM model. The proposed approach outperforms existing methods and provides a new viewpoint on stock forecasting.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 13-19"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight-to-diffusion framework for semantic image communications 语义图像通信的轻量化扩散框架
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-12-29 DOI: 10.1016/j.icte.2025.12.013
Thien Huynh-The , Toan Van Nguyen , Phuong Luu Vo , Huu-Tai Nguyen
We introduce LDSeCom, a novel lightweight-to-diffusion framework for semantic image communication. LDSeCom addresses bandwidth constraints by developing LSNet, a lightweight, loop-based segmentation model at the sender, and an improved diffusion model guided by our AFM-Net at the receiver. LSNet efficiently compresses images into semantic maps, while AFM-Net’s adaptive feature modulation ensures high-quality image reconstruction. On benchmark datasets, our LSNet achieves competitive accuracy with only 0.5M parameters, while our diffusion model improves image reconstruction quality by up to 28.51% mFID. The framework enables high-fidelity results from semantic maps compressed to 1/80 of the original size, proving its efficiency for bandwidth-constrained scenarios.
我们介绍了LDSeCom,一种用于语义图像通信的新型轻量化扩散框架。LDSeCom通过开发LSNet来解决带宽限制问题,LSNet是一种轻量级的、基于环路的发送端分割模型,而在接收端则是一种由AFM-Net指导的改进的扩散模型。LSNet有效地将图像压缩成语义映射,而AFM-Net的自适应特征调制确保了高质量的图像重建。在基准数据集上,我们的LSNet仅用0.5M参数就达到了具有竞争力的精度,而我们的扩散模型将图像重建质量提高了28.51%的mFID。该框架可以将语义映射压缩到原始大小的1/80,从而实现高保真结果,证明了其在带宽受限场景下的效率。
{"title":"A lightweight-to-diffusion framework for semantic image communications","authors":"Thien Huynh-The ,&nbsp;Toan Van Nguyen ,&nbsp;Phuong Luu Vo ,&nbsp;Huu-Tai Nguyen","doi":"10.1016/j.icte.2025.12.013","DOIUrl":"10.1016/j.icte.2025.12.013","url":null,"abstract":"<div><div>We introduce LDSeCom, a novel lightweight-to-diffusion framework for semantic image communication. LDSeCom addresses bandwidth constraints by developing LSNet, a lightweight, loop-based segmentation model at the sender, and an improved diffusion model guided by our AFM-Net at the receiver. LSNet efficiently compresses images into semantic maps, while AFM-Net’s adaptive feature modulation ensures high-quality image reconstruction. On benchmark datasets, our LSNet achieves competitive accuracy with only 0.5M parameters, while our diffusion model improves image reconstruction quality by up to 28.51% mFID. The framework enables high-fidelity results from semantic maps compressed to <span><math><mrow><mn>1</mn><mo>/</mo><mn>80</mn></mrow></math></span> of the original size, proving its efficiency for bandwidth-constrained scenarios.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 175-179"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Action recognition: A comprehensive survey of tasks, methods, and challenges 行动识别:对任务、方法和挑战的全面调查
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-11-29 DOI: 10.1016/j.icte.2025.11.015
Sehwan Heo , Junbeom Moon , Soon Ki Jung
Action recognition has emerged as a central research problem in computer vision, aiming to identify and understand human actions from video data. Over the past decade, the field has advanced from early convolutional approaches to sophisticated architectures capable of capturing complex spatio-temporal dependencies. This survey provides a comprehensive overview of action recognition across six major tasks: Action Classification, Temporal Action Localization, Spatio-temporal Action Localization, Temporal Action Segmentation, Online Action Detection, and Action Anticipation. For each task, we trace the methodological evolution from foundational models to recent state-of-the-art approaches, highlighting how key challenges such as long-range temporal modeling, viewpoint variation, boundary precision, over-segmentation, real-time inference, and future uncertainty have been addressed. We also reorganize benchmark results and evaluation metrics, presenting a unified perspective that facilitates fair comparisons and reproducible research. In addition, we analyze representative datasets, ranging from early benchmarks like UCF101 and HMDB51 to large-scale collections such as Kinetics, ActivityNet, and Epic-Kitchens, which have enabled rapid progress in both supervised and self-supervised learning. We discuss open issues and unresolved challenges, including the use of State Space Models for efficient sequence modeling, multimodal fusion techniques that dynamically assess modality reliability, synthetic data and weak supervision for reducing annotation costs, and fairness-aware frameworks that ensure ethical applicability. By consolidating a decade of progress, this survey offers a structured understanding of the action recognition landscape and aims to inspire further research toward robust, scalable, and responsible video understanding systems.
动作识别已经成为计算机视觉领域的一个核心研究问题,旨在从视频数据中识别和理解人类的动作。在过去的十年中,该领域已经从早期的卷积方法发展到能够捕获复杂时空依赖性的复杂架构。这项调查提供了六个主要任务的动作识别的全面概述:动作分类,时间动作定位,时空动作定位,时间动作分割,在线动作检测和动作预测。对于每个任务,我们追溯了从基础模型到最新技术方法的方法演变,强调了如何解决远程时间建模、视点变化、边界精度、过度分割、实时推理和未来不确定性等关键挑战。我们还重新组织基准结果和评估指标,提出统一的观点,促进公平比较和可重复的研究。此外,我们还分析了具有代表性的数据集,从早期的基准如UCF101和HMDB51到大型集合如Kinetics, ActivityNet和Epic-Kitchens,这些数据集在监督和自监督学习方面都取得了快速进展。我们讨论了开放的问题和未解决的挑战,包括使用状态空间模型进行有效的序列建模,动态评估模态可靠性的多模态融合技术,用于降低注释成本的合成数据和弱监督,以及确保伦理适用性的公平性意识框架。通过巩固十年来的进展,本调查提供了对动作识别前景的结构化理解,旨在激发对健壮、可扩展和负责任的视频理解系统的进一步研究。
{"title":"Action recognition: A comprehensive survey of tasks, methods, and challenges","authors":"Sehwan Heo ,&nbsp;Junbeom Moon ,&nbsp;Soon Ki Jung","doi":"10.1016/j.icte.2025.11.015","DOIUrl":"10.1016/j.icte.2025.11.015","url":null,"abstract":"<div><div>Action recognition has emerged as a central research problem in computer vision, aiming to identify and understand human actions from video data. Over the past decade, the field has advanced from early convolutional approaches to sophisticated architectures capable of capturing complex spatio-temporal dependencies. This survey provides a comprehensive overview of action recognition across six major tasks: Action Classification, Temporal Action Localization, Spatio-temporal Action Localization, Temporal Action Segmentation, Online Action Detection, and Action Anticipation. For each task, we trace the methodological evolution from foundational models to recent state-of-the-art approaches, highlighting how key challenges such as long-range temporal modeling, viewpoint variation, boundary precision, over-segmentation, real-time inference, and future uncertainty have been addressed. We also reorganize benchmark results and evaluation metrics, presenting a unified perspective that facilitates fair comparisons and reproducible research. In addition, we analyze representative datasets, ranging from early benchmarks like UCF101 and HMDB51 to large-scale collections such as Kinetics, ActivityNet, and Epic-Kitchens, which have enabled rapid progress in both supervised and self-supervised learning. We discuss open issues and unresolved challenges, including the use of State Space Models for efficient sequence modeling, multimodal fusion techniques that dynamically assess modality reliability, synthetic data and weak supervision for reducing annotation costs, and fairness-aware frameworks that ensure ethical applicability. By consolidating a decade of progress, this survey offers a structured understanding of the action recognition landscape and aims to inspire further research toward robust, scalable, and responsible video understanding systems.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 32-49"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal beamforming in over-the-air federated learning for efficient model aggregation 基于高效模型聚合的空中联合学习中最优波束形成
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-07-10 DOI: 10.1016/j.icte.2025.06.016
Sangwoo Choi , Minsik Kim , Daeyoung Park
Federated learning (FL) enables distributed model training while preserving privacy, but frequent updates from many devices create substantial communication challenges. Over-the-air computation (AirComp) offers a solution by aggregating updates directly over wireless channels through signal superposition, reducing overhead. However, AirComp can increase the mean squared error (MSE) of aggregated signals, affecting model accuracy. This paper introduces a beamforming optimization framework for AirComp in FL systems, jointly optimizing base station beamforming and device transmission scaling to minimize MSE. Two algorithms are developed: a high-performance convex method (Miso-CVX) and a lower-complexity subgradient method (Miso-Subgradient), both balancing signal misalignment and noise. Extensive simulations show improved aggregation accuracy, convergence speed, and robustness to channel variations.
联邦学习(FL)在保护隐私的同时支持分布式模型训练,但是来自许多设备的频繁更新会给通信带来重大挑战。空中计算(AirComp)提供了一种解决方案,通过信号叠加直接在无线信道上聚合更新,减少了开销。然而,AirComp会增加聚合信号的均方误差(MSE),影响模型的精度。本文介绍了一种用于FL系统中AirComp的波束形成优化框架,该框架联合优化基站波束形成和设备传输缩放,以最小化MSE。开发了两种算法:一种高性能的凸方法(Miso-CVX)和一种低复杂度的子梯度方法(Miso-Subgradient),两者都平衡了信号失调和噪声。大量的仿真表明,该方法提高了聚合精度、收敛速度和对信道变化的鲁棒性。
{"title":"Optimal beamforming in over-the-air federated learning for efficient model aggregation","authors":"Sangwoo Choi ,&nbsp;Minsik Kim ,&nbsp;Daeyoung Park","doi":"10.1016/j.icte.2025.06.016","DOIUrl":"10.1016/j.icte.2025.06.016","url":null,"abstract":"<div><div>Federated learning (FL) enables distributed model training while preserving privacy, but frequent updates from many devices create substantial communication challenges. Over-the-air computation (AirComp) offers a solution by aggregating updates directly over wireless channels through signal superposition, reducing overhead. However, AirComp can increase the mean squared error (MSE) of aggregated signals, affecting model accuracy. This paper introduces a beamforming optimization framework for AirComp in FL systems, jointly optimizing base station beamforming and device transmission scaling to minimize MSE. Two algorithms are developed: a high-performance convex method (Miso-CVX) and a lower-complexity subgradient method (Miso-Subgradient), both balancing signal misalignment and noise. Extensive simulations show improved aggregation accuracy, convergence speed, and robustness to channel variations.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 136-141"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-based localization in UAV-RIS enabled non-terrestrial networks 基于变压器的无人机- ris非地面网络定位
IF 4.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-02-01 Epub Date: 2025-11-28 DOI: 10.1016/j.icte.2025.11.017
Seungseok Sin , Sangmi Moon , Cheol Hong Kim , Intae Hwang
Accurate localization is essential for next-generation wireless systems. Traditional millimeter-wave (mmWave) techniques rely heavily on line-of-sight (LOS) paths, which limits their performance in non-line-of-sight (NLOS) environments. To overcome this challenge, we propose a non-terrestrial network (NTN) framework that employs an unmanned aerial vehicle–mounted reconfigurable intelligent surface (UAV-RIS) in conjunction with a Transformer-based refinement model. Unlike conventional regression or filtering approaches, the Transformer leverages self-attention mechanisms to refine coarse geometric estimates. Simulations using the DeepMIMO dataset show that more than 90% of users achieve sub-meter localization accuracy, representing a 35% improvement over existing baselines. These results demonstrate the novelty and effectiveness of integrating RIS adaptability with Transformer-based learning to enable robust, high-precision localization.
© 2025 The Author(s). Published by Elsevier B.V. on behalf of The Korean Institute of Communications and Information Sciences. This is an open-access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
准确的定位对下一代无线系统至关重要。传统的毫米波(mmWave)技术严重依赖于视距(LOS)路径,这限制了它们在非视距(NLOS)环境中的性能。为了克服这一挑战,我们提出了一种非地面网络(NTN)框架,该框架采用了无人机安装的可重构智能表面(UAV-RIS)以及基于变压器的改进模型。与传统的回归或过滤方法不同,Transformer利用自关注机制来细化粗略的几何估计。使用DeepMIMO数据集的模拟表明,超过90%的用户实现了亚米级的定位精度,比现有基线提高了35%。这些结果证明了将RIS适应性与基于变压器的学习相结合以实现鲁棒、高精度定位的新颖性和有效性。©2025作者。由爱思唯尔B.V.代表韩国通信与信息科学研究所出版。这是一篇基于CC BY许可(http://creativecommons.org/licenses/by/4.0/)的开放获取文章。
{"title":"Transformer-based localization in UAV-RIS enabled non-terrestrial networks","authors":"Seungseok Sin ,&nbsp;Sangmi Moon ,&nbsp;Cheol Hong Kim ,&nbsp;Intae Hwang","doi":"10.1016/j.icte.2025.11.017","DOIUrl":"10.1016/j.icte.2025.11.017","url":null,"abstract":"<div><div>Accurate localization is essential for next-generation wireless systems. Traditional millimeter-wave (mmWave) techniques rely heavily on line-of-sight (LOS) paths, which limits their performance in non-line-of-sight (NLOS) environments. To overcome this challenge, we propose a non-terrestrial network (NTN) framework that employs an unmanned aerial vehicle–mounted reconfigurable intelligent surface (UAV-RIS) in conjunction with a Transformer-based refinement model. Unlike conventional regression or filtering approaches, the Transformer leverages self-attention mechanisms to refine coarse geometric estimates. Simulations using the DeepMIMO dataset show that more than 90% of users achieve sub-meter localization accuracy, representing a 35% improvement over existing baselines. These results demonstrate the novelty and effectiveness of integrating RIS adaptability with Transformer-based learning to enable robust, high-precision localization.</div><div>© 2025 The Author(s). Published by Elsevier B.V. on behalf of The Korean Institute of Communications and Information Sciences. This is an open-access article under the CC BY license (<span><span>http://creativecommons.org/licenses/by/4.0/</span><svg><path></path></svg></span>).</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"12 1","pages":"Pages 20-25"},"PeriodicalIF":4.2,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ICT Express
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1