首页 > 最新文献

IEEE Transactions on Mobile Computing最新文献

英文 中文
AdaDT: Adaptive Service Provision and Digital Twin Migration for ISAC-Assisted Edge Intelligence AdaDT: isac辅助边缘智能的自适应服务提供和数字孪生迁移
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-18 DOI: 10.1109/TMC.2025.3634128
Wenqiang Ma;Yi Yang;Wen Sun;Peng Wang;Lei Liu;Dusit Niyato;Victor C.M. Leung
Edge Intelligence (EI) combines edge computing and artificial intelligence to deliver low-latency and resource-efficient services. Integrated Sensing and Communication (ISAC) further empowers EI by enhancing edge perception and accelerating intelligent model training. However, integrating ISAC into EI complicates the coordination of dynamically varying sensing, communication, and computation resources, especially under device mobility and unpredictable network conditions, leading to degraded service performance. To address these coordination challenges and sustain high-quality service under mobility and dynamics, we aim to design an adaptive service provision framework that tightly couples real-time perception with intelligent decision-making at the edge. Specifically, we propose an adaptive service provision architecture for ISAC-assisted EI, where Digital Twins (DTs) hosted on edge servers represent edge devices and their contexts to enable accurate perception and intelligent decision-making, thereby enhancing the efficiency of ISAC-enabled services. By dynamically migrating DTs across edge servers based on device mobility and resource availability, the system supports continuous decision-making and seamless service delivery. We further integrate convex optimization for efficient multi-resource coordination and a Time-Varying Contextual Bandit (TVCB) algorithm to enable adaptive, context-aware DT migration in dynamic environments. Extensive simulations demonstrate that our approach significantly improves service quality, reliability, and adaptability in ISAC-assisted EI systems, reducing migration oscillations and overhead while achieving lower latency and higher utility compared with representative baselines.
边缘智能(EI)将边缘计算和人工智能相结合,提供低延迟和资源高效的服务。集成传感和通信(ISAC)通过增强边缘感知和加速智能模型训练,进一步增强了EI的能力。但是,将ISAC集成到EI中,使动态变化的感知、通信和计算资源的协调变得复杂,特别是在设备移动性和不可预测的网络条件下,导致业务性能下降。为了应对这些协调挑战并在移动性和动态性下保持高质量的服务,我们的目标是设计一个自适应的服务提供框架,将实时感知与边缘的智能决策紧密结合起来。具体而言,我们提出了一种isac辅助EI的自适应服务提供架构,其中托管在边缘服务器上的数字双胞胎(dt)代表边缘设备及其上下文,以实现准确的感知和智能决策,从而提高isac支持的服务的效率。通过基于设备移动性和资源可用性在边缘服务器之间动态迁移dt,系统支持持续决策和无缝服务交付。我们进一步集成了凸优化,以实现高效的多资源协调和时变上下文强盗(tvb)算法,以实现动态环境中自适应、上下文感知的DT迁移。大量的仿真表明,与代表性基线相比,我们的方法显著提高了isac辅助EI系统的服务质量、可靠性和适应性,减少了迁移振荡和开销,同时实现了更低的延迟和更高的效用。
{"title":"AdaDT: Adaptive Service Provision and Digital Twin Migration for ISAC-Assisted Edge Intelligence","authors":"Wenqiang Ma;Yi Yang;Wen Sun;Peng Wang;Lei Liu;Dusit Niyato;Victor C.M. Leung","doi":"10.1109/TMC.2025.3634128","DOIUrl":"https://doi.org/10.1109/TMC.2025.3634128","url":null,"abstract":"Edge Intelligence (EI) combines edge computing and artificial intelligence to deliver low-latency and resource-efficient services. Integrated Sensing and Communication (ISAC) further empowers EI by enhancing edge perception and accelerating intelligent model training. However, integrating ISAC into EI complicates the coordination of dynamically varying sensing, communication, and computation resources, especially under device mobility and unpredictable network conditions, leading to degraded service performance. To address these coordination challenges and sustain high-quality service under mobility and dynamics, we aim to design an adaptive service provision framework that tightly couples real-time perception with intelligent decision-making at the edge. Specifically, we propose an adaptive service provision architecture for ISAC-assisted EI, where Digital Twins (DTs) hosted on edge servers represent edge devices and their contexts to enable accurate perception and intelligent decision-making, thereby enhancing the efficiency of ISAC-enabled services. By dynamically migrating DTs across edge servers based on device mobility and resource availability, the system supports continuous decision-making and seamless service delivery. We further integrate convex optimization for efficient multi-resource coordination and a Time-Varying Contextual Bandit (TVCB) algorithm to enable adaptive, context-aware DT migration in dynamic environments. Extensive simulations demonstrate that our approach significantly improves service quality, reliability, and adaptability in ISAC-assisted EI systems, reducing migration oscillations and overhead while achieving lower latency and higher utility compared with representative baselines.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5775-5791"},"PeriodicalIF":9.2,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aerial Secure Collaborative Communications Under Eavesdropper Collusion in Low-Altitude Economy: A Generative Swarm Intelligent Approach 低空经济窃听者合谋下的空中安全协同通信:一种生成群智能方法
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633953
Jiahui Li;Geng Sun;Qingqing Wu;Shuang Liang;Jiacheng Wang;Dusit Niyato;Dong In Kim
The rapid development of the low-altitude economy (LAE) has significantly increased the utilization of autonomous aerial vehicles (AAVs) in various applications, necessitating efficient and secure communication methods among AAV swarms. In this work, we aim to introduce distributed collaborative beamforming (DCB) into AAV swarms and handle the eavesdropper collusion by controlling the corresponding signal distributions. Specifically, we consider a two-way DCB-enabled aerial communication between two AAV swarms and construct these swarms as two AAV virtual antenna arrays. Then, we minimize the two-way known secrecy capacity and maximum sidelobe level to avoid information leakage from the known and unknown eavesdroppers, respectively. Simultaneously, we also minimize the energy consumption of AAVs when constructing virtual antenna arrays. Due to the conflicting relationships between secure performance and energy efficiency, we consider these objectives by formulating a multi-objective optimization problem, which is NP-hard and with a large number of decision variables. Accordingly, we design a novel generative swarm intelligence (GenSI) framework to solve the problem with less overhead, which contains a conditional variational autoencoder (CVAE)-based generative method and a proposed powerful swarm intelligence algorithm. In this framework, CVAE can collect expert solutions obtained by the swarm intelligence algorithm in other environment states to explore characteristics and patterns, thereby directly generating high-quality initial solutions in new environment factors for the swarm intelligence algorithm to search solution space efficiently. Simulation results show that the proposed swarm intelligence algorithm outperforms other state-of-the-art baseline algorithms, and the GenSI can achieve similar optimization results by using far fewer iterations than the ordinary swarm intelligence algorithm. Experimental tests demonstrate that introducing the CVAE mechanism achieves a 58.7% reduction in execution time, which enables the deployment of GenSI even on AAV platforms with limited computing power.
低空经济(LAE)的快速发展大大提高了自主飞行器(AAV)在各种应用中的利用率,需要高效、安全的AAV群之间的通信方法。在本研究中,我们将分布式协同波束形成(DCB)引入到AAV群中,并通过控制相应的信号分布来处理窃听者合谋。具体来说,我们考虑了两个AAV群之间的双向dcb支持空中通信,并将这些群构建为两个AAV虚拟天线阵列。然后,最小化双向已知保密容量和最大旁瓣电平,分别避免从已知窃听者和未知窃听者处泄露信息。同时,在构建虚拟天线阵列时,我们也尽量减少了aav的能量消耗。由于安全性能和能源效率之间的冲突关系,我们通过制定一个多目标优化问题来考虑这些目标,该问题是np困难的,具有大量的决策变量。为此,我们设计了一种新的基于条件变分自编码器(CVAE)的生成方法和一种强大的群体智能算法的生成群智能框架,以较少的开销来解决这一问题。在该框架中,CVAE可以收集群智能算法在其他环境状态下得到的专家解,探索特征和模式,从而直接生成新的环境因素下的高质量初始解,供群智能算法高效地搜索解空间。仿真结果表明,所提出的群智能算法优于其他最先进的基线算法,并且GenSI算法可以通过比普通群智能算法少得多的迭代获得相似的优化结果。实验测试表明,引入CVAE机制可以将执行时间缩短58.7%,即使在计算能力有限的AAV平台上也可以部署GenSI。
{"title":"Aerial Secure Collaborative Communications Under Eavesdropper Collusion in Low-Altitude Economy: A Generative Swarm Intelligent Approach","authors":"Jiahui Li;Geng Sun;Qingqing Wu;Shuang Liang;Jiacheng Wang;Dusit Niyato;Dong In Kim","doi":"10.1109/TMC.2025.3633953","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633953","url":null,"abstract":"The rapid development of the low-altitude economy (LAE) has significantly increased the utilization of autonomous aerial vehicles (AAVs) in various applications, necessitating efficient and secure communication methods among AAV swarms. In this work, we aim to introduce distributed collaborative beamforming (DCB) into AAV swarms and handle the eavesdropper collusion by controlling the corresponding signal distributions. Specifically, we consider a two-way DCB-enabled aerial communication between two AAV swarms and construct these swarms as two AAV virtual antenna arrays. Then, we minimize the two-way known secrecy capacity and maximum sidelobe level to avoid information leakage from the known and unknown eavesdroppers, respectively. Simultaneously, we also minimize the energy consumption of AAVs when constructing virtual antenna arrays. Due to the conflicting relationships between secure performance and energy efficiency, we consider these objectives by formulating a multi-objective optimization problem, which is NP-hard and with a large number of decision variables. Accordingly, we design a novel generative swarm intelligence (GenSI) framework to solve the problem with less overhead, which contains a conditional variational autoencoder (CVAE)-based generative method and a proposed powerful swarm intelligence algorithm. In this framework, CVAE can collect expert solutions obtained by the swarm intelligence algorithm in other environment states to explore characteristics and patterns, thereby directly generating high-quality initial solutions in new environment factors for the swarm intelligence algorithm to search solution space efficiently. Simulation results show that the proposed swarm intelligence algorithm outperforms other state-of-the-art baseline algorithms, and the GenSI can achieve similar optimization results by using far fewer iterations than the ordinary swarm intelligence algorithm. Experimental tests demonstrate that introducing the CVAE mechanism achieves a 58.7% reduction in execution time, which enables the deployment of GenSI even on AAV platforms with limited computing power.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5758-5774"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patch Matter: Dual Modality Patch Contrastive for Non-Stationary Radio Signals 斑块物质:非平稳无线电信号的双模态斑块对比
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633263
Jie Su;Yuting Jiang;Yuheng Ye;Zhenyu Wen;Taotao Li;Shibo He;Xiaoqin Zhang;Rajiv Ranjan
The emergence of abundant non-stationary radio signal (NSRS) data presents significant opportunities for applications in wireless communications, radar systems, remote sensing, and healthcare. While deep learning models have shown promise in capturing sequence dependencies, deriving generic and fine-grained representations of NSRS data remains challenging due to its complex, dynamic nature and the scarcity of labeled data. The NSRS data are often frequency-sensitive and exhibit minuscule inter-class distances, posing significant challenges for precise classification. To address these issues, we propose a novel Dual Modality Patch Contrastive (DMPC) framework. This framework leverages a stochastic patching paradigm for diverse local pattern extraction and a time-frequency cross-view optimization for frequency-sensitive feature mining. Furthermore, an Attentive Patch Aggregation (APA) mechanism enhances fine-grained inference under few-shot conditions through patch-level feature voting. Extensive experiments demonstrate the effectiveness of our approach in addressing the unique challenges of NSRS data.
大量非固定无线电信号(NSRS)数据的出现为无线通信、雷达系统、遥感和医疗保健等领域的应用提供了重要的机会。虽然深度学习模型在捕获序列依赖关系方面表现出了希望,但由于其复杂性、动态性和标记数据的稀缺性,派生NSRS数据的通用和细粒度表示仍然具有挑战性。NSRS数据通常是频率敏感的,并且显示极小的类间距离,这对精确分类构成了重大挑战。为了解决这些问题,我们提出了一个新的双模态补丁对比(DMPC)框架。该框架利用随机修补范式进行多种局部模式提取,并利用时频交叉视图优化进行频率敏感特征挖掘。此外,细心补丁聚合(APA)机制通过补丁级特征投票增强了在少镜头条件下的细粒度推理。大量的实验证明了我们的方法在解决NSRS数据的独特挑战方面的有效性。
{"title":"Patch Matter: Dual Modality Patch Contrastive for Non-Stationary Radio Signals","authors":"Jie Su;Yuting Jiang;Yuheng Ye;Zhenyu Wen;Taotao Li;Shibo He;Xiaoqin Zhang;Rajiv Ranjan","doi":"10.1109/TMC.2025.3633263","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633263","url":null,"abstract":"The emergence of abundant non-stationary radio signal (NSRS) data presents significant opportunities for applications in wireless communications, radar systems, remote sensing, and healthcare. While deep learning models have shown promise in capturing sequence dependencies, deriving generic and fine-grained representations of NSRS data remains challenging due to its complex, dynamic nature and the scarcity of labeled data. The NSRS data are often frequency-sensitive and exhibit minuscule inter-class distances, posing significant challenges for precise classification. To address these issues, we propose a novel <bold>D</b>ual <bold>M</b>odality <bold>P</b>atch <bold>C</b>ontrastive (<bold>DMPC</b>) framework. This framework leverages a stochastic patching paradigm for diverse local pattern extraction and a time-frequency cross-view optimization for frequency-sensitive feature mining. Furthermore, an Attentive Patch Aggregation (APA) mechanism enhances fine-grained inference under few-shot conditions through patch-level feature voting. Extensive experiments demonstrate the effectiveness of our approach in addressing the unique challenges of NSRS data.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5936-5951"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RF-MAE: A Self-Supervised Adaptive Frequency Masked Autoencoder With Radio-Frequency Signal Processing Applications RF-MAE:一种具有射频信号处理应用的自监督自适应频率掩码自编码器
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633287
Zhongyi Wen;Zhikai Zhai;Yatong Wang;Qiang Li;Wei Zhang;Huaizong Shao
Radio-frequency (RF) signal processing has seen significant advancements with the advent of deep learning, providing more accurate and efficient solutions for tasks such as signal classification and generation. However, most existing methods are heavily dependent on large labeled datasets, which are often scarce and costly to obtain in real-world RF environments. Furthermore, these approaches tend to be task-specific, limiting their ability to generalize across various RF applications. To address these challenges, this paper proposes RF-MAE, a self-supervised adaptive frequency masked autoencoder. RF-MAE leverages self-supervised learning (SSL) to capture intrinsic patterns from large-scale unlabeled RF data. Central to RF-MAE is a novel Adaptive Frequency Masked (AFM) strategy, which dynamically masks frequency components based on their energy distribution. Supported by a robust theoretical foundation, AFM ensures the model focuses on the most informative signal components, thereby enhancing generalization across RF tasks. By pretraining on unlabeled data and fine-tuning on specific tasks, RF-MAE significantly reduces the reliance on labeled datasets while improving adaptability across diverse RF signal processing tasks. Experimental results demonstrate that RF-MAE consistently outperforms traditional models, underscoring its potential to generalize across tasks and deliver superior performance in a wide range of RF signal applications.
随着深度学习的出现,射频(RF)信号处理取得了重大进展,为信号分类和生成等任务提供了更准确、更高效的解决方案。然而,大多数现有方法严重依赖于大型标记数据集,而这些数据集在实际射频环境中往往稀缺且成本高昂。此外,这些方法往往是特定于任务的,限制了它们在各种RF应用程序中的泛化能力。为了解决这些问题,本文提出了一种自监督自适应掩频自编码器RF-MAE。RF- mae利用自监督学习(SSL)从大规模未标记的RF数据中捕获内在模式。RF-MAE的核心是一种新的自适应频率掩蔽(AFM)策略,该策略根据频率分量的能量分布动态掩蔽频率分量。在强大的理论基础的支持下,AFM确保模型专注于最具信息量的信号成分,从而增强RF任务的泛化。通过对未标记数据的预训练和对特定任务的微调,RF- mae显著降低了对标记数据集的依赖,同时提高了不同RF信号处理任务的适应性。实验结果表明,RF- mae始终优于传统模型,强调其在跨任务推广的潜力,并在广泛的RF信号应用中提供卓越的性能。
{"title":"RF-MAE: A Self-Supervised Adaptive Frequency Masked Autoencoder With Radio-Frequency Signal Processing Applications","authors":"Zhongyi Wen;Zhikai Zhai;Yatong Wang;Qiang Li;Wei Zhang;Huaizong Shao","doi":"10.1109/TMC.2025.3633287","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633287","url":null,"abstract":"Radio-frequency (RF) signal processing has seen significant advancements with the advent of deep learning, providing more accurate and efficient solutions for tasks such as signal classification and generation. However, most existing methods are heavily dependent on large labeled datasets, which are often scarce and costly to obtain in real-world RF environments. Furthermore, these approaches tend to be task-specific, limiting their ability to generalize across various RF applications. To address these challenges, this paper proposes RF-MAE, a self-supervised adaptive frequency masked autoencoder. RF-MAE leverages self-supervised learning (SSL) to capture intrinsic patterns from large-scale unlabeled RF data. Central to RF-MAE is a novel Adaptive Frequency Masked (AFM) strategy, which dynamically masks frequency components based on their energy distribution. Supported by a robust theoretical foundation, AFM ensures the model focuses on the most informative signal components, thereby enhancing generalization across RF tasks. By pretraining on unlabeled data and fine-tuning on specific tasks, RF-MAE significantly reduces the reliance on labeled datasets while improving adaptability across diverse RF signal processing tasks. Experimental results demonstrate that RF-MAE consistently outperforms traditional models, underscoring its potential to generalize across tasks and deliver superior performance in a wide range of RF signal applications.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5920-5935"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward an Ultra-Lightweight Text Representation: Causal Convolutional Networks With Feature Self-Enhancement Mechanisms 迈向超轻量文本表示:具有特征自增强机制的因果卷积网络
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633250
Xiaoyan Liu;Peng Yang;Xingyu Liu;Zhenqi Wang;Zijian Bai
Text representation models, such as Transformers and RNNs, are foundational to natural language processing research and play a crucial role in a wide range of downstream tasks. However, most existing large models rely on the self-attention mechanism, which involves frequent token interactions, a large parameter scale, and substantial hardware and data resource requirements—posing significant challenges for deployment on mobile and edge devices with limited computing power, high communication latency, or intermittent connectivity. To address these practical constraints, this study proposes an ultra-lightweight text representation model based on Causal Recurrent Convolutional Networks (CausalRCN), specifically engineered for efficient inference in resource-constrained mobile environments. Rather than introducing new atomic modules, our innovation lies in the systematic integration of causal convolution and self-enhancement mechanisms into a compact recurrent architecture that eliminates attention-driven computation entirely. By replacing self-attention with parallelized causal convolutions and recurrent feature propagation, the model achieves dramatically lower computational complexity and memory footprint, enabling real-time inference on commodity edge hardware. The design leverages local causality to approximate global contextual dependencies and employs feature self-enhancement to strengthen nonlinear expressiveness, effectively balancing accuracy and efficiency. Experimental results show that with only 666 K parameters, the proposed model achieves over 10× higher throughput and 50% less memory usage compared to competitive baselines, while improving accuracy by more than 1%. Validated on real-world platforms including Raspberry Pi 4B and Qualcomm Snapdragon Gen 1, the system demonstrates up to a 90× speedup over ALBERT in end-to-end latency, underscoring its suitability for mobile intelligent computing.
文本表示模型,如Transformers和rnn,是自然语言处理研究的基础,在广泛的下游任务中起着至关重要的作用。然而,大多数现有的大型模型依赖于自关注机制,这涉及频繁的令牌交互、大的参数规模以及大量的硬件和数据资源需求,这对在计算能力有限、通信延迟高或间歇性连接的移动和边缘设备上的部署构成了重大挑战。为了解决这些实际限制,本研究提出了一种基于因果循环卷积网络(CausalRCN)的超轻量级文本表示模型,该模型专门用于在资源受限的移动环境中进行高效推理。我们的创新不是引入新的原子模块,而是将因果卷积和自我增强机制系统地集成到一个紧凑的循环架构中,从而完全消除了注意力驱动的计算。通过用并行因果卷积和循环特征传播取代自关注,该模型显著降低了计算复杂度和内存占用,实现了在商品边缘硬件上的实时推理。设计利用局部因果关系来近似全局上下文依赖关系,并采用特征自增强来加强非线性表达,有效地平衡了准确性和效率。实验结果表明,与竞争基准相比,仅使用666 K个参数,该模型的吞吐量提高了10倍以上,内存使用减少了50%,准确率提高了1%以上。在包括Raspberry Pi 4B和Qualcomm Snapdragon Gen 1在内的实际平台上进行了验证,该系统在端到端延迟方面的速度比ALBERT提高了90倍,强调了其对移动智能计算的适用性。
{"title":"Toward an Ultra-Lightweight Text Representation: Causal Convolutional Networks With Feature Self-Enhancement Mechanisms","authors":"Xiaoyan Liu;Peng Yang;Xingyu Liu;Zhenqi Wang;Zijian Bai","doi":"10.1109/TMC.2025.3633250","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633250","url":null,"abstract":"Text representation models, such as Transformers and RNNs, are foundational to natural language processing research and play a crucial role in a wide range of downstream tasks. However, most existing large models rely on the self-attention mechanism, which involves frequent token interactions, a large parameter scale, and substantial hardware and data resource requirements—posing significant challenges for deployment on mobile and edge devices with limited computing power, high communication latency, or intermittent connectivity. To address these practical constraints, this study proposes an ultra-lightweight text representation model based on Causal Recurrent Convolutional Networks (CausalRCN), specifically engineered for efficient inference in resource-constrained mobile environments. Rather than introducing new atomic modules, our innovation lies in the systematic integration of causal convolution and self-enhancement mechanisms into a compact recurrent architecture that eliminates attention-driven computation entirely. By replacing self-attention with parallelized causal convolutions and recurrent feature propagation, the model achieves dramatically lower computational complexity and memory footprint, enabling real-time inference on commodity edge hardware. The design leverages local causality to approximate global contextual dependencies and employs feature self-enhancement to strengthen nonlinear expressiveness, effectively balancing accuracy and efficiency. Experimental results show that with only 666 K parameters, the proposed model achieves over 10× higher throughput and 50% less memory usage compared to competitive baselines, while improving accuracy by more than 1%. Validated on real-world platforms including Raspberry Pi 4B and Qualcomm Snapdragon Gen 1, the system demonstrates up to a 90× speedup over ALBERT in end-to-end latency, underscoring its suitability for mobile intelligent computing.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5890-5904"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scene Graph-Aided Probabilistic Semantic Communication for Image Transmission 面向图像传输的场景图辅助概率语义通信
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633495
Chen Zhu;Siyun Liang;Zhouxiang Zhao;Jianrong Bao;Zhaohui Yang;Zhaoyang Zhang;Dusit Niyato
Semantic communication emphasizes the transmission of meaning rather than raw symbols. It offers a promising solution to alleviate network congestion and improve transmission efficiency. In this paper, we propose a wireless image communication framework that employs probability graphs as shared semantic knowledge base among distributed users. High-level image semantics are represented via scene graphs, and a two-stage compression algorithm is devised to remove predictable components based on learned conditional and co-occurrence probabilities. At the transmitter, the algorithm filters redundant relations and entity pairs, while at the receiver, semantic recovery leverages the same probability graphs to reconstruct omitted information. For further research, we also put forward a multi-round semantic compression algorithm with its theoretical performance analysis. Simulation results demonstrate that our semantic-aware scheme achieves superior transmission throughput and satiable semantic alignment, validating the efficacy of leveraging high-level semantics for image communication.
语义交际强调的是意义的传递,而不是单纯的符号。它为缓解网络拥塞和提高传输效率提供了一种很有前途的解决方案。本文提出了一种利用概率图作为分布式用户间共享语义知识库的无线图像通信框架。通过场景图表示高级图像语义,并设计了一种两阶段压缩算法,根据学习到的条件和共现概率去除可预测的组件。在发送端,算法过滤冗余关系和实体对,而在接收端,语义恢复利用相同的概率图来重建遗漏的信息。为了进一步研究,我们还提出了一种多轮语义压缩算法,并对其进行了理论性能分析。仿真结果表明,我们的语义感知方案实现了优越的传输吞吐量和令人满意的语义对齐,验证了利用高级语义进行图像通信的有效性。
{"title":"Scene Graph-Aided Probabilistic Semantic Communication for Image Transmission","authors":"Chen Zhu;Siyun Liang;Zhouxiang Zhao;Jianrong Bao;Zhaohui Yang;Zhaoyang Zhang;Dusit Niyato","doi":"10.1109/TMC.2025.3633495","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633495","url":null,"abstract":"Semantic communication emphasizes the transmission of meaning rather than raw symbols. It offers a promising solution to alleviate network congestion and improve transmission efficiency. In this paper, we propose a wireless image communication framework that employs probability graphs as shared semantic knowledge base among distributed users. High-level image semantics are represented via scene graphs, and a two-stage compression algorithm is devised to remove predictable components based on learned conditional and co-occurrence probabilities. At the transmitter, the algorithm filters redundant relations and entity pairs, while at the receiver, semantic recovery leverages the same probability graphs to reconstruct omitted information. For further research, we also put forward a multi-round semantic compression algorithm with its theoretical performance analysis. Simulation results demonstrate that our semantic-aware scheme achieves superior transmission throughput and satiable semantic alignment, validating the efficacy of leveraging high-level semantics for image communication.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5905-5919"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Architecture Knowledge Distillation for Deep Joint Source-Channel Coding 深度联合信源信道编码的跨架构知识精馏
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/TMC.2025.3633266
Simin Dai;Xuechen Chen;Xiaoheng Deng;Siyu Lin
Deep learning-based joint source-channel coding (DeepJSCC) has shown significant benefits in emerging semantic and task-oriented communications, providing a promising solution for reducing latency and bandwidth requirements in next-generation mobile networks. However, its deployment on resource-constrained devices is limited by model complexity. Devices with varying computational capacities require models of distinct architectures and complexity levels, motivating the design of a cross-architecture model compression scheme for DeepJSCC. In this paper, we propose a cross-architecture knowledge distillation framework called CAKDJSCC for heterogeneous DeepJSCC models. Specifically, we design a teaching assistant network with feature fusion modules (FFMs) that dynamically perceive architecture gaps between teacher and student models, thereby generating student-adaptive feature representations to alleviate feature space misalignment caused by architectural inconsistencies. In addition, we introduce a conditional information bottleneck (CIB) loss to optimize the distillation process, which prevents students from overfitting to teacher-specific inductive biases while enhancing knowledge transfer efficiency in cross-architecture scenarios. Extensive experiments demonstrate that our approach significantly improves the student model’s reconstruction accuracy and perceptual quality without increasing the inference latency while minimizing the performance degradation during model compression.
基于深度学习的联合源信道编码(DeepJSCC)在新兴的语义和面向任务的通信中显示出显著的优势,为减少下一代移动网络的延迟和带宽要求提供了一个有前途的解决方案。然而,其在资源受限设备上的部署受到模型复杂性的限制。具有不同计算能力的设备需要不同架构和复杂级别的模型,这激发了DeepJSCC跨架构模型压缩方案的设计。本文针对异构DeepJSCC模型,提出了一种名为CAKDJSCC的跨架构知识蒸馏框架。具体来说,我们设计了一个带有特征融合模块(ffm)的助教网络,该网络动态感知教师和学生模型之间的架构差距,从而生成学生自适应的特征表示,以缓解由于架构不一致而导致的特征空间不对齐。此外,我们引入了条件信息瓶颈(CIB)损失来优化蒸馏过程,从而防止学生过度拟合教师特定的归纳偏差,同时提高跨架构场景中的知识转移效率。大量的实验表明,我们的方法在不增加推理延迟的情况下显著提高了学生模型的重建精度和感知质量,同时最大限度地降低了模型压缩过程中的性能下降。
{"title":"Cross-Architecture Knowledge Distillation for Deep Joint Source-Channel Coding","authors":"Simin Dai;Xuechen Chen;Xiaoheng Deng;Siyu Lin","doi":"10.1109/TMC.2025.3633266","DOIUrl":"https://doi.org/10.1109/TMC.2025.3633266","url":null,"abstract":"Deep learning-based joint source-channel coding (DeepJSCC) has shown significant benefits in emerging semantic and task-oriented communications, providing a promising solution for reducing latency and bandwidth requirements in next-generation mobile networks. However, its deployment on resource-constrained devices is limited by model complexity. Devices with varying computational capacities require models of distinct architectures and complexity levels, motivating the design of a cross-architecture model compression scheme for DeepJSCC. In this paper, we propose a cross-architecture knowledge distillation framework called CAKDJSCC for heterogeneous DeepJSCC models. Specifically, we design a teaching assistant network with feature fusion modules (FFMs) that dynamically perceive architecture gaps between teacher and student models, thereby generating student-adaptive feature representations to alleviate feature space misalignment caused by architectural inconsistencies. In addition, we introduce a conditional information bottleneck (CIB) loss to optimize the distillation process, which prevents students from overfitting to teacher-specific inductive biases while enhancing knowledge transfer efficiency in cross-architecture scenarios. Extensive experiments demonstrate that our approach significantly improves the student model’s reconstruction accuracy and perceptual quality without increasing the inference latency while minimizing the performance degradation during model compression.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5683-5699"},"PeriodicalIF":9.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Runtime Reliability Anomaly Detection for Edge Services Rejuvenation 边缘服务恢复的分层运行时可靠性异常检测
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-14 DOI: 10.1109/TMC.2025.3632794
Lei Wang;Jiyuan Liu;Qiang He;Feifei Chen;Xiaoyu Xia
Multi-access Edge Computing (MEC) deploys computation and storage resources at the network edge, enabling devices to process data and requests on nearby edge services. This reduces data transmission latency and network congestion. However, due to edge servers’ volatile running status and limited resources, the reliability of edge services deployed on them fluctuates over time. This may lead to concept drifts in edge services’ real-time reliability streaming data. A severe negative drift may indicate a runtime reliability anomaly in an edge service, which often impacts users’ Quality of Experience (QoE). To ensure edge services’ reliability, this paper proposes CS-Detection, a hierarchical approach for detecting runtime reliability anomalies based on concept drift. CS-Detection employs the compressed sensing technique to sample complex and large-scale reliability streaming data. It employs a new technique that combines Variational AutoEncoder and Energy-Based Generative Adversarial Network (E2BGAN) to estimate the anomaly level of edge services by calculating the reconstruction error and discriminant error of compressed real-time reliability streaming data. To demonstrate the usefulness of CS-Detection in ensuring the QoE of MEC systems, we present CPRest, a coordinated checkpoint-based effective rejuvenation approach for restoring the normal operation of edge services affected by runtime reliability anomalies. CPRest classifies detection results into four levels and adjusts the edge services’ restart trigger time accordingly. Comprehensive experiments conducted on real-world datasets demonstrate the effectiveness and efficiency of CS-Detection compared to state-of-the-art approaches.
MEC (Multi-access Edge Computing)将计算和存储资源部署在网络边缘,使设备能够在附近的边缘服务上处理数据和请求。这减少了数据传输延迟和网络拥塞。但是,由于边缘服务器的运行状态不稳定,资源有限,部署在边缘服务器上的边缘服务的可靠性会随着时间的推移而波动。这可能会导致边缘服务的实时可靠性流数据的概念漂移。严重的负漂移可能表明边缘服务的运行时可靠性异常,这通常会影响用户的体验质量(QoE)。为了保证边缘服务的可靠性,本文提出了一种基于概念漂移的分层检测运行时可靠性异常的CS-Detection方法。CS-Detection采用压缩感知技术对复杂、大规模的可靠性流数据进行采样。采用变分自动编码器和基于能量的生成对抗网络(E2BGAN)相结合的新技术,通过计算压缩后的实时可靠性流数据的重构误差和判别误差来估计边缘服务的异常水平。为了证明cs检测在确保MEC系统的QoE方面的有用性,我们提出了CPRest,一种基于协调检查点的有效恢复方法,用于恢复受运行时可靠性异常影响的边缘服务的正常运行。CPRest将检测结果分为4个级别,并相应调整边缘业务重启触发时间。在真实世界数据集上进行的综合实验表明,与最先进的方法相比,CS-Detection的有效性和效率。
{"title":"Hierarchical Runtime Reliability Anomaly Detection for Edge Services Rejuvenation","authors":"Lei Wang;Jiyuan Liu;Qiang He;Feifei Chen;Xiaoyu Xia","doi":"10.1109/TMC.2025.3632794","DOIUrl":"https://doi.org/10.1109/TMC.2025.3632794","url":null,"abstract":"Multi-access Edge Computing (MEC) deploys computation and storage resources at the network edge, enabling devices to process data and requests on nearby edge services. This reduces data transmission latency and network congestion. However, due to edge servers’ volatile running status and limited resources, the reliability of edge services deployed on them fluctuates over time. This may lead to concept drifts in edge services’ real-time reliability streaming data. A severe negative drift may indicate a runtime reliability anomaly in an edge service, which often impacts users’ Quality of Experience (QoE). To ensure edge services’ reliability, this paper proposes CS-Detection, a hierarchical approach for detecting runtime reliability anomalies based on concept drift. CS-Detection employs the compressed sensing technique to sample complex and large-scale reliability streaming data. It employs a new technique that combines Variational Auto<underline>E</u>ncoder and <underline>E</u>nergy-<underline>B</u>ased <underline>G</u>enerative <underline>A</u>dversarial <underline>N</u>etwork (E2BGAN) to estimate the anomaly level of edge services by calculating the reconstruction error and discriminant error of compressed real-time reliability streaming data. To demonstrate the usefulness of CS-Detection in ensuring the QoE of MEC systems, we present CPRest, a coordinated checkpoint-based effective rejuvenation approach for restoring the normal operation of edge services affected by runtime reliability anomalies. CPRest classifies detection results into four levels and adjusts the edge services’ restart trigger time accordingly. Comprehensive experiments conducted on real-world datasets demonstrate the effectiveness and efficiency of CS-Detection compared to state-of-the-art approaches.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5807-5823"},"PeriodicalIF":9.2,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFedLF: Adaptive Layer Freezing of Foundation Models in Heterogeneous Federated Learning 异构联邦学习中基础模型的自适应层冻结
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-14 DOI: 10.1109/TMC.2025.3632655
Yue Zeng;Jie Zhang;Song Guo;Baoliu Ye;Zhihao Qu;Zicong Hong;Bin Tang;Jinyu Chen;Junlong Zhou;Jiaying Yu
The rise of pre-trained foundation models (FMs) has popularized the trend of fine-tuning FMs to fit downstream tasks, while Federated Learning (FL) has become the de-facto approach for training distributed data with privacy-preservation. However, fine-tuning FMs in FL faces overwhelming overheads due to its bulky nature. While freezing parameters in FM have the potential to accelerate FL training, existing freezing strategies statically freeze parameters on specified or already converged layers, incur severe accuracy degradation, and resource-inefficiency in heterogeneous environments. In this paper, we propose AFedLF, an adaptive freezing framework for FM in FL, to accelerate its wall-clock time for convergence without losing its final accuracy. However, this poses great challenges, as different freezing strategies lead to different accuracy gains and time overheads, while unfreezing more layers may bring marginal accuracy gains but significant time overheads. To address this challenge, AFedLF mathematically establishes a correlation between the freezing strategy and the accuracy gain and time overhead, and allocates adaptive freezing strategies to clients, based on our insight that unfreezing more layers on devices with strong computation and communication capabilities helps improve resource efficiency. Besides, AFedLF incorporates our well-designed intermediate result caching scheme with constant approximation ratios utilizing the limited storage capacity on mobile devices to cache intermediate results to skip forward propagation, further saving wall-clock time. Finally, we implemented AFedLF using an open-source FL benchmark, and extensive trace-driven experimental results showed that AFedLF accelerates wall-clock time by up to 6.1× compared to state-of-the-art solutions, without sacrificing accuracy.
预训练基础模型(FMs)的兴起已经普及了微调FMs以适应下游任务的趋势,而联邦学习(FL)已经成为训练具有隐私保护的分布式数据的事实上的方法。然而,在FL中微调fm由于其庞大的性质而面临着巨大的开销。虽然FM中的冻结参数有可能加速FL训练,但现有的冻结策略在指定或已经收敛的层上静态冻结参数,导致严重的精度下降,并且在异构环境中资源效率低下。本文提出了一种自适应冻结框架AFedLF,在不损失最终精度的前提下,加快了其时钟时间的收敛速度。然而,这带来了巨大的挑战,因为不同的冻结策略会导致不同的精度增益和时间开销,而解冻更多的层可能会带来边际精度增益,但会带来显著的时间开销。为了应对这一挑战,AFedLF在数学上建立了冻结策略与精度增益和时间开销之间的相关性,并根据我们的见解,在具有强大计算和通信能力的设备上解冻更多层有助于提高资源效率,为客户分配自适应冻结策略。此外,AFedLF结合了我们精心设计的中间结果缓存方案,采用恒定的近似比率,利用移动设备有限的存储容量缓存中间结果,跳过前向传播,进一步节省了时钟时间。最后,我们使用开源的FL基准实现了AFedLF,广泛的跟踪驱动实验结果表明,与最先进的解决方案相比,AFedLF在不牺牲精度的情况下将挂钟时间加快了6.1倍。
{"title":"AFedLF: Adaptive Layer Freezing of Foundation Models in Heterogeneous Federated Learning","authors":"Yue Zeng;Jie Zhang;Song Guo;Baoliu Ye;Zhihao Qu;Zicong Hong;Bin Tang;Jinyu Chen;Junlong Zhou;Jiaying Yu","doi":"10.1109/TMC.2025.3632655","DOIUrl":"https://doi.org/10.1109/TMC.2025.3632655","url":null,"abstract":"The rise of pre-trained foundation models (FMs) has popularized the trend of fine-tuning FMs to fit downstream tasks, while Federated Learning (FL) has become the de-facto approach for training distributed data with privacy-preservation. However, fine-tuning FMs in FL faces overwhelming overheads due to its bulky nature. While freezing parameters in FM have the potential to accelerate FL training, existing freezing strategies statically freeze parameters on specified or already converged layers, incur severe accuracy degradation, and resource-inefficiency in heterogeneous environments. In this paper, we propose AFedLF, an adaptive freezing framework for FM in FL, to accelerate its wall-clock time for convergence without losing its final accuracy. However, this poses great challenges, as different freezing strategies lead to different accuracy gains and time overheads, while unfreezing more layers may bring marginal accuracy gains but significant time overheads. To address this challenge, AFedLF mathematically establishes a correlation between the freezing strategy and the accuracy gain and time overhead, and allocates adaptive freezing strategies to clients, based on our insight that unfreezing more layers on devices with strong computation and communication capabilities helps improve resource efficiency. Besides, AFedLF incorporates our well-designed intermediate result caching scheme with constant approximation ratios utilizing the limited storage capacity on mobile devices to cache intermediate results to skip forward propagation, further saving wall-clock time. Finally, we implemented AFedLF using an open-source FL benchmark, and extensive trace-driven experimental results showed that AFedLF accelerates wall-clock time by up to 6.1× compared to state-of-the-art solutions, without sacrificing accuracy.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5666-5682"},"PeriodicalIF":9.2,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wi-ViTAL: Domain Generalization of Wireless Human Activity Recognition Using Linear Attention Vision Transformer With Adversarial Learning Wi-ViTAL:基于线性注意力视觉转换器和对抗学习的无线人体活动识别的领域推广
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-14 DOI: 10.1109/TMC.2025.3632752
Yeqin Li;David Chieng;Boon Giin Lee;Chiew Foong Kwong;Kian Ming Lim;Shuyu Li
The learning-based, passive, device-free wireless human activity recognition (WHAR) systems still face significant challenges, especially in real-world deployments. Environmental differences and domain diversities cause signals collected in the source domain to have a different distribution from those in the target domain, and this affects the accuracy. To achieve domain generalization (DG), a multi-scale linear attention vision transformer (ViT) based feature extractor and domain adversarial learning with Wasserstein distance are proposed. By aligning both marginal and conditional distributions across different source domains, the adversarial learning reduces the differences between trained and unseen domains. As a result, the extracted features become domain-invariant in the latent space, ensuring accuracy is preserved in new or unseen domains. Extensive evaluations using commercial IEEE 802.11ac routers with human activity data collected over different days, environments, human subjects, and obstacle configurations show that the proposed Wi-ViTAL achieves 97.57% average accuracy for five-label classification and more than 76% for eight-label classification in unseen domains. Wi-ViTAL also demonstrates an overall DG improvement compared to other recent benchmarks.
基于学习、无源、无设备的无线人类活动识别(WHAR)系统仍然面临重大挑战,特别是在实际部署中。环境的差异和域的多样性会导致源域和目标域采集到的信号分布不同,从而影响精度。为了实现领域泛化,提出了一种基于多尺度线性注意视觉变换的特征提取器和基于Wasserstein距离的领域对抗学习。通过对齐不同源域的边际分布和条件分布,对抗性学习减少了训练域和未见域之间的差异。因此,提取的特征在潜在空间中成为域不变的,确保在新的或未见过的域中保持准确性。使用商用IEEE 802.11ac路由器对不同日期、环境、人类受试者和障碍物配置收集的人类活动数据进行了广泛的评估,结果表明,在未知领域,所提出的Wi-ViTAL在五标签分类方面的平均准确率达到97.57%,在八标签分类方面的平均准确率超过76%。与最近的其他基准测试相比,Wi-ViTAL还显示了总体DG的改进。
{"title":"Wi-ViTAL: Domain Generalization of Wireless Human Activity Recognition Using Linear Attention Vision Transformer With Adversarial Learning","authors":"Yeqin Li;David Chieng;Boon Giin Lee;Chiew Foong Kwong;Kian Ming Lim;Shuyu Li","doi":"10.1109/TMC.2025.3632752","DOIUrl":"https://doi.org/10.1109/TMC.2025.3632752","url":null,"abstract":"The learning-based, passive, device-free wireless human activity recognition (WHAR) systems still face significant challenges, especially in real-world deployments. Environmental differences and domain diversities cause signals collected in the source domain to have a different distribution from those in the target domain, and this affects the accuracy. To achieve domain generalization (DG), a multi-scale linear attention vision transformer (ViT) based feature extractor and domain adversarial learning with Wasserstein distance are proposed. By aligning both marginal and conditional distributions across different source domains, the adversarial learning reduces the differences between trained and unseen domains. As a result, the extracted features become domain-invariant in the latent space, ensuring accuracy is preserved in new or unseen domains. Extensive evaluations using commercial IEEE 802.11ac routers with human activity data collected over different days, environments, human subjects, and obstacle configurations show that the proposed Wi-ViTAL achieves 97.57% average accuracy for five-label classification and more than 76% for eight-label classification in unseen domains. Wi-ViTAL also demonstrates an overall DG improvement compared to other recent benchmarks.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 4","pages":"5792-5806"},"PeriodicalIF":9.2,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Mobile Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1