首页 > 最新文献

IEEE Transactions on Mobile Computing最新文献

英文 中文
Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks 基于量子多智能体强化学习的空-空-地集成网络协同移动接入
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-18 DOI: 10.1109/TMC.2025.3599683
Gyu Seon Kim;Yeryeong Cho;Jaehyun Chung;Soohyun Park;Soyi Jung;Zhu Han;Joongheon Kim
Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance uncrewed aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for providing cooperatively global access sustainability and energy efficiency. However, as the number of CubeSats and HALE-UAVs, increases, the scheduling dimension of each ground station (GS) increases. As a result, each GS can fall into the curse of dimensionality, and this challenge becomes one major hurdle for efficient global access. Therefore, this paper provides a quantum multi-agent reinforcement Learning (QMARL)-based method for scheduling between GSs and CubeSats/HALE-UAVs in order to improve global access availability and energy efficiency. The main reason why the QMARL-based scheduler can be beneficial is that the algorithm facilitates a logarithmic-scale reduction in scheduling action dimensions, which is one critical feature as the number of CubeSats and HALE-UAVs expands. Additionally, individual GSs have different traffic demands depending on their locations and characteristics, thus it is essential to provide differentiated access services. The superiority of the proposed scheduler is validated through data-intensive experiments in realistic CubeSat/HALE-UAV settings.
仅通过立方体卫星实现全球空间-空气-地面综合网络(SAGIN)的接入存在重大挑战,例如特定区域(例如极地地区)的接入可持续性限制以及立方体卫星的能源效率限制。为了解决这些问题,高空长航时无人驾驶飞行器(hale - uav)可以补充立方体卫星的这些缺点,以提供合作的全球访问可持续性和能源效率。然而,随着立方体卫星和hale - uav数量的增加,每个地面站(GS)的调度维度增加。因此,每个GS都可能陷入维度的诅咒,这一挑战成为有效的全球访问的一个主要障碍。因此,本文提出了一种基于量子多智能体强化学习(QMARL)的gps和CubeSats/ hale - uav之间调度方法,以提高全局访问可用性和能源效率。基于qmarl的调度程序之所以有益,主要原因是该算法有助于调度操作维度的对数尺度减少,这是立方体卫星和hale - uav数量增加时的一个关键特征。此外,各gps的位置和特点不同,会有不同的流量需求,因此必须提供差异化的接达服务。在实际的CubeSat/HALE-UAV设置中,通过数据密集型实验验证了所提出的调度程序的优越性。
{"title":"Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks","authors":"Gyu Seon Kim;Yeryeong Cho;Jaehyun Chung;Soohyun Park;Soyi Jung;Zhu Han;Joongheon Kim","doi":"10.1109/TMC.2025.3599683","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599683","url":null,"abstract":"Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance uncrewed aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for providing cooperatively global access sustainability and energy efficiency. However, as the number of CubeSats and HALE-UAVs, increases, the scheduling dimension of each ground station (GS) increases. As a result, each GS can fall into the curse of dimensionality, and this challenge becomes one major hurdle for efficient global access. Therefore, this paper provides a quantum multi-agent reinforcement Learning (QMARL)-based method for scheduling between GSs and CubeSats/HALE-UAVs in order to improve global access availability and energy efficiency. The main reason why the QMARL-based scheduler can be beneficial is that the algorithm facilitates a logarithmic-scale reduction in scheduling action dimensions, which is one critical feature as the number of CubeSats and HALE-UAVs expands. Additionally, individual GSs have different traffic demands depending on their locations and characteristics, thus it is essential to provide differentiated access services. The superiority of the proposed scheduler is validated through data-intensive experiments in realistic CubeSat/HALE-UAV settings.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1200-1218"},"PeriodicalIF":9.2,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DRDST: Low-Latency DAG Consensus Through Robust Dynamic Sharding and Tree-Broadcasting for IoV DRDST:通过鲁棒动态分片和树广播实现IoV低延迟DAG共识
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-18 DOI: 10.1109/TMC.2025.3599385
Runhua Chen;Haoxiang Luo;Gang Sun;Hongfang Yu;Dusit Niyato;Schahram Dustdar
The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduced latency. However, traditional centralized networks struggle to meet these demands, prompting the exploration of decentralized solutions such as blockchain. Addressing blockchain’s scalability challenges posed by the growing number of nodes and transactions calls for innovative solutions, among which sharding stands out as a pivotal approach to significantly enhance blockchain throughput. However, existing schemes still face challenges related to a) the impact of vehicle mobility on blockchain consensus, especially for cross-shard transaction; and b) the strict requirements of low latency consensus in a highly dynamic network. In this paper, we propose a DAG (Directed Acyclic Graph) consensus leveraging Robust Dynamic Sharding and Tree-broadcasting (DRDST) to address these challenges. Specifically, we first develop a standard for evaluating the network stability of nodes, combined with the nodes’ trust values, to propose a novel robust sharding model that is solved through the design of the Genetic Sharding Algorithm (GSA). Then, we optimize the broadcast latency of the whole sharded network by improving the tree-broadcasting to minimize the maximum broadcast latency within each shard. On this basis, we also design a DAG consensus scheme based on an improved hashgraph protocol, which can efficiently handle cross-shard transactions. Finally, the simulation proves the proposed scheme is superior to the comparison schemes in latency, throughput, consensus success rate, and node traffic load.
车联网(IoV)正在成为加强交通管理和安全的关键技术。它的快速发展需要提高通信效率和降低延迟的解决方案。然而,传统的集中式网络难以满足这些需求,这促使人们探索区块链等去中心化解决方案。随着节点和事务数量的增加,区块链的可扩展性面临挑战,需要创新的解决方案,其中分片是显著提高区块链吞吐量的关键方法。然而,现有方案仍然面临以下挑战:a)车辆移动性对区块链共识的影响,特别是对于跨分片交易;b)高动态网络对低延迟共识的严格要求。在本文中,我们提出了一种DAG(有向无环图)共识,利用鲁棒动态分片和树广播(DRDST)来解决这些挑战。具体而言,我们首先制定了一个评估节点网络稳定性的标准,结合节点的信任值,提出了一种新的鲁棒分片模型,该模型通过设计遗传分片算法(GSA)来解决。然后,我们通过改进树广播来优化整个分片网络的广播延迟,以最小化每个分片内的最大广播延迟。在此基础上,我们还设计了一种基于改进哈希图协议的DAG共识方案,可以有效地处理跨分片事务。最后,通过仿真验证了该方案在时延、吞吐量、一致性成功率和节点流量负载等方面优于比较方案。
{"title":"DRDST: Low-Latency DAG Consensus Through Robust Dynamic Sharding and Tree-Broadcasting for IoV","authors":"Runhua Chen;Haoxiang Luo;Gang Sun;Hongfang Yu;Dusit Niyato;Schahram Dustdar","doi":"10.1109/TMC.2025.3599385","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599385","url":null,"abstract":"The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduced latency. However, traditional centralized networks struggle to meet these demands, prompting the exploration of decentralized solutions such as blockchain. Addressing blockchain’s scalability challenges posed by the growing number of nodes and transactions calls for innovative solutions, among which sharding stands out as a pivotal approach to significantly enhance blockchain throughput. However, existing schemes still face challenges related to <italic>a</i>) the impact of vehicle mobility on blockchain consensus, especially for cross-shard transaction; and <italic>b</i>) the strict requirements of low latency consensus in a highly dynamic network. In this paper, we propose a DAG (Directed Acyclic Graph) consensus leveraging Robust Dynamic Sharding and Tree-broadcasting (DRDST) to address these challenges. Specifically, we first develop a standard for evaluating the network stability of nodes, combined with the nodes’ trust values, to propose a novel robust sharding model that is solved through the design of the Genetic Sharding Algorithm (GSA). Then, we optimize the broadcast latency of the whole sharded network by improving the tree-broadcasting to minimize the maximum broadcast latency within each shard. On this basis, we also design a DAG consensus scheme based on an improved hashgraph protocol, which can efficiently handle cross-shard transactions. Finally, the simulation proves the proposed scheme is superior to the comparison schemes in latency, throughput, consensus success rate, and node traffic load.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1236-1253"},"PeriodicalIF":9.2,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Security-Enhanced Spatial Range Query Over Large-Scale Encrypted Mobile Cloud Datasets 大规模加密移动云数据集的安全增强空间范围查询
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-15 DOI: 10.1109/TMC.2025.3599519
Yinbin Miao;Jiaqi Yu;Jiliang Li;Xinghua Li;Jun Feng;Zhiquan Liu;Robert H. Deng
Privacy-preserving spatial range query allows users to obtain valid data based on specific spatial attributes or geographical location while ensuring privacy. However, many existing Privacy-Preserving Spatial Range Query (PSRQ) schemes generally face the problems of low query efficiency and insufficient security when dealing with large-scale mobile cloud data sets, and it is difficult to resist Indistinguishability under Chosen-Plaintext Attack (IND-CPA). To solve these challenges, we first propose an Efficient and Secure Spatial Range Query scheme (ESSRQ), which is based on a dual mobile cloud architecture by integrating Geohash algorithm, Circular Shift Coalesce Zero-Sum Garbled Bloom Filter (CSC-ZGBF) and Symmetric Homomorphic Encryption (SHE), achieving a constant search complexity. However, ESSRQ cannot protect the access patterns, where the cloud server still has the potential to infer attacks based on the index position and even obtain plaintext queries. On this basis, we further propose an extended scheme ESSRQ-PIR, which introduces Private Information Retrieval (PIR) into single mobile cloud-based architecture, effectively prevents the leakage of access patterns, enhances the security of ESSRQ and can also realize efficient query on large-scale cloud datasets. Formal security analysis proves that our proposed schemes are secure against IND-CPA, and extensive experiments demonstrate that our schemes improve the query efficiency by up to nearly 20 times when compared with previous solutions. These features make the proposed schemes particularly suitable for privacy-preserving spatial queries in mobile cloud computing environments.
保护隐私的空间范围查询允许用户在保证隐私的同时,根据特定的空间属性或地理位置获取有效的数据。然而,现有的许多保隐私空间范围查询(PSRQ)方案在处理大规模移动云数据集时普遍存在查询效率低、安全性不足的问题,且难以抵抗选择明文攻击(IND-CPA)下的不可分辨性。为了解决这些挑战,我们首先提出了一种高效安全的空间距离查询方案(ESSRQ),该方案基于双移动云架构,通过集成Geohash算法、圆移位合并零和乱码布隆滤波器(CSC-ZGBF)和对称同态加密(SHE),实现了恒定的搜索复杂度。然而,ESSRQ不能保护访问模式,云服务器仍然有可能根据索引位置推断攻击,甚至获得明文查询。在此基础上,我们进一步提出了一种扩展方案ESSRQ-PIR,该方案将私有信息检索(Private Information Retrieval, PIR)引入到单个基于移动云的架构中,有效地防止了访问模式的泄漏,增强了ESSRQ的安全性,也可以实现对大规模云数据集的高效查询。正式的安全性分析证明了我们提出的方案对IND-CPA是安全的,大量的实验表明,我们的方案与以前的方案相比,查询效率提高了近20倍。这些特征使得所提出的方案特别适合于移动云计算环境中保护隐私的空间查询。
{"title":"Security-Enhanced Spatial Range Query Over Large-Scale Encrypted Mobile Cloud Datasets","authors":"Yinbin Miao;Jiaqi Yu;Jiliang Li;Xinghua Li;Jun Feng;Zhiquan Liu;Robert H. Deng","doi":"10.1109/TMC.2025.3599519","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599519","url":null,"abstract":"Privacy-preserving spatial range query allows users to obtain valid data based on specific spatial attributes or geographical location while ensuring privacy. However, many existing Privacy-Preserving Spatial Range Query (PSRQ) schemes generally face the problems of low query efficiency and insufficient security when dealing with large-scale mobile cloud data sets, and it is difficult to resist Indistinguishability under Chosen-Plaintext Attack (IND-CPA). To solve these challenges, we first propose an Efficient and Secure Spatial Range Query scheme (ESSRQ), which is based on a dual mobile cloud architecture by integrating Geohash algorithm, Circular Shift Coalesce Zero-Sum Garbled Bloom Filter (CSC-ZGBF) and Symmetric Homomorphic Encryption (SHE), achieving a constant search complexity. However, ESSRQ cannot protect the access patterns, where the cloud server still has the potential to infer attacks based on the index position and even obtain plaintext queries. On this basis, we further propose an extended scheme ESSRQ-PIR, which introduces Private Information Retrieval (PIR) into single mobile cloud-based architecture, effectively prevents the leakage of access patterns, enhances the security of ESSRQ and can also realize efficient query on large-scale cloud datasets. Formal security analysis proves that our proposed schemes are secure against IND-CPA, and extensive experiments demonstrate that our schemes improve the query efficiency by up to nearly 20 times when compared with previous solutions. These features make the proposed schemes particularly suitable for privacy-preserving spatial queries in mobile cloud computing environments.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1184-1199"},"PeriodicalIF":9.2,"publicationDate":"2025-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WaveRoRA: Wavelet Rotary Route Attention for Multivariate Time Series Forecasting 多变量时间序列预测的小波旋转路径关注
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-14 DOI: 10.1109/TMC.2025.3599406
Aobo Liang;Yan Sun;Nadra Guizani
Various sensors of Internet of Things (IoT) generate massive amounts of mobile traffic data, forming multivariate time series (MTS). Accurate forecasting of MTS facilitates the enhancement of proactive autoscaling and resource allocation in edge networks. While recent Transformer-based models (Transformers) have achieved significant success in MTS forecasting (MTSF), they tend to rely solely on either time-domain or frequency-domain features, which captures inadequate trends and periodic characteristics. To this end, we propose a wavelet learning framework that seamlessly integrates wavelet transforms with Transformers to benefit from time and frequency characteristics. We design a mixing-splitting architecture to model multi-scale wavelet coefficients and utilizes the attention mechanism to capture inter-series dependencies in the wavelet domain. However, the vanilla softmax self-attention (SA) is high-computational-cost and its smoothing effect diminishes the contrast between strong and weak variable correlations. Therefore, we propose a novel attention mechanism: Rotary Route Attention (RoRA). RoRA incorporates rotary positional embeddings to enhance feature diversity and introduces a small number of routing tokens $r$ to aggregate information from the $KV$ matrices and redistribute it to the $Q$ matrix. Such design strengthens interactions among strongly correlated variables while mitigating the impact of weakly correlated noise. We further propose WaveRoRA, a unified model that leverages RoRA capturing inter-series dependencies in the wavelet domain. We conduct extensive experiments on eight real-world datasets. The results indicate that WaveRoRA outperforms existing state-of-the-art models while maintaining lower computational costs.
物联网的各种传感器产生大量的移动流量数据,形成多元时间序列(MTS)。MTS的准确预测有助于增强边缘网络中的主动自缩放和资源分配。虽然最近基于变压器的模型(变压器)在MTS预测(MTSF)中取得了显著的成功,但它们往往仅仅依赖于时域或频域特征,这捕获了不充分的趋势和周期性特征。为此,我们提出了一个小波学习框架,将小波变换与变压器无缝集成,以受益于时间和频率特性。我们设计了一种混合分裂结构来建模多尺度小波系数,并利用注意机制来捕获小波域内的序列间依赖关系。然而,香草软最大自注意(SA)的计算成本高,其平滑效果减弱了强弱变量相关性之间的对比。因此,我们提出一种新的注意机制:旋转路径注意(RoRA)。RoRA采用旋转位置嵌入来增强特征多样性,并引入少量路由令牌$r$来聚合来自$KV$矩阵的信息并将其重新分配到$Q$矩阵。这样的设计加强了强相关变量之间的相互作用,同时减轻了弱相关噪声的影响。我们进一步提出了WaveRoRA,这是一个利用RoRA捕获小波域序列间依赖关系的统一模型。我们在八个真实世界的数据集上进行了广泛的实验。结果表明,WaveRoRA在保持较低计算成本的同时,性能优于现有的最先进模型。
{"title":"WaveRoRA: Wavelet Rotary Route Attention for Multivariate Time Series Forecasting","authors":"Aobo Liang;Yan Sun;Nadra Guizani","doi":"10.1109/TMC.2025.3599406","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599406","url":null,"abstract":"Various sensors of Internet of Things (IoT) generate massive amounts of mobile traffic data, forming multivariate time series (MTS). Accurate forecasting of MTS facilitates the enhancement of proactive autoscaling and resource allocation in edge networks. While recent Transformer-based models (Transformers) have achieved significant success in MTS forecasting (MTSF), they tend to rely solely on either time-domain or frequency-domain features, which captures inadequate trends and periodic characteristics. To this end, we propose a wavelet learning framework that seamlessly integrates wavelet transforms with Transformers to benefit from time and frequency characteristics. We design a mixing-splitting architecture to model multi-scale wavelet coefficients and utilizes the attention mechanism to capture inter-series dependencies in the wavelet domain. However, the vanilla softmax self-attention (SA) is high-computational-cost and its smoothing effect diminishes the contrast between strong and weak variable correlations. Therefore, we propose a novel attention mechanism: Rotary Route Attention (RoRA). RoRA incorporates rotary positional embeddings to enhance feature diversity and introduces a small number of routing tokens <inline-formula><tex-math>$r$</tex-math></inline-formula> to aggregate information from the <inline-formula><tex-math>$KV$</tex-math></inline-formula> matrices and redistribute it to the <inline-formula><tex-math>$Q$</tex-math></inline-formula> matrix. Such design strengthens interactions among strongly correlated variables while mitigating the impact of weakly correlated noise. We further propose WaveRoRA, a unified model that leverages RoRA capturing inter-series dependencies in the wavelet domain. We conduct extensive experiments on eight real-world datasets. The results indicate that WaveRoRA outperforms existing state-of-the-art models while maintaining lower computational costs.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1287-1301"},"PeriodicalIF":9.2,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-Temporal Pyramid-Based Multi-Scale Data Completion in Sparse Crowdsensing 基于时空金字塔的稀疏众感多尺度数据补全
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-14 DOI: 10.1109/TMC.2025.3599322
Wenbin Liu;Hao Du;En Wang;Jiajian Lv;Weiting Liu;Bo Yang;Jie Wu
Sparse Crowdsensing has emerged as a crucial and flexible method for collecting spatio-temporal data in various applications, such as traffic management, environmental monitoring, and disaster response. By recruiting users and utilizing their diverse mobile devices, this approach often results in data that is both sparse and multi-scale, complicating the data completion process. Although numerous data completion algorithms have been developed to address data sparsity, most assume that the collected data is of the same or similar scale, rendering them ineffective for multi-scale data. To overcome this limitation, in this paper, we propose a spatio-temporal pyramid-based multi-scale data completion framework in Sparse Crowdsensing. The basic idea is to leverage a pyramid structure to efficiently capture the complex interrelations between different scales. We first develop a Spatial-Temporal Pyramid Construction Module (ST-PC) to handle multi-scale inputs, and then propose a Spatial-Temporal Pyramid Attention Mechanism (ST-PAM) to capture multi-scale correlations while reducing computational complexity. Furthermore, our method incorporates cross-scale constraints to optimize completion performance. Extensive experiments on four real-world spatio-temporal datasets demonstrate the effectiveness of our framework in multi-scale data completion.
在交通管理、环境监测和灾害响应等各种应用中,稀疏众感已经成为收集时空数据的一种重要而灵活的方法。通过招募用户并利用他们不同的移动设备,这种方法通常会导致数据既稀疏又多尺度,使数据完成过程复杂化。尽管已经开发了许多数据补全算法来解决数据稀疏性问题,但大多数算法都假设收集的数据具有相同或相似的规模,这使得它们对多尺度数据无效。为了克服这一局限性,本文提出了一种基于时空金字塔的稀疏众感多尺度数据补全框架。其基本思想是利用金字塔结构来有效地捕捉不同尺度之间复杂的相互关系。我们首先开发了一个时空金字塔构建模块(ST-PC)来处理多尺度输入,然后提出了一个时空金字塔注意机制(ST-PAM)来捕获多尺度相关性,同时降低计算复杂度。此外,我们的方法结合了跨尺度约束来优化完井性能。在四个真实时空数据集上的大量实验证明了我们的框架在多尺度数据补全方面的有效性。
{"title":"Spatio-Temporal Pyramid-Based Multi-Scale Data Completion in Sparse Crowdsensing","authors":"Wenbin Liu;Hao Du;En Wang;Jiajian Lv;Weiting Liu;Bo Yang;Jie Wu","doi":"10.1109/TMC.2025.3599322","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599322","url":null,"abstract":"Sparse Crowdsensing has emerged as a crucial and flexible method for collecting spatio-temporal data in various applications, such as traffic management, environmental monitoring, and disaster response. By recruiting users and utilizing their diverse mobile devices, this approach often results in data that is both sparse and multi-scale, complicating the data completion process. Although numerous data completion algorithms have been developed to address data sparsity, most assume that the collected data is of the same or similar scale, rendering them ineffective for multi-scale data. To overcome this limitation, in this paper, we propose a spatio-temporal pyramid-based multi-scale data completion framework in Sparse Crowdsensing. The basic idea is to leverage a pyramid structure to efficiently capture the complex interrelations between different scales. We first develop a Spatial-Temporal Pyramid Construction Module (ST-PC) to handle multi-scale inputs, and then propose a Spatial-Temporal Pyramid Attention Mechanism (ST-PAM) to capture multi-scale correlations while reducing computational complexity. Furthermore, our method incorporates cross-scale constraints to optimize completion performance. Extensive experiments on four real-world spatio-temporal datasets demonstrate the effectiveness of our framework in multi-scale data completion.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1270-1286"},"PeriodicalIF":9.2,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-Tier Submodel Partition Framework for Enhancing UAV Swarm Robustness in Forest Fire Detection 增强森林火灾探测无人机群鲁棒性的两层子模型划分框架
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-08-14 DOI: 10.1109/TMC.2025.3599384
Xingyu Li;Wenzhe Zhang;Linfeng Liu;Ping Wang
The deployment of Unmanned Aerial Vehicle (UAV) swarm for Forest Fire Detection (FFD) missions presents unique challenges, e.g., the early forest fires are difficult to identify due to environment diversity and feature complexity, especially when some UAVs could be destroyed in harsh environments. To address these challenges, UAV swarm-based FFD missions can leverage advanced deep learning techniques, where online model updates, robustness, and communication overhead control become crucial for ensuring the effectiveness and adaptability of these missions. In this paper, we propose a Two-tier Submodel Partition Framework (TSPF) to enhance the robustness of UAV swarm conducting FFD missions. TSPF utilizes online model updates to adapt to diverse mission environments, thus strengthening the generalization capability of the model. In addition, a graph coloring method, an intragroup backup mechanism, and a Dynamic Server Selection (DSS) mechanism for the grouping are employed to enhance the robustness of FFD missions when some UAVs are destroyed, hence maintaining the high performance of FFD missions in harsh environments. Moreover, TSPF enables submodel updates by aggregating the parameters of selected layers within/between UAV groups, thereby effectively reducing the model parameter uploads (communication overhead) in model training. Experimental evaluations demonstrate that our proposed TSPF significantly improves the detection accuracy of forest fires, enhances the robustness of FFD missions against the destruction of some UAVs, and reduces the communication overhead in FFD missions.
在森林火灾探测(FFD)任务中部署无人机群面临着独特的挑战,例如,由于环境多样性和特征复杂性,早期森林火灾难以识别,特别是当一些无人机可能在恶劣环境中被摧毁时。为了应对这些挑战,基于无人机群的FFD任务可以利用先进的深度学习技术,其中在线模型更新、鲁棒性和通信开销控制对于确保这些任务的有效性和适应性至关重要。为了提高无人机群执行FFD任务的鲁棒性,提出了一种两层子模型划分框架(TSPF)。TSPF利用在线模型更新来适应不同的任务环境,从而增强了模型的泛化能力。此外,采用图着色方法、组内备份机制和分组动态服务器选择(DSS)机制增强FFD任务在部分无人机被摧毁时的鲁棒性,从而保持FFD任务在恶劣环境下的高性能。此外,TSPF通过聚合无人机组内/组间选定层的参数实现子模型更新,从而有效减少模型训练中的模型参数上传(通信开销)。实验结果表明,该算法显著提高了森林火灾的探测精度,增强了FFD任务对某些无人机破坏的鲁棒性,降低了FFD任务的通信开销。
{"title":"Two-Tier Submodel Partition Framework for Enhancing UAV Swarm Robustness in Forest Fire Detection","authors":"Xingyu Li;Wenzhe Zhang;Linfeng Liu;Ping Wang","doi":"10.1109/TMC.2025.3599384","DOIUrl":"https://doi.org/10.1109/TMC.2025.3599384","url":null,"abstract":"The deployment of Unmanned Aerial Vehicle (UAV) swarm for Forest Fire Detection (FFD) missions presents unique challenges, e.g., the early forest fires are difficult to identify due to environment diversity and feature complexity, especially when some UAVs could be destroyed in harsh environments. To address these challenges, UAV swarm-based FFD missions can leverage advanced deep learning techniques, where online model updates, robustness, and communication overhead control become crucial for ensuring the effectiveness and adaptability of these missions. In this paper, we propose a Two-tier Submodel Partition Framework (TSPF) to enhance the robustness of UAV swarm conducting FFD missions. TSPF utilizes online model updates to adapt to diverse mission environments, thus strengthening the generalization capability of the model. In addition, a graph coloring method, an intragroup backup mechanism, and a Dynamic Server Selection (DSS) mechanism for the grouping are employed to enhance the robustness of FFD missions when some UAVs are destroyed, hence maintaining the high performance of FFD missions in harsh environments. Moreover, TSPF enables submodel updates by aggregating the parameters of selected layers within/between UAV groups, thereby effectively reducing the model parameter uploads (communication overhead) in model training. Experimental evaluations demonstrate that our proposed TSPF significantly improves the detection accuracy of forest fires, enhances the robustness of FFD missions against the destruction of some UAVs, and reduces the communication overhead in FFD missions.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 1","pages":"1169-1183"},"PeriodicalIF":9.2,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DRAM: Digital Twin-Driven Double-Layer Reverse Auction Method for Multi-Platform Vehicular Crowdsensing 数字双驱动双层逆向拍卖多平台车辆众测方法
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-31 DOI: 10.1109/TMC.2025.3594488
Zhenning Wang;Yue Cao;Huan Zhou;Xiaokang Zhou;Jiawen Kang;Houbing Song
Recently, For-Hire Vehicles (FHVs) have emerged as major players in Vehicular CrowdSensing (VCS). However, the heterogeneity of tasks issued by Data Requesters (DRs) and the heterogeneity of sensors equipped on FHVs under different Vehicle Platforms (VPs) bring difficulties to task allocation and execution. It can be concluded that it is important to reasonably analyze the relationship among DRs, VPs, and FHVs, as well as to motivate VPs and FHVs to complete sensing tasks. Therefore, taking advantage of the real-time simulation and intelligent decision-making of Digital Twins (DT), this paper proposes a DT-driven Double-layer Reverse Auction Method (DRAM). In the first layer, the reverse auction is established between each DR and VPs, and in the second layer, the reverse auction is established between each VP and FHVs. Meanwhile, we also introduce a sensing fairness index to ensure the sensing balance of different sub-regions and consider it in the DRAM process. Here, the idea of backward induction is used to solve the above problems, with the goal of minimizing the overhead of winning VP and the average overhead of all DRs. Finally, the effectiveness of the DRAM proposed in this paper is verified based on the real data set. Compared with the baseline method, DRAM can reduce the average overhead of DR by about 4%-25%. Meanwhile, in terms of sensing fairness, it can be improved by up to 55%.
最近,出租车辆(fhv)已成为车辆众传感(VCS)的主要参与者。然而,在不同的车辆平台下,数据请求者(dr)发出的任务的异构性以及fhv上配备的传感器的异构性给任务的分配和执行带来了困难。由此可见,合理分析dr、VPs和FHVs之间的关系,激励VPs和FHVs完成传感任务是非常重要的。因此,利用数字孪生(DT)的实时仿真和智能决策,本文提出了一种数字孪生驱动的双层反向拍卖方法(DRAM)。第一层在每个DR和VP之间建立反向拍卖,第二层在每个VP和fhv之间建立反向拍卖。同时,我们还引入了感知公平性指标来保证不同子区域的感知平衡,并在DRAM过程中加以考虑。在这里,使用逆向归纳的思想来解决上述问题,目标是最小化赢得VP的开销和所有dr的平均开销。最后,基于实际数据集验证了本文提出的DRAM的有效性。与基线方法相比,DRAM可以减少DR的平均开销约4%-25%。同时,在感知公平性方面,它可以提高高达55%。
{"title":"DRAM: Digital Twin-Driven Double-Layer Reverse Auction Method for Multi-Platform Vehicular Crowdsensing","authors":"Zhenning Wang;Yue Cao;Huan Zhou;Xiaokang Zhou;Jiawen Kang;Houbing Song","doi":"10.1109/TMC.2025.3594488","DOIUrl":"https://doi.org/10.1109/TMC.2025.3594488","url":null,"abstract":"Recently, For-Hire Vehicles (FHVs) have emerged as major players in Vehicular CrowdSensing (VCS). However, the heterogeneity of tasks issued by Data Requesters (DRs) and the heterogeneity of sensors equipped on FHVs under different Vehicle Platforms (VPs) bring difficulties to task allocation and execution. It can be concluded that it is important to reasonably analyze the relationship among DRs, VPs, and FHVs, as well as to motivate VPs and FHVs to complete sensing tasks. Therefore, taking advantage of the real-time simulation and intelligent decision-making of Digital Twins (DT), this paper proposes a DT-driven <underline>D</u>ouble-layer <underline>R</u>everse <underline>A</u>uction <underline>M</u>ethod (DRAM). In the first layer, the reverse auction is established between each DR and VPs, and in the second layer, the reverse auction is established between each VP and FHVs. Meanwhile, we also introduce a sensing fairness index to ensure the sensing balance of different sub-regions and consider it in the DRAM process. Here, the idea of backward induction is used to solve the above problems, with the goal of minimizing the overhead of winning VP and the average overhead of all DRs. Finally, the effectiveness of the DRAM proposed in this paper is verified based on the real data set. Compared with the baseline method, DRAM can reduce the average overhead of DR by about 4%-25%. Meanwhile, in terms of sensing fairness, it can be improved by up to 55%.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 12","pages":"13725-13742"},"PeriodicalIF":9.2,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145442694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User Context Generation for Large Language Models From Mobile Sensing Data 基于移动传感数据的大型语言模型的用户上下文生成
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-28 DOI: 10.1109/TMC.2025.3591561
Rui Xing;Zhenzhe Zheng;Fan Wu;Guihai Chen
Large language models (LLMs) exhibit remarkable capabilities in natural language understanding and generation. However, the accuracy of the inference depends deeply on the contexts of queries, especially for personal services. Abundant mobile sensing data collected by sensors embedded in smart devices can proactively capture real-time user contexts. However, raw sensing data are low-quality (e.g., existing missing data and data redundancy) and are incapable of providing accurate contexts. In this work, we present ConGen, a user context generation framework for LLMs, aiming at prompting users’ contexts through their implicit mobile sensing information. ConGen integrates two components: refined data completion and multi-granularity context compression. Specifically, the refined data completion couples data-centric feature selection by leveraging the eXplainable AI (XAI) method into the data imputation model to generate fewer but more informative features for efficient and effective context generation. Additionally, we implement multi-granularity context compression, reducing timestep- and context-level data redundancy while further elevating context quality. Experiment results show that ConGen can generate more accurate context, surpassing competitive baselines by 1.3%-8.3% in context inference on all four datasets. Moreover, context compression significantly reduces redundancy to $1/70sim 1/40$ of the original data amount, and further improves the context accuracy. Finally, the enhanced performance of LLMs, as demonstrated by both quantitative and qualitative evaluations of prompting ConGen-generated user contexts, underscores the effectiveness of ConGen.
大型语言模型(llm)在自然语言理解和生成方面表现出非凡的能力。然而,推理的准确性在很大程度上取决于查询的上下文,特别是对于个人服务。智能设备中嵌入的传感器采集到丰富的移动传感数据,能够主动捕捉实时用户情境。然而,原始传感数据质量低(例如,现有的缺失数据和数据冗余),无法提供准确的上下文。在这项工作中,我们提出了ConGen,一个用于llm的用户上下文生成框架,旨在通过用户的隐式移动感知信息提示用户上下文。ConGen集成了两个组件:精细的数据完成和多粒度上下文压缩。具体来说,精细化的数据补全将以数据为中心的特征选择结合起来,利用可解释的人工智能(eXplainable AI, XAI)方法与数据输入模型相结合,生成更少但信息量更大的特征,从而实现高效的上下文生成。此外,我们实现了多粒度上下文压缩,减少了时间步长和上下文级别的数据冗余,同时进一步提高了上下文质量。实验结果表明,ConGen可以生成更准确的上下文,在所有四个数据集上的上下文推理都比竞争基准高出1.3%-8.3%。此外,上下文压缩显著降低冗余到原始数据量的1/70 / 1/40,进一步提高了上下文的准确性。最后,通过对提示ConGen生成的用户上下文进行定量和定性评估,llm的性能得到了提高,这强调了ConGen的有效性。
{"title":"User Context Generation for Large Language Models From Mobile Sensing Data","authors":"Rui Xing;Zhenzhe Zheng;Fan Wu;Guihai Chen","doi":"10.1109/TMC.2025.3591561","DOIUrl":"https://doi.org/10.1109/TMC.2025.3591561","url":null,"abstract":"Large language models (LLMs) exhibit remarkable capabilities in natural language understanding and generation. However, the accuracy of the inference depends deeply on the contexts of queries, especially for personal services. Abundant mobile sensing data collected by sensors embedded in smart devices can proactively capture real-time user contexts. However, raw sensing data are low-quality (e.g., existing missing data and data redundancy) and are incapable of providing accurate contexts. In this work, we present ConGen, a user context generation framework for LLMs, aiming at prompting users’ contexts through their implicit mobile sensing information. ConGen integrates two components: refined data completion and multi-granularity context compression. Specifically, the refined data completion couples data-centric feature selection by leveraging the eXplainable AI (XAI) method into the data imputation model to generate fewer but more informative features for efficient and effective context generation. Additionally, we implement multi-granularity context compression, reducing timestep- and context-level data redundancy while further elevating context quality. Experiment results show that ConGen can generate more accurate context, surpassing competitive baselines by 1.3%-8.3% in context inference on all four datasets. Moreover, context compression significantly reduces redundancy to <inline-formula><tex-math>$1/70sim 1/40$</tex-math></inline-formula> of the original data amount, and further improves the context accuracy. Finally, the enhanced performance of LLMs, as demonstrated by both quantitative and qualitative evaluations of prompting ConGen-generated user contexts, underscores the effectiveness of ConGen.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 12","pages":"13678-13695"},"PeriodicalIF":9.2,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145442695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-Time MU-MIMO Beamforming With Limited Channel Samples in 5G Networks 5G网络中有限信道采样的实时MU-MIMO波束形成
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-25 DOI: 10.1109/TMC.2025.3592929
Shaoran Li;Nan Jiang;Chengzhang Li;Shiva Acharya;Yubo Wu;Weijun Xie;Wenjing Lou;Y. Thomas Hou
MU-MIMO beamforming is a key technology for 5G networks, relying on Channel State Information (CSI). However, in practice, the estimated CSI in reality is prone to uncertainty. Further, a MU-MIMO beamforming solution must be derived within a millisecond to be useful for real-time 5G applications. We present ReDBeam—a real-time data-driven beamforming solution for MU-MIMO using limited CSI data samples. The main novelties of ReDBeam are a parallel algorithm and an optimized GPU implementation. ReDBeam delivers a MU-MIMO beamforming solution within 1 millisecond to meet the probabilistic data rate requirements from the users, and minimize a base station’s power consumption. Through extensive experiments, we show that ReDBeam consistently meets the stringent 1-millisecond real-time requirement and is orders of magnitude faster than other state-of-the-art algorithms. ReDBeam conclusively demonstrates that MU-MIMO beamforming with data rate requirements can be achieved in real-time using only limited CSI data samples.
MU-MIMO波束形成是5G网络的关键技术,它依赖于信道状态信息(CSI)。然而,在实践中,现实中估计的CSI容易存在不确定性。此外,必须在一毫秒内推导出MU-MIMO波束形成解决方案,才能用于实时5G应用。我们提出了redbeam -一种实时数据驱动的MU-MIMO波束形成解决方案,使用有限的CSI数据样本。ReDBeam的主要创新点是并行算法和优化的GPU实现。ReDBeam在1毫秒内提供MU-MIMO波束形成解决方案,以满足用户的概率数据速率要求,并最大限度地降低基站的功耗。通过广泛的实验,我们表明ReDBeam始终满足严格的1毫秒实时要求,并且比其他最先进的算法快几个数量级。ReDBeam最终证明,仅使用有限的CSI数据样本就可以实现具有数据速率要求的MU-MIMO波束形成。
{"title":"Real-Time MU-MIMO Beamforming With Limited Channel Samples in 5G Networks","authors":"Shaoran Li;Nan Jiang;Chengzhang Li;Shiva Acharya;Yubo Wu;Weijun Xie;Wenjing Lou;Y. Thomas Hou","doi":"10.1109/TMC.2025.3592929","DOIUrl":"https://doi.org/10.1109/TMC.2025.3592929","url":null,"abstract":"MU-MIMO beamforming is a key technology for 5G networks, relying on Channel State Information (CSI). However, in practice, the estimated CSI in reality is prone to uncertainty. Further, a MU-MIMO beamforming solution must be derived within a millisecond to be useful for real-time 5G applications. We present ReDBeam—a real-time data-driven beamforming solution for MU-MIMO using limited CSI data samples. The main novelties of ReDBeam are a parallel algorithm and an optimized GPU implementation. ReDBeam delivers a MU-MIMO beamforming solution within 1 millisecond to meet the probabilistic data rate requirements from the users, and minimize a base station’s power consumption. Through extensive experiments, we show that ReDBeam consistently meets the stringent 1-millisecond real-time requirement and is orders of magnitude faster than other state-of-the-art algorithms. ReDBeam conclusively demonstrates that MU-MIMO beamforming with data rate requirements can be achieved in real-time using only limited CSI data samples.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 12","pages":"13549-13566"},"PeriodicalIF":9.2,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145442707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FluidEdge: Expediting Serverless Machine Learning Inference via Bottleneck-Aware Auto-Scaling on Edge SoCs FluidEdge:通过边缘soc上的瓶颈感知自动缩放加速无服务器机器学习推理
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-24 DOI: 10.1109/TMC.2025.3592334
Borui Li;Tiange Xia;Weilong Wang;Jingyuan Zhang;Shuai Wang;Chenhong Cao;Zheng Dong;Shuai Wang
Mobile applications based on machine learning (ML) are increasingly relying on offloading to the edge devices for low-latency, resource-efficient computation. Applying serverless computing for these ML applications on the edge offers a promising solution for handling dynamic workloads while meeting user-specified latency service-level objectives (SLOs). However, existing serverless frameworks, with their coarse-grained data parallelism and rigid model partitioning, are inadequate for ML inference on widely adopted edge System-on-Chip (SoC) devices. This paper presents FluidEdge, an edge-native serverless inference framework. FluidEdge identifies bottleneck operators in ML models and addresses them through a novel fine-grained intra-function latency-sensitive auto-scaling approach that dynamically scales inference bottlenecks during online serving. Additionally, it employs inter-function scaling to further prevent latency SLO violations and leverages the unified memory of edge SoCs for efficient data sharing during inference. Experimental results demonstrate that FluidEdge achieves a 37.4% latency improvement and 67.3% -87.6% SLO violation reduction compared to best-performed state-of-the-art serverless inference frameworks.
基于机器学习(ML)的移动应用程序越来越依赖于卸载到边缘设备,以实现低延迟、资源高效的计算。在边缘为这些ML应用程序应用无服务器计算提供了一种很有前途的解决方案,可以处理动态工作负载,同时满足用户指定的延迟服务水平目标(slo)。然而,现有的无服务器框架,由于其粗粒度的数据并行性和严格的模型划分,不足以在广泛采用的边缘系统芯片(SoC)设备上进行机器学习推理。本文介绍了FluidEdge,一个边缘本地无服务器推理框架。FluidEdge识别ML模型中的瓶颈操作符,并通过一种新颖的细粒度功能内部延迟敏感的自动缩放方法来解决它们,该方法可以在在线服务期间动态缩放推理瓶颈。此外,它采用功能间扩展来进一步防止延迟SLO违规,并利用边缘soc的统一内存在推理期间进行有效的数据共享。实验结果表明,与性能最好的最先进的无服务器推理框架相比,FluidEdge实现了37.4%的延迟改进和67.3% -87.6%的SLO违规减少。
{"title":"FluidEdge: Expediting Serverless Machine Learning Inference via Bottleneck-Aware Auto-Scaling on Edge SoCs","authors":"Borui Li;Tiange Xia;Weilong Wang;Jingyuan Zhang;Shuai Wang;Chenhong Cao;Zheng Dong;Shuai Wang","doi":"10.1109/TMC.2025.3592334","DOIUrl":"https://doi.org/10.1109/TMC.2025.3592334","url":null,"abstract":"Mobile applications based on machine learning (ML) are increasingly relying on offloading to the edge devices for low-latency, resource-efficient computation. Applying serverless computing for these ML applications on the edge offers a promising solution for handling dynamic workloads while meeting user-specified latency service-level objectives (SLOs). However, existing serverless frameworks, with their coarse-grained data parallelism and rigid model partitioning, are inadequate for ML inference on widely adopted edge System-on-Chip (SoC) devices. This paper presents FluidEdge, an edge-native serverless inference framework. FluidEdge identifies bottleneck operators in ML models and addresses them through a novel fine-grained intra-function latency-sensitive auto-scaling approach that dynamically scales inference bottlenecks during online serving. Additionally, it employs inter-function scaling to further prevent latency SLO violations and leverages the unified memory of edge SoCs for efficient data sharing during inference. Experimental results demonstrate that FluidEdge achieves a 37.4% latency improvement and 67.3% -87.6% SLO violation reduction compared to best-performed state-of-the-art serverless inference frameworks.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 12","pages":"13586-13599"},"PeriodicalIF":9.2,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145442687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Mobile Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1