首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Hierarchical Spatial-Temporal Masked Contrast for Skeleton Action Recognition 用于骨骼动作识别的分层时空掩蔽对比技术
Pub Date : 2024-07-17 DOI: 10.1109/TAI.2024.3430260
Wenming Cao;Aoyu Zhang;Zhihai He;Yicha Zhang;Xinpeng Yin
In the field of 3-D action recognition, self-supervised learning has shown promising results but remains a challenging task. Previous approaches to motion modeling often relied on selecting features solely from the temporal or spatial domain, which limited the extraction of higher-level semantic information. Additionally, traditional one-to-one approaches in multilevel comparative learning overlooked the relationships between different levels, hindering the learning representation of the model. To address these issues, we propose the hierarchical spatial-temporal masked network (HSTM) for learning 3-D action representations. HSTM introduces a novel masking method that operates simultaneously in both the temporal and spatial dimensions. This approach leverages semantic relevance to identify meaningful regions in time and space, guiding the masking process based on semantic richness. This guidance is crucial for learning useful feature representations effectively. Furthermore, to enhance the learning of potential features, we introduce cross-level distillation (CLD) to extend the comparative learning approach. By training the model with two types of losses simultaneously, each level of the multilevel comparative learning process can be guided by levels rich in semantic information. This allows for more effective supervision of comparative learning, leading to improved performance. Extensive experiments conducted on the NTU-60, NTU-120, and PKU-MMD datasets demonstrate the effectiveness of our proposed framework. The learned action representations exhibit strong transferability and achieve state-of-the-art results.
在三维动作识别领域,自监督学习已经取得了可喜的成果,但仍然是一项具有挑战性的任务。以往的运动建模方法通常只依赖于从时间或空间域中选择特征,这限制了对更高层次语义信息的提取。此外,多层次比较学习中传统的一对一方法忽略了不同层次之间的关系,阻碍了模型的学习表示。为了解决这些问题,我们提出了用于学习三维动作表征的分层时空遮蔽网络(HSTM)。HSTM 引入了一种在时间和空间维度上同时运行的新型遮蔽方法。这种方法利用语义相关性来识别时间和空间中的有意义区域,并根据语义丰富程度来指导屏蔽过程。这种指导对于有效学习有用的特征表征至关重要。此外,为了加强对潜在特征的学习,我们引入了跨层次蒸馏(CLD)来扩展比较学习方法。通过同时用两类损失对模型进行训练,多层次比较学习过程中的每个层次都能得到语义信息丰富的层次的指导。这样就能更有效地监督比较学习,从而提高性能。在 NTU-60、NTU-120 和 PKU-MMD 数据集上进行的广泛实验证明了我们提出的框架的有效性。学习到的动作表征具有很强的可移植性,并取得了最先进的结果。
{"title":"Hierarchical Spatial-Temporal Masked Contrast for Skeleton Action Recognition","authors":"Wenming Cao;Aoyu Zhang;Zhihai He;Yicha Zhang;Xinpeng Yin","doi":"10.1109/TAI.2024.3430260","DOIUrl":"https://doi.org/10.1109/TAI.2024.3430260","url":null,"abstract":"In the field of 3-D action recognition, self-supervised learning has shown promising results but remains a challenging task. Previous approaches to motion modeling often relied on selecting features solely from the temporal or spatial domain, which limited the extraction of higher-level semantic information. Additionally, traditional one-to-one approaches in multilevel comparative learning overlooked the relationships between different levels, hindering the learning representation of the model. To address these issues, we propose the hierarchical spatial-temporal masked network (HSTM) for learning 3-D action representations. HSTM introduces a novel masking method that operates simultaneously in both the temporal and spatial dimensions. This approach leverages semantic relevance to identify meaningful regions in time and space, guiding the masking process based on semantic richness. This guidance is crucial for learning useful feature representations effectively. Furthermore, to enhance the learning of potential features, we introduce cross-level distillation (CLD) to extend the comparative learning approach. By training the model with two types of losses simultaneously, each level of the multilevel comparative learning process can be guided by levels rich in semantic information. This allows for more effective supervision of comparative learning, leading to improved performance. Extensive experiments conducted on the NTU-60, NTU-120, and PKU-MMD datasets demonstrate the effectiveness of our proposed framework. The learned action representations exhibit strong transferability and achieve state-of-the-art results.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5801-5814"},"PeriodicalIF":0.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study of Enhancing Federated Learning on Non-IID Data With Server Learning 通过服务器学习加强非 IID 数据上的联合学习的研究。
Pub Date : 2024-07-17 DOI: 10.1109/TAI.2024.3430250
Van Sy Mai;Richard J. La;Tao Zhang
Federated learning (FL) has emerged as a means of distributed learning using local data stored at clients with a coordinating server. Recent studies showed that FL can suffer from poor performance and slower convergence when training data at the clients are not independent and identically distributed (IID). Here, we consider auxiliary server learning (SL) as a complementary approach to improving the performance of FL on non-IID data. Our analysis and experiments show that this approach can achieve significant improvements in both model accuracy and convergence time even when the dataset utilized by the server is small and its distribution differs from that of the clients’ aggregate data. Moreover, experimental results suggest that auxiliary SL delivers benefits when employed together with other techniques proposed to mitigate the performance degradation of FL on non-IID data.
联合学习(FL)是一种利用存储在客户端的本地数据与协调服务器进行分布式学习的方法。最近的研究表明,当客户端的训练数据不是独立且同分布的(IID)时,FL 的性能会变差,收敛速度也会变慢。在此,我们考虑将辅助服务器学习作为一种补充方法,以提高 FL 在非独立同分布数据上的性能。我们的分析和实验表明,即使服务器使用的数据集很小,而且其分布与客户端的总数据分布不同,这种方法也能显著提高模型的准确性和收敛时间。此外,实验结果表明,当辅助服务器学习与其他技术一起使用时,能有效缓解 FL 在非 IID 数据上的性能下降问题。
{"title":"A Study of Enhancing Federated Learning on Non-IID Data With Server Learning","authors":"Van Sy Mai;Richard J. La;Tao Zhang","doi":"10.1109/TAI.2024.3430250","DOIUrl":"10.1109/TAI.2024.3430250","url":null,"abstract":"Federated learning (FL) has emerged as a means of distributed learning using local data stored at clients with a coordinating server. Recent studies showed that FL can suffer from poor performance and slower convergence when training data at the clients are not independent and identically distributed (IID). Here, we consider auxiliary server learning (SL) as a \u0000<italic>complementary</i>\u0000 approach to improving the performance of FL on non-IID data. Our analysis and experiments show that this approach can achieve significant improvements in both model accuracy and convergence time even when the dataset utilized by the server is small and its distribution differs from that of the clients’ aggregate data. Moreover, experimental results suggest that auxiliary SL delivers benefits when employed together with other techniques proposed to mitigate the performance degradation of FL on non-IID data.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5589-5604"},"PeriodicalIF":0.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Observer-Based Adaptive Fuzzy Control for Singular Systems with Nonlinear Perturbation and Actuator Saturation 基于观测器的自适应模糊控制,用于具有非线性扰动和致动器饱和的奇异系统
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429052
Qingtan Meng;Qian Ma
This article investigates the adaptive fuzzy control problem for singular systems with actuator saturation and nonlinear perturbation, where the system consists of two coupled differential and algebraic subsystems. To cope with the actuator saturation, a new auxiliary system whose order is the same as the differential subsystem is introduced. With the help of the backstepping method and adaptive fuzzy control method, an observer-based adaptive output feedback tracking control approach is utilized. Under the designed controller, it is proved that the closed-loop system is impulse-free and regular, and all the involved signals are bounded. Furthermore, it is ensured that the tracking error can be adjusted by the errors between the control inputs and the corresponding saturated inputs, as well as the design parameters. Finally, simulation studies demonstrate the validity of the control approach.
本文研究了具有执行器饱和和非线性扰动的奇异系统的自适应模糊控制问题,该系统由两个耦合的微分和代数子系统组成。为了应对执行器饱和,引入了一个新的辅助系统,其阶数与微分子系统相同。在反步法和自适应模糊控制方法的帮助下,采用了基于观测器的自适应输出反馈跟踪控制方法。在所设计的控制器下,闭环系统证明是无脉冲和规则的,所有参与信号都是有界的。此外,还确保了跟踪误差可通过控制输入和相应饱和输入之间的误差以及设计参数进行调整。最后,模拟研究证明了控制方法的有效性。
{"title":"Observer-Based Adaptive Fuzzy Control for Singular Systems with Nonlinear Perturbation and Actuator Saturation","authors":"Qingtan Meng;Qian Ma","doi":"10.1109/TAI.2024.3429052","DOIUrl":"https://doi.org/10.1109/TAI.2024.3429052","url":null,"abstract":"This article investigates the adaptive fuzzy control problem for singular systems with actuator saturation and nonlinear perturbation, where the system consists of two coupled differential and algebraic subsystems. To cope with the actuator saturation, a new auxiliary system whose order is the same as the differential subsystem is introduced. With the help of the backstepping method and adaptive fuzzy control method, an observer-based adaptive output feedback tracking control approach is utilized. Under the designed controller, it is proved that the closed-loop system is impulse-free and regular, and all the involved signals are bounded. Furthermore, it is ensured that the tracking error can be adjusted by the errors between the control inputs and the corresponding saturated inputs, as well as the design parameters. Finally, simulation studies demonstrate the validity of the control approach.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5090-5099"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence-Driven Framework for Augmented Reality Markerless Navigation in Knee Surgery 人工智能驱动的膝关节手术增强现实无标记导航框架
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429048
Xue Hu;Fabrizio Cutolo;Hisham Iqbal;Johann Henckel;Ferdinando Rodriguez y Baena
Conventional orthopedic navigation systems depend on marker-based tracking, which may introduce additional skin incisions, increase the risk and discomfort for the patient, and entail increased workflow complexity. The guidance is conveyed via 2-D monitors, which may distract the surgeon and increase the cognitive burden. This study presents an artificial intelligence (AI)—driven surgical navigation framework for knee replacement surgery. The system comprises an augmented reality (AR) interface that combines an occlusions-robust deep learning-based markerless bone tracking and registration algorithm with a commercial HoloLens 2 headset calibrated for the user's perspective on both eyes. The feasibility of such a system in navigating a bone drilling task is investigated with an experienced orthopedic surgeon on three cadaveric knees under realistic operating room (OR) conditions. After registering an implant model to computed tomography (CT) scans, the preoperative plans are determined based on the location of the fixation pins. Navigation accuracy is quantified using a highly accurate optical tracking system. The achieved drilling error is 7.88 $pm$ 2.41 mm in translation and 7.36 $pm$ 1.77${}^{boldsymbol{circ}}$ in orientation. The results demonstrate the viability of integrating AI and AR technology to navigate knee surgery.
传统的骨科导航系统依赖于基于标记的跟踪,这可能会带来额外的皮肤切口,增加病人的风险和不适感,并增加工作流程的复杂性。导引通过二维显示器传递,这可能会分散外科医生的注意力,增加认知负担。本研究提出了一种人工智能(AI)驱动的膝关节置换手术导航框架。该系统包括一个增强现实(AR)界面,它将基于闭塞的深度学习无标记骨追踪和配准算法与根据用户双眼视角校准的商用 HoloLens 2 头显相结合。在真实的手术室(OR)条件下,由一名经验丰富的骨科医生对三个尸体膝关节进行骨钻孔任务导航,研究了这种系统的可行性。将植入物模型与计算机断层扫描(CT)扫描结果进行比对后,根据固定钉的位置确定术前计划。使用高精度光学跟踪系统对导航精度进行量化。钻孔误差在平移时为 7.88 $pm$ 2.41 mm,在定位时为 7.36 $pm$ 1.77${}^{boldsymbol{circ}}$。这些结果证明了将人工智能和 AR 技术整合到膝关节手术导航中的可行性。
{"title":"Artificial Intelligence-Driven Framework for Augmented Reality Markerless Navigation in Knee Surgery","authors":"Xue Hu;Fabrizio Cutolo;Hisham Iqbal;Johann Henckel;Ferdinando Rodriguez y Baena","doi":"10.1109/TAI.2024.3429048","DOIUrl":"https://doi.org/10.1109/TAI.2024.3429048","url":null,"abstract":"Conventional orthopedic navigation systems depend on marker-based tracking, which may introduce additional skin incisions, increase the risk and discomfort for the patient, and entail increased workflow complexity. The guidance is conveyed via 2-D monitors, which may distract the surgeon and increase the cognitive burden. This study presents an artificial intelligence (AI)—driven surgical navigation framework for knee replacement surgery. The system comprises an augmented reality (AR) interface that combines an occlusions-robust deep learning-based markerless bone tracking and registration algorithm with a commercial HoloLens 2 headset calibrated for the user's perspective on both eyes. The feasibility of such a system in navigating a bone drilling task is investigated with an experienced orthopedic surgeon on three cadaveric knees under realistic operating room (OR) conditions. After registering an implant model to computed tomography (CT) scans, the preoperative plans are determined based on the location of the fixation pins. Navigation accuracy is quantified using a highly accurate optical tracking system. The achieved drilling error is 7.88 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 2.41 mm in translation and 7.36 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 1.77\u0000<inline-formula><tex-math>${}^{boldsymbol{circ}}$</tex-math></inline-formula>\u0000 in orientation. The results demonstrate the viability of integrating AI and AR technology to navigate knee surgery.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5205-5215"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10599938","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation 用于分类和分段的全局注意力引导双域点云特征学习
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429050
Zihao Li;Pan Gao;Kang You;Chuan Yan;Manoranjan Paul
Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a global attention-guided dual-domain feature learning network (GAD) to address the above-mentioned issues. We first devise the contextual position-enhanced transformer (CPT) module, which is armed with an improved global attention mechanism, to produces a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the dual-domain K-nearest neighbor feature fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.
以往的研究已经证明了基于点的神经模型在点云分析任务中的有效性。然而,为原始点坐标生成高效输入嵌入仍是一个关键问题。此外,另一个问题在于邻近聚合的效率有限,而邻近聚合是网络干系中的关键组成部分。在本文中,我们提出了一种全局注意力引导的双域特征学习网络(GAD)来解决上述问题。我们首先设计了上下文位置增强变换器(CPT)模块,该模块采用改进的全局注意力机制,生成全局感知输入嵌入,作为后续聚合的指导。然后,级联双域 K 近邻特征融合(DKFF),通过新颖的双域特征学习(既重视局部几何关系,又重视长距离语义联系)进行有效的特征聚合。在多个点云分析任务(如分类、部件分割和场景语义分割)上的广泛实验证明了所提方法的卓越性能和所设计模块的功效。
{"title":"Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation","authors":"Zihao Li;Pan Gao;Kang You;Chuan Yan;Manoranjan Paul","doi":"10.1109/TAI.2024.3429050","DOIUrl":"10.1109/TAI.2024.3429050","url":null,"abstract":"Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a global attention-guided dual-domain feature learning network (GAD) to address the above-mentioned issues. We first devise the contextual position-enhanced transformer (CPT) module, which is armed with an improved global attention mechanism, to produces a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the dual-domain K-nearest neighbor feature fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5167-5178"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141655081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy Scheduling Optimization for Microgrids Based on Partially Observable Markov Game 基于部分可观测马尔可夫博弈的微电网能源调度优化
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3428510
Jiakai Gong;Nuo Yu;Fen Han;Bin Tang;Haolong Wu;Yuan Ge
Microgrids (MGs) are essential for enhancing energy efficiency and minimizing power usage through the regulation of energy storage systems. Nevertheless, privacy-related concerns obstruct the real-time precise regulation of these systems due to unavailable state-of-charge (SOC) data. This article introduces a self-adaptive energy scheduling optimization framework for MGs that operates without SOC information, utilizing a partially observable Markov game (POMG) to decrease energy usage. Furthermore, to develop an optimal energy scheduling strategy, a MG system optimization approach using recurrent multiagent deep deterministic policy gradient (RMADDPG) is presented. This method is evaluated against other existing techniques such as MADDPG, deterministic recurrent policy gradient (DRPG), and independent Q-learning (IQL), demonstrating reductions in electrical energy consumption by 4.29%, 5.56%, and 12.95%, respectively, according to simulation outcomes.
微电网(MGs)对于通过调节储能系统提高能源效率和最大限度地减少用电量至关重要。然而,由于无法获得充电状态(SOC)数据,与隐私相关的问题阻碍了这些系统的实时精确调节。本文介绍了在没有 SOC 信息的情况下运行的 MG 自适应能量调度优化框架,利用部分可观测马尔可夫博弈(POMG)来减少能量使用。此外,为了制定最佳能源调度策略,还介绍了一种使用递归多代理深度确定性策略梯度(RMADDPG)的 MG 系统优化方法。该方法与其他现有技术(如 MADDPG、确定性递归策略梯度(DRPG)和独立 Q-learning(IQL))进行了对比评估,根据模拟结果,电能消耗分别减少了 4.29%、5.56% 和 12.95%。
{"title":"Energy Scheduling Optimization for Microgrids Based on Partially Observable Markov Game","authors":"Jiakai Gong;Nuo Yu;Fen Han;Bin Tang;Haolong Wu;Yuan Ge","doi":"10.1109/TAI.2024.3428510","DOIUrl":"https://doi.org/10.1109/TAI.2024.3428510","url":null,"abstract":"Microgrids (MGs) are essential for enhancing energy efficiency and minimizing power usage through the regulation of energy storage systems. Nevertheless, privacy-related concerns obstruct the real-time precise regulation of these systems due to unavailable state-of-charge (SOC) data. This article introduces a self-adaptive energy scheduling optimization framework for MGs that operates without SOC information, utilizing a partially observable Markov game (POMG) to decrease energy usage. Furthermore, to develop an optimal energy scheduling strategy, a MG system optimization approach using recurrent multiagent deep deterministic policy gradient (RMADDPG) is presented. This method is evaluated against other existing techniques such as MADDPG, deterministic recurrent policy gradient (DRPG), and independent Q-learning (IQL), demonstrating reductions in electrical energy consumption by 4.29%, 5.56%, and 12.95%, respectively, according to simulation outcomes.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5371-5380"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services U-Park:以用户为中心的电动共享微型交通服务智能停车推荐系统
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3428513
Sen Yan;Noel E. O’Connor;Mingming Liu
Electric shared micromobility services (ESMSs) has become a vital element within the mobility as a service framework, contributing to sustainable transportation systems. However, existing ESMS face notable design challenges such as shortcomings in integration, transparency, and user-centered approaches, resulting in increased operational costs and decreased service quality. A key operational issue for ESMS revolves around parking, particularly ensuring the availability of parking spaces as users approach their destinations. For instance, a recent study illustrated that nearly 13% of shared e-bike users in Dublin, Ireland, encounter difficulties parking their e-bikes due to inadequate planning and guidance. In response, we introduce U-Park, a user-centric smart parking recommendation system designed for ESMS, providing tailored recommendations to users by analyzing their historical mobility data, trip trajectory, and parking space availability. We present the system architecture, implement it, and evaluate its performance using real-world data from an Irish-based shared e-bike provider, MOBY Bikes. Our results illustrate U-Park's ability to predict a user's destination within a shared e-bike system, achieving an approximate accuracy rate of over 97.60%, all without requiring direct user input. Experiments have proven that this predictive capability empowers U-Park to suggest the optimal parking station to users based on the availability of predicted parking spaces, improving the probability of obtaining a parking spot by 24.91% on average and 29.66% on maximum when parking availability is limited.
电动共享微型交通服务(ESMS)已成为交通即服务框架中的一个重要元素,为可持续交通系统做出了贡献。然而,现有的 ESMS 在设计上面临着明显的挑战,如在整合、透明度和以用户为中心的方法上存在不足,导致运营成本增加和服务质量下降。ESMS 的一个关键运营问题是停车问题,尤其是在用户接近目的地时确保停车位的可用性。例如,最近的一项研究表明,爱尔兰都柏林近 13% 的共享电动自行车用户在停放电动自行车时遇到困难,原因是规划和引导不足。为此,我们介绍了 U-Park,这是一个以用户为中心、专为 ESMS 设计的智能停车推荐系统,通过分析用户的历史移动数据、行程轨迹和停车位可用性,为用户提供量身定制的推荐。我们介绍了该系统的架构、实施方法,并使用爱尔兰共享电动自行车提供商 MOBY Bikes 的真实数据对其性能进行了评估。我们的结果表明,U-Park 能够预测用户在共享电动自行车系统中的目的地,准确率超过 97.60%,而且无需用户直接输入。实验证明,这种预测能力使 U-Park 能够根据预测停车位的可用性向用户推荐最佳停车站,在停车位有限的情况下,获得停车位的概率平均提高了 24.91%,最高提高了 29.66%。
{"title":"U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services","authors":"Sen Yan;Noel E. O’Connor;Mingming Liu","doi":"10.1109/TAI.2024.3428513","DOIUrl":"https://doi.org/10.1109/TAI.2024.3428513","url":null,"abstract":"Electric shared micromobility services (ESMSs) has become a vital element within the mobility as a service framework, contributing to sustainable transportation systems. However, existing ESMS face notable design challenges such as shortcomings in integration, transparency, and user-centered approaches, resulting in increased operational costs and decreased service quality. A key operational issue for ESMS revolves around parking, particularly ensuring the availability of parking spaces as users approach their destinations. For instance, a recent study illustrated that nearly 13% of shared e-bike users in Dublin, Ireland, encounter difficulties parking their e-bikes due to inadequate planning and guidance. In response, we introduce U-Park, a user-centric smart parking recommendation system designed for ESMS, providing tailored recommendations to users by analyzing their historical mobility data, trip trajectory, and parking space availability. We present the system architecture, implement it, and evaluate its performance using real-world data from an Irish-based shared e-bike provider, MOBY Bikes. Our results illustrate U-Park's ability to predict a user's destination within a shared e-bike system, achieving an approximate accuracy rate of over 97.60%, all without requiring direct user input. Experiments have proven that this predictive capability empowers U-Park to suggest the optimal parking station to users based on the availability of predicted parking spaces, improving the probability of obtaining a parking spot by 24.91% on average and 29.66% on maximum when parking availability is limited.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5179-5193"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10599560","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息
Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3422574
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2024.3422574","DOIUrl":"https://doi.org/10.1109/TAI.2024.3422574","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 7","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10599851","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Human-in-the-Middle Attack Against Object Detection Systems 针对物体检测系统的中间人攻击
Pub Date : 2024-07-15 DOI: 10.1109/TAI.2024.3428520
Han Wu;Sareh Rowlands;Johan Wahlström
Object detection systems using deep learning models have become increasingly popular in robotics thanks to the rising power of central processing units (CPUs) and graphics processing units (GPUs) in embedded systems. However, these models are susceptible to adversarial attacks. While some attacks are limited by strict assumptions on access to the detection system, we propose a novel hardware attack inspired by Man-in-the-Middle attacks in cryptography. This attack generates a universal adversarial perturbations (UAPs) and injects the perturbation between the universal serial bus (USB) camera and the detection system via a hardware attack. Besides, prior research is misled by an evaluation metric that measures the model accuracy rather than the attack performance. In combination with our proposed evaluation metrics, we significantly increased the strength of adversarial perturbations. These findings raise serious concerns for applications of deep learning models in safety-critical systems, such as autonomous driving.
由于嵌入式系统中中央处理器(CPU)和图形处理器(GPU)的性能不断提升,使用深度学习模型的物体检测系统在机器人领域越来越受欢迎。然而,这些模型容易受到恶意攻击。有些攻击受限于访问检测系统的严格假设,而我们提出的新型硬件攻击则受密码学中的 "中间人 "攻击启发。这种攻击会产生一种通用对抗扰动(UAPs),并通过硬件攻击将扰动注入通用串行总线(USB)摄像头和检测系统之间。此外,先前的研究还受到了衡量模型准确性而非攻击性能的评估指标的误导。结合我们提出的评估指标,我们大大提高了对抗性扰动的强度。这些发现引起了人们对深度学习模型在自动驾驶等安全关键系统中应用的严重担忧。
{"title":"A Human-in-the-Middle Attack Against Object Detection Systems","authors":"Han Wu;Sareh Rowlands;Johan Wahlström","doi":"10.1109/TAI.2024.3428520","DOIUrl":"https://doi.org/10.1109/TAI.2024.3428520","url":null,"abstract":"Object detection systems using deep learning models have become increasingly popular in robotics thanks to the rising power of central processing units (CPUs) and graphics processing units (GPUs) in embedded systems. However, these models are susceptible to adversarial attacks. While some attacks are limited by strict assumptions on access to the detection system, we propose a novel hardware attack inspired by Man-in-the-Middle attacks in cryptography. This attack generates a universal adversarial perturbations (UAPs) and injects the perturbation between the universal serial bus (USB) camera and the detection system via a hardware attack. Besides, prior research is misled by an evaluation metric that measures the model accuracy rather than the attack performance. In combination with our proposed evaluation metrics, we significantly increased the strength of adversarial perturbations. These findings raise serious concerns for applications of deep learning models in safety-critical systems, such as autonomous driving.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"4884-4892"},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
360° High-Resolution Depth Estimation via Uncertainty-Aware Structural Knowledge Transfer 通过不确定性感知结构知识转移实现 360° 高分辨率深度估算
Pub Date : 2024-07-12 DOI: 10.1109/TAI.2024.3427068
Zidong Cao;Hao Ai;Athanasios V. Vasilakos;Lin Wang
To predict high-resolution (HR) omnidirectional depth maps, existing methods typically leverage HR omnidirectional image (ODI) as the input via fully supervised learning. However, in practice, taking HR ODI as input is undesired due to resource-constrained devices. In addition, depth maps are often with lower resolution than color images. Therefore, in this article, we explore for the first time to estimate the HR omnidirectional depth directly from a low-resolution (LR) ODI, when no HR depth ground truth (GT) map is available. Our key idea is to transfer the scene structural knowledge from the HR image modality and the corresponding LR depth maps to achieve the goal of HR depth estimation without any extra inference cost. Specifically, we introduce ODI super-resolution (SR) as an auxiliary task and train both tasks collaboratively in a weakly supervised manner to boost the performance of HR depth estimation. The ODI SR task extracts the scene structural knowledge via uncertainty estimation. Buttressed by this, a scene structural knowledge transfer (SSKT) module is proposed with two key components. First, we employ a cylindrical implicit interpolation function (CIIF) to learn cylindrical neural interpolation weights for feature up-sampling and share the parameters of CIIFs between the two tasks. Then, we propose a feature distillation (FD) loss that provides extra structural regularization to help the HR depth estimation task learn more scene structural knowledge. Extensive experiments demonstrate that our weakly supervised method outperforms baseline methods, and even achieves comparable performance with the fully supervised methods.
为了预测高分辨率(HR)全向深度图,现有方法通常通过完全监督学习,利用 HR 全向图像(ODI)作为输入。然而,在实际应用中,由于设备资源有限,将高分辨率全向图像作为输入是不可取的。此外,深度图的分辨率通常低于彩色图像。因此,在本文中,我们首次探索了在没有高清深度地面实况(GT)图的情况下,直接从低分辨率(LR)ODI 估算高清全向深度的方法。我们的主要想法是从高分辨率图像模式和相应的低分辨率深度图中转移场景结构知识,以实现高分辨率深度估计的目标,而无需任何额外的推理成本。具体来说,我们引入 ODI 超分辨率(SR)作为辅助任务,并以弱监督的方式对两个任务进行协同训练,以提高 HR 深度估计的性能。ODI SR 任务通过不确定性估计提取场景结构知识。在此基础上,我们提出了一个场景结构知识转移(SSKT)模块,该模块由两个关键部分组成。首先,我们采用圆柱隐式插值函数(CIIF)来学习用于特征上采样的圆柱神经插值权重,并在两个任务之间共享 CIIF 的参数。然后,我们提出了一种提供额外结构正则化的特征蒸馏(FD)损失,以帮助 HR 深度估计任务学习更多场景结构知识。大量实验证明,我们的弱监督方法优于基线方法,甚至达到了与完全监督方法相当的性能。
{"title":"360° High-Resolution Depth Estimation via Uncertainty-Aware Structural Knowledge Transfer","authors":"Zidong Cao;Hao Ai;Athanasios V. Vasilakos;Lin Wang","doi":"10.1109/TAI.2024.3427068","DOIUrl":"https://doi.org/10.1109/TAI.2024.3427068","url":null,"abstract":"To predict high-resolution (HR) omnidirectional depth maps, existing methods typically leverage HR omnidirectional image (ODI) as the input via fully supervised learning. However, in practice, taking HR ODI as input is undesired due to resource-constrained devices. In addition, depth maps are often with lower resolution than color images. Therefore, in this article, we explore for the first time to estimate the HR omnidirectional depth directly from a low-resolution (LR) ODI, when no HR depth ground truth (GT) map is available. Our key idea is to transfer the scene structural knowledge from the HR image modality and the corresponding LR depth maps to achieve the goal of HR depth estimation without any extra inference cost. Specifically, we introduce ODI super-resolution (SR) as an auxiliary task and train both tasks collaboratively in a weakly supervised manner to boost the performance of HR depth estimation. The ODI SR task extracts the scene structural knowledge via uncertainty estimation. Buttressed by this, a scene structural knowledge transfer (SSKT) module is proposed with two key components. First, we employ a cylindrical implicit interpolation function (CIIF) to learn cylindrical neural interpolation weights for feature up-sampling and share the parameters of CIIFs between the two tasks. Then, we propose a feature distillation (FD) loss that provides extra structural regularization to help the HR depth estimation task learn more scene structural knowledge. Extensive experiments demonstrate that our weakly supervised method outperforms baseline methods, and even achieves comparable performance with the fully supervised methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5392-5402"},"PeriodicalIF":0.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1