IEEE Access最新文献_第7页

Adaptive Fault-Tolerant Thrust Allocation for Underwater Vehicles With Resource Constraints 具有资源约束的水下航行器自适应容错推力分配

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649761

Waseem Akram;Muhayy Ud Din;Tarek Taha;Irfan Hussain

Thruster allocation is critical for the reliable operation of underwater vehicles, particularly under actuator degradation, power limitations, and thermal stress. Existing methods, such as pseudo-inverse or standard quadratic programming (QP)-based approaches, mainly minimize allocation error or energy consumption but often overlook real-time degradation and resource constraints. In this paper, we propose an adaptive fault-tolerant thrust allocation framework integrated with a PID plus Sliding Mode Control (PID+SMC) law for robust trajectory tracking. The approach leverages convex optimization to simultaneously enforce: 1) residual-driven health adaptation that down-weights degraded thrusters online; 2) power-aware allocation ensuring operation within a global energy budget; and 3) thermal-aware constraints that actively prevent overheating. A lightweight residual filter continuously updates thruster health indices, enabling rapid reallocation under faults and efficiency loss. Simulation results across nominal, power-limited, thermal-limited, faulted, and combined scenarios show that the proposed method reduces trajectory tracking error by up to 4.3% and completely eliminates power and thermal violations compared to conventional baselines. This unified framework establishes a foundation for real-time, safety-aware thruster management in marine robotics.

推进器的配置对于水下航行器的可靠运行至关重要，特别是在执行器退化、功率限制和热应力的情况下。现有的方法，如基于伪逆或标准二次规划（QP）的方法，主要是最小化分配误差或能量消耗，但往往忽略了实时退化和资源约束。本文提出了一种集成PID+滑模控制（PID+SMC）律的自适应容错推力分配框架，用于鲁棒轨迹跟踪。该方法利用凸优化来同时执行：1)残差驱动的健康适应，在线降低退化推进器的权重；2)电力感知分配，确保在全球能源预算范围内运行；3)热感知约束，主动防止过热。轻量级残留过滤器不断更新推进器健康指数，在故障和效率损失下实现快速重新分配。在标称、功率限制、热限制、故障和组合场景下的仿真结果表明，与传统基线相比，该方法将轨迹跟踪误差降低了4.3%，并完全消除了功率和热违规。这个统一的框架为船舶机器人的实时、安全感知推进器管理奠定了基础。

{"title":"Adaptive Fault-Tolerant Thrust Allocation for Underwater Vehicles With Resource Constraints","authors":"Waseem Akram;Muhayy Ud Din;Tarek Taha;Irfan Hussain","doi":"10.1109/ACCESS.2025.3649761","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649761","url":null,"abstract":"Thruster allocation is critical for the reliable operation of underwater vehicles, particularly under actuator degradation, power limitations, and thermal stress. Existing methods, such as pseudo-inverse or standard quadratic programming (QP)-based approaches, mainly minimize allocation error or energy consumption but often overlook real-time degradation and resource constraints. In this paper, we propose an adaptive fault-tolerant thrust allocation framework integrated with a PID plus Sliding Mode Control (PID+SMC) law for robust trajectory tracking. The approach leverages convex optimization to simultaneously enforce: 1) residual-driven health adaptation that down-weights degraded thrusters online; 2) power-aware allocation ensuring operation within a global energy budget; and 3) thermal-aware constraints that actively prevent overheating. A lightweight residual filter continuously updates thruster health indices, enabling rapid reallocation under faults and efficiency loss. Simulation results across nominal, power-limited, thermal-limited, faulted, and combined scenarios show that the proposed method reduces trajectory tracking error by up to 4.3% and completely eliminates power and thermal violations compared to conventional baselines. This unified framework establishes a foundation for real-time, safety-aware thruster management in marine robotics.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1341-1357"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318569","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Common-Mode Voltage Elimination and Zero-Sequence Circulating Current Reduction of Parallel Back-to-Back Converters 并联背对背变换器的共模电压消除和零序循环电流减小

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649536

Reza Farajpour;Mojtaba Sarparast;Jafar Adabi;Mohammad Rezanejad;Edris Pouresmaeil

Parallel back-to-back converters are highly demanded in many high-power applications such as adjustable speed drive (ASD) systems, which reduce harmonics and improve the power factor and reliability compared to single two-level converters. It is evident that common-mode voltage (CMV) is the root cause of many challenges in ASD systems, such as shaft voltage and bearing damage, which may reduce equipment lifespan. On the other hand, Zero Sequence Circulating Current (ZSCC) leads to an additional current of switches which increases power loss and decreases the current capacity of converters. Simultaneous reduction of these two critical issues has to be considered in any switching strategy. In this regard, this paper presents a switching strategy based on a modified three-level space vector modulation scheme, which completely eliminates the common-mode voltage (CMV = 0 V). Moreover, the proposed switching sequence keeps the ZSCC within a low-amplitude and fully symmetric ripple, ensuring controlled circulating-current behavior without requiring any additional hardware. The method also generates a three-level line voltage and achieves an input-current THD of 3.92%. The simulation and experimental results confirm the effectiveness of the proposed approach.

并联背靠背转换器在许多大功率应用中都有很高的需求，例如可调速驱动（ASD）系统，与单个双电平转换器相比，它可以减少谐波并提高功率因数和可靠性。很明显，共模电压（CMV）是ASD系统中许多挑战的根本原因，例如轴电压和轴承损坏，这可能会降低设备的使用寿命。另一方面，零序循环电流（ZSCC）导致开关的额外电流，这增加了功率损耗，降低了变换器的电流容量。在任何切换策略中都必须考虑同时减少这两个关键问题。为此，本文提出了一种基于改进的三电平空间矢量调制方案的开关策略，完全消除了共模电压（CMV = 0 V）。此外，所提出的开关序列使ZSCC保持在低幅度和完全对称的纹波内，确保控制循环电流行为，而不需要任何额外的硬件。该方法还产生了三电平线电压，并实现了3.92%的输入电流THD。仿真和实验结果验证了该方法的有效性。

{"title":"Common-Mode Voltage Elimination and Zero-Sequence Circulating Current Reduction of Parallel Back-to-Back Converters","authors":"Reza Farajpour;Mojtaba Sarparast;Jafar Adabi;Mohammad Rezanejad;Edris Pouresmaeil","doi":"10.1109/ACCESS.2025.3649536","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649536","url":null,"abstract":"Parallel back-to-back converters are highly demanded in many high-power applications such as adjustable speed drive (ASD) systems, which reduce harmonics and improve the power factor and reliability compared to single two-level converters. It is evident that common-mode voltage (CMV) is the root cause of many challenges in ASD systems, such as shaft voltage and bearing damage, which may reduce equipment lifespan. On the other hand, Zero Sequence Circulating Current (ZSCC) leads to an additional current of switches which increases power loss and decreases the current capacity of converters. Simultaneous reduction of these two critical issues has to be considered in any switching strategy. In this regard, this paper presents a switching strategy based on a modified three-level space vector modulation scheme, which completely eliminates the common-mode voltage (CMV = 0 V). Moreover, the proposed switching sequence keeps the ZSCC within a low-amplitude and fully symmetric ripple, ensuring controlled circulating-current behavior without requiring any additional hardware. The method also generates a three-level line voltage and achieves an input-current THD of 3.92%. The simulation and experimental results confirm the effectiveness of the proposed approach.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1256-1268"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318331","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing Stock Market Prediction With Hybrid Deep Learning: Integrating LSTM, Transformer Attention, Federated Learning, and Sentiment Analysis 用混合深度学习增强股票市场预测：集成LSTM、变压器注意、联邦学习和情绪分析

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649668

Yousef Nejatbakhsh;Malihe Aliasgari

Accurate stock market prediction remains a critical yet challenging task due to the highly non-linear, volatile, and sentiment-driven nature of financial markets. In this paper, we present a hybrid deep learning framework that integrates long-short-term memory (LSTM) networks with Transformer-based attention mechanisms, sentiment analysis from financial news, and a privacy-preserving Federated Learning (FL) strategy. First, we benchmark traditional forecasting approaches, including ARIMA, SARIMAX, Prophet, Random Forest, and Support Vector Regression, against the baseline LSTM models. Our results show that LSTMs consistently outperform conventional methods in capturing temporal dependencies. To further enhance predictive accuracy, we incorporate Transformer attention to improve long-range dependency modeling and apply sentiment analysis using FinBERT-tone to embed market sentiment signals into the model. Finally, we simulate a Federated Learning environment, enabling decentralized model training without sharing raw financial data, thus addressing privacy concerns in the financial domain. Experimental results in ten major technology companies (Tesla, Apple, Amazon, Microsoft, Google, etc.) demonstrate that our hybrid model achieves superior short-term forecasting performance, with an average

$R^{2}$

variance score of 0.91 across ten major technology companies and a trend precision of

$65.36~%$

, demonstrating strong prediction performance for short-term stock forecasting. These findings highlight the potential of combining deep sequential models, attention mechanisms, and privacy-sensitive training strategies for robust and secure stock market forecasting.

由于金融市场具有高度非线性、波动性和情绪驱动的特性，准确的股市预测仍然是一项关键而具有挑战性的任务。在本文中，我们提出了一个混合深度学习框架，该框架将长短期记忆（LSTM）网络与基于transformer的注意力机制、金融新闻的情绪分析和保护隐私的联邦学习（FL）策略集成在一起。首先，我们将传统的预测方法，包括ARIMA、SARIMAX、Prophet、Random Forest和支持向量回归，与基线LSTM模型进行比较。我们的结果表明，lstm在捕获时间依赖性方面始终优于传统方法。为了进一步提高预测的准确性，我们结合了Transformer注意力来改进远程依赖关系建模，并使用FinBERT-tone应用情绪分析将市场情绪信号嵌入到模型中。最后，我们模拟了一个联邦学习环境，在不共享原始财务数据的情况下实现分散的模型训练，从而解决了金融领域的隐私问题。在10家主要科技公司（特斯拉、苹果、亚马逊、微软、b谷歌等）的实验结果表明，我们的混合模型具有较好的短期预测性能，10家主要科技公司的平均$R^{2}$方差得分为0.91，趋势精度为$65.36~%$，对短期股票预测具有较强的预测性能。这些发现强调了将深度序列模型、注意力机制和隐私敏感训练策略结合起来进行稳健和安全的股市预测的潜力。

{"title":"Enhancing Stock Market Prediction With Hybrid Deep Learning: Integrating LSTM, Transformer Attention, Federated Learning, and Sentiment Analysis","authors":"Yousef Nejatbakhsh;Malihe Aliasgari","doi":"10.1109/ACCESS.2025.3649668","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649668","url":null,"abstract":"Accurate stock market prediction remains a critical yet challenging task due to the highly non-linear, volatile, and sentiment-driven nature of financial markets. In this paper, we present a hybrid deep learning framework that integrates long-short-term memory (LSTM) networks with Transformer-based attention mechanisms, sentiment analysis from financial news, and a privacy-preserving Federated Learning (FL) strategy. First, we benchmark traditional forecasting approaches, including ARIMA, SARIMAX, Prophet, Random Forest, and Support Vector Regression, against the baseline LSTM models. Our results show that LSTMs consistently outperform conventional methods in capturing temporal dependencies. To further enhance predictive accuracy, we incorporate Transformer attention to improve long-range dependency modeling and apply sentiment analysis using FinBERT-tone to embed market sentiment signals into the model. Finally, we simulate a Federated Learning environment, enabling decentralized model training without sharing raw financial data, thus addressing privacy concerns in the financial domain. Experimental results in ten major technology companies (Tesla, Apple, Amazon, Microsoft, Google, etc.) demonstrate that our hybrid model achieves superior short-term forecasting performance, with an average <inline-formula> <tex-math>$R^{2}$ </tex-math></inline-formula> variance score of 0.91 across ten major technology companies and a trend precision of <inline-formula> <tex-math>$65.36~%$ </tex-math></inline-formula>, demonstrating strong prediction performance for short-term stock forecasting. These findings highlight the potential of combining deep sequential models, attention mechanisms, and privacy-sensitive training strategies for robust and secure stock market forecasting.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"3926-3942"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318561","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust and Efficient Autonomous Charging Station for Uncrewed Aerial Vehicles Under Large Landing Inaccuracies 大着陆误差下无人飞行器的鲁棒高效自主充电站

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649751

Jeongwoo Son;Chansu Kim;Sang Hoon Kang

This paper proposes an electrical contact-based autonomous charging station for uncrewed aerial vehicles (UAVs) that reliably initiates charging regardless of landing position and orientation inaccuracies. Unlike existing UAV charging methods – which may suffer from efficiency losses due to wireless power transfer or require mechanical actuators, specially shaped structures, or diode bridges – the proposed autonomous charging station uses modular units with Hall-effect sensors to detect a magnet mounted on the UAV’s positive charging electrode. Thus, the proposed charging station was designed to allow direct electrical contact without rectifier diodes or actuators, reducing unnecessary losses. Across all 832 possible landing poses of the UAV, the power transfer efficiency exceeded 98.34% – surpassing the 91.02% reported in prior work; in outdoor repeated-flight tests, charging initiated and succeeded in all trials (30/30) with randomized landing positions and orientations. Preliminary field trials at a 765-kV substation demonstrated feasibility under elevated electromagnetic interference. These results highlight the robustness of the proposed system to substantial landing inaccuracies, providing a strong foundation for prolonged, unattended UAV missions in demanding real-world environments.

本文提出了一种基于电接触的无人驾驶飞行器（uav）自主充电站，该充电站可以在着陆位置和方向不准确的情况下可靠地启动充电。与现有的无人机充电方法不同——由于无线电力传输或需要机械致动器、特殊形状的结构或二极管桥，可能会遭受效率损失——拟议的自主充电站使用带有霍尔效应传感器的模块化单元来检测安装在无人机正电荷电极上的磁铁。因此，拟议的充电站被设计成允许直接电接触而不需要整流二极管或致动器，从而减少不必要的损失。在所有832种可能的着陆姿态中，动力传输效率超过98.34%，超过了先前工作报告的91.02%；在室外重复飞行试验中，随机着陆位置和方向的所有试验（30/30）都启动了充电并成功。在765千伏变电站进行的初步现场试验证明了在高强度电磁干扰下的可行性。这些结果突出了所提出的系统对大量着陆不准确性的鲁棒性，为在苛刻的现实环境中进行长时间无人值守的无人机任务提供了坚实的基础。

{"title":"Robust and Efficient Autonomous Charging Station for Uncrewed Aerial Vehicles Under Large Landing Inaccuracies","authors":"Jeongwoo Son;Chansu Kim;Sang Hoon Kang","doi":"10.1109/ACCESS.2025.3649751","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649751","url":null,"abstract":"This paper proposes an electrical contact-based autonomous charging station for uncrewed aerial vehicles (UAVs) that reliably initiates charging regardless of landing position and orientation inaccuracies. Unlike existing UAV charging methods – which may suffer from efficiency losses due to wireless power transfer or require mechanical actuators, specially shaped structures, or diode bridges – the proposed autonomous charging station uses modular units with Hall-effect sensors to detect a magnet mounted on the UAV’s positive charging electrode. Thus, the proposed charging station was designed to allow direct electrical contact without rectifier diodes or actuators, reducing unnecessary losses. Across all 832 possible landing poses of the UAV, the power transfer efficiency exceeded 98.34% – surpassing the 91.02% reported in prior work; in outdoor repeated-flight tests, charging initiated and succeeded in all trials (30/30) with randomized landing positions and orientations. Preliminary field trials at a 765-kV substation demonstrated feasibility under elevated electromagnetic interference. These results highlight the robustness of the proposed system to substantial landing inaccuracies, providing a strong foundation for prolonged, unattended UAV missions in demanding real-world environments.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"4027-4037"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318570","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Robotic Behavior in Industrial Human–Robot Collaboration: A Systematic Review of Taxonomies, Enabling Mechanisms, and Research Frontiers 工业人机协作中的自适应机器人行为：分类、使能机制和研究前沿的系统回顾

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649702

Bsher Karbouj;Rajwinder Garha;Konstantin KeßLer;Jörg Krüger

The inherent variability in human performance introduces stochastic perturbations into manufacturing environments, undermining the seamless coordination required for effective human-robot collaboration (HRC) systems. While human cognitive flexibility enhances adaptability, it simultaneously acts as a source of operational uncertainty, complicating the modeling and optimization of integrated robotic systems. Given these challenges, there is an urgent need to substantially expand the adaptability of robotic systems through real-time detection, algorithmic analysis and dynamic behavioral adjustments in response to human performance fluctuations. The systematic development of such systems capable of precisely detecting task-specific variations, analyzing them via advanced AI algorithms and adapting their behavior accordingly remains a critical focus of contemporary research. To evaluate progress in this domain, this study conducts a systematic literature review, synthesizing advancements across 124 publications and identifying underexplored research frontiers. The findings reveal a persistent misalignment between current technical capabilities and the requirements of adaptive collaboration in dynamic industrial environments. Key gaps include the absence of explainable AI frameworks for transparent decision-making, limited generalizability of adaptive control architectures and a lack of proactive strategies that anticipate rather than merely react to performance deviations.

人类表现的内在可变性将随机扰动引入制造环境，破坏了有效人机协作（HRC）系统所需的无缝协调。虽然人类的认知灵活性增强了适应性，但它同时也是操作不确定性的来源，使集成机器人系统的建模和优化复杂化。鉴于这些挑战，迫切需要通过实时检测、算法分析和动态行为调整来响应人类表现的波动，从而大幅扩大机器人系统的适应性。这种系统的系统开发能够精确检测特定任务的变化，通过先进的人工智能算法对其进行分析，并相应地调整其行为，这仍然是当代研究的关键焦点。为了评估这一领域的进展，本研究进行了系统的文献综述，综合了124篇出版物的进展，并确定了未被探索的研究前沿。研究结果揭示了当前技术能力与动态工业环境中适应性协作需求之间的持续错位。主要差距包括缺乏可解释的人工智能透明决策框架，自适应控制体系结构的可泛化性有限，以及缺乏预测而不仅仅是对性能偏差做出反应的主动策略。

{"title":"Adaptive Robotic Behavior in Industrial Human–Robot Collaboration: A Systematic Review of Taxonomies, Enabling Mechanisms, and Research Frontiers","authors":"Bsher Karbouj;Rajwinder Garha;Konstantin KeßLer;Jörg Krüger","doi":"10.1109/ACCESS.2025.3649702","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649702","url":null,"abstract":"The inherent variability in human performance introduces stochastic perturbations into manufacturing environments, undermining the seamless coordination required for effective human-robot collaboration (HRC) systems. While human cognitive flexibility enhances adaptability, it simultaneously acts as a source of operational uncertainty, complicating the modeling and optimization of integrated robotic systems. Given these challenges, there is an urgent need to substantially expand the adaptability of robotic systems through real-time detection, algorithmic analysis and dynamic behavioral adjustments in response to human performance fluctuations. The systematic development of such systems capable of precisely detecting task-specific variations, analyzing them via advanced AI algorithms and adapting their behavior accordingly remains a critical focus of contemporary research. To evaluate progress in this domain, this study conducts a systematic literature review, synthesizing advancements across 124 publications and identifying underexplored research frontiers. The findings reveal a persistent misalignment between current technical capabilities and the requirements of adaptive collaboration in dynamic industrial environments. Key gaps include the absence of explainable AI frameworks for transparent decision-making, limited generalizability of adaptive control architectures and a lack of proactive strategies that anticipate rather than merely react to performance deviations.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1398-1422"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318881","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design and Implementation of ROS2 Security Module for Performance and Security Harmonization 面向性能与安全协调的ROS2安全模块的设计与实现

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649485

Jeong Hyeon Park;Hong Seong Park

The security of Robot Operating System 2 (ROS 2) is crucial for ensuring the safety of individual robotic systems and the reliability of the environments in which they operate. Although Secure ROS 2 (SROS2) enhances security via Data Distribution Service (DDS) Security for authentication, access control, and data encryption, it has several limitations. The complexity of managing security artifacts and the degradation of communication performance are two primary concerns. To address these challenges, we propose the ROS 2 Security (ROS2Sec) module, which enhances ROS 2 security while minimizing communication performance degradation. ROS2Sec introduces centralized authentication management and group-based access control, thus reducing the number of security artifacts from seven for SROS2 to only three, thereby simplifying management. Additionally, ROS2Sec employs the Advanced Encryption Standard in Galois/Counter Mode at the ROS 2 message level to optimize data confidentiality while minimizing overhead. Experimental results demonstrate that the proposed ROS2Sec prevents unauthorized access by malicious ROS 2 nodes, reduces the mean communication latency by approximately 9% compared with that of SROS2, and maintains stable performance even as the number of subscribers increases. These findings confirm that ROS2Sec balances security and communication performance and is a practical solution for secure and efficient ROS 2-based robotic systems.

机器人操作系统2 （ROS 2）的安全性对于确保单个机器人系统的安全性及其运行环境的可靠性至关重要。尽管安全ROS2 （SROS2）通过数据分发服务（DDS）安全性来增强身份验证、访问控制和数据加密的安全性，但它有几个限制。管理安全构件的复杂性和通信性能的下降是两个主要问题。为了解决这些挑战，我们提出了ROS2安全性（ROS2Sec）模块，该模块增强了ROS2安全性，同时最大限度地降低了通信性能的下降。ROS2Sec引入了集中的身份验证管理和基于组的访问控制，从而将安全构件的数量从SROS2的7个减少到只有3个，从而简化了管理。此外，ROS2Sec在ros2消息级别采用伽罗瓦/计数器模式的高级加密标准，以优化数据机密性，同时最大限度地减少开销。实验结果表明，所提出的ROS2Sec可以防止恶意ROS2节点的未经授权访问，与SROS2相比，平均通信延迟减少了约9%，并且即使用户数量增加也能保持稳定的性能。这些发现证实，ROS2Sec平衡了安全性和通信性能，是安全高效的基于ROS2的机器人系统的实用解决方案。

{"title":"Design and Implementation of ROS2 Security Module for Performance and Security Harmonization","authors":"Jeong Hyeon Park;Hong Seong Park","doi":"10.1109/ACCESS.2025.3649485","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649485","url":null,"abstract":"The security of Robot Operating System 2 (ROS 2) is crucial for ensuring the safety of individual robotic systems and the reliability of the environments in which they operate. Although Secure ROS 2 (SROS2) enhances security via Data Distribution Service (DDS) Security for authentication, access control, and data encryption, it has several limitations. The complexity of managing security artifacts and the degradation of communication performance are two primary concerns. To address these challenges, we propose the ROS 2 Security (ROS2Sec) module, which enhances ROS 2 security while minimizing communication performance degradation. ROS2Sec introduces centralized authentication management and group-based access control, thus reducing the number of security artifacts from seven for SROS2 to only three, thereby simplifying management. Additionally, ROS2Sec employs the Advanced Encryption Standard in Galois/Counter Mode at the ROS 2 message level to optimize data confidentiality while minimizing overhead. Experimental results demonstrate that the proposed ROS2Sec prevents unauthorized access by malicious ROS 2 nodes, reduces the mean communication latency by approximately 9% compared with that of SROS2, and maintains stable performance even as the number of subscribers increases. These findings confirm that ROS2Sec balances security and communication performance and is a practical solution for secure and efficient ROS 2-based robotic systems.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1287-1296"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318342","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quantifying the Economic Loss and Operational Implications of Air Pollution on Grid-Connected PV Systems in the Arabian Peninsula: A Machine Learning-Based Analysis 量化阿拉伯半岛空气污染对并网光伏系统的经济损失和运行影响：基于机器学习的分析

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-30 DOI: 10.1109/ACCESS.2025.3649764

Mohammed A. Bou-Rabee;Fajer M. Alelaj;Hussain Al-Sairfi

The Arabian Peninsula is one of the regions in the world with the highest potential for solar energy development in the sense that the levels of solar irradiance are very high, making it an ideal site to install photovoltaic (PV) systems. However, this huge potential is seriously undermined by enduring environmental problems, especially the high levels of airborne particulate matter and anthropogenic pollutants that together reduce the intensity of the sun and contribute to the rapid soiling of PV modules. Though the current literature has largely concentrated on the quantification of the technical performance instability of PV systems caused by soiling phenomena, the associated extensive analysis in terms of quantifying the technical losses into quantifiable economic and operational effects on a regional level is strikingly lacking in the literature. This study builds a combined machine learning system in order to quantitatively measure the economic losses attributable to air pollution for utility-scale, grid-connected PV systems across the Gulf Cooperation Council (GCC) member states. We developed and strictly tested a Random Forest regression model with a remarkable coefficient of determination (R 2) of 98.24 that was trained on a large dataset of meteorological parameters, real-time air pollution data (PM2.52.5, PM1010, SO22, NO22, O33, CO) and the most important operational features of solar panels that occurred between 2018 and 2020. With the help of this validated predictive model, we conducted an advanced counterfactual analysis by modeling the potential power production under hypothetical conditions of clean atmosphere benchmarks. We show that air pollution is one of the contributors to the loss of 8.5% to 12.3% of annual energy production in the region; or more simply, it is a significant loss that can translate to huge financial fines, which can significantly affect the economics of projects. In addition, we propose a new data-driven predictive cleaning scheduling algorithm that proves capable of cutting operational expenditures (OPEX) by up to 25 percent relative to traditional calendar-driven cleaning schedules. The findings are empirically based, critical to renewable energy investors, utility grid operators, and policymakers, and they categorically highlight the significant economic necessity to set up wide-ranging air pollution reduction measures, even as operation and maintenance (O&M) protocols in arid, dust-prone geographical settings run to their optimum.

阿拉伯半岛是世界上太阳能发展潜力最大的地区之一，因为太阳辐照度非常高，使其成为安装光伏系统的理想地点。然而，这种巨大的潜力被长期存在的环境问题严重破坏，尤其是空气中高浓度的颗粒物和人为污染物，它们共同降低了太阳的强度，并导致光伏组件迅速被污染。虽然目前的文献主要集中在对由污染现象引起的光伏系统技术性能不稳定性的量化，但在将技术损失量化为可量化的区域经济和运营影响方面，相关的广泛分析在文献中明显缺乏。本研究建立了一个联合机器学习系统，以定量衡量海湾合作委员会（GCC）成员国公用事业规模的并网光伏系统因空气污染造成的经济损失。我们开发并严格测试了随机森林回归模型，该模型具有显著的决定系数（r2）为98.24，该模型基于大型数据集，包括气象参数、实时空气污染数据（pm2.5 52.5、PM1010、SO22、NO22、O33、CO）以及2018年至2020年间太阳能电池板的最重要运行特征。借助这一经过验证的预测模型，我们通过对清洁大气基准假设条件下的潜在发电量进行建模，进行了先进的反事实分析。研究表明，空气污染是造成该地区年能源产量损失8.5%至12.3%的原因之一；或者更简单地说，这是一项重大损失，可能会转化为巨额罚款，这可能会严重影响项目的经济效益。此外，我们提出了一种新的数据驱动的预测清洁调度算法，与传统的日历驱动的清洁计划相比，该算法能够将运营支出（OPEX）削减高达25%。这些发现是基于经验的，对可再生能源投资者、公用事业电网运营商和政策制定者至关重要，它们明确强调了建立广泛的空气污染减少措施的重大经济必要性，即使在干旱、易沙尘的地理环境中，运营和维护（O&M）协议达到了最佳状态。

{"title":"Quantifying the Economic Loss and Operational Implications of Air Pollution on Grid-Connected PV Systems in the Arabian Peninsula: A Machine Learning-Based Analysis","authors":"Mohammed A. Bou-Rabee;Fajer M. Alelaj;Hussain Al-Sairfi","doi":"10.1109/ACCESS.2025.3649764","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649764","url":null,"abstract":"The Arabian Peninsula is one of the regions in the world with the highest potential for solar energy development in the sense that the levels of solar irradiance are very high, making it an ideal site to install photovoltaic (PV) systems. However, this huge potential is seriously undermined by enduring environmental problems, especially the high levels of airborne particulate matter and anthropogenic pollutants that together reduce the intensity of the sun and contribute to the rapid soiling of PV modules. Though the current literature has largely concentrated on the quantification of the technical performance instability of PV systems caused by soiling phenomena, the associated extensive analysis in terms of quantifying the technical losses into quantifiable economic and operational effects on a regional level is strikingly lacking in the literature. This study builds a combined machine learning system in order to quantitatively measure the economic losses attributable to air pollution for utility-scale, grid-connected PV systems across the Gulf Cooperation Council (GCC) member states. We developed and strictly tested a Random Forest regression model with a remarkable coefficient of determination (R 2) of 98.24 that was trained on a large dataset of meteorological parameters, real-time air pollution data (PM2.52.5, PM1010, SO22, NO22, O33, CO) and the most important operational features of solar panels that occurred between 2018 and 2020. With the help of this validated predictive model, we conducted an advanced counterfactual analysis by modeling the potential power production under hypothetical conditions of clean atmosphere benchmarks. We show that air pollution is one of the contributors to the loss of 8.5% to 12.3% of annual energy production in the region; or more simply, it is a significant loss that can translate to huge financial fines, which can significantly affect the economics of projects. In addition, we propose a new data-driven predictive cleaning scheduling algorithm that proves capable of cutting operational expenditures (OPEX) by up to 25 percent relative to traditional calendar-driven cleaning schedules. The findings are empirically based, critical to renewable energy investors, utility grid operators, and policymakers, and they categorically highlight the significant economic necessity to set up wide-ranging air pollution reduction measures, even as operation and maintenance (O&M) protocols in arid, dust-prone geographical settings run to their optimum.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"4180-4188"},"PeriodicalIF":3.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11318572","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

NefroAI: An Explainable and Real-Time Framework for Predicting Chronic Kidney Disease Using Diverse Machine Learning Models and Different Feature Selection Techniques NefroAI：使用多种机器学习模型和不同特征选择技术预测慢性肾脏疾病的可解释和实时框架

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-29 DOI: 10.1109/ACCESS.2025.3649006

Md Sadi Al Huda;Esrath Kanon;Md. Shahidul Khan Pappo;Md. Asraf Ali;Nasim Ahmed

Chronic Kidney Disease (CKD) is a serious health condition that progresses silently, often going undiagnosed until it reaches critical stages. Early detection is vital, but traditional diagnostic methods can be expensive, slow, and out of reach for many people. In this study, we introduce NefroAI, a real-time, explainable framework designed to predict CKD using both machine learning and deep learning techniques. We worked with three diverse datasets sourced from the UCI Machine Learning Repository and Kaggle to ensure strong generalization across populations. Our main contributions include applying thorough data preprocessing like missing value imputation, SMOTE for balancing, normalization, and using hybrid feature selection and hyperparameter tuning techniques. We also prioritized explainability, incorporating SHAP and LIME to make model decisions transparent, and deployed the model using Streamlit for easy real-time access with Real-time CKD prediction in Risk Indicators and a Download report. We implemented eight machine learning models and three deep learning models in this study. Among these models, Random Forest, K-Nearest Neighbors, and Support Vector Machine achieved 100.0% accuracy on Dataset A; SVM performed best on Dataset B with 98.7% accuracy; Random Forest, Decision Tree, K-Nearest Neighbors, AdaBoost, Logistic Regression, Naive Bayes, Support Vector Machine, XGBoost, ANN, LSTM, and RNN achieved 100% accuracy on Dataset C, showing strong consistency across datasets. While our results are promising, exploring collaborative learning across multiple data sources as a next step can enhance privacy and improve model generalizability.

慢性肾脏疾病（CKD）是一种严重的健康状况，悄无声息地发展，通常直到达到关键阶段才被诊断出来。早期检测至关重要，但传统的诊断方法可能昂贵、缓慢，而且对许多人来说遥不可及。在本研究中，我们介绍了NefroAI，这是一个实时的、可解释的框架，旨在使用机器学习和深度学习技术来预测CKD。我们使用了来自UCI机器学习存储库和Kaggle的三个不同的数据集，以确保在人群中有很强的泛化能力。我们的主要贡献包括应用全面的数据预处理，如缺失值输入、用于平衡、规范化的SMOTE，以及使用混合特征选择和超参数调优技术。我们还优先考虑了可解释性，结合SHAP和LIME使模型决策透明，并使用Streamlit部署模型，以便在风险指标和下载报告中轻松实时访问实时CKD预测。我们在本研究中实现了8个机器学习模型和3个深度学习模型。其中，随机森林模型、k近邻模型和支持向量机模型在数据集A上的准确率达到100.0%；SVM在数据集B上表现最好，准确率为98.7%；随机森林、决策树、k近邻、AdaBoost、逻辑回归、朴素贝叶斯、支持向量机、XGBoost、ANN、LSTM和RNN在数据集C上实现了100%的准确率，显示出数据集之间很强的一致性。虽然我们的结果很有希望，但下一步探索跨多个数据源的协作学习可以增强隐私性并提高模型的可泛化性。

{"title":"NefroAI: An Explainable and Real-Time Framework for Predicting Chronic Kidney Disease Using Diverse Machine Learning Models and Different Feature Selection Techniques","authors":"Md Sadi Al Huda;Esrath Kanon;Md. Shahidul Khan Pappo;Md. Asraf Ali;Nasim Ahmed","doi":"10.1109/ACCESS.2025.3649006","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649006","url":null,"abstract":"Chronic Kidney Disease (CKD) is a serious health condition that progresses silently, often going undiagnosed until it reaches critical stages. Early detection is vital, but traditional diagnostic methods can be expensive, slow, and out of reach for many people. In this study, we introduce NefroAI, a real-time, explainable framework designed to predict CKD using both machine learning and deep learning techniques. We worked with three diverse datasets sourced from the UCI Machine Learning Repository and Kaggle to ensure strong generalization across populations. Our main contributions include applying thorough data preprocessing like missing value imputation, SMOTE for balancing, normalization, and using hybrid feature selection and hyperparameter tuning techniques. We also prioritized explainability, incorporating SHAP and LIME to make model decisions transparent, and deployed the model using Streamlit for easy real-time access with Real-time CKD prediction in Risk Indicators and a Download report. We implemented eight machine learning models and three deep learning models in this study. Among these models, Random Forest, K-Nearest Neighbors, and Support Vector Machine achieved 100.0% accuracy on Dataset A; SVM performed best on Dataset B with 98.7% accuracy; Random Forest, Decision Tree, K-Nearest Neighbors, AdaBoost, Logistic Regression, Naive Bayes, Support Vector Machine, XGBoost, ANN, LSTM, and RNN achieved 100% accuracy on Dataset C, showing strong consistency across datasets. While our results are promising, exploring collaborative learning across multiple data sources as a next step can enhance privacy and improve model generalizability.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"10939-10976"},"PeriodicalIF":3.6,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11316603","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ViT-ResNet Fusion: An Explainable Hybrid Framework for High-Accuracy Multiclass Lung Disease Classification in Chest X-Rays viti - resnet融合：一种可解释的混合框架，用于胸部x光片中高精度多类别肺部疾病分类

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-29 DOI: 10.1109/ACCESS.2025.3649109

Rahul Deva;Arvind Dagur

Chest radiographs are widely used to clinically diagnose thoracic diseases such as lung opacity, pneumonia, and coronavirus disease 2019 due to their cost-effectiveness and easy access. However, accurate interpretation remains challenging because of overlapping anatomical structures, low contrast, and subtle disease manifestations. Classical deep learning models are effective but often exhibit overfitting, and weak interpretability, which restricts their clinical applicability. This paper presents an attention-driven hybrid framework that integrates contrast-limited adaptive histogram equalization-based enhancement with a dual-backbone architecture combining a vision transformer and a residual network. The vision transformer captures global contextual dependencies, while the residual network extracts local discriminative features. The concatenated representations are classified using a multi-layer perceptron and optimized end-to-end with the AdamW optimizer and a step learning rate scheduler. To improve transparency, gradient-weighted class activation mapping is used to highlight disease-relevant regions in chest radiographs. Experimental evaluation highlights that the proposed framework achieves 98.54% classification accuracy, outperforming state-of-the-art models, including EfficientNet 96.20%, DenseNet 97.50%, and a fine-tuned vision transformer 95.79%. To ensure generalizability, cross-dataset validation was conducted and trained on COVID-19 Radiography Dataset. The same model was tested on an independent COVID–Pneumonia–Normal chest X-ray dataset which achieved an accuracy of 94.07% and demonstrated good performance over heterogeneous imaging sources. These findings confirm the proposed framework’s robustness, interpretability, and suitability for real-time clinical decision support in both pandemic and routine diagnostic settings.

胸部x线片因其成本效益高且易于获取，被广泛用于临床诊断肺部混浊、肺炎和2019冠状病毒病等胸部疾病。然而，由于重叠的解剖结构，低对比度和微妙的疾病表现，准确的解释仍然具有挑战性。经典的深度学习模型是有效的，但往往表现出过拟合和弱可解释性，这限制了它们的临床适用性。本文提出了一种注意力驱动的混合框架，该框架将基于对比度限制的自适应直方图均衡化增强与结合视觉变压器和残差网络的双主干架构相结合。视觉转换器捕获全局上下文依赖关系，而残差网络提取局部判别特征。使用多层感知器对连接的表示进行分类，并使用AdamW优化器和步进学习率调度器进行端到端优化。为了提高透明度，使用梯度加权类激活映射来突出胸片上与疾病相关的区域。实验评估表明，所提出的框架达到了98.54%的分类准确率，优于最先进的模型，包括效率网96.20%,DenseNet 97.50%和微调视觉变压器95.79%。为了确保通用性，对COVID-19放射学数据集进行了跨数据集验证和培训。在独立的covid -肺炎-正常胸部x线数据集上对同一模型进行了测试，准确率达到94.07%，在异构成像源上表现良好。这些发现证实了所提出的框架在大流行和常规诊断环境中对实时临床决策支持的稳健性、可解释性和适用性。

{"title":"ViT-ResNet Fusion: An Explainable Hybrid Framework for High-Accuracy Multiclass Lung Disease Classification in Chest X-Rays","authors":"Rahul Deva;Arvind Dagur","doi":"10.1109/ACCESS.2025.3649109","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649109","url":null,"abstract":"Chest radiographs are widely used to clinically diagnose thoracic diseases such as lung opacity, pneumonia, and coronavirus disease 2019 due to their cost-effectiveness and easy access. However, accurate interpretation remains challenging because of overlapping anatomical structures, low contrast, and subtle disease manifestations. Classical deep learning models are effective but often exhibit overfitting, and weak interpretability, which restricts their clinical applicability. This paper presents an attention-driven hybrid framework that integrates contrast-limited adaptive histogram equalization-based enhancement with a dual-backbone architecture combining a vision transformer and a residual network. The vision transformer captures global contextual dependencies, while the residual network extracts local discriminative features. The concatenated representations are classified using a multi-layer perceptron and optimized end-to-end with the AdamW optimizer and a step learning rate scheduler. To improve transparency, gradient-weighted class activation mapping is used to highlight disease-relevant regions in chest radiographs. Experimental evaluation highlights that the proposed framework achieves 98.54% classification accuracy, outperforming state-of-the-art models, including EfficientNet 96.20%, DenseNet 97.50%, and a fine-tuned vision transformer 95.79%. To ensure generalizability, cross-dataset validation was conducted and trained on COVID-19 Radiography Dataset. The same model was tested on an independent COVID–Pneumonia–Normal chest X-ray dataset which achieved an accuracy of 94.07% and demonstrated good performance over heterogeneous imaging sources. These findings confirm the proposed framework’s robustness, interpretability, and suitability for real-time clinical decision support in both pandemic and routine diagnostic settings.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1358-1372"},"PeriodicalIF":3.6,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11316598","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Scene Dataset and Object Detector for Outside Blind Individual Identification 基于多场景数据集和目标检测器的外盲个体识别

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access

Pub Date : 2025-12-29 DOI: 10.1109/ACCESS.2025.3649265

Haotian Ji;Israel Mendonça;Masayoshi Aritsugi

In the field of Computer Vision (CV), assistive navigation systems for visually impaired individuals have garnered significant attention in recent years. However, most existing solutions rely on wearable devices or mobile platforms, which often face limitations in cost, deployment, and robustness in complex outdoor environments. This paper proposes a practical approach to public space recognition for the visually impaired. The proposed recognition method can be widely applied to existing public video surveillance systems. We created a specialized image dataset tailored for recognition by visually impaired individuals. At the same time, for recognition tasks involving nighttime environments and partially occluded targets, we propose a composite framework that integrates various grouped convolutions and image enhancement networks. Compared to the original baseline detection models, our models outperform on our created dataset by 1.5% average precision (e.g., from 94.1% to 95.6% for YOLOv8x), while reducing parameters by up to 35.7% (e.g., from 56.9M to 36.6M for YOLOv11x). Furthermore, our models also achieve over 0.8% AP and a parameter reduction exceeding 10% compared to the original baseline models on ExDARK dataset.

在计算机视觉（CV）领域，视障人士的辅助导航系统近年来受到了广泛的关注。然而，大多数现有解决方案依赖于可穿戴设备或移动平台，这些解决方案往往面临成本、部署和复杂户外环境下的鲁棒性方面的限制。本文提出了一种实用的视障人士公共空间识别方法。该方法可广泛应用于现有的公共视频监控系统中。我们为视障人士创建了一个专门的图像数据集。同时，对于涉及夜间环境和部分遮挡目标的识别任务，我们提出了一种集成各种分组卷积和图像增强网络的复合框架。与原始基线检测模型相比，我们的模型在我们创建的数据集上的平均精度高出1.5%（例如，YOLOv8x的平均精度从94.1%提高到95.6%），同时将参数降低了35.7%（例如，YOLOv11x的平均精度从569米降低到366米）。此外，与ExDARK数据集上的原始基线模型相比，我们的模型也实现了超过0.8%的AP和超过10%的参数减少。

{"title":"Multi-Scene Dataset and Object Detector for Outside Blind Individual Identification","authors":"Haotian Ji;Israel Mendonça;Masayoshi Aritsugi","doi":"10.1109/ACCESS.2025.3649265","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3649265","url":null,"abstract":"In the field of Computer Vision (CV), assistive navigation systems for visually impaired individuals have garnered significant attention in recent years. However, most existing solutions rely on wearable devices or mobile platforms, which often face limitations in cost, deployment, and robustness in complex outdoor environments. This paper proposes a practical approach to public space recognition for the visually impaired. The proposed recognition method can be widely applied to existing public video surveillance systems. We created a specialized image dataset tailored for recognition by visually impaired individuals. At the same time, for recognition tasks involving nighttime environments and partially occluded targets, we propose a composite framework that integrates various grouped convolutions and image enhancement networks. Compared to the original baseline detection models, our models outperform on our created dataset by 1.5% average precision (e.g., from 94.1% to 95.6% for YOLOv8x), while reducing parameters by up to 35.7% (e.g., from 56.9M to 36.6M for YOLOv11x). Furthermore, our models also achieve over 0.8% AP and a parameter reduction exceeding 10% compared to the original baseline models on ExDARK dataset.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"1423-1438"},"PeriodicalIF":3.6,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11317963","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0