首页 > 最新文献

Concurrency and Computation-Practice & Experience最新文献

英文 中文
RFG-YOLO: Accurate and Lightweight Detection of Multi-Scale Defects on Metallic Surfaces RFG-YOLO:金属表面多尺度缺陷的精确、轻量化检测
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-19 DOI: 10.1002/cpe.70563
Jiale Geng, Ming Li, Chengjun Tang

Surface defect detection is critical in industrial applications, especially for multi-scale small object detection under complex backgrounds, low resolution, and real-time constraints. Traditional YOLO-based detectors face three key limitations: single-path backbones struggle to balance fine-grained details and high-level semantics simultaneously, conventional feature pyramids cause small-object feature dilution through simple additive fusion, and detection heads lack explicit quality estimation for diverse defect morphologies. To address these challenges, we propose RFG-YOLO, a lightweight detection network. The proposed method introduces three key innovations: DPFNet employs dual-path parallel processing with PSHGNetv2 and YOLOv11 backbone to extract fine-grained details and semantic context simultaneously, eliminating the single-path bottleneck; MSWFPN utilizes weighted fusion modules including MFM, SCFM, and CSP-SMSFB to prevent small-object feature dilution through adaptive multi-scale integration; RQCD incorporates location quality estimation with RepConv to refine bounding box predictions for diverse defect shapes, improving localization precision across scales. Experiments on NEU-DET, GEAR-DET, and GC10-DET demonstrate that RFG-YOLO achieves mAP@50 improvements of 4.1%, 5.77%, and 4.1%, respectively, while reducing model parameters by 60.5% and computational cost by 50.8%. The model achieves a runtime speed of 156.0 frames per second on a GPU. These results validate that RFG-YOLO strikes an optimal balance between detection accuracy and computational efficiency, rendering it highly suitable for real-time defect detection in industrial settings.

表面缺陷检测在工业应用中至关重要,特别是在复杂背景、低分辨率和实时约束下的多尺度小目标检测。传统的基于yolo的检测器面临三个关键限制:单路径主干难以同时平衡细粒度细节和高级语义;传统的特征金字塔通过简单的加法融合导致小目标特征稀释;检测头缺乏对各种缺陷形态的明确质量估计。为了应对这些挑战,我们提出了轻量级检测网络RFG-YOLO。该方法引入了三个关键创新:DPFNet采用PSHGNetv2和YOLOv11主干的双路径并行处理,同时提取细粒度细节和语义上下文,消除了单路径瓶颈;MSWFPN利用加权融合模块,包括MFM、SCFM和CSP-SMSFB,通过自适应多尺度融合防止小目标特征稀释;RQCD将定位质量估计与RepConv结合起来,以改进不同缺陷形状的边界框预测,提高跨尺度的定位精度。在NEU-DET、GEAR-DET和GC10-DET上的实验表明,RFG-YOLO分别实现了4.1%、5.77%和4.1%的mAP@50改进,模型参数降低了60.5%,计算成本降低了50.8%。该模型在GPU上实现了每秒156.0帧的运行速度。这些结果验证了RFG-YOLO在检测精度和计算效率之间取得了最佳平衡,使其非常适合工业环境中的实时缺陷检测。
{"title":"RFG-YOLO: Accurate and Lightweight Detection of Multi-Scale Defects on Metallic Surfaces","authors":"Jiale Geng,&nbsp;Ming Li,&nbsp;Chengjun Tang","doi":"10.1002/cpe.70563","DOIUrl":"https://doi.org/10.1002/cpe.70563","url":null,"abstract":"<div>\u0000 \u0000 <p>Surface defect detection is critical in industrial applications, especially for multi-scale small object detection under complex backgrounds, low resolution, and real-time constraints. Traditional YOLO-based detectors face three key limitations: single-path backbones struggle to balance fine-grained details and high-level semantics simultaneously, conventional feature pyramids cause small-object feature dilution through simple additive fusion, and detection heads lack explicit quality estimation for diverse defect morphologies. To address these challenges, we propose RFG-YOLO, a lightweight detection network. The proposed method introduces three key innovations: DPFNet employs dual-path parallel processing with PSHGNetv2 and YOLOv11 backbone to extract fine-grained details and semantic context simultaneously, eliminating the single-path bottleneck; MSWFPN utilizes weighted fusion modules including MFM, SCFM, and CSP-SMSFB to prevent small-object feature dilution through adaptive multi-scale integration; RQCD incorporates location quality estimation with RepConv to refine bounding box predictions for diverse defect shapes, improving localization precision across scales. Experiments on NEU-DET, GEAR-DET, and GC10-DET demonstrate that RFG-YOLO achieves mAP@50 improvements of 4.1%, 5.77%, and 4.1%, respectively, while reducing model parameters by 60.5% and computational cost by 50.8%. The model achieves a runtime speed of 156.0 frames per second on a GPU. These results validate that RFG-YOLO strikes an optimal balance between detection accuracy and computational efficiency, rendering it highly suitable for real-time defect detection in industrial settings.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146099431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GFSAI: An Adaptive Factorized Sparse Approximate Inverse Preconditioning Algorithm on GPU 一种基于GPU的自适应分解稀疏近似逆预处理算法
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-18 DOI: 10.1002/cpe.70552
Yizhou Wang, Yige Zhang, Jiaquan Gao

The factorized sparse approximate inverse (FSAI) preconditioner has been proven to be effective in accelerating the convergence of iterative methods. Due to the high cost of constructing the FSAI preconditioner, accelerating it on graphics processing unit (GPU) has attracted considerable attention. However, despite the development of some existing FSAI preconditioning algorithms on GPU, their performance will significantly decrease when they encounter matrix types that are not suitable for them. This motivates us to investigate how to design an effective FSAI preconditioning algorithm on GPU. In this paper, we propose an adaptive FSAI preconditioning algorithm on GPU, called GFSAI-Adaptive, to address the above problem. In GFSAI-Adaptive, first, two adaptive thread allocation strategies are proposed for two special types of SPD matrices to ensure that the allocated threads can be fully utilized. Second, based on the proposed two thread allocation strategies, two FSAI kernels, called GFSAII and GFSAIII, are presented. Third, we construct a new graph convolutional network, and thus propose a search engine to select the optimal kernel from GFSAII and GFSAIII for matrices that do not belong to two special types based on it. Experimental results show that our proposed GFSAI-Adaptive is effective and outperforms a popular preconditioning algorithm in the public CUSPARSE library and a recent parallel static FSAI preconditioning algorithm on GPU.

因式稀疏近似逆(FSAI)预调节器在加速迭代方法收敛方面是有效的。由于构建FSAI预调节器的成本较高,在图形处理器(GPU)上加速FSAI预调节器已引起人们的广泛关注。然而,尽管现有的一些FSAI预处理算法在GPU上得到了发展,但当它们遇到不适合它们的矩阵类型时,它们的性能会显著下降。这促使我们研究如何在GPU上设计一种有效的FSAI预处理算法。为了解决上述问题,本文提出了一种基于GPU的自适应FSAI预处理算法,称为GFSAI-Adaptive。在gfsa - adaptive中,首先针对两种特殊类型的SPD矩阵提出了两种自适应的线程分配策略,以保证分配的线程能够得到充分利用;其次,基于所提出的两种线程分配策略,提出了GFSAII和GFSAIII两个FSAI内核。第三,我们构造了一个新的图卷积网络,并在此基础上提出了一个搜索引擎,对不属于两种特殊类型的矩阵从GFSAII和GFSAIII中选择最优核。实验结果表明,本文提出的FSAI- adaptive算法是有效的,并且优于公共CUSPARSE库中流行的预处理算法和最近在GPU上的并行静态FSAI预处理算法。
{"title":"GFSAI: An Adaptive Factorized Sparse Approximate Inverse Preconditioning Algorithm on GPU","authors":"Yizhou Wang,&nbsp;Yige Zhang,&nbsp;Jiaquan Gao","doi":"10.1002/cpe.70552","DOIUrl":"https://doi.org/10.1002/cpe.70552","url":null,"abstract":"<div>\u0000 \u0000 <p>The factorized sparse approximate inverse (FSAI) preconditioner has been proven to be effective in accelerating the convergence of iterative methods. Due to the high cost of constructing the FSAI preconditioner, accelerating it on graphics processing unit (GPU) has attracted considerable attention. However, despite the development of some existing FSAI preconditioning algorithms on GPU, their performance will significantly decrease when they encounter matrix types that are not suitable for them. This motivates us to investigate how to design an effective FSAI preconditioning algorithm on GPU. In this paper, we propose an adaptive FSAI preconditioning algorithm on GPU, called GFSAI-Adaptive, to address the above problem. In GFSAI-Adaptive, first, two adaptive thread allocation strategies are proposed for two special types of SPD matrices to ensure that the allocated threads can be fully utilized. Second, based on the proposed two thread allocation strategies, two FSAI kernels, called GFSAII and GFSAIII, are presented. Third, we construct a new graph convolutional network, and thus propose a search engine to select the optimal kernel from GFSAII and GFSAIII for matrices that do not belong to two special types based on it. Experimental results show that our proposed GFSAI-Adaptive is effective and outperforms a popular preconditioning algorithm in the public CUSPARSE library and a recent parallel static FSAI preconditioning algorithm on GPU.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ensemble Stacking Regression Model for Brown Plant Hopper Pest Population Prediction 基于集合叠加回归模型的褐飞虱种群预测
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-15 DOI: 10.1002/cpe.70538
M. K. Shwetha,  Nagarathna, V. Ravikumar

The management of agricultural pests is crucial for the security of the food supply. The Brown Plant Hopper (BPH) harms rice fields and reduces productivity, which is a major concern. Accurate forecasting and early detection of BPH population dynamics based on environmental parameters are essential for the quick and efficient application of pest control strategies. This study investigates the use of machine learning methods like bagging, boosting, voting, and ensemble regression techniques to predict the population of BPH. The proposed ensemble stacking regression model uses Extreme gradient boosting (XGBoost), Random Forest (RF), and Extremely Randomized Trees (ERT) as base models with a meta-model Decision Tree (DT), and it is utilized to estimate the pest population. Several existing models are compared with the proposed model in terms of R-squared, Adjusted R-squared, RMSE as well as MSE. The outcomes show that these machine learning models are capable of accurately and successfully predicting the BPH population, which makes the proposed model an important tool for pest population prediction in rice farming.

农业有害生物的治理对粮食供应安全至关重要。褐飞虱(BPH)危害稻田,降低生产力,是人们关注的主要问题。基于环境参数的褐飞虱种群动态准确预测和早期发现是快速有效地实施害虫防治策略的必要条件。本研究探讨了使用机器学习方法,如bagging, boosting, voting和集成回归技术来预测BPH的数量。该模型以极端梯度增强(XGBoost)、随机森林(RF)和极端随机树(ERT)为基础模型,结合元模型决策树(DT)对害虫种群进行估计。从r平方、调整后r平方、RMSE和MSE等方面比较了几种现有模型。结果表明,这些机器学习模型能够准确、成功地预测BPH种群,这使得该模型成为水稻种植中害虫种群预测的重要工具。
{"title":"An Ensemble Stacking Regression Model for Brown Plant Hopper Pest Population Prediction","authors":"M. K. Shwetha,&nbsp; Nagarathna,&nbsp;V. Ravikumar","doi":"10.1002/cpe.70538","DOIUrl":"https://doi.org/10.1002/cpe.70538","url":null,"abstract":"<div>\u0000 <p>The management of agricultural pests is crucial for the security of the food supply. The Brown Plant Hopper (BPH) harms rice fields and reduces productivity, which is a major concern. Accurate forecasting and early detection of BPH population dynamics based on environmental parameters are essential for the quick and efficient application of pest control strategies. This study investigates the use of machine learning methods like bagging, boosting, voting, and ensemble regression techniques to predict the population of BPH. The proposed ensemble stacking regression model uses Extreme gradient boosting (XGBoost), Random Forest (RF), and Extremely Randomized Trees (ERT) as base models with a meta-model Decision Tree (DT), and it is utilized to estimate the pest population. Several existing models are compared with the proposed model in terms of R-squared, Adjusted R-squared, RMSE as well as MSE. The outcomes show that these machine learning models are capable of accurately and successfully predicting the BPH population, which makes the proposed model an important tool for pest population prediction in rice farming.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146002418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Joint Fusion Framework Integrating Traditional and Deep Image Features for Improved Knee Osteoarthritis Grading 融合传统和深度图像特征的关节融合框架用于改善膝关节骨性关节炎分级
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-14 DOI: 10.1002/cpe.70546
Usame Yilmaz, Fatma Z. Solak

Knee osteoarthritis (KOA) grading from x-ray images is important for supporting effective treatment planning. Yet it remains difficult due to the disease's complex presentation. Subtle anatomical changes add to the challenge. The subjectivity of manual evaluation further complicates the process. Such challenges highlight the importance of automated, objective, and reproducible computer-aided systems capable of leveraging complementary sources of information. In line with this, a joint fusion framework was developed to integrate optimized traditional image features with deep learning representations obtained from multiple pre-trained convolutional neural network models. Traditional features, including morphological, statistical, texture-based, and other clinically relevant descriptors, provide interpretable insights into bone structure and tissue characteristics. In parallel, deep features capture intricate spatial patterns and semantic details beyond the reach of manual modeling. For improved discrimination and efficiency, analysis of variance and linear discriminant analysis were used for selecting traditional features, while principal component analysis was applied to deep features to retain 85% variance. The two feature sets were combined using a Joint Fusion Type II approach, and class imbalance was mitigated through synthetic minority oversampling. A neural network was trained to capture interdependencies between these features. Experimental results on a benchmark KOA dataset indicated that fusion with VGG16 deep features achieved 85.39% accuracy, outperforming individual feature-based approaches. The framework maintained relatively high accuracy across all five KOA grades, including borderline cases, indicating its potential for consistent and clinically relevant KOA grading.

膝关节骨关节炎(KOA)的x线图像分级是重要的支持有效的治疗计划。然而,由于这种疾病的复杂表现,它仍然很困难。细微的解剖变化增加了挑战。人工评估的主观性使这一过程进一步复杂化。这些挑战突出了自动化、客观和可复制的计算机辅助系统的重要性,这些系统能够利用互补的信息源。基于此,开发了一种联合融合框架,将优化后的传统图像特征与多个预训练卷积神经网络模型获得的深度学习表征进行融合。传统的特征,包括形态学、统计学、基于纹理和其他临床相关的描述符,提供了对骨结构和组织特征的可解释的见解。与此同时,深度特征捕获复杂的空间模式和语义细节,超出了人工建模的范围。为提高识别效率,传统特征选择采用方差分析和线性判别分析,深层特征选择采用主成分分析,保留85%方差。使用Joint Fusion Type II方法将两个特征集结合起来,并通过合成少数派过采样来缓解类不平衡。神经网络被训练来捕捉这些特征之间的相互依赖关系。在KOA基准数据集上的实验结果表明,与VGG16深度特征的融合准确率达到85.39%,优于基于单个特征的方法。该框架在所有五个KOA分级(包括边缘病例)中保持相对较高的准确性,表明其具有一致和临床相关的KOA分级的潜力。
{"title":"A Joint Fusion Framework Integrating Traditional and Deep Image Features for Improved Knee Osteoarthritis Grading","authors":"Usame Yilmaz,&nbsp;Fatma Z. Solak","doi":"10.1002/cpe.70546","DOIUrl":"https://doi.org/10.1002/cpe.70546","url":null,"abstract":"<div>\u0000 \u0000 <p>Knee osteoarthritis (KOA) grading from x-ray images is important for supporting effective treatment planning. Yet it remains difficult due to the disease's complex presentation. Subtle anatomical changes add to the challenge. The subjectivity of manual evaluation further complicates the process. Such challenges highlight the importance of automated, objective, and reproducible computer-aided systems capable of leveraging complementary sources of information. In line with this, a joint fusion framework was developed to integrate optimized traditional image features with deep learning representations obtained from multiple pre-trained convolutional neural network models. Traditional features, including morphological, statistical, texture-based, and other clinically relevant descriptors, provide interpretable insights into bone structure and tissue characteristics. In parallel, deep features capture intricate spatial patterns and semantic details beyond the reach of manual modeling. For improved discrimination and efficiency, analysis of variance and linear discriminant analysis were used for selecting traditional features, while principal component analysis was applied to deep features to retain 85% variance. The two feature sets were combined using a Joint Fusion Type II approach, and class imbalance was mitigated through synthetic minority oversampling. A neural network was trained to capture interdependencies between these features. Experimental results on a benchmark KOA dataset indicated that fusion with VGG16 deep features achieved 85.39% accuracy, outperforming individual feature-based approaches. The framework maintained relatively high accuracy across all five KOA grades, including borderline cases, indicating its potential for consistent and clinically relevant KOA grading.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145969851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cybersecurity-Driven Strategy: Resilient Base Stations Deployment for Robust Open RAN 5G/6G Networks 网络安全驱动战略:面向强健开放RAN 5G/6G网络的弹性基站部署
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-12 DOI: 10.1002/cpe.70524
Ibtihal A. Alablani, Mohammed J. F. Alenazi

The proliferation of Open Radio Access Network (O-RAN) architectures in 5G/6G networks introduces unprecedented cybersecurity challenges. Strategic base station deployment constitutes a fundamental determinant of network security posture and cyberattack resilience. In this paper, a novel cybersecurity-driven deployment strategy for resilient base station positioning using an intelligent Resilient Ant Colony Optimization (iResACO) algorithm. The algorithm integrates security considerations directly into deployment optimization, employing bio-inspired collective intelligence to discover patterns that balance coverage efficiency with attack resilience. Through extensive simulations in a 3.6 km ×$$ times $$ 3.6 km urban environment in Riyadh, Saudi Arabia, experimental results demonstrate superior performance achieving 92.04% overall effectiveness with 96.0% coverage probability and 100% critical infrastructure protection. Under various cyberattack scenarios ranging from random to coordinated sophisticated attacks, the algorithm maintains coverage above 87% while preserving complete protection of critical facilities. The proposed approach provides a practical framework for deploying secure, resilient 5G/6G networks capable of withstanding evolving cyber threats while ensuring uninterrupted service to essential infrastructure.

开放无线接入网(O-RAN)架构在5G/6G网络中的扩散带来了前所未有的网络安全挑战。战略性基站部署是网络安全态势和网络攻击抵御能力的基本决定因素。本文采用智能弹性蚁群优化(iResACO)算法,提出了一种新的网络安全驱动的弹性基站定位部署策略。该算法将安全考虑直接集成到部署优化中,采用生物启发的集体智能来发现平衡覆盖效率和攻击弹性的模式。通过在沙特阿拉伯利雅得3.6 km × $$ times $$ 3.6 km的城市环境中进行的大量模拟,实验结果表明性能优异,达到92.04% overall effectiveness with 96.0% coverage probability and 100% critical infrastructure protection. Under various cyberattack scenarios ranging from random to coordinated sophisticated attacks, the algorithm maintains coverage above 87% while preserving complete protection of critical facilities. The proposed approach provides a practical framework for deploying secure, resilient 5G/6G networks capable of withstanding evolving cyber threats while ensuring uninterrupted service to essential infrastructure.
{"title":"Cybersecurity-Driven Strategy: Resilient Base Stations Deployment for Robust Open RAN 5G/6G Networks","authors":"Ibtihal A. Alablani,&nbsp;Mohammed J. F. Alenazi","doi":"10.1002/cpe.70524","DOIUrl":"https://doi.org/10.1002/cpe.70524","url":null,"abstract":"<div>\u0000 \u0000 <p>The proliferation of Open Radio Access Network (O-RAN) architectures in 5G/6G networks introduces unprecedented cybersecurity challenges. Strategic base station deployment constitutes a fundamental determinant of network security posture and cyberattack resilience. In this paper, a novel cybersecurity-driven deployment strategy for resilient base station positioning using an intelligent Resilient Ant Colony Optimization (iResACO) algorithm. The algorithm integrates security considerations directly into deployment optimization, employing bio-inspired collective intelligence to discover patterns that balance coverage efficiency with attack resilience. Through extensive simulations in a 3.6 km <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo>×</mo>\u0000 </mrow>\u0000 <annotation>$$ times $$</annotation>\u0000 </semantics></math> 3.6 km urban environment in Riyadh, Saudi Arabia, experimental results demonstrate superior performance achieving 92.04% overall effectiveness with 96.0% coverage probability and 100% critical infrastructure protection. Under various cyberattack scenarios ranging from random to coordinated sophisticated attacks, the algorithm maintains coverage above 87% while preserving complete protection of critical facilities. The proposed approach provides a practical framework for deploying secure, resilient 5G/6G networks capable of withstanding evolving cyber threats while ensuring uninterrupted service to essential infrastructure.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145969907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Communication Frequency in Megatron-LM: Experimental Insights Applied to Heterogeneous Distributed Training Time Prediction Megatron-LM中的通信频率:应用于异构分布式训练时间预测的实验见解
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-12 DOI: 10.1002/cpe.70500
HaoRan Zhang, Yanzhao Feng, Zhengwei Chen, Yutong Tian, Xiaoli Zheng, Cong Liu, Sheng Wang, Jie Ren, Yucong Li, Rui Zhu

As model parameters increase exponentially, distributed training has become essential for advancing modern deep neural networks. Megatron-LM, an efficient distributed training framework developed by NVIDIA, enables the training of trillion-parameter models on thousands of GPUs by integrating tensor, pipeline, and data parallelism. Its computational efficiency has established it as a foundational tool for training large-scale models. Rapid identification of optimal parallel configurations for specific GPU clusters is critical for maximizing computational resource utilization, with training time prediction serving as a key evaluation metric. The high cost and limited availability of high-performance GPUs, particularly those based on NVIDIA architectures, have made the construction of large-scale heterogeneous clusters a practical solution to resource and cost constraints. However, existing prediction methods do not reliably or efficiently account for the computational and communication complexities inherent in heterogeneous GPU clusters. To address this gap, HATP (Heterogeneous-Aware Time Predictor) is introduced as a novel performance prediction method specifically designed for heterogeneous GPU clusters. For any given parallel configuration, HATP rapidly and accurately simulates execution times to inform the optimization of parallel strategies. To address communication differences among heterogeneous GPUs, comprehensive experimental analyses are conducted and analytical expressions are derived to characterize the communication frequency patterns in Megatron-LM's parallel strategies. This work presents the first systematic quantification of communication operations within Megatron-LM framework, ensuring that performance predictions remain highly accurate even in complex, heterogeneous environments. Furthermore, to account for computational differences among heterogeneous GPUs, a layer-level computational performance acquisition scheme is proposed to reduce the impact of fine-grained operator overlap and additional memory operations. Experimental results demonstrate that HATP achieves an average prediction accuracy of 97.41% in isomorphic environments, surpassing the current state-of-the-art method, ACEso. HATP also attains an average accuracy of 96.04% in heterogeneous data parallel and pipeline parallel configurations, representing the first extension of training time prediction capabilities to heterogeneous environments.

随着模型参数呈指数级增长,分布式训练对现代深度神经网络的发展至关重要。Megatron-LM是由NVIDIA开发的高效分布式训练框架,通过集成张量、管道和数据并行性,可以在数千个gpu上训练数万亿参数的模型。它的计算效率使其成为训练大规模模型的基础工具。快速识别特定GPU集群的最佳并行配置对于最大化计算资源利用率至关重要,训练时间预测是一个关键的评估指标。高性能gpu的高成本和有限的可用性,特别是基于NVIDIA架构的gpu,使得构建大规模异构集群成为解决资源和成本限制的实际解决方案。然而,现有的预测方法不能可靠或有效地解释异构GPU集群中固有的计算和通信复杂性。为了解决这一差距,HATP(异构感知时间预测器)作为一种专门为异构GPU集群设计的新型性能预测方法被引入。对于任何给定的并行配置,HATP快速准确地模拟执行时间,以通知并行策略的优化。为了解决异构gpu之间的通信差异,进行了全面的实验分析,并推导了表征Megatron-LM并行策略中通信频率模式的解析表达式。这项工作提出了Megatron-LM框架内通信操作的第一个系统量化,确保即使在复杂的异构环境中,性能预测也保持高度准确。此外,为了考虑异构gpu之间的计算差异,提出了一种层级计算性能获取方案,以减少细粒度运算符重叠和额外内存操作的影响。实验结果表明,在同构环境下,HATP的平均预测准确率达到97.41%,超过了目前最先进的ACEso方法。在异构数据并行和管道并行配置下,HATP的平均准确率达到96.04%,这是训练时间预测能力首次扩展到异构环境。
{"title":"Communication Frequency in Megatron-LM: Experimental Insights Applied to Heterogeneous Distributed Training Time Prediction","authors":"HaoRan Zhang,&nbsp;Yanzhao Feng,&nbsp;Zhengwei Chen,&nbsp;Yutong Tian,&nbsp;Xiaoli Zheng,&nbsp;Cong Liu,&nbsp;Sheng Wang,&nbsp;Jie Ren,&nbsp;Yucong Li,&nbsp;Rui Zhu","doi":"10.1002/cpe.70500","DOIUrl":"https://doi.org/10.1002/cpe.70500","url":null,"abstract":"<div>\u0000 \u0000 <p>As model parameters increase exponentially, distributed training has become essential for advancing modern deep neural networks. Megatron-LM, an efficient distributed training framework developed by NVIDIA, enables the training of trillion-parameter models on thousands of GPUs by integrating tensor, pipeline, and data parallelism. Its computational efficiency has established it as a foundational tool for training large-scale models. Rapid identification of optimal parallel configurations for specific GPU clusters is critical for maximizing computational resource utilization, with training time prediction serving as a key evaluation metric. The high cost and limited availability of high-performance GPUs, particularly those based on NVIDIA architectures, have made the construction of large-scale heterogeneous clusters a practical solution to resource and cost constraints. However, existing prediction methods do not reliably or efficiently account for the computational and communication complexities inherent in heterogeneous GPU clusters. To address this gap, HATP (Heterogeneous-Aware Time Predictor) is introduced as a novel performance prediction method specifically designed for heterogeneous GPU clusters. For any given parallel configuration, HATP rapidly and accurately simulates execution times to inform the optimization of parallel strategies. To address communication differences among heterogeneous GPUs, comprehensive experimental analyses are conducted and analytical expressions are derived to characterize the communication frequency patterns in Megatron-LM's parallel strategies. This work presents the first systematic quantification of communication operations within Megatron-LM framework, ensuring that performance predictions remain highly accurate even in complex, heterogeneous environments. Furthermore, to account for computational differences among heterogeneous GPUs, a layer-level computational performance acquisition scheme is proposed to reduce the impact of fine-grained operator overlap and additional memory operations. Experimental results demonstrate that HATP achieves an average prediction accuracy of 97.41% in isomorphic environments, surpassing the current state-of-the-art method, ACEso. HATP also attains an average accuracy of 96.04% in heterogeneous data parallel and pipeline parallel configurations, representing the first extension of training time prediction capabilities to heterogeneous environments.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ensemble of Swarm-Based Evolutionary Learning Strategies for UAV Path-Planning Problem 无人机路径规划问题的基于群的进化学习策略集成
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-12 DOI: 10.1002/cpe.70536
Shrishti Chamoli, Anupam Yadav

The increasing application of unmanned aerial vehicles (UAVs) in diverse domains demands highly robust and autonomous path-planning algorithms capable of navigating complex and dynamic environments. To address the multifaceted challenges posed by obstacle avoidance, energy constraints, and environmental uncertainty, this work proposes an ensemble of learning strategies for the optimal path planning of UAVs. We introduce a modular particle swarm optimization and differential evolution (PSO-DE) ensemble framework and systematically investigate the impact of multiple learning and adaptation strategies, such as chaotic parameter adaptation, opposition-based learning (OBL), and a range of DE mutation schemes, to enhance the optimization process. We perform extensive experimentation across 16 carefully designed scenarios with varying complexity against ten competitive algorithms. We demonstrate that the integration of the PSO-DE hybrid with the opposition-based learning (OBLPSODE) achieves faster convergence while maintaining superior solution quality across all scenarios. The proposed OBLPSODE algorithm substantially outperforms other hybrid variants in both computational efficiency and path optimality, particularly excelling in cluttered environments where traditional algorithms often converge prematurely. Beyond algorithmic contributions, this work provides critical complexity analysis identifying obstacle-checking operations as the primary computational bottleneck in UAV path planning. The findings offer practical guidance for deploying UAVs in real-world applications and establish transferable design principles for developing adaptive meta-heuristics in complex optimization domains.

无人机在不同领域的应用越来越广泛,需要高度鲁棒和自主的路径规划算法,能够在复杂和动态的环境中导航。为了解决避障、能源约束和环境不确定性带来的多方面挑战,本研究提出了一套用于无人机最优路径规划的学习策略。引入模块化粒子群优化与差分进化(PSO-DE)集成框架,并系统研究了多种学习和适应策略(如混沌参数自适应、基于对手的学习(OBL)和一系列差分进化突变方案)对优化过程的影响。我们在16个精心设计的场景中对10种竞争算法进行了广泛的实验,这些场景具有不同的复杂性。我们证明了PSO-DE混合与基于对立的学习(OBLPSODE)的集成实现了更快的收敛,同时在所有场景中保持了卓越的解决方案质量。提出的OBLPSODE算法在计算效率和路径最优性方面都大大优于其他混合变体,特别是在传统算法经常过早收敛的混乱环境中表现出色。除了算法贡献之外,这项工作提供了关键的复杂性分析,将障碍物检查操作识别为无人机路径规划中的主要计算瓶颈。研究结果为在实际应用中部署无人机提供了实用指导,并为在复杂优化领域中开发自适应元启发式方法建立了可转移的设计原则。
{"title":"An Ensemble of Swarm-Based Evolutionary Learning Strategies for UAV Path-Planning Problem","authors":"Shrishti Chamoli,&nbsp;Anupam Yadav","doi":"10.1002/cpe.70536","DOIUrl":"https://doi.org/10.1002/cpe.70536","url":null,"abstract":"<div>\u0000 \u0000 <p>The increasing application of unmanned aerial vehicles (UAVs) in diverse domains demands highly robust and autonomous path-planning algorithms capable of navigating complex and dynamic environments. To address the multifaceted challenges posed by obstacle avoidance, energy constraints, and environmental uncertainty, this work proposes an ensemble of learning strategies for the optimal path planning of UAVs. We introduce a modular particle swarm optimization and differential evolution (PSO-DE) ensemble framework and systematically investigate the impact of multiple learning and adaptation strategies, such as chaotic parameter adaptation, opposition-based learning (OBL), and a range of DE mutation schemes, to enhance the optimization process. We perform extensive experimentation across 16 carefully designed scenarios with varying complexity against ten competitive algorithms. We demonstrate that the integration of the PSO-DE hybrid with the opposition-based learning (OBLPSODE) achieves faster convergence while maintaining superior solution quality across all scenarios. The proposed OBLPSODE algorithm substantially outperforms other hybrid variants in both computational efficiency and path optimality, particularly excelling in cluttered environments where traditional algorithms often converge prematurely. Beyond algorithmic contributions, this work provides critical complexity analysis identifying obstacle-checking operations as the primary computational bottleneck in UAV path planning. The findings offer practical guidance for deploying UAVs in real-world applications and establish transferable design principles for developing adaptive meta-heuristics in complex optimization domains.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCF-Net: Spatial-Channel Fusion and Feature Refinement for Vessel Re-Identification SCF-Net:空间通道融合和特征细化用于舰船再识别
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-12 DOI: 10.1002/cpe.70545
Gangzhu Lin, Yongguo Ling, Yuting He, Wenhao Shao, Shaozi Li, Hongfeng Xu

Vessel re-identification (ReID) plays a critical role in maritime surveillance by matching vessels across different camera views. Compared with person or vehicle ReID, vessel ReID faces unique challenges due to subtle interclass differences and large intraclass variations caused by viewpoint changes. These issues are further exacerbated by the highly similar appearances of vessels and the lack of fine-grained identity cues commonly found in other ReID tasks. To address these challenges, we propose a spatial-channel fusion network (SCF-Net), a dual-branch deep framework that integrates a spatial-channel fusion (SCF) module and a feature refinement and alignment (FRA) module. The SCF module captures interdependent relationships between spatial and channel dimensions, enabling the network to emphasize discriminative regions while suppressing irrelevant background information. The FRA module refines high-dimensional embeddings into a compact representation and enforces intraclass similarity via a learnable multilayer perceptron (MLP) and a supervised mean squared error (MSE) loss. By jointly optimizing the two branches and the FRA output, SCF-Net effectively learns both interclass discrimination and intraclass compactness. Extensive experiments demonstrate that SCF-Net achieves competitive performance on public vessel ReID benchmarks, highlighting its effectiveness in handling subtle interclass differences and large intraclass variations.

船舶再识别(ReID)通过在不同的摄像机视图中匹配船舶,在海上监视中起着至关重要的作用。与人或车辆的ReID相比,船舶ReID面临着独特的挑战,因为它具有微妙的类间差异和视点变化导致的较大的类内差异。这些问题进一步加剧了血管高度相似的外观,以及缺乏其他ReID任务中常见的细粒度身份线索。为了应对这些挑战,我们提出了一个空间信道融合网络(SCF- net),这是一个双分支深度框架,集成了空间信道融合(SCF)模块和特征细化和对齐(FRA)模块。SCF模块捕获空间和通道维度之间的相互依存关系,使网络能够强调区别区域,同时抑制无关的背景信息。FRA模块将高维嵌入细化为紧凑的表示,并通过可学习的多层感知器(MLP)和监督均方误差(MSE)损失来增强类内相似性。通过联合优化两个分支和FRA输出,SCF-Net有效地学习了类间判别和类内紧密性。大量实验表明,SCF-Net在公共船舶ReID基准测试中取得了具有竞争力的性能,突出了其在处理微妙的船级间差异和较大的船级内变化方面的有效性。
{"title":"SCF-Net: Spatial-Channel Fusion and Feature Refinement for Vessel Re-Identification","authors":"Gangzhu Lin,&nbsp;Yongguo Ling,&nbsp;Yuting He,&nbsp;Wenhao Shao,&nbsp;Shaozi Li,&nbsp;Hongfeng Xu","doi":"10.1002/cpe.70545","DOIUrl":"https://doi.org/10.1002/cpe.70545","url":null,"abstract":"<div>\u0000 \u0000 <p>Vessel re-identification (ReID) plays a critical role in maritime surveillance by matching vessels across different camera views. Compared with person or vehicle ReID, vessel ReID faces unique challenges due to subtle interclass differences and large intraclass variations caused by viewpoint changes. These issues are further exacerbated by the highly similar appearances of vessels and the lack of fine-grained identity cues commonly found in other ReID tasks. To address these challenges, we propose a spatial-channel fusion network (SCF-Net), a dual-branch deep framework that integrates a spatial-channel fusion (SCF) module and a feature refinement and alignment (FRA) module. The SCF module captures interdependent relationships between spatial and channel dimensions, enabling the network to emphasize discriminative regions while suppressing irrelevant background information. The FRA module refines high-dimensional embeddings into a compact representation and enforces intraclass similarity via a learnable multilayer perceptron (MLP) and a supervised mean squared error (MSE) loss. By jointly optimizing the two branches and the FRA output, SCF-Net effectively learns both interclass discrimination and intraclass compactness. Extensive experiments demonstrate that SCF-Net achieves competitive performance on public vessel ReID benchmarks, highlighting its effectiveness in handling subtle interclass differences and large intraclass variations.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145986979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent Tunnel Collapse Prediction Using Multi-Modal Gaussian Cross-Attention Fusion (MGCAF): Integration of TBM Parameters and Geological Radar Data 基于多模态高斯交叉注意融合(MGCAF)的隧道塌陷智能预测:TBM参数与地质雷达数据的集成
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-11 DOI: 10.1002/cpe.70542
Youliang Chen, Wencan Guan, Rafig Azzam, Suran Wang, Yungui Pan, Chao Yan

Tunnel face instability prediction represents a critical technical challenge in underground engineering, particularly during tunnel boring machine (TBM) excavation under complex geological conditions. This study proposes the Multi-modal Gaussian Cross-Attention Fusion (MGCAF) algorithm, which integrates physics-constrained Gaussian processes with cross-attention mechanisms to achieve intelligent tunnel collapse prediction. The MGCAF framework reconstructs the traditional prediction paradigm by treating earth pressure balance chamber pressure as the primary prediction target rather than an input parameter, while incorporating first-principles constraints of TBM cutting mechanisms into kernel function design. The algorithm employs a dual-pathway architecture that fuses TBM operational parameters through temporal modeling, processes geological radar images via deep feature extraction, and achieves cross-modal information fusion through physics-constrained cross-attention mechanisms. Dynamic kernel optimization enables real-time adaptive parameter adjustment through multi-source gradient feedback. Validation results based on the Yinsong Water Diversion Tunnel project (20 km length, 9 collapse events) demonstrate that the algorithm achieves high-precision prediction with R2 = 0.8330, successfully predicting major collapse locations with approximately 20-m accuracy. Comparative analysis against baseline methods (Transformer, Gaussian Process, Random Forest, XGBoost) indicates that MGCAF exhibits superior performance in engineering reliability (0.95) and ROC-AUC (0.765) metrics. Generalization testing on the 2025 Los Angeles Wilmington Sewage Outfall Tunnel confirms the algorithm's cross-domain applicability. Ablation experiments reveal that the cross-attention mechanism serves as the primary performance driver, while uncertainty quantification provides interpretable risk assessment for TBM operations in heterogeneous geological environments.

隧道工作面失稳预测是地下工程,特别是复杂地质条件下隧道掘进机开挖中的一个关键技术难题。本研究提出了多模态高斯交叉注意融合(MGCAF)算法,该算法将物理约束的高斯过程与交叉注意机制相结合,实现隧道塌陷智能预测。MGCAF框架重构了传统的预测范式,将土压力平衡腔室压力作为主要预测目标而非输入参数,并将掘进机切削机构的第一性原理约束纳入核函数设计。该算法采用双路径架构,通过时间建模融合掘进机运行参数,通过深度特征提取处理地质雷达图像,通过物理约束的交叉关注机制实现跨模态信息融合。动态核优化通过多源梯度反馈实现参数的实时自适应调整。基于银松引水隧洞工程(长度20 km,塌方事件9次)的验证结果表明,该算法预测精度较高,R2 = 0.8330,成功预测主要塌方位置,精度约为20 m。与基线方法(Transformer, Gaussian Process, Random Forest, XGBoost)的比较分析表明,MGCAF在工程可靠性(0.95)和ROC-AUC(0.765)指标上表现出优越的性能。对2025年洛杉矶威尔明顿污水排放隧道的泛化测试证实了该算法的跨域适用性。消融实验表明,交叉关注机制是主要的性能驱动因素,而不确定性量化为非均质地质环境下的TBM操作提供了可解释的风险评估。
{"title":"Intelligent Tunnel Collapse Prediction Using Multi-Modal Gaussian Cross-Attention Fusion (MGCAF): Integration of TBM Parameters and Geological Radar Data","authors":"Youliang Chen,&nbsp;Wencan Guan,&nbsp;Rafig Azzam,&nbsp;Suran Wang,&nbsp;Yungui Pan,&nbsp;Chao Yan","doi":"10.1002/cpe.70542","DOIUrl":"https://doi.org/10.1002/cpe.70542","url":null,"abstract":"<div>\u0000 \u0000 <p>Tunnel face instability prediction represents a critical technical challenge in underground engineering, particularly during tunnel boring machine (TBM) excavation under complex geological conditions. This study proposes the Multi-modal Gaussian Cross-Attention Fusion (MGCAF) algorithm, which integrates physics-constrained Gaussian processes with cross-attention mechanisms to achieve intelligent tunnel collapse prediction. The MGCAF framework reconstructs the traditional prediction paradigm by treating earth pressure balance chamber pressure as the primary prediction target rather than an input parameter, while incorporating first-principles constraints of TBM cutting mechanisms into kernel function design. The algorithm employs a dual-pathway architecture that fuses TBM operational parameters through temporal modeling, processes geological radar images via deep feature extraction, and achieves cross-modal information fusion through physics-constrained cross-attention mechanisms. Dynamic kernel optimization enables real-time adaptive parameter adjustment through multi-source gradient feedback. Validation results based on the Yinsong Water Diversion Tunnel project (20 km length, 9 collapse events) demonstrate that the algorithm achieves high-precision prediction with <i>R</i><sup>2</sup> = 0.8330, successfully predicting major collapse locations with approximately 20-m accuracy. Comparative analysis against baseline methods (Transformer, Gaussian Process, Random Forest, XGBoost) indicates that MGCAF exhibits superior performance in engineering reliability (0.95) and ROC-AUC (0.765) metrics. Generalization testing on the 2025 Los Angeles Wilmington Sewage Outfall Tunnel confirms the algorithm's cross-domain applicability. Ablation experiments reveal that the cross-attention mechanism serves as the primary performance driver, while uncertainty quantification provides interpretable risk assessment for TBM operations in heterogeneous geological environments.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145970035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OD-H-SABE: A Hierarchical Searchable Attribute-Based Encryption Scheme With Outsourced Decryption for Blockchain-Based Data Sharing OD-H-SABE:基于区块链的数据共享的分层可搜索属性加密方案和外包解密
IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-01-11 DOI: 10.1002/cpe.70510
Gaimei Gao, Yiqing Wei, Jingyue Wang, Chunxia Liu, Junji Li

The growing demand for data sharing in domains such as health care highlights limitations in existing solutions, including low search efficiency, coarse-grained access control, and heavy decryption overhead on users. To address these challenges, this paper proposes a hierarchical searchable attribute-based encryption scheme with outsourced decryption for blockchain-based data sharing (OD-H-SABE). OD-H-SABE introduces a hierarchical attribute structure alongside an outsourced decryption mechanism that offloads computationally intensive bilinear operations to the cloud server. Consequently, users only need to perform a single lightweight operation to complete decryption, significantly alleviating the computational burden. Furthermore, the scheme integrates searchable encryption with multi-keyword aggregate hashing, enabling efficient search with constant complexity regardless of the number of keywords. Leveraging the transparency and immutability of blockchain, smart contracts verify the integrity of results returned from the cloud server, ensuring data security and trustworthiness throughout the sharing process. Theoretical and experimental analyses demonstrate that OD-H-SABE achieves notable advantages over traditional schemes in terms of security, search efficiency, and computational overhead. For example, compared to MKS-VABE and BEM-ABSE, OD-H-SABE reduces encryption and user-side decryption overhead by approximately 20% and 42%, respectively. This makes it a practical and lightweight solution for constructing secure and efficient blockchain-based data-sharing platforms.

医疗保健等领域对数据共享的需求日益增长,这突出了现有解决方案的局限性,包括搜索效率低、粗粒度访问控制和用户繁重的解密开销。为了解决这些挑战,本文提出了一种分层可搜索的基于属性的加密方案,并将解密外包给基于区块链的数据共享(OD-H-SABE)。OD-H-SABE引入了分层属性结构和外包解密机制,将计算密集型双线性操作卸载到云服务器。因此,用户只需要执行一个轻量级的操作就可以完成解密,大大减轻了计算负担。此外,该方案将可搜索加密与多关键字聚合散列相结合,无论关键字数量多少,都能实现具有恒定复杂度的高效搜索。利用区块链的透明性和不可变性,智能合约验证从云服务器返回的结果的完整性,确保整个共享过程中的数据安全性和可信度。理论和实验分析表明,OD-H-SABE在安全性、搜索效率和计算开销方面比传统方案具有显著优势。例如,与MKS-VABE和BEM-ABSE相比,OD-H-SABE分别减少了大约20%和42%的加密和用户端解密开销。这使得它成为构建安全高效的基于区块链的数据共享平台的实用和轻量级解决方案。
{"title":"OD-H-SABE: A Hierarchical Searchable Attribute-Based Encryption Scheme With Outsourced Decryption for Blockchain-Based Data Sharing","authors":"Gaimei Gao,&nbsp;Yiqing Wei,&nbsp;Jingyue Wang,&nbsp;Chunxia Liu,&nbsp;Junji Li","doi":"10.1002/cpe.70510","DOIUrl":"https://doi.org/10.1002/cpe.70510","url":null,"abstract":"<div>\u0000 \u0000 <p>The growing demand for data sharing in domains such as health care highlights limitations in existing solutions, including low search efficiency, coarse-grained access control, and heavy decryption overhead on users. To address these challenges, this paper proposes a hierarchical searchable attribute-based encryption scheme with outsourced decryption for blockchain-based data sharing (OD-H-SABE). OD-H-SABE introduces a hierarchical attribute structure alongside an outsourced decryption mechanism that offloads computationally intensive bilinear operations to the cloud server. Consequently, users only need to perform a single lightweight operation to complete decryption, significantly alleviating the computational burden. Furthermore, the scheme integrates searchable encryption with multi-keyword aggregate hashing, enabling efficient search with constant complexity regardless of the number of keywords. Leveraging the transparency and immutability of blockchain, smart contracts verify the integrity of results returned from the cloud server, ensuring data security and trustworthiness throughout the sharing process. Theoretical and experimental analyses demonstrate that OD-H-SABE achieves notable advantages over traditional schemes in terms of security, search efficiency, and computational overhead. For example, compared to MKS-VABE and BEM-ABSE, OD-H-SABE reduces encryption and user-side decryption overhead by approximately 20% and 42%, respectively. This makes it a practical and lightweight solution for constructing secure and efficient blockchain-based data-sharing platforms.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"38 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2026-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145964366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Concurrency and Computation-Practice & Experience
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1