首页 > 最新文献

Journal of King Saud University-Computer and Information Sciences最新文献

英文 中文
Energy-efficient routing protocols for UWSNs: A comprehensive review of taxonomy, challenges, opportunities, future research directions, and machine learning perspectives 用于 UWSN 的高能效路由协议:对分类、挑战、机遇、未来研究方向和机器学习视角的全面回顾
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-23 DOI: 10.1016/j.jksuci.2024.102128

Underwater Wireless Sensor Networks (UWSNs) are essential for a number of environmental and oceanographic monitoring applications. However, they face different and more complex challenges than terrestrial wireless sensor networks (TWSNs). The main challenges faced by UWSNs are limited include high propagation delays, poor bandwidth, low throughput, and high energy consumption. Replacing sensor batteries in such networks becomes extremely difficult as they are usually deployed in remote areas where limited human interaction is possible. The unbalanced and inefficient usage of energy by various network nodes poses another issue, as it may reduce the applicability and feasibility of the network. Therefore, proposing Energy-Efficient Routing Protocols (E-ER-Ps) is crucial to improve the performance and lifespan of these networks. Due to the challenges mentioned earlier, this research presents an extensive analysis of several different E-ER-Ps intended for UWSNs. We compare contemporary approaches that use machine learning (ML) with conventional protocols, as ML-based approaches have shown significant potential in resolving the intricate challenges faced by UWSNs. This paper aims to present a critical review of different E-ER-Ps from various prospects for UWSNs. To better comprehend the structure and uses of these protocols, we provide an innovative taxonomy for their classification. While ML-based protocols are evaluated for their flexibility, predictive power, and overall efficiency advancements, traditional protocols are evaluated based on their routing tactics and energy-efficiency improvements. A thorough comparative analysis highlights the advantages, disadvantages, and possible uses for different protocols. Furthermore, a critical analysis of ML’s function, incorporating intelligent and adaptive routing approaches, is presented, highlighting the technology’s potential to completely alter UWSN management. To formulate and implement E-ER-Ps for UWSNs, the article concludes by highlighting the present obstacles, including the need for real-time flexibility, resilience to environmental alters, and interaction with pre-existing network infrastructures. The development of ML-based approaches, hybrid approaches that combine conventional and ML-based methodologies, and the design of protocols that can adapt dynamically to the changing circumstances of underwater habitats are highlighted as future research objectives. This research provides the foundation for future advancements in this crucial field by presenting a comprehensive overview of the state-of-the-art UWSN E-ER-Ps.

水下无线传感器网络(UWSN)对于许多环境和海洋学监测应用来说至关重要。然而,与地面无线传感器网络(TWSN)相比,水下无线传感器网络面临着不同且更加复杂的挑战。UWSN 面临的主要挑战包括传播延迟大、带宽差、吞吐量低和能耗高。在这类网络中更换传感器电池变得极为困难,因为它们通常部署在偏远地区,人与人之间的互动有限。各网络节点对能量的不平衡和低效使用也是一个问题,因为这可能会降低网络的适用性和可行性。因此,提出高能效路由协议(E-ER-Ps)对于提高这些网络的性能和寿命至关重要。鉴于前面提到的挑战,本研究对几种针对 UWSN 的不同 E-ER-Ps 进行了广泛分析。我们将使用机器学习(ML)的当代方法与传统协议进行了比较,因为基于 ML 的方法在解决 UWSN 面临的复杂挑战方面已显示出巨大潜力。本文旨在对 UWSNs 不同前景下的不同 E-ER-Ps 进行批判性评述。为了更好地理解这些协议的结构和用途,我们提供了一种创新的分类方法。在对基于 ML 的协议进行评估时,我们关注的是其灵活性、预测能力和整体效率的提高,而对传统协议的评估则基于其路由策略和能效的提高。全面的比较分析突出了不同协议的优缺点和可能用途。此外,还结合智能和自适应路由方法对 ML 的功能进行了批判性分析,强调了该技术彻底改变 UWSN 管理的潜力。为了制定和实施用于 UWSN 的 E-ER-Ps,文章最后强调了目前存在的障碍,包括对实时灵活性的需求、对环境变化的适应能力以及与现有网络基础设施的交互。文章强调了未来的研究目标,即开发基于 ML 的方法、结合传统方法和基于 ML 的方法的混合方法,以及设计能够动态适应水下栖息地不断变化的环境的协议。本研究通过对最先进的 UWSN E-ER-Ps 进行全面概述,为这一关键领域的未来发展奠定了基础。
{"title":"Energy-efficient routing protocols for UWSNs: A comprehensive review of taxonomy, challenges, opportunities, future research directions, and machine learning perspectives","authors":"","doi":"10.1016/j.jksuci.2024.102128","DOIUrl":"10.1016/j.jksuci.2024.102128","url":null,"abstract":"<div><p>Underwater Wireless Sensor Networks (UWSNs) are essential for a number of environmental and oceanographic monitoring applications. However, they face different and more complex challenges than terrestrial wireless sensor networks (TWSNs). The main challenges faced by UWSNs are limited include high propagation delays, poor bandwidth, low throughput, and high energy consumption. Replacing sensor batteries in such networks becomes extremely difficult as they are usually deployed in remote areas where limited human interaction is possible. The unbalanced and inefficient usage of energy by various network nodes poses another issue, as it may reduce the applicability and feasibility of the network. Therefore, proposing Energy-Efficient Routing Protocols (E-ER-Ps) is crucial to improve the performance and lifespan of these networks. Due to the challenges mentioned earlier, this research presents an extensive analysis of several different E-ER-Ps intended for UWSNs. We compare contemporary approaches that use machine learning (ML) with conventional protocols, as ML-based approaches have shown significant potential in resolving the intricate challenges faced by UWSNs. This paper aims to present a critical review of different E-ER-Ps from various prospects for UWSNs. To better comprehend the structure and uses of these protocols, we provide an innovative taxonomy for their classification. While ML-based protocols are evaluated for their flexibility, predictive power, and overall efficiency advancements, traditional protocols are evaluated based on their routing tactics and energy-efficiency improvements. A thorough comparative analysis highlights the advantages, disadvantages, and possible uses for different protocols. Furthermore, a critical analysis of ML’s function, incorporating intelligent and adaptive routing approaches, is presented, highlighting the technology’s potential to completely alter UWSN management. To formulate and implement E-ER-Ps for UWSNs, the article concludes by highlighting the present obstacles, including the need for real-time flexibility, resilience to environmental alters, and interaction with pre-existing network infrastructures. The development of ML-based approaches, hybrid approaches that combine conventional and ML-based methodologies, and the design of protocols that can adapt dynamically to the changing circumstances of underwater habitats are highlighted as future research objectives. This research provides the foundation for future advancements in this crucial field by presenting a comprehensive overview of the state-of-the-art UWSN E-ER-Ps.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002179/pdfft?md5=0ca24269fca8e21ff16074a33686ceaa&pid=1-s2.0-S1319157824002179-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141846029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face forgery video detection based on expression key sequences 基于表情键序列的人脸伪造视频检测
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-23 DOI: 10.1016/j.jksuci.2024.102142

In order to minimize additional computational costs in detecting forged videos, and enhance detection accuracy, this paper employs dynamic facial expression sequences as key sequences, replacing original video sequences as inputs for the detection model. A spatio-temporal dual-branch detection network is designed based on the visual Transformer architecture. Specifically, this process involves three steps. Firstly, dynamic facial expression sequences are localized as key sequences using optical flow difference algorithms. Subsequently, the spatial branch network employs the focal self-attention mechanism to focus on dynamic features of expression-relevant regions and uses Factorization Machines to facilitate feature interaction among multiple key sequences. Meanwhile, the temporal branch network concentrates on learning the temporal inconsistency of optical flow differences between adjacent frames. Finally, a binary classification linear SVM combines the Softmax values from the two branch networks to provide the ultimate detection outcome. Experimental results on the Faceforensics++ dataset demonstrate: (a) replacing whole video sequences with facial expression key sequences effectively reduces training and detection time by nearly 80% and 90%, respectively; (b) compared to state-of-the-art methods involving random sequence/frame extraction and key frame extraction based on video compression techniques, the proposed approach in this paper presents a more competitive detection accuracy.

为了尽量减少检测伪造视频的额外计算成本,提高检测精度,本文采用动态面部表情序列作为关键序列,取代原始视频序列作为检测模型的输入。基于视觉变换器架构设计了时空双分支检测网络。具体来说,这一过程包括三个步骤。首先,使用光流差分算法将动态面部表情序列定位为关键序列。随后,空间分支网络利用焦点自我关注机制,关注表情相关区域的动态特征,并利用因式分解机促进多个关键序列之间的特征交互。同时,时间分支网络专注于学习相邻帧之间光流差异的时间不一致性。最后,二元分类线性 SVM 将两个分支网络的 Softmax 值结合起来,提供最终的检测结果。在 Faceforensics++ 数据集上的实验结果表明:(a) 用面部表情关键序列替换整个视频序列能有效地减少近 80% 的训练时间和 90% 的检测时间;(b) 与最先进的随机序列/帧提取方法和基于视频压缩技术的关键帧提取方法相比,本文提出的方法具有更有竞争力的检测精度。
{"title":"Face forgery video detection based on expression key sequences","authors":"","doi":"10.1016/j.jksuci.2024.102142","DOIUrl":"10.1016/j.jksuci.2024.102142","url":null,"abstract":"<div><p>In order to minimize additional computational costs in detecting forged videos, and enhance detection accuracy, this paper employs dynamic facial expression sequences as key sequences, replacing original video sequences as inputs for the detection model. A spatio-temporal dual-branch detection network is designed based on the visual Transformer architecture. Specifically, this process involves three steps. Firstly, dynamic facial expression sequences are localized as key sequences using optical flow difference algorithms. Subsequently, the spatial branch network employs the focal self-attention mechanism to focus on dynamic features of expression-relevant regions and uses Factorization Machines to facilitate feature interaction among multiple key sequences. Meanwhile, the temporal branch network concentrates on learning the temporal inconsistency of optical flow differences between adjacent frames. Finally, a binary classification linear SVM combines the Softmax values from the two branch networks to provide the ultimate detection outcome. Experimental results on the Faceforensics++ dataset demonstrate: (a) replacing whole video sequences with facial expression key sequences effectively reduces training and detection time by nearly 80% and 90%, respectively; (b) compared to state-of-the-art methods involving random sequence/frame extraction and key frame extraction based on video compression techniques, the proposed approach in this paper presents a more competitive detection accuracy.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002313/pdfft?md5=d3161c3d47c3e55bf622551f8213c551&pid=1-s2.0-S1319157824002313-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141845621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight citrus leaf disease detection model based on ARMS and cross-domain dynamic attention 基于 ARMS 和跨域动态注意力的轻量级柑橘叶病检测模型
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-23 DOI: 10.1016/j.jksuci.2024.102133

In citrus cultivation, Anthracnose, Scab, and Greasy Spot significantly impact yield and quality. Facing challenges in detecting small targets against complex orchard backgrounds with uneven lighting and obstructions, existing models suffer from low detection accuracy. This study introduces the YOLOv8n-CDDA citrus leaf disease detection model. The Cross-Domain Dynamic Attention (CDDA) mechanism deconstructs the backbone network’s input feature maps into sections, dynamically assigning spatial and channel attention weights to reconstruct critical information and capture the variations and weak semantic features of disease textures. The proposed Adaptive Random Mix-Cut Splicing (ARMS) image augmentation technique blends diseased leaf images with healthy citrus backgrounds, enhancing the diversity and number of background targets. To reduce computational and memory consumption, the network is streamlined through channel pruning; to compensate for the loss in accuracy from pruning, a teacher–assistant–student network format is used for knowledge distillation, where the student network learns from soft knowledge to improve disease recognition accuracy. Finally, Grad-CAM++ technology generates heatmaps of the detections, facilitating the visualization of effective features and deepening understanding of the model’s focus areas. Experimental results demonstrate that the YOLOv8n-CDDA model achieves an average accuracy of 90.89% in disease detection, with an average recall rate of 81.12%, and a mean Average Precision (mAP50) of 88.36%. Compared to the original YOLO v8n and current mainstream detection models such as YOLOv5s, SSD, and Faster-RCNN, the improvements in average accuracy are respectively 2.95%, 4.78%, 14.22%, and 21.01%; in average recall, 2.36%, 3.09%, 15.74%, and 23.27%; and in mAP50, 2.38%, 3.13%, 13.45%, and 20.91%. After pruning and distillation for lightweight adaptation, the YOLOv8n-CDDA model has a parameter size of 0.8M, requires 4.2 GFLOPs, weighs 2.0 MB, and operates at 45 fps. Compared to YOLOv8n, this represents a reduction of 2.2M in parameters, 3.9 GFLOPs, and 4 MB in model weight, with an increase of 7 fps in speed. This model exhibits exceptional performance in the complex environment of citrus leaf disease detection, providing robust technical support for citrus growth monitoring studies, and offering insights for disease detection in other crops as well.

在柑橘种植中,炭疽病、疮痂病和油脂斑病对产量和质量有很大影响。面对复杂的果园背景、不均匀的光照和障碍物,现有模型在检测小目标时面临挑战,检测精度较低。本研究介绍了 YOLOv8n-CDDA 柑橘叶病检测模型。跨域动态注意力(CDDA)机制将骨干网络的输入特征图解构为多个部分,动态分配空间和通道注意力权重以重构关键信息,并捕捉病害纹理的变化和弱语义特征。所提出的自适应随机混合剪切拼接(ARMS)图像增强技术将病叶图像与健康柑橘背景图像混合在一起,增强了背景目标的多样性和数量。为了减少计算量和内存消耗,通过通道剪枝精简了网络;为了弥补剪枝带来的准确率损失,采用了教师-助手-学生的网络形式进行知识提炼,其中学生网络从软知识中学习,以提高疾病识别准确率。最后,Grad-CAM++ 技术生成了检测的热图,促进了有效特征的可视化,加深了对模型重点领域的理解。实验结果表明,YOLOv8n-CDDA 模型的疾病检测平均准确率为 90.89%,平均召回率为 81.12%,平均精度(mAP50)为 88.36%。与最初的 YOLO v8n 和目前主流的检测模型(如 YOLOv5s、SSD 和 Faster-RCNN)相比,平均准确率分别提高了 2.95%、4.78%、14.22% 和 21.01%;平均召回率分别提高了 2.36%、3.09%、15.74% 和 23.27%;mAP50 分别提高了 2.38%、3.13%、13.45% 和 20.91%。经过剪枝和蒸馏以实现轻量级适配后,YOLOv8n-CDDA 模型的参数大小为 0.8M,需要 4.2 GFLOPs,重 2.0 MB,运行速度为 45 fps。与 YOLOv8n 相比,参数减少了 2.2M,需要 3.9 GFLOPs,模型重量减少了 4 MB,速度提高了 7 fps。该模型在柑橘叶病检测的复杂环境中表现出卓越的性能,为柑橘生长监测研究提供了强大的技术支持,同时也为其他作物的病害检测提供了启示。
{"title":"Lightweight citrus leaf disease detection model based on ARMS and cross-domain dynamic attention","authors":"","doi":"10.1016/j.jksuci.2024.102133","DOIUrl":"10.1016/j.jksuci.2024.102133","url":null,"abstract":"<div><p>In citrus cultivation, Anthracnose, Scab, and Greasy Spot significantly impact yield and quality. Facing challenges in detecting small targets against complex orchard backgrounds with uneven lighting and obstructions, existing models suffer from low detection accuracy. This study introduces the YOLOv8n-CDDA citrus leaf disease detection model. The Cross-Domain Dynamic Attention (CDDA) mechanism deconstructs the backbone network’s input feature maps into sections, dynamically assigning spatial and channel attention weights to reconstruct critical information and capture the variations and weak semantic features of disease textures. The proposed Adaptive Random Mix-Cut Splicing (ARMS) image augmentation technique blends diseased leaf images with healthy citrus backgrounds, enhancing the diversity and number of background targets. To reduce computational and memory consumption, the network is streamlined through channel pruning; to compensate for the loss in accuracy from pruning, a teacher–assistant–student network format is used for knowledge distillation, where the student network learns from soft knowledge to improve disease recognition accuracy. Finally, Grad-CAM++ technology generates heatmaps of the detections, facilitating the visualization of effective features and deepening understanding of the model’s focus areas. Experimental results demonstrate that the YOLOv8n-CDDA model achieves an average accuracy of 90.89% in disease detection, with an average recall rate of 81.12%, and a mean Average Precision (mAP50) of 88.36%. Compared to the original YOLO v8n and current mainstream detection models such as YOLOv5s, SSD, and Faster-RCNN, the improvements in average accuracy are respectively 2.95%, 4.78%, 14.22%, and 21.01%; in average recall, 2.36%, 3.09%, 15.74%, and 23.27%; and in mAP50, 2.38%, 3.13%, 13.45%, and 20.91%. After pruning and distillation for lightweight adaptation, the YOLOv8n-CDDA model has a parameter size of 0.8M, requires 4.2 GFLOPs, weighs 2.0 MB, and operates at 45 fps. Compared to YOLOv8n, this represents a reduction of 2.2M in parameters, 3.9 GFLOPs, and 4 MB in model weight, with an increase of 7 fps in speed. This model exhibits exceptional performance in the complex environment of citrus leaf disease detection, providing robust technical support for citrus growth monitoring studies, and offering insights for disease detection in other crops as well.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002222/pdfft?md5=d186db7e088c79129786ff12b138ee08&pid=1-s2.0-S1319157824002222-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141841316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review on software reliability prediction via swarm intelligence algorithms 通过蜂群智能算法预测软件可靠性的系统综述
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-20 DOI: 10.1016/j.jksuci.2024.102132

The widespread integration of software into all parts of our lives has led to the need for software of higher reliability. Ensuring reliable software usually necessitates some form of formal methods in the early stages of the development process which requires strenuous effort. Hence, researchers in the field of software reliability introduced Software Reliability Growth Models (SRGMs) as a relatively inexpensive approach to software reliability prediction. Conventional parameter estimation methods of SRGMs were ineffective and left more to be desired. Consequently, researchers sought out swarm intelligence to combat its flaws, resulting in significant improvements. While similar surveys exist within the domain, the surveys are broader in scope and do not cover many swarm intelligence algorithms. Moreover, the broader scope has resulted in the occasional omission of information regarding the design for reliability predictions. A more comprehensive survey containing 38 studies and 18 different swarm intelligence algorithms in the domain is presented. Each design proposed by the studies was systematically analyzed where relevant information including the measures used, datasets used, SRGMs used, and the effectiveness of each proposed design, were extracted and organized into tables and taxonomies to be able to identify the current trends within the domain. Some notable findings include the distance-based approach providing a high prediction accuracy and an increasing trend in hybridized variants of swarm intelligence algorithms designs to predict software reliability. Future researchers are encouraged to include Mean Square Error (MSE) or Root MSE as the measures offer the largest sample size for comparison.

随着软件广泛融入我们生活的方方面面,我们需要可靠性更高的软件。要确保软件的可靠性,通常需要在开发过程的早期阶段采用某种形式的正规方法,这需要付出艰苦的努力。因此,软件可靠性领域的研究人员引入了软件可靠性增长模型(SRGM),作为一种相对廉价的软件可靠性预测方法。传统的 SRGM 参数估计方法效果不佳,还有待改进。因此,研究人员寻找蜂群智能来克服其缺陷,从而取得了显著的改进。虽然该领域也有类似的调查,但调查范围更广,没有涵盖很多群智能算法。此外,由于范围较广,偶尔也会遗漏有关可靠性预测设计的信息。本文介绍了一项更为全面的调查,其中包含 38 项研究和 18 种不同的蜂群智能算法。对研究提出的每种设计都进行了系统分析,提取了相关信息,包括使用的测量方法、使用的数据集、使用的 SRGM 以及每种设计的有效性,并将其整理成表格和分类法,以便能够识别该领域的当前趋势。一些值得注意的发现包括:基于距离的方法可提供较高的预测准确性,以及预测软件可靠性的群集智能算法设计的混合变体呈上升趋势。我们鼓励未来的研究人员将均方误差 (MSE) 或根 MSE 纳入研究范围,因为这些指标提供了最大的样本量供比较。
{"title":"A systematic review on software reliability prediction via swarm intelligence algorithms","authors":"","doi":"10.1016/j.jksuci.2024.102132","DOIUrl":"10.1016/j.jksuci.2024.102132","url":null,"abstract":"<div><p>The widespread integration of software into all parts of our lives has led to the need for software of higher reliability. Ensuring reliable software usually necessitates some form of formal methods in the early stages of the development process which requires strenuous effort. Hence, researchers in the field of software reliability introduced Software Reliability Growth Models (SRGMs) as a relatively inexpensive approach to software reliability prediction. Conventional parameter estimation methods of SRGMs were ineffective and left more to be desired. Consequently, researchers sought out swarm intelligence to combat its flaws, resulting in significant improvements. While similar surveys exist within the domain, the surveys are broader in scope and do not cover many swarm intelligence algorithms. Moreover, the broader scope has resulted in the occasional omission of information regarding the design for reliability predictions. A more comprehensive survey containing 38 studies and 18 different swarm intelligence algorithms in the domain is presented. Each design proposed by the studies was systematically analyzed where relevant information including the measures used, datasets used, SRGMs used, and the effectiveness of each proposed design, were extracted and organized into tables and taxonomies to be able to identify the current trends within the domain. Some notable findings include the distance-based approach providing a high prediction accuracy and an increasing trend in hybridized variants of swarm intelligence algorithms designs to predict software reliability. Future researchers are encouraged to include Mean Square Error (MSE) or Root MSE as the measures offer the largest sample size for comparison.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002210/pdfft?md5=65281963d468eb6753881c759697abc2&pid=1-s2.0-S1319157824002210-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141846519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A many-to-many matching with externalities solution for parallel task offloading in IoT networks 物联网网络并行任务卸载的多对多匹配与外部性解决方案
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-18 DOI: 10.1016/j.jksuci.2024.102134

The efficient and timely execution of tasks is a fundamental challenge in the realm of future Internet of Things (IoT) networks. To address this challenge, fog devices are often deployed close to end devices to facilitate task processing on behalf of IoT nodes. One strategy for improving task computational delay is to employ parallel task offloading, in which tasks are subdivided into subtasks and sent to different fog devices for execution in parallel. However, allocating computational resources to fog nodes and mapping these resources to IoT subtasks is a key challenge in this area. This work models the parallel task offloading problem as a graph-matching problem and utilizes a many-to-many matching technique to achieve a stable mapping of IoT subtasks to fog node resources. Unfortunately, the proposed solution is subject to the problem of externalities due to the dynamic preference profiling of fog nodes. To address this issue, we employ an iterative algorithm to resolve any blocking pairs that may arise. Our results demonstrate that the proposed technique reduces the task latency by 29% as compared to other matching-based techniques available in the literature.

高效及时地执行任务是未来物联网(IoT)网络领域的一项基本挑战。为应对这一挑战,通常会在终端设备附近部署雾设备,以促进代表物联网节点的任务处理。改善任务计算延迟的一种策略是采用并行任务卸载,即将任务细分为子任务,并发送到不同的雾设备并行执行。然而,为雾节点分配计算资源并将这些资源映射到物联网子任务是这一领域的关键挑战。这项工作将并行任务卸载问题建模为图匹配问题,并利用多对多匹配技术实现物联网子任务与雾节点资源的稳定映射。遗憾的是,由于雾节点的动态偏好剖析,所提出的解决方案存在外部性问题。为了解决这个问题,我们采用了一种迭代算法来解决可能出现的任何阻塞对。我们的研究结果表明,与文献中其他基于匹配的技术相比,所提出的技术可将任务延迟时间缩短 29%。
{"title":"A many-to-many matching with externalities solution for parallel task offloading in IoT networks","authors":"","doi":"10.1016/j.jksuci.2024.102134","DOIUrl":"10.1016/j.jksuci.2024.102134","url":null,"abstract":"<div><p>The efficient and timely execution of tasks is a fundamental challenge in the realm of future Internet of Things (IoT) networks. To address this challenge, fog devices are often deployed close to end devices to facilitate task processing on behalf of IoT nodes. One strategy for improving task computational delay is to employ parallel task offloading, in which tasks are subdivided into subtasks and sent to different fog devices for execution in parallel. However, allocating computational resources to fog nodes and mapping these resources to IoT subtasks is a key challenge in this area. This work models the parallel task offloading problem as a graph-matching problem and utilizes a many-to-many matching technique to achieve a stable mapping of IoT subtasks to fog node resources. Unfortunately, the proposed solution is subject to the problem of externalities due to the dynamic preference profiling of fog nodes. To address this issue, we employ an iterative algorithm to resolve any blocking pairs that may arise. Our results demonstrate that the proposed technique reduces the task latency by 29% as compared to other matching-based techniques available in the literature.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002234/pdfft?md5=ca723de57705f68d89bad154b59605a4&pid=1-s2.0-S1319157824002234-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141960831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimized fusion of deep learning models for kidney stone detection from CT images 优化融合深度学习模型,从 CT 图像中检测肾结石
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-18 DOI: 10.1016/j.jksuci.2024.102130

Accurate diagnosis of kidney disease is crucial, as it is a significant health concern that demands precise identification for effective and appropriate treatment. Deep learning methods are increasingly recognized as valuable tools for disease diagnosis in the biomedical field. However, current models utilizing deep networks often encounter challenges of overfitting and low accuracy, necessitating further refinement for optimal performance. To overcome these challenges, this paper proposes the introduction of two ensemble models designed for kidney stone detection in CT images. The first model, called StackedEnsembleNet, is a two-level deep stack ensemble model that effectively integrates the predictions from four base models: InceptionV3, InceptionResNetV2, MobileNet, and Xception. By leveraging the collective knowledge of these models, StackedEnsembleNet improves the accuracy and reliability of kidney stone detection. The second model PSOWeightedAvgNet, leverages the Particle Swarm Optimization (PSO) algorithm to determine the optimal weights for the weighted average ensemble. Through PSO, this ensemble approach assigns optimized weights to each model during the ensembling process, effectively enhancing the performance by optimizing the combination of their predictions. Experimental results conducted on a large dataset of 1799 CT images demonstrate that both StackedEnsembleNet and PSOWeightedAvgNet outperform the individual base models, achieving high accuracy rates in kidney stone detection. Furthermore, additional experiments on an unseen dataset validate the models’ ability to generalize. The comparison with previous methods confirms the superior performance of the proposed ensemble models. The paper also presents Grad-CAM visualizations and error case analysis to provide insights into the decision-making processes of the models. By overcoming the limitations of existing deep learning models, StackedEnsembleNet and PSOWeightedAvgNet offer a promising approach for accurate kidney stone detection, contributing to improved diagnosis and treatment outcomes in the field of nephrology.

准确诊断肾脏疾病至关重要,因为肾脏疾病是一个重大的健康问题,需要精确识别才能进行有效和适当的治疗。深度学习方法越来越被认为是生物医学领域疾病诊断的重要工具。然而,目前利用深度网络的模型经常会遇到过拟合和准确率低的挑战,需要进一步改进才能获得最佳性能。为了克服这些挑战,本文提出了两个针对 CT 图像中肾结石检测的集合模型。第一个模型名为 StackedEnsembleNet,是一个两级深度堆栈集合模型,有效整合了四个基础模型的预测结果:InceptionV3、InceptionResNetV2、MobileNet 和 Xception。通过利用这些模型的集体知识,StackedEnsembleNet 提高了肾结石检测的准确性和可靠性。第二个模型 PSOWeightedAvgNet 利用粒子群优化(PSO)算法来确定加权平均集合的最佳权重。通过 PSO,这种集合方法可在集合过程中为每个模型分配优化的权重,通过优化组合这些模型的预测结果来有效提高性能。在一个包含 1799 张 CT 图像的大型数据集上进行的实验结果表明,StackedEnsembleNet 和 PSOWeightedAvgNet 的性能均优于单个基础模型,在肾结石检测中达到了很高的准确率。此外,在未见过的数据集上进行的其他实验也验证了模型的泛化能力。与以往方法的比较证实了所提出的集合模型的卓越性能。论文还介绍了 Grad-CAM 可视化和错误案例分析,以便深入了解模型的决策过程。通过克服现有深度学习模型的局限性,StackedEnsembleNet 和 PSOWeightedAvgNet 为准确检测肾结石提供了一种前景广阔的方法,有助于改善肾脏病学领域的诊断和治疗效果。
{"title":"An optimized fusion of deep learning models for kidney stone detection from CT images","authors":"","doi":"10.1016/j.jksuci.2024.102130","DOIUrl":"10.1016/j.jksuci.2024.102130","url":null,"abstract":"<div><p>Accurate diagnosis of kidney disease is crucial, as it is a significant health concern that demands precise identification for effective and appropriate treatment. Deep learning methods are increasingly recognized as valuable tools for disease diagnosis in the biomedical field. However, current models utilizing deep networks often encounter challenges of overfitting and low accuracy, necessitating further refinement for optimal performance. To overcome these challenges, this paper proposes the introduction of two ensemble models designed for kidney stone detection in CT images. The first model, called StackedEnsembleNet, is a two-level deep stack ensemble model that effectively integrates the predictions from four base models: InceptionV3, InceptionResNetV2, MobileNet, and Xception. By leveraging the collective knowledge of these models, StackedEnsembleNet improves the accuracy and reliability of kidney stone detection. The second model PSOWeightedAvgNet, leverages the Particle Swarm Optimization (PSO) algorithm to determine the optimal weights for the weighted average ensemble. Through PSO, this ensemble approach assigns optimized weights to each model during the ensembling process, effectively enhancing the performance by optimizing the combination of their predictions. Experimental results conducted on a large dataset of 1799 CT images demonstrate that both StackedEnsembleNet and PSOWeightedAvgNet outperform the individual base models, achieving high accuracy rates in kidney stone detection. Furthermore, additional experiments on an unseen dataset validate the models’ ability to generalize. The comparison with previous methods confirms the superior performance of the proposed ensemble models. The paper also presents Grad-CAM visualizations and error case analysis to provide insights into the decision-making processes of the models. By overcoming the limitations of existing deep learning models, StackedEnsembleNet and PSOWeightedAvgNet offer a promising approach for accurate kidney stone detection, contributing to improved diagnosis and treatment outcomes in the field of nephrology.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002192/pdfft?md5=49b54c2eb6fd0a154e0f96100151eede&pid=1-s2.0-S1319157824002192-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141728833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel image captioning model with visual-semantic similarities and visual representations re-weighting 具有视觉语义相似性和视觉表征重权的新型图像标题模型
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-14 DOI: 10.1016/j.jksuci.2024.102127

Image captioning, the task of generating descriptive sentences for images, has seen significant advancements by incorporating semantic information. However, previous studies employed semantic attribute detectors to extract predetermined attributes consistently applied at every time step, resulting in the use of irrelevant attributes to the linguistic context during words’ generation. Furthermore, the integration between semantic attributes and visual representations in previous works is considered superficial and ineffective, leading to the neglection of the rich visual-semantic connections affecting the captioning model’s performance. To address the limitations of previous models, we introduced a novel framework that adapts attribute usage based on contextual relevance and effectively utilizes the similarities between visual features and semantic attributes. Our framework includes an Attribute Detection Component (ADC) that predicts relevant attributes using visual features and attribute embeddings. The Attribute Prediction and Visual Weighting module (APVW) then dynamically adjusts these attributes and generates weights to refine the visual context vector, enhancing semantic alignment. Our approach demonstrated an average improvement of 3.30% in BLEU@1 and 5.24% in CIDEr on MS-COCO, and 6.55% in BLEU@1 and 25.72% in CIDEr on Flickr30K, during CIDEr optimization phase.

图像标题制作是为图像生成描述性句子的任务,通过纳入语义信息,图像标题制作取得了重大进展。然而,以往的研究采用语义属性检测器来提取预先确定的属性,并在每个时间步骤中持续应用,导致在生成词语时使用了与语言上下文无关的属性。此外,以往的研究认为语义属性和视觉表征之间的整合是肤浅和无效的,导致忽略了丰富的视觉-语义联系,影响了字幕模型的性能。为了解决以往模型的局限性,我们引入了一个新颖的框架,该框架可根据上下文相关性调整属性的使用,并有效利用视觉特征与语义属性之间的相似性。我们的框架包括一个属性检测组件(ADC),它能利用视觉特征和属性嵌入预测相关属性。然后,属性预测和视觉加权模块(APVW)会动态调整这些属性并生成权重,以完善视觉上下文向量,从而加强语义对齐。在 CIDEr 优化阶段,我们的方法在 MS-COCO 上平均提高了 3.30% 的 BLEU@1 和 5.24% 的 CIDEr,在 Flickr30K 上平均提高了 6.55% 的 BLEU@1 和 25.72% 的 CIDEr。
{"title":"A novel image captioning model with visual-semantic similarities and visual representations re-weighting","authors":"","doi":"10.1016/j.jksuci.2024.102127","DOIUrl":"10.1016/j.jksuci.2024.102127","url":null,"abstract":"<div><p>Image captioning, the task of generating descriptive sentences for images, has seen significant advancements by incorporating semantic information. However, previous studies employed semantic attribute detectors to extract predetermined attributes consistently applied at every time step, resulting in the use of irrelevant attributes to the linguistic context during words’ generation. Furthermore, the integration between semantic attributes and visual representations in previous works is considered superficial and ineffective, leading to the neglection of the rich visual-semantic connections affecting the captioning model’s performance. To address the limitations of previous models, we introduced a novel framework that adapts attribute usage based on contextual relevance and effectively utilizes the similarities between visual features and semantic attributes. Our framework includes an Attribute Detection Component (ADC) that predicts relevant attributes using visual features and attribute embeddings. The Attribute Prediction and Visual Weighting module (APVW) then dynamically adjusts these attributes and generates weights to refine the visual context vector, enhancing semantic alignment. Our approach demonstrated an average improvement of 3.30% in BLEU@1 and 5.24% in CIDEr on MS-COCO, and 6.55% in BLEU@1 and 25.72% in CIDEr on Flickr30K, during CIDEr optimization phase.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002167/pdfft?md5=a64ddf3f2ec61fdc99923155773d0fc6&pid=1-s2.0-S1319157824002167-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141712696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel deep CNN model with entropy coded sine cosine for corn disease classification 采用熵编码正弦余弦的新型深度 CNN 模型用于玉米疾病分类
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-14 DOI: 10.1016/j.jksuci.2024.102126

Corn diseases significantly impact crop yields, posing a major challenge to agricultural productivity. Early and accurate detection of these diseases is crucial for effective management and mitigation. Existing methods, mostly relying on analyzing corn leaves, often lack the precision to identify and classify a wide range of diseases under varying conditions. This study introduces a novel approach to detecting corn diseases using image processing and deep learning techniques, aiming to enhance detection accuracy through pre-processing, improved feature extraction and selection, and classification algorithms. A new deep Convolutional Neural Network (CNN) model named TreeNet, with 35 layers and 38 connections, is proposed. TreeNet is pre-trained using the Plant Village imaging dataset. For image pre-processing, the YCbCr color space is utilized to improve color representation and contrast. Feature extraction is performed using TreeNet and two pre-trained models, Darknet-53, and DenseNet-201, with features fused using a serial-based fusion method. The Entropy-coded Sine Cosine Algorithm is applied for feature selection, optimizing the feature set for classification. The selected features are used to train Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) classifiers, with extensive experiments conducted using both 5-fold and 10-fold cross-validation, and feature sizes ranging from 200 to 1150. The proposed method achieves classification accuracy, precision, recall, and F1-score of 99.8%, 99%, 100%, and 99%, respectively, surpassing existing benchmarks. The integration of TreeNet with Darknet-53 and DenseNet-201, along with robust pre-processing and feature selection, significantly improves corn disease detection, highlighting the potential of advanced CNN architectures in agriculture.

玉米病害严重影响作物产量,对农业生产力构成重大挑战。及早准确地发现这些病害对于有效管理和缓解病害至关重要。现有方法大多依赖于分析玉米叶片,往往缺乏在不同条件下识别和分类各种病害的精度。本研究介绍了一种利用图像处理和深度学习技术检测玉米病害的新方法,旨在通过预处理、改进特征提取和选择以及分类算法来提高检测精度。研究提出了一种名为 TreeNet 的新型深度卷积神经网络(CNN)模型,该模型有 35 层和 38 个连接。TreeNet 使用植物村图像数据集进行预训练。在进行图像预处理时,使用 YCbCr 色彩空间来改善色彩表现和对比度。特征提取使用 TreeNet 和两个预训练模型 Darknet-53 和 DenseNet-201 进行,并使用基于序列的融合方法进行特征融合。熵编码正余弦算法用于特征选择,优化分类特征集。选定的特征用于训练支持向量机(SVM)和 K-近邻(KNN)分类器,并使用 5 倍和 10 倍交叉验证进行了大量实验,特征大小从 200 到 1150 不等。所提出的方法在分类准确率、精确度、召回率和 F1 分数上分别达到了 99.8%、99%、100% 和 99%,超过了现有的基准。TreeNet 与 Darknet-53 和 DenseNet-201 的集成,加上强大的预处理和特征选择,显著提高了玉米病害检测的效果,凸显了先进 CNN 架构在农业领域的潜力。
{"title":"A novel deep CNN model with entropy coded sine cosine for corn disease classification","authors":"","doi":"10.1016/j.jksuci.2024.102126","DOIUrl":"10.1016/j.jksuci.2024.102126","url":null,"abstract":"<div><p>Corn diseases significantly impact crop yields, posing a major challenge to agricultural productivity. Early and accurate detection of these diseases is crucial for effective management and mitigation. Existing methods, mostly relying on analyzing corn leaves, often lack the precision to identify and classify a wide range of diseases under varying conditions. This study introduces a novel approach to detecting corn diseases using image processing and deep learning techniques, aiming to enhance detection accuracy through pre-processing, improved feature extraction and selection, and classification algorithms. A new deep Convolutional Neural Network (CNN) model named TreeNet, with 35 layers and 38 connections, is proposed. TreeNet is pre-trained using the Plant Village imaging dataset. For image pre-processing, the YCbCr color space is utilized to improve color representation and contrast. Feature extraction is performed using TreeNet and two pre-trained models, Darknet-53, and DenseNet-201, with features fused using a serial-based fusion method. The Entropy-coded Sine Cosine Algorithm is applied for feature selection, optimizing the feature set for classification. The selected features are used to train Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) classifiers, with extensive experiments conducted using both 5-fold and 10-fold cross-validation, and feature sizes ranging from 200 to 1150. The proposed method achieves classification accuracy, precision, recall, and F1-score of 99.8%, 99%, 100%, and 99%, respectively, surpassing existing benchmarks. The integration of TreeNet with Darknet-53 and DenseNet-201, along with robust pre-processing and feature selection, significantly improves corn disease detection, highlighting the potential of advanced CNN architectures in agriculture.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002155/pdfft?md5=43529fffcaeee3e790d439f86b92c4d2&pid=1-s2.0-S1319157824002155-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141704359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CFNet: Cross-scale fusion network for medical image segmentation CFNet:用于医学图像分割的跨尺度融合网络
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-10 DOI: 10.1016/j.jksuci.2024.102123

Learning multi-scale feature representations is essential for medical image segmentation. Most existing frameworks are based on U-shape architecture in which the high-resolution representation is recovered progressively by connecting different levels of the decoder with the low-resolution representation from the encoder. However, intrinsic defects in complementary feature fusion inhibit the U-shape from aggregating efficient global and discriminative features along object boundaries. While Transformer can help model the global features, their computation complexity limits the application in real-time medical scenarios. To address these issues, we propose a Cross-scale Fusion Network (CFNet), combining a cross-scale attention module and pyramidal module to fuse multi-stage/global context information. Specifically, we first utilize large kernel convolution to design the basic building block capable of extracting global and local information. Then, we propose a Bidirectional Atrous Spatial Pyramid Pooling (BiASPP), which employs atrous convolution in the bidirectional paths to capture various shapes and sizes of brain tumors. Furthermore, we introduce a cross-stage attention mechanism to reduce redundant information when merging features from two stages with different semantics. Extensive evaluation was performed on five medical image segmentation datasets: a 3D volumetric dataset, namely Brats benchmarks. CFNet-L achieves 85.74% and 90.98% dice score for Enhanced Tumor and Whole Tumor on Brats2018, respectively. Furthermore, our largest model CFNet-L outperformed other methods on 2D medical image. It achieved 71.95%, 82.79%, and 80.79% SE for STARE, DRIVE, and CHASEDB1, respectively. The code will be available at https://github.com/aminabenabid/CFNet

学习多尺度特征表示对医学图像分割至关重要。现有的大多数框架都基于 U 型结构,通过将解码器的不同层次与编码器的低分辨率表示连接起来,逐步恢复高分辨率表示。然而,互补特征融合的内在缺陷阻碍了 U 型结构沿对象边界聚合有效的全局特征和鉴别特征。虽然变换器可以帮助建立全局特征模型,但其计算复杂性限制了其在实时医疗场景中的应用。为了解决这些问题,我们提出了一种跨尺度融合网络(CFNet),它结合了跨尺度注意力模块和金字塔模块来融合多阶段/全局上下文信息。具体来说,我们首先利用大核卷积来设计能够提取全局和局部信息的基本构件。然后,我们提出了双向阿特柔斯空间金字塔池化(BiASPP),在双向路径中采用阿特柔斯卷积来捕捉各种形状和大小的脑肿瘤。此外,我们还引入了跨阶段关注机制,以便在合并来自两个不同语义阶段的特征时减少冗余信息。我们在五个医学图像分割数据集上进行了广泛的评估:一个三维体积数据集,即 Brats 基准。在 Brats2018 上,CFNet-L 对增强肿瘤和整体肿瘤的骰子得分分别达到 85.74% 和 90.98%。此外,我们的最大模型 CFNet-L 在二维医学图像上的表现优于其他方法。它对 STARE、DRIVE 和 CHASEDB1 的 SE 分别达到了 71.95%、82.79% 和 80.79%。代码可在 https://github.com/aminabenabid/CFNet
{"title":"CFNet: Cross-scale fusion network for medical image segmentation","authors":"","doi":"10.1016/j.jksuci.2024.102123","DOIUrl":"10.1016/j.jksuci.2024.102123","url":null,"abstract":"<div><p>Learning multi-scale feature representations is essential for medical image segmentation. Most existing frameworks are based on U-shape architecture in which the high-resolution representation is recovered progressively by connecting different levels of the decoder with the low-resolution representation from the encoder. However, intrinsic defects in complementary feature fusion inhibit the U-shape from aggregating efficient global and discriminative features along object boundaries. While Transformer can help model the global features, their computation complexity limits the application in real-time medical scenarios. To address these issues, we propose a Cross-scale Fusion Network (CFNet), combining a cross-scale attention module and pyramidal module to fuse multi-stage/global context information. Specifically, we first utilize large kernel convolution to design the basic building block capable of extracting global and local information. Then, we propose a Bidirectional Atrous Spatial Pyramid Pooling (BiASPP), which employs atrous convolution in the bidirectional paths to capture various shapes and sizes of brain tumors. Furthermore, we introduce a cross-stage attention mechanism to reduce redundant information when merging features from two stages with different semantics. Extensive evaluation was performed on five medical image segmentation datasets: a 3D volumetric dataset, namely Brats benchmarks. CFNet-L achieves 85.74% and 90.98% dice score for Enhanced Tumor and Whole Tumor on Brats2018, respectively. Furthermore, our largest model CFNet-L outperformed other methods on 2D medical image. It achieved 71.95%, 82.79%, and 80.79% SE for STARE, DRIVE, and CHASEDB1, respectively. The code will be available at <span><span>https://github.com/aminabenabid/CFNet</span><svg><path></path></svg></span></p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S131915782400212X/pdfft?md5=f9e4769e712ba1e0a899046089ca2727&pid=1-s2.0-S131915782400212X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141705259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ECG signal fusion reconstruction via hash autoencoder and margin semantic reinforcement 通过哈希自动编码器和边际语义强化进行心电信号融合重建
IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-01 DOI: 10.1016/j.jksuci.2024.102124

The ECG signal is often accompanied by noise, which can affect its shape characteristics, so it is important to perform signal de-noising. However, the commonly used signal noise reduction methods, such as wavelet or filter transformation, often prioritize high-frequency signals over low-frequency ones, leading to the loss of low-frequency band features or difficulties in capturing them. We propose a fusion reconstruction framework that combines hash autoencoder and margin semantic reinforcement to enhance low-frequency band features. Specifically, for labeled samples, margin semantic reinforcement identifies and corrects weight discrepancies among bands with similar waveforms but different labels to amplify the low-frequency signals associated with the label and reduce irrelevant ones. Meanwhile, hash autoencoder utilizes a semantic hash dictionary to reconstruct the original signal and mitigate noise pollution. For unlabeled samples, the hash autoencoder is utilized to generate pseudo-labels, followed by the reproduction of the aforementioned enhanced reconstruction process. The final step involves weighting the two types of signals, enhanced with margin semantics and hash autoencoder reconstruction, to achieve the reconstruction objective of the original signal, facilitating recognition and detection tasks. Experiments conducted on different classical classifiers demonstrate that the reconstructed ECG signals can significantly improve their performance.

心电信号通常伴有噪声,会影响其形状特征,因此进行信号去噪非常重要。然而,常用的信号降噪方法,如小波变换或滤波变换,往往优先考虑高频信号而非低频信号,导致低频段特征丢失或难以捕捉。我们提出了一种融合重构框架,将哈希自动编码器和边际语义强化相结合,以增强低频段特征。具体来说,对于有标签的样本,边际语义强化可以识别并纠正波形相似但标签不同的频带之间的权重差异,从而放大与标签相关的低频信号,减少不相关的信号。同时,哈希自动编码器利用语义哈希字典重建原始信号,减少噪声污染。对于无标签样本,哈希自动编码器被用来生成伪标签,然后再复制上述增强的重建过程。最后一步是对利用余量语义和哈希自动编码器重构增强的两类信号进行加权,以实现原始信号的重构目标,从而促进识别和检测任务的完成。在不同经典分类器上进行的实验表明,重构后的心电信号能显著提高分类器的性能。
{"title":"ECG signal fusion reconstruction via hash autoencoder and margin semantic reinforcement","authors":"","doi":"10.1016/j.jksuci.2024.102124","DOIUrl":"10.1016/j.jksuci.2024.102124","url":null,"abstract":"<div><p>The ECG signal is often accompanied by noise, which can affect its shape characteristics, so it is important to perform signal de-noising. However, the commonly used signal noise reduction methods, such as wavelet or filter transformation, often prioritize high-frequency signals over low-frequency ones, leading to the loss of low-frequency band features or difficulties in capturing them. We propose a fusion reconstruction framework that combines hash autoencoder and margin semantic reinforcement to enhance low-frequency band features. Specifically, for labeled samples, margin semantic reinforcement identifies and corrects weight discrepancies among bands with similar waveforms but different labels to amplify the low-frequency signals associated with the label and reduce irrelevant ones. Meanwhile, hash autoencoder utilizes a semantic hash dictionary to reconstruct the original signal and mitigate noise pollution. For unlabeled samples, the hash autoencoder is utilized to generate pseudo-labels, followed by the reproduction of the aforementioned enhanced reconstruction process. The final step involves weighting the two types of signals, enhanced with margin semantics and hash autoencoder reconstruction, to achieve the reconstruction objective of the original signal, facilitating recognition and detection tasks. Experiments conducted on different classical classifiers demonstrate that the reconstructed ECG signals can significantly improve their performance.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":null,"pages":null},"PeriodicalIF":5.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002131/pdfft?md5=cb705ac9ed204e1395389a7ec4365e45&pid=1-s2.0-S1319157824002131-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141639174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of King Saud University-Computer and Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1