首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Channel Estimation Algorithm Based on Spatial Direction Acquisition and Dynamic-Window Expansion in Massive MIMO System 大规模多输入多输出系统中基于空间方向采集和动态窗口扩展的信道估计算法
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-19 DOI: 10.1155/2024/7727469
Shufeng Li, Baoxin Su, Yiming Liu, Junwei Zhang, Minglei You

Millimeter-wave (mmWave) and massive multiple-input multiple-output (MIMO) technologies are critical in current and future communication research. They play an essential role in meeting the demands for high-capacity, high-speed, and low-latency communication brought about by technological advancements. However, existing mmWave channel estimation schemes rely on idealized common sparse channel support assumptions, and their performance significantly degrades when encountering beam squint scenarios. To address this issue, this paper introduces a dynamic support detection window (DSDW) algorithm. This algorithm dynamically adjusts the position and size of the window based on the received signal strength, thereby better capturing signal strength variations and obtaining a more complete set of signal supports. The DSDW algorithm can better capture and utilize the sparsity of the channel, improving the efficiency and accuracy of the channel state information acquisition. By combining the beam-split pattern (BSP) algorithm with the DSDW algorithm, this paper designs an effective method to address the inherent beam-spreading problem in mmWave scenarios. Simulation results are proposed to demonstrate the effectiveness of the BSP-DSDW algorithm.

毫米波(mmWave)和大规模多输入多输出(MIMO)技术在当前和未来的通信研究中至关重要。它们在满足技术进步带来的大容量、高速度和低延迟通信需求方面发挥着至关重要的作用。然而,现有的毫米波信道估计方案依赖于理想化的普通稀疏信道支持假设,当遇到波束斜视场景时,其性能会明显下降。为解决这一问题,本文介绍了一种动态支持检测窗(DSDW)算法。该算法可根据接收到的信号强度动态调整窗口的位置和大小,从而更好地捕捉信号强度变化,获得更完整的信号支持集。DSDW 算法能更好地捕捉和利用信道的稀疏性,提高信道状态信息获取的效率和准确性。通过将分束模式(BSP)算法与 DSDW 算法相结合,本文设计了一种有效的方法来解决毫米波场景中固有的波束扩散问题。仿真结果证明了 BSP-DSDW 算法的有效性。
{"title":"Channel Estimation Algorithm Based on Spatial Direction Acquisition and Dynamic-Window Expansion in Massive MIMO System","authors":"Shufeng Li,&nbsp;Baoxin Su,&nbsp;Yiming Liu,&nbsp;Junwei Zhang,&nbsp;Minglei You","doi":"10.1155/2024/7727469","DOIUrl":"https://doi.org/10.1155/2024/7727469","url":null,"abstract":"<div>\u0000 <p>Millimeter-wave (mmWave) and massive multiple-input multiple-output (MIMO) technologies are critical in current and future communication research. They play an essential role in meeting the demands for high-capacity, high-speed, and low-latency communication brought about by technological advancements. However, existing mmWave channel estimation schemes rely on idealized common sparse channel support assumptions, and their performance significantly degrades when encountering beam squint scenarios. To address this issue, this paper introduces a dynamic support detection window (DSDW) algorithm. This algorithm dynamically adjusts the position and size of the window based on the received signal strength, thereby better capturing signal strength variations and obtaining a more complete set of signal supports. The DSDW algorithm can better capture and utilize the sparsity of the channel, improving the efficiency and accuracy of the channel state information acquisition. By combining the beam-split pattern (BSP) algorithm with the DSDW algorithm, this paper designs an effective method to address the inherent beam-spreading problem in mmWave scenarios. Simulation results are proposed to demonstrate the effectiveness of the BSP-DSDW algorithm.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/7727469","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141730284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine-Grained Passenger Load Prediction inside Metro Network via Smart Card Data 通过智能卡数据对地铁网络内的乘客负荷进行精细预测
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-18 DOI: 10.1155/2024/6643018
Xiancai Tian, Chen Zhang, Baihua Zheng

Metro system serves as the backbone for urban public transportation. Accurate passenger load prediction for the metro system plays a crucial role in metro service quality improvement, such as helping operators schedule train timetables and passengers plan their trips. However, existing works can only predict low-grained passenger flows of origin-destination (O-D) paths or inflows/outflows of each station but cannot predict passenger load distribution over the whole metro network. To this end, this paper proposes an end-to-end inference framework, PIPE, for passenger load prediction of every metro segment between two adjacent stations, by only utilizing smart card data. In particular, PIPE includes two modules. The first is the core. It formulates the travel time distribution of each metro segment as a truncated Gaussian distribution. Since there might be several possible routes for certain O-D paths, the population-level travel time distribution of these O-D paths would be a mixture of travel times of different routes. Considering the route preference may change over time, a dynamic truncated Gaussian mixture model is proposed for parameter inference of each truncated Gaussian distribution of each metro segment. The second module serves as the supplement, which compiles a bunch of methods for predicting passenger flows of O-D paths. Built upon them, PIPE is able to predict the travel time that future passengers of each O-D path will take for passing each metro segment and consequently can predict the passenger load of each metro segment in the short future. Numerical studies from Singapore’s metro system demonstrate the efficacy of our method.

地铁系统是城市公共交通的骨干。准确预测地铁系统的客流对提高地铁服务质量起着至关重要的作用,如帮助运营商安排列车时刻表和乘客规划行程。然而,现有的工作只能预测起点-终点(O-D)路径或每个车站流入/流出的低粒度客流,却无法预测整个地铁网络的客流分布。为此,本文提出了一种端到端推理框架 PIPE,只需利用智能卡数据,即可对相邻两站之间的每个地铁区间进行客流预测。具体而言,PIPE 包括两个模块。第一个是核心模块。它将每个地铁区间的旅行时间分布表述为截断高斯分布。由于某些 O-D 路径可能有多条可能的路线,因此这些 O-D 路径的人口级旅行时间分布将是不同路线旅行时间的混合。考虑到路线偏好可能会随时间发生变化,我们提出了一个动态截断高斯混合模型,用于推断每个地铁段截断高斯分布的参数。第二个模块作为补充,汇编了一系列预测 O-D 路径客流的方法。在这些方法的基础上,PIPE 能够预测每条 O-D 路径的未来乘客通过每个地铁段所需的旅行时间,从而预测每个地铁段在短期内的客流量。新加坡地铁系统的数值研究证明了我们方法的有效性。
{"title":"Fine-Grained Passenger Load Prediction inside Metro Network via Smart Card Data","authors":"Xiancai Tian,&nbsp;Chen Zhang,&nbsp;Baihua Zheng","doi":"10.1155/2024/6643018","DOIUrl":"https://doi.org/10.1155/2024/6643018","url":null,"abstract":"<div>\u0000 <p>Metro system serves as the backbone for urban public transportation. Accurate passenger load prediction for the metro system plays a crucial role in metro service quality improvement, such as helping operators schedule train timetables and passengers plan their trips. However, existing works can only predict low-grained passenger flows of origin-destination (O-D) paths or inflows/outflows of each station but cannot predict passenger load distribution over the whole metro network. To this end, this paper proposes an end-to-end inference framework, PIPE, for passenger load prediction of every metro segment between two adjacent stations, by only utilizing smart card data. In particular, PIPE includes two modules. The first is the core. It formulates the travel time distribution of each metro segment as a truncated Gaussian distribution. Since there might be several possible routes for certain O-D paths, the population-level travel time distribution of these O-D paths would be a mixture of travel times of different routes. Considering the route preference may change over time, a dynamic truncated Gaussian mixture model is proposed for parameter inference of each truncated Gaussian distribution of each metro segment. The second module serves as the supplement, which compiles a bunch of methods for predicting passenger flows of O-D paths. Built upon them, PIPE is able to predict the travel time that future passengers of each O-D path will take for passing each metro segment and consequently can predict the passenger load of each metro segment in the short future. Numerical studies from Singapore’s metro system demonstrate the efficacy of our method.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/6643018","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141730210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodality Data Augmentation Network for Arrhythmia Classification 用于心律失常分类的多模态数据增强网络
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-14 DOI: 10.1155/2024/9954821
Zhimin Xu, Mujun Zang, Tong Liu, Zhihao Wang, Shusen Zhou, Chanjuan Liu, Qingjun Wang

Arrhythmia is a prevalent cardiovascular disease, which has garnered widespread attention due to its age-related increases in mortality rates. In the analysis of arrhythmia, the electrocardiogram (ECG) plays an important role. Arrhythmia classification often suffers from a significant data imbalance issue due to the limited availability of data for certain arrhythmia categories. This imbalance problem significantly affects the classification performance of the model. To address this challenge, data augmentation emerges as a viable solution, aiming to neutralize the adverse effects of imbalanced datasets on the model. To this end, this paper proposes a novel Multimodality Data Augmentation Network (MM-DANet) for arrhythmia classification. The MM-DANet consists of two modules: the multimodality data matching-based data augmentation module and the multimodality feature encoding module. In the multimodality data matching-based data augmentation module, we expand the underrepresented arrhythmia categories to match the size of the largest category. Subsequently, the multimodality feature encoding module employs convolutional neural networks (CNN) to extract the modality-specific features from both signals and images and concatenate them for efficient and accurate classification. The MM-DANet was evaluated on the MIT-BIH Arrhythmia Database and achieving an accuracy of 98.83%, along with an average specificity of 98.87%, average sensitivity of 92.92%, average precision of 91.05%, and average F1_score of 91.96%. Furthermore, its performance was also assessed on the St. Petersburg INCART arrhythmia database and the MIT-BIH supraventricular arrhythmia database, yielding AUC values of 81.98% and 90.93%, respectively. These outstanding results not only underscore the effectiveness of MM-DANet but also indicate its potential for facilitating reliable automated analysis of arrhythmias.

心律失常是一种普遍存在的心血管疾病,由于其与年龄相关的死亡率上升而受到广泛关注。在心律失常的分析中,心电图(ECG)发挥着重要作用。由于某些心律失常类别的数据有限,心律失常分类往往存在严重的数据不平衡问题。这种不平衡问题严重影响了模型的分类性能。为应对这一挑战,数据扩增成为一种可行的解决方案,旨在消除不平衡数据集对模型的不利影响。为此,本文提出了一种用于心律失常分类的新型多模态数据增强网络(MM-DANet)。MM-DANet 由两个模块组成:基于多模态数据匹配的数据增强模块和多模态特征编码模块。在基于多模态数据匹配的数据扩增模块中,我们扩充了代表性不足的心律失常类别,使其与最大类别的规模相匹配。随后,多模态特征编码模块采用卷积神经网络(CNN)从信号和图像中提取特定模态特征,并将其串联起来,以实现高效准确的分类。MM-DANet 在 MIT-BIH 心律失常数据库上进行了评估,准确率达到 98.83%,平均特异性为 98.87%,平均灵敏度为 92.92%,平均精确度为 91.05%,平均 F1_score 为 91.96%。此外,还对圣彼得堡 INCART 心律失常数据库和 MIT-BIH 室上性心律失常数据库进行了性能评估,其 AUC 值分别为 81.98% 和 90.93%。这些出色的结果不仅强调了 MM-DANet 的有效性,还表明了它在促进可靠的心律失常自动分析方面的潜力。
{"title":"Multimodality Data Augmentation Network for Arrhythmia Classification","authors":"Zhimin Xu,&nbsp;Mujun Zang,&nbsp;Tong Liu,&nbsp;Zhihao Wang,&nbsp;Shusen Zhou,&nbsp;Chanjuan Liu,&nbsp;Qingjun Wang","doi":"10.1155/2024/9954821","DOIUrl":"https://doi.org/10.1155/2024/9954821","url":null,"abstract":"<div>\u0000 <p>Arrhythmia is a prevalent cardiovascular disease, which has garnered widespread attention due to its age-related increases in mortality rates. In the analysis of arrhythmia, the electrocardiogram (ECG) plays an important role. Arrhythmia classification often suffers from a significant data imbalance issue due to the limited availability of data for certain arrhythmia categories. This imbalance problem significantly affects the classification performance of the model. To address this challenge, data augmentation emerges as a viable solution, aiming to neutralize the adverse effects of imbalanced datasets on the model. To this end, this paper proposes a novel Multimodality Data Augmentation Network (MM-DANet) for arrhythmia classification. The MM-DANet consists of two modules: the multimodality data matching-based data augmentation module and the multimodality feature encoding module. In the multimodality data matching-based data augmentation module, we expand the underrepresented arrhythmia categories to match the size of the largest category. Subsequently, the multimodality feature encoding module employs convolutional neural networks (CNN) to extract the modality-specific features from both signals and images and concatenate them for efficient and accurate classification. The MM-DANet was evaluated on the MIT-BIH Arrhythmia Database and achieving an accuracy of 98.83%, along with an average specificity of 98.87%, average sensitivity of 92.92%, average precision of 91.05%, and average <i>F</i>1_score of 91.96%. Furthermore, its performance was also assessed on the St. Petersburg INCART arrhythmia database and the MIT-BIH supraventricular arrhythmia database, yielding AUC values of 81.98% and 90.93%, respectively. These outstanding results not only underscore the effectiveness of MM-DANet but also indicate its potential for facilitating reliable automated analysis of arrhythmias.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/9954821","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141624447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixed-Time Stable Gradient Flows for Optimal Adaptive Control of Continuous-Time Nonlinear Systems 用于连续时间非线性系统优化自适应控制的固定时间稳定梯度流
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-12 DOI: 10.1155/2024/5241035
Mahdi Niroomand, Reihaneh Kardehi Moghaddam, Hamidreza Modares, Mohammad-Bagher Naghibi Sistani

This paper introduces an inclusive class of fixed-time stable continuous-time gradient flows (GFs). This class of GFs is then leveraged to learn optimal control solutions for nonlinear systems in fixed time. It is shown that the presented GF guarantees convergence within a fixed time from any initial condition to the exact minimum of functions that satisfy the Polyak–Łojasiewicz (PL) inequality. The presented fixed-time GF is then utilized to design fixed-time optimal adaptive control algorithms. To this end, a fixed-time reinforcement learning (RL) algorithm is developed on the basis of a single network adaptive critic (SNAC) to learn the solution to an infinite-horizon optimal control problem in a fixed-time convergent, online, adaptive, and forward-in-time manner. It is shown that the PL inequality in the presented RL algorithm amounts to a mild inequality condition on a few collected samples. This condition is much weaker than the standard persistence of excitation (PE) and finite duration PE that relies on a rank condition of a dataset. This is crucial for learning-enabled control systems as control systems can commit to learning an optimal controller from the beginning, in sharp contrast to existing results that rely on the PE and rank condition, and can only commit to learning after rich data samples are collected. Simulation results are provided to validate the performance and efficacy of the presented fixed-time RL algorithm.

本文介绍了一类固定时间稳定连续时间梯度流(GFs)。然后利用该类梯度流学习固定时间内非线性系统的最优控制解。研究表明,所提出的 GF 能保证在固定时间内从任意初始条件收敛到满足 Polyak-Łojasiewicz (PL) 不等式的函数的精确最小值。提出的固定时间 GF 可用于设计固定时间最优自适应控制算法。为此,我们在单网络自适应批判者(SNAC)的基础上开发了一种固定时间强化学习(RL)算法,以固定时间收敛、在线、自适应和实时前进的方式学习无限视距最优控制问题的解。研究表明,所提出的 RL 算法中的 PL 不等式等同于少数采集样本上的温和不等式条件。这一条件比依赖于数据集等级条件的标准持续激励(PE)和有限持续时间 PE 弱得多。这对于支持学习的控制系统至关重要,因为控制系统可以从一开始就致力于学习最优控制器,这与依赖于 PE 和等级条件的现有结果形成鲜明对比,后者只能在收集到丰富的数据样本后才能致力于学习。仿真结果验证了所介绍的固定时间 RL 算法的性能和功效。
{"title":"Fixed-Time Stable Gradient Flows for Optimal Adaptive Control of Continuous-Time Nonlinear Systems","authors":"Mahdi Niroomand,&nbsp;Reihaneh Kardehi Moghaddam,&nbsp;Hamidreza Modares,&nbsp;Mohammad-Bagher Naghibi Sistani","doi":"10.1155/2024/5241035","DOIUrl":"https://doi.org/10.1155/2024/5241035","url":null,"abstract":"<div>\u0000 <p>This paper introduces an inclusive class of fixed-time stable continuous-time gradient flows (GFs). This class of GFs is then leveraged to learn optimal control solutions for nonlinear systems in fixed time. It is shown that the presented GF guarantees convergence within a fixed time from any initial condition to the exact minimum of functions that satisfy the Polyak–Łojasiewicz (PL) inequality. The presented fixed-time GF is then utilized to design fixed-time optimal adaptive control algorithms. To this end, a fixed-time reinforcement learning (RL) algorithm is developed on the basis of a single network adaptive critic (SNAC) to learn the solution to an infinite-horizon optimal control problem in a fixed-time convergent, online, adaptive, and forward-in-time manner. It is shown that the PL inequality in the presented RL algorithm amounts to a mild inequality condition on a few collected samples. This condition is much weaker than the standard persistence of excitation (PE) and finite duration PE that relies on a rank condition of a dataset. This is crucial for learning-enabled control systems as control systems can commit to learning an optimal controller from the beginning, in sharp contrast to existing results that rely on the PE and rank condition, and can only commit to learning after rich data samples are collected. Simulation results are provided to validate the performance and efficacy of the presented fixed-time RL algorithm.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/5241035","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141608007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Attention Module for Image Recognition Systems in Autonomous Driving 自动驾驶图像识别系统的自适应注意力模块
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-12 DOI: 10.1155/2024/3934270
Ma Xianghua, Hu Kaitao, Sun Xiangyu, Shining Chen

Lightweight, high-performance networks are important in vision perception systems. Recent research on convolutional neural networks has shown that attention mechanisms can significantly improve the network performance. However, existing approaches either ignore the significance of using both types of attention mechanisms (channel and space) simultaneously or increase the model complexity. In this study, we propose the adaptive attention module (AAM), which is a truly lightweight yet effective module that comprises channel and spatial submodules to balance model performance and complexity. The AAM initially utilizes the channel submodule to generate intermediate channel-refined features. In this module, an adaptive mechanism enables the model to autonomously learn the weights between features extracted by global max pooling and global average pooling to adapt to different stages of the model, thus enhancing performance. The spatial submodule employs a group-interact-aggregate strategy to enhance the expression of important features. It groups the intermediate channel-refined features along the channel dimension into multiple subfeatures for parallel processing and generates spatial attention feature descriptors and channelwise refined subfeatures for each subfeature; subsequently, it aggregates all the refined subfeatures and employs a “channel shuffle” operator to transfer information between different subfeatures, thereby generating the final refined features and adaptively emphasizing important regions. Additionally, AAM is a plug-and-play architectural unit that can be directly used to replace standard convolutions in various convolutional neural networks. Extensive tests on CIFAR-100, ImageNet-1k, BDD100K, and MS COCO demonstrate that AAM improves the baseline network performance under various models and tasks, thereby validating its versatility.

轻量级、高性能的网络在视觉感知系统中非常重要。最近对卷积神经网络的研究表明,注意力机制能显著提高网络性能。然而,现有的方法要么忽略了同时使用两种注意机制(通道和空间)的重要性,要么增加了模型的复杂性。在本研究中,我们提出了自适应注意力模块(AAM),这是一个真正轻量级但有效的模块,由通道和空间子模块组成,以平衡模型性能和复杂性。AAM 最初利用信道子模块生成中间信道提炼特征。在该模块中,自适应机制使模型能够自主学习全局最大池化和全局平均池化提取的特征之间的权重,以适应模型的不同阶段,从而提高性能。空间子模块采用分组-交互-聚合策略来增强重要特征的表达。它将沿通道维度的中间通道细化特征分组为多个子特征进行并行处理,并为每个子特征生成空间注意力特征描述符和通道细化子特征;随后,它汇总所有细化子特征,并采用 "通道洗牌 "算子在不同子特征之间传递信息,从而生成最终细化特征,并自适应地强调重要区域。此外,AAM 还是一个即插即用的架构单元,可直接用于替代各种卷积神经网络中的标准卷积。在 CIFAR-100、ImageNet-1k、BDD100K 和 MS COCO 上进行的广泛测试表明,在各种模型和任务下,AAM 都能提高基线网络的性能,从而验证了它的多功能性。
{"title":"Adaptive Attention Module for Image Recognition Systems in Autonomous Driving","authors":"Ma Xianghua,&nbsp;Hu Kaitao,&nbsp;Sun Xiangyu,&nbsp;Shining Chen","doi":"10.1155/2024/3934270","DOIUrl":"https://doi.org/10.1155/2024/3934270","url":null,"abstract":"<div>\u0000 <p>Lightweight, high-performance networks are important in vision perception systems. Recent research on convolutional neural networks has shown that attention mechanisms can significantly improve the network performance. However, existing approaches either ignore the significance of using both types of attention mechanisms (channel and space) simultaneously or increase the model complexity. In this study, we propose the adaptive attention module (AAM), which is a truly lightweight yet effective module that comprises channel and spatial submodules to balance model performance and complexity. The AAM initially utilizes the channel submodule to generate intermediate channel-refined features. In this module, an adaptive mechanism enables the model to autonomously learn the weights between features extracted by global max pooling and global average pooling to adapt to different stages of the model, thus enhancing performance. The spatial submodule employs a group-interact-aggregate strategy to enhance the expression of important features. It groups the intermediate channel-refined features along the channel dimension into multiple subfeatures for parallel processing and generates spatial attention feature descriptors and channelwise refined subfeatures for each subfeature; subsequently, it aggregates all the refined subfeatures and employs a “channel shuffle” operator to transfer information between different subfeatures, thereby generating the final refined features and adaptively emphasizing important regions. Additionally, AAM is a plug-and-play architectural unit that can be directly used to replace standard convolutions in various convolutional neural networks. Extensive tests on CIFAR-100, ImageNet-1k, BDD100K, and MS COCO demonstrate that AAM improves the baseline network performance under various models and tasks, thereby validating its versatility.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/3934270","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141596998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Spatial-Temporal Attention for Video Quality Evaluation 利用时空注意力进行视频质量评估
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-12 DOI: 10.1155/2024/5514627
Biwei Chi, Ruifang Su, Xinhui Chen

With the rapid development of media, the role of video quality assessment (VQA) is becoming increasingly significant. VQA has applications in many domains. For example, in the field of remote medical diagnosis, it can enhance the quality of video communication between doctors and patients. Besides, in sports broadcasting, it can improve video clarity. Within VQA, the human visual system (HVS) is a crucial component that should be taken into consideration. Considering that attention is guided by goal-driven and top-down factors, such as anticipated locations or some attractive frames within the video, we propose a blind VQA algorithm based on spatial-temporal attention model. Specifically, we first use two pretrained convolutional networks to extract low-level static-dynamic fusion features. Then, a spatial attention-guided model is established to get more representative features of frame-level quality perception. Next, through a temporal attention-guided model, the video-level features are obtained. Finally, the features are fed into a regression model to calculate the final video quality score. The experiments conducted on seven VQA databases reach the state-of-the-art performance, demonstrating the effectiveness of our proposed method.

随着媒体的飞速发展,视频质量评估(VQA)的作用越来越重要。视频质量评估在许多领域都有应用。例如,在远程医疗诊断领域,它可以提高医生和病人之间的视频通信质量。此外,在体育转播领域,它还能提高视频清晰度。在 VQA 中,人类视觉系统(HVS)是一个需要考虑的重要组成部分。考虑到注意力是由目标驱动和自上而下的因素引导的,例如视频中的预期位置或一些有吸引力的帧,我们提出了一种基于时空注意力模型的盲 VQA 算法。具体来说,我们首先使用两个预训练的卷积网络来提取低层次的静态-动态融合特征。然后,建立一个空间注意力引导模型,以获得更具代表性的帧级质量感知特征。接着,通过时间注意力引导模型,获得视频级特征。最后,将这些特征输入回归模型,计算出最终的视频质量得分。在七个 VQA 数据库上进行的实验达到了最先进的性能,证明了我们所提方法的有效性。
{"title":"Using Spatial-Temporal Attention for Video Quality Evaluation","authors":"Biwei Chi,&nbsp;Ruifang Su,&nbsp;Xinhui Chen","doi":"10.1155/2024/5514627","DOIUrl":"https://doi.org/10.1155/2024/5514627","url":null,"abstract":"<div>\u0000 <p>With the rapid development of media, the role of video quality assessment (VQA) is becoming increasingly significant. VQA has applications in many domains. For example, in the field of remote medical diagnosis, it can enhance the quality of video communication between doctors and patients. Besides, in sports broadcasting, it can improve video clarity. Within VQA, the human visual system (HVS) is a crucial component that should be taken into consideration. Considering that attention is guided by goal-driven and top-down factors, such as anticipated locations or some attractive frames within the video, we propose a blind VQA algorithm based on spatial-temporal attention model. Specifically, we first use two pretrained convolutional networks to extract low-level static-dynamic fusion features. Then, a spatial attention-guided model is established to get more representative features of frame-level quality perception. Next, through a temporal attention-guided model, the video-level features are obtained. Finally, the features are fed into a regression model to calculate the final video quality score. The experiments conducted on seven VQA databases reach the state-of-the-art performance, demonstrating the effectiveness of our proposed method.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/5514627","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141596992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LsAc ∗-MJ: A Low-Resource Consumption Reinforcement Learning Model for Mahjong Game LsAc ∗-MJ:麻将游戏的低资源消耗强化学习模型
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-10 DOI: 10.1155/2024/4558614
Xiali Li, Zhaoqi Wang, Bo Liu, Junxue Dai

This article proposes a novel Mahjong game model, LsAc -MJ, designed to address challenges posed by data scarcity, difficulty in leveraging contextual information, and the computational resource-intensive nature of self-play zero-shot learning. The model is applied to Japanese Mahjong for experiments. LsAc -MJ employs long short-term memory (LSTM) neural networks, utilizing hidden nodes to store and propagate contextual historical information, thereby enhancing decision accuracy. Additionally, the paper introduces an optimized Advantage Actor-Critic (A2C) algorithm incorporating an experience replay mechanism to enhance the model’s decision-making capabilities and mitigate convergence difficulties arising from strong data correlations. Furthermore, the paper presents a two-stage training approach for self-play deep reinforcement learning models guided by expert knowledge, thereby improving training efficiency. Extensive ablation experiments and performance comparisons demonstrate that, in contrast to other typical deep reinforcement learning models on the RLcard platform, the LsAc -MJ model consumes lower computational and time resources, has higher training efficiency, faster average decision time, higher win-rate, and stronger decision-making ability.

本文提出了一个新颖的麻将游戏模型LsAc ∗-MJ,旨在解决数据稀缺、难以利用上下文信息以及自娱自乐零点学习的计算资源密集型特性所带来的挑战。该模型应用于日本麻将进行实验。LsAc ∗-MJ采用了长短时记忆(LSTM)神经网络,利用隐藏节点来存储和传播上下文历史信息,从而提高了决策的准确性。此外,本文还介绍了一种优化的优势行动者-批评者(A2C)算法,该算法结合了经验重放机制,以增强模型的决策能力,并缓解因数据关联性强而导致的收敛困难。此外,本文还提出了一种以专家知识为指导的自播放深度强化学习模型的两阶段训练方法,从而提高了训练效率。大量的消融实验和性能对比表明,与 RLcard 平台上其他典型的深度强化学习模型相比,LsAc ∗-MJ 模型消耗的计算资源和时间资源更少,训练效率更高,平均决策时间更快,胜率更高,决策能力更强。
{"title":"LsAc ∗-MJ: A Low-Resource Consumption Reinforcement Learning Model for Mahjong Game","authors":"Xiali Li,&nbsp;Zhaoqi Wang,&nbsp;Bo Liu,&nbsp;Junxue Dai","doi":"10.1155/2024/4558614","DOIUrl":"https://doi.org/10.1155/2024/4558614","url":null,"abstract":"<div>\u0000 <p>This article proposes a novel Mahjong game model, LsAc <sup>∗</sup>-MJ, designed to address challenges posed by data scarcity, difficulty in leveraging contextual information, and the computational resource-intensive nature of self-play zero-shot learning. The model is applied to Japanese Mahjong for experiments. LsAc <sup>∗</sup>-MJ employs long short-term memory (LSTM) neural networks, utilizing hidden nodes to store and propagate contextual historical information, thereby enhancing decision accuracy. Additionally, the paper introduces an optimized Advantage Actor-Critic (A2C) algorithm incorporating an experience replay mechanism to enhance the model’s decision-making capabilities and mitigate convergence difficulties arising from strong data correlations. Furthermore, the paper presents a two-stage training approach for self-play deep reinforcement learning models guided by expert knowledge, thereby improving training efficiency. Extensive ablation experiments and performance comparisons demonstrate that, in contrast to other typical deep reinforcement learning models on the RLcard platform, the LsAc <sup>∗</sup>-MJ model consumes lower computational and time resources, has higher training efficiency, faster average decision time, higher win-rate, and stronger decision-making ability.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/4558614","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141584087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Reinforcement Learning-Based Multireconfigurable Intelligent Surface for MEC Offloading 基于深度强化学习的多可配置智能表面,用于 MEC 卸载
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-09 DOI: 10.1155/2024/2960447
Long Qu, An Huang, Junqi Pan, Cheng Dai, Sahil Garg, Mohammad Mehedi Hassan

Computational offloading in mobile edge computing (MEC) systems provides an efficient solution for resource-intensive applications on devices. However, the frequent communication between devices and edge servers increases the traffic within the network, thereby hindering significant improvements in latency. Furthermore, the benefits of MEC cannot be fully realized when the communication link utilized for offloading tasks experiences severe attenuation. Fortunately, reconfigurable intelligent surfaces (RISs) can mitigate propagation-induced impairments by adjusting the phase shifts imposed on the incident signals using their passive reflecting elements. This paper investigates the performance gains achieved by deploying multiple RISs in MEC systems under energy-constrained conditions to minimize the overall system latency. Considering the high coupling among variables such as the selection of multiple RISs, optimization of their phase shifts, transmit power, and MEC offloading volume, the problem is formulated as a nonconvex problem. We propose two approaches to address this problem. First, we employ an alternating optimization approach based on semidefinite relaxation (AO-SDR) to decompose the original problem into two subproblems, enabling the alternating optimization of multi-RIS communication and MEC offloading volume. Second, due to its capability to model and learn the optimal phase adjustment strategies adaptively in dynamic and uncertain environments, deep reinforcement learning (DRL) offers a promising approach to enhance the performance of phase optimization strategies. We leverage DRL to address the joint design of MEC-offloading volume and multi-RIS communication. Extensive simulations and numerical analysis results demonstrate that compared to conventional MEC systems without RIS assistance, the multi-RIS-assisted schemes based on the AO-SDR and DRL methods achieve a reduction in latency by 23.5% and 29.6%, respectively.

移动边缘计算(MEC)系统中的计算卸载为设备上的资源密集型应用提供了有效的解决方案。然而,设备与边缘服务器之间的频繁通信增加了网络内的流量,从而阻碍了延迟的显著改善。此外,当用于卸载任务的通信链路出现严重衰减时,MEC 的优势就无法充分体现。幸运的是,可重构智能表面(RIS)可以利用其无源反射元件调整施加在入射信号上的相移,从而减轻传播引起的衰减。本文研究了在能源受限条件下,通过在 MEC 系统中部署多个 RIS 来实现性能提升,从而最大限度地减少整个系统的延迟。考虑到多个 RIS 的选择、相移优化、发射功率和 MEC 卸载量等变量之间的高度耦合,该问题被表述为一个非凸问题。我们提出了两种方法来解决这一问题。首先,我们采用基于半定量松弛的交替优化方法(AO-SDR),将原始问题分解为两个子问题,从而实现多 RIS 通信和 MEC 卸载量的交替优化。其次,由于深度强化学习(DRL)能够在动态和不确定的环境中建模并自适应地学习最优相位调整策略,它为提高相位优化策略的性能提供了一种前景广阔的方法。我们利用 DRL 解决了 MEC 卸载量和多 RIS 通信的联合设计问题。广泛的仿真和数值分析结果表明,与没有 RIS 辅助的传统 MEC 系统相比,基于 AO-SDR 和 DRL 方法的多 RIS 辅助方案的延迟时间分别缩短了 23.5% 和 29.6%。
{"title":"Deep Reinforcement Learning-Based Multireconfigurable Intelligent Surface for MEC Offloading","authors":"Long Qu,&nbsp;An Huang,&nbsp;Junqi Pan,&nbsp;Cheng Dai,&nbsp;Sahil Garg,&nbsp;Mohammad Mehedi Hassan","doi":"10.1155/2024/2960447","DOIUrl":"https://doi.org/10.1155/2024/2960447","url":null,"abstract":"<div>\u0000 <p>Computational offloading in mobile edge computing (MEC) systems provides an efficient solution for resource-intensive applications on devices. However, the frequent communication between devices and edge servers increases the traffic within the network, thereby hindering significant improvements in latency. Furthermore, the benefits of MEC cannot be fully realized when the communication link utilized for offloading tasks experiences severe attenuation. Fortunately, reconfigurable intelligent surfaces (RISs) can mitigate propagation-induced impairments by adjusting the phase shifts imposed on the incident signals using their passive reflecting elements. This paper investigates the performance gains achieved by deploying multiple RISs in MEC systems under energy-constrained conditions to minimize the overall system latency. Considering the high coupling among variables such as the selection of multiple RISs, optimization of their phase shifts, transmit power, and MEC offloading volume, the problem is formulated as a nonconvex problem. We propose two approaches to address this problem. First, we employ an alternating optimization approach based on semidefinite relaxation (AO-SDR) to decompose the original problem into two subproblems, enabling the alternating optimization of multi-RIS communication and MEC offloading volume. Second, due to its capability to model and learn the optimal phase adjustment strategies adaptively in dynamic and uncertain environments, deep reinforcement learning (DRL) offers a promising approach to enhance the performance of phase optimization strategies. We leverage DRL to address the joint design of MEC-offloading volume and multi-RIS communication. Extensive simulations and numerical analysis results demonstrate that compared to conventional MEC systems without RIS assistance, the multi-RIS-assisted schemes based on the AO-SDR and DRL methods achieve a reduction in latency by 23.5% and 29.6%, respectively.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/2960447","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141583721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Long Short-Term Memory Learning Strategy for Object Tracking 用于物体跟踪的新型长短期记忆学习策略
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-05 DOI: 10.1155/2024/6632242
Qian Wang, Jian Yang, Hong Song

In this paper, a novel integrated long short-term memory (LSTM) network and dynamic update model are proposed for long-term object tracking in video images. The LSTM network tracking method is introduced to improve the effect of tracking failure caused by target occlusion. Stable tracking of the target is achieved using the LSTM method to predict the motion trajectory of the target when it is occluded and dynamically updating the tracking template. First, in target tracking, global average peak-to-correlation energy (GAPCE) is used to determine whether the tracking target is blocked or temporarily disappearing such that the follow-up response tracking strategy can be adjusted accordingly. Second, the data with target motion characteristics are utilized to train the designed LSTM model to obtain an offline model, which effectively predicts the motion trajectory during the period when the target is occluded or has disappeared. Therefore, it can be captured again when the target reappears. Finally, in the dynamic template adjustment stage, the historical information of the target movement is combined, and the corresponding value of the current target is compared with the historical response value to realize the dynamic adjustment of the target tracking template. Compared with the current mainstream efficient convolution operators, namely, the E.T.Track, ToMP, KeepTrack, and RTS algorithms, on the OTB100 and LaSOT datasets, the proposed algorithm increases the distance precision by 9.9% when the distance threshold is 5 pixels, increases the overlap success rate by 0.94% when the overlap threshold is 0.75, and decreases the center location error by 18.9%. The proposed method has higher tracking accuracy and robustness and is more suitable for long-term tracking of targets in actual scenarios than are the main approaches.

本文提出了一种新颖的集成长短期记忆(LSTM)网络和动态更新模型,用于视频图像中的长期目标跟踪。本文引入了 LSTM 网络跟踪方法,以改善目标遮挡导致的跟踪失败的影响。利用 LSTM 方法预测目标被遮挡时的运动轨迹,并动态更新跟踪模板,从而实现对目标的稳定跟踪。首先,在目标跟踪中,利用全局平均峰值相关能量(GAPCE)来判断跟踪目标是否受阻或暂时消失,从而相应地调整后续响应跟踪策略。其次,利用具有目标运动特征的数据来训练所设计的 LSTM 模型,从而获得离线模型,该模型可有效预测目标被遮挡或消失期间的运动轨迹。因此,当目标再次出现时,可以再次进行捕捉。最后,在动态模板调整阶段,结合目标运动的历史信息,将当前目标的对应值与历史响应值进行比较,实现目标跟踪模板的动态调整。在 OTB100 和 LaSOT 数据集上,与目前主流的高效卷积算子,即 E.T.Track、ToMP、KeepTrack 和 RTS 算法相比,当距离阈值为 5 像素时,所提算法的距离精度提高了 9.9%;当重叠阈值为 0.75 时,重叠成功率提高了 0.94%;中心位置误差降低了 18.9%。与其他主要方法相比,本文提出的方法具有更高的跟踪精度和鲁棒性,更适合在实际场景中对目标进行长期跟踪。
{"title":"A Novel Long Short-Term Memory Learning Strategy for Object Tracking","authors":"Qian Wang,&nbsp;Jian Yang,&nbsp;Hong Song","doi":"10.1155/2024/6632242","DOIUrl":"https://doi.org/10.1155/2024/6632242","url":null,"abstract":"<div>\u0000 <p>In this paper, a novel integrated long short-term memory (LSTM) network and dynamic update model are proposed for long-term object tracking in video images. The LSTM network tracking method is introduced to improve the effect of tracking failure caused by target occlusion. Stable tracking of the target is achieved using the LSTM method to predict the motion trajectory of the target when it is occluded and dynamically updating the tracking template. First, in target tracking, global average peak-to-correlation energy (GAPCE) is used to determine whether the tracking target is blocked or temporarily disappearing such that the follow-up response tracking strategy can be adjusted accordingly. Second, the data with target motion characteristics are utilized to train the designed LSTM model to obtain an offline model, which effectively predicts the motion trajectory during the period when the target is occluded or has disappeared. Therefore, it can be captured again when the target reappears. Finally, in the dynamic template adjustment stage, the historical information of the target movement is combined, and the corresponding value of the current target is compared with the historical response value to realize the dynamic adjustment of the target tracking template. Compared with the current mainstream efficient convolution operators, namely, the E.T.Track, ToMP, KeepTrack, and RTS algorithms, on the OTB100 and LaSOT datasets, the proposed algorithm increases the distance precision by 9.9% when the distance threshold is 5 pixels, increases the overlap success rate by 0.94% when the overlap threshold is 0.75, and decreases the center location error by 18.9%. The proposed method has higher tracking accuracy and robustness and is more suitable for long-term tracking of targets in actual scenarios than are the main approaches.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/6632242","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141536546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BrainNet: Precision Brain Tumor Classification with Optimized EfficientNet Architecture BrainNet:利用优化的 EfficientNet 架构进行精确脑肿瘤分类
IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-06-30 DOI: 10.1155/2024/3583612
Md Manowarul Islam, Md. Alamin Talukder, Md Ashraf Uddin, Arnisha Akhter, Majdi Khalid

Brain tumors significantly impact human health due to their complexity and the challenges in early detection and treatment. Accurate diagnosis is crucial for effective intervention, but existing methods often suffer from limitations in accuracy and efficiency. To address these challenges, this study presents a novel deep learning (DL) approach utilizing the EfficientNet family for enhanced brain tumor classification and detection. Leveraging a comprehensive dataset of 3064 T1-weighted CE MRI images, our methodology incorporates advanced preprocessing and augmentation techniques to optimize model performance. The experiments demonstrate that EfficientNetB(07) achieved 99.14%, 98.76%, 99.07%, 99.69%, 99.07%, 98.76%, 98.76%, and 99.07% accuracy, respectively. The pinnacle of our research is the EfficientNetB3 model, which demonstrated exceptional performance with an accuracy rate of 99.69%. This performance surpasses many existing state-of-the-art (SOTA) techniques, underscoring the efficacy of our approach. The precision of our high-accuracy DL model promises to improve diagnostic reliability and speed in clinical settings, facilitating earlier and more effective treatment strategies. Our findings suggest significant potential for improving patient outcomes in brain tumor diagnosis.

脑肿瘤因其复杂性以及早期检测和治疗方面的挑战而严重影响人类健康。准确诊断对于有效干预至关重要,但现有方法在准确性和效率方面往往存在局限性。为了应对这些挑战,本研究提出了一种利用 EfficientNet 系列的新型深度学习(DL)方法,用于增强脑肿瘤分类和检测。利用由 3064 张 T1 加权 CE MRI 图像组成的综合数据集,我们的方法采用了先进的预处理和增强技术来优化模型性能。实验证明,EfficientNetB(07)的准确率分别达到了99.14%、98.76%、99.07%、99.69%、99.07%、98.76%、98.76%和99.07%。我们研究的巅峰之作是 EfficientNetB3 模型,它的准确率高达 99.69%,表现出了卓越的性能。这一性能超越了许多现有的最先进(SOTA)技术,彰显了我们方法的功效。我们高精度 DL 模型的精确性有望提高临床诊断的可靠性和速度,从而促进更早、更有效的治疗策略。我们的研究结果表明,在脑肿瘤诊断方面,我们具有改善患者预后的巨大潜力。
{"title":"BrainNet: Precision Brain Tumor Classification with Optimized EfficientNet Architecture","authors":"Md Manowarul Islam,&nbsp;Md. Alamin Talukder,&nbsp;Md Ashraf Uddin,&nbsp;Arnisha Akhter,&nbsp;Majdi Khalid","doi":"10.1155/2024/3583612","DOIUrl":"https://doi.org/10.1155/2024/3583612","url":null,"abstract":"<div>\u0000 <p>Brain tumors significantly impact human health due to their complexity and the challenges in early detection and treatment. Accurate diagnosis is crucial for effective intervention, but existing methods often suffer from limitations in accuracy and efficiency. To address these challenges, this study presents a novel deep learning (DL) approach utilizing the EfficientNet family for enhanced brain tumor classification and detection. Leveraging a comprehensive dataset of 3064 T1-weighted CE MRI images, our methodology incorporates advanced preprocessing and augmentation techniques to optimize model performance. The experiments demonstrate that EfficientNetB(07) achieved 99.14%, 98.76%, 99.07%, 99.69%, 99.07%, 98.76%, 98.76%, and 99.07% accuracy, respectively. The pinnacle of our research is the EfficientNetB3 model, which demonstrated exceptional performance with an accuracy rate of 99.69%. This performance surpasses many existing state-of-the-art (SOTA) techniques, underscoring the efficacy of our approach. The precision of our high-accuracy DL model promises to improve diagnostic reliability and speed in clinical settings, facilitating earlier and more effective treatment strategies. Our findings suggest significant potential for improving patient outcomes in brain tumor diagnosis.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/3583612","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141488330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1