首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Semisupervised Vector Quantization in Visual SLAM Using HGCN 使用 HGCN 在视觉 SLAM 中进行半监督矢量量化
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-29 DOI: 10.1155/2024/9992159
Amir Zarringhalam, Saeid Shiry Ghidary, Ali Mohades, Seyed-Ali Sadegh-Zadeh

We present a novel vector quantization (VQ) module for the two state-of-the-art long-range simultaneous localization and mapping (SLAM) algorithms. The VQ task in SLAM is generally performed using unsupervised methods. We provide an alternative approach trough embedding a semisupervised hyperbolic graph convolutional neural network (HGCN) in the VQ step of the SLAM processes. The SLAM platforms we have utilized for this purpose are fast appearance-based mapping (FABMAP) and oriented fast and rotated short (ORB), both of which rely on extracting the features of the captured images in their loop closure detection (LCD) module. For the first time, we have considered the space formed by these SURF features, robust image descriptors, as a graph, enabling us to apply an HGCN in the VQ section which results in an improved LCD performance. The HGCN vector quantizes the SURF feature space, leading to a bag-of-word (BoW) representation construction of the images. This representation is subsequently used to determine LCD accuracy and recall. Our approaches in this study are referred to as HGCN-FABMAP and HGCN-ORB. The main advantage of using HGCN in the LCD section is that it scales linearly when the features are accumulated. The benchmarking experiments show the superiority of our methods in terms of both trajectory generation accuracy in small-scale paths and LCD accuracy and recall for large-scale problems.

我们为两种最先进的远距离同步定位与映射(SLAM)算法提出了一种新颖的矢量量化(VQ)模块。SLAM 中的向量量化任务通常使用无监督方法执行。我们提供了一种替代方法,即在 SLAM 过程的 VQ 步骤中嵌入一个半监督双曲图卷积神经网络(HGCN)。我们为此使用的 SLAM 平台是基于外观的快速映射(FABMAP)和定向快速旋转短映射(ORB),这两种方法都依赖于在其闭环检测(LCD)模块中提取捕获图像的特征。我们首次将这些 SURF 特征(稳健的图像描述符)形成的空间视为图形,从而在 VQ 部分应用了 HGCN,提高了 LCD 性能。HGCN 向量对 SURF 特征空间进行量化,从而构建出图像的词袋(BoW)表示法。该表示随后用于确定 LCD 的准确性和召回率。本研究中的方法被称为 HGCN-FABMAP 和 HGCN-ORB。在 LCD 部分使用 HGCN 的主要优势在于,当特征积累到一定程度时,它可以线性扩展。基准实验表明,我们的方法在小规模路径的轨迹生成准确性和大规模问题的 LCD 准确性和召回率方面都更胜一筹。
{"title":"Semisupervised Vector Quantization in Visual SLAM Using HGCN","authors":"Amir Zarringhalam,&nbsp;Saeid Shiry Ghidary,&nbsp;Ali Mohades,&nbsp;Seyed-Ali Sadegh-Zadeh","doi":"10.1155/2024/9992159","DOIUrl":"https://doi.org/10.1155/2024/9992159","url":null,"abstract":"<p>We present a novel vector quantization (VQ) module for the two state-of-the-art long-range simultaneous localization and mapping (SLAM) algorithms. The VQ task in SLAM is generally performed using unsupervised methods. We provide an alternative approach trough embedding a semisupervised hyperbolic graph convolutional neural network (HGCN) in the VQ step of the SLAM processes. The SLAM platforms we have utilized for this purpose are fast appearance-based mapping (FABMAP) and oriented fast and rotated short (ORB), both of which rely on extracting the features of the captured images in their loop closure detection (LCD) module. For the first time, we have considered the space formed by these SURF features, robust image descriptors, as a graph, enabling us to apply an HGCN in the VQ section which results in an improved LCD performance. The HGCN vector quantizes the SURF feature space, leading to a bag-of-word (BoW) representation construction of the images. This representation is subsequently used to determine LCD accuracy and recall. Our approaches in this study are referred to as HGCN-FABMAP and HGCN-ORB. The main advantage of using HGCN in the LCD section is that it scales linearly when the features are accumulated. The benchmarking experiments show the superiority of our methods in terms of both trajectory generation accuracy in small-scale paths and LCD accuracy and recall for large-scale problems.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
T-SPP: Improving GNSS Single-Point Positioning Performance Using Transformer-Based Correction T-SPP:利用变压器校正提高全球导航卫星系统单点定位性能
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-29 DOI: 10.1155/2024/6643723
Fan Wu, Liangrui Wei, Haiyong Luo, Fang Zhao, Xin Ma, Bokun Ning

GNSS (global navigation satellite systems) technology enables high-precision single-point positioning (SPP) in open environments. However, the accuracy of GNSS positioning is significantly compromised in complex urban canyons due to signal obstructions and non-line-of-sight propagation errors. To address this challenge, we propose a GNSS displacement estimation algorithm. This method learns nonlinear dependencies between GNSS raw measurements and corresponding position changes, capturing dynamic and layered features in GNSS measurement data for displacement estimation. We introduce a denoising auto-encoder (DAE) to preprocess raw GNSS observations, reducing the impact of noise. The model simultaneously outputs estimated displacement and model confidence. The fusion process dynamically combines positioning results from the SPP algorithm and the D-Tran model, adaptively blending them to achieve accurate and optimal positioning estimation. This approach optimizes the accuracy of estimated positioning results while maintaining confidence in the estimation. Experimental results show a 61% reduction in root mean square error (RMSE) and 100% availability in urban canyon environments compared to traditional single-point positioning techniques.

全球导航卫星系统(GNSS)技术可在开放环境中实现高精度单点定位(SPP)。然而,在复杂的城市峡谷中,由于信号障碍和非视线传播误差,GNSS 定位的精确度大打折扣。为了应对这一挑战,我们提出了一种 GNSS 位移估计算法。该方法学习 GNSS 原始测量数据与相应位置变化之间的非线性依赖关系,捕捉 GNSS 测量数据中的动态和分层特征,从而进行位移估计。我们引入了一个去噪自动编码器(DAE)来预处理原始 GNSS 观测数据,以减少噪声的影响。模型同时输出估计位移和模型置信度。融合过程动态结合 SPP 算法和 D-Tran 模型的定位结果,自适应地将它们融合在一起,以实现精确和最优的定位估算。这种方法既能优化定位估算结果的准确性,又能保持估算结果的可信度。实验结果表明,与传统的单点定位技术相比,该技术在城市峡谷环境中的均方根误差(RMSE)降低了 61%,可用性提高了 100%。
{"title":"T-SPP: Improving GNSS Single-Point Positioning Performance Using Transformer-Based Correction","authors":"Fan Wu,&nbsp;Liangrui Wei,&nbsp;Haiyong Luo,&nbsp;Fang Zhao,&nbsp;Xin Ma,&nbsp;Bokun Ning","doi":"10.1155/2024/6643723","DOIUrl":"10.1155/2024/6643723","url":null,"abstract":"<p>GNSS (global navigation satellite systems) technology enables high-precision single-point positioning (SPP) in open environments. However, the accuracy of GNSS positioning is significantly compromised in complex urban canyons due to signal obstructions and non-line-of-sight propagation errors. To address this challenge, we propose a GNSS displacement estimation algorithm. This method learns nonlinear dependencies between GNSS raw measurements and corresponding position changes, capturing dynamic and layered features in GNSS measurement data for displacement estimation. We introduce a denoising auto-encoder (DAE) to preprocess raw GNSS observations, reducing the impact of noise. The model simultaneously outputs estimated displacement and model confidence. The fusion process dynamically combines positioning results from the SPP algorithm and the D-Tran model, adaptively blending them to achieve accurate and optimal positioning estimation. This approach optimizes the accuracy of estimated positioning results while maintaining confidence in the estimation. Experimental results show a 61% reduction in root mean square error (RMSE) and 100% availability in urban canyon environments compared to traditional single-point positioning techniques.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140416063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Data-Driven Method and Hybrid Deep Learning Model for Flood Risk Prediction 洪水风险预测的数据驱动方法和混合深度学习模型
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-27 DOI: 10.1155/2024/3562709
Chenmin Ni, Pei Shan Fam, Muhammad Fadhil Marsani

Flood disasters occur worldwide, and flood risk prediction is conducive to protecting human life and property safety. Influenced by topographic changes and rainfall, the water level fluctuates randomly and violently during the flood, introducing many noises and directly increasing the difficulty of flood prediction. A data-driven flood forecasting method is proposed based on data preprocessing and a two-layer BiLSTM-Attention network to improve forecast accuracy. First, the Variational Mode Decomposition (VMD) is used to decompose the data for reducing noise and produce suitable Intrinsic Mode Functions (IMFs); Then, an optimized two-layer attention-based Bidirectional Long Sshort-Term memory (BiLSTM-Attention) network is constructed to predict each IMF. Finally, two optimization algorithms are used to obtain the optimized parameters of VMD and BiLSTM intelligently, increasing the self-adaptability. The inertia factor of particle swarm optimization is improved and then used to optimize the five hyperparameters of BiLSTM. The proposed model reduces storage errors for smaller training sets and can achieve good performance. Three water level data sets from the Yangtze River in China are used for comparative experiments. Numerical results show that the peak height absolute error is within 2 cm, and the relative error of peak time arrival is within 30%. Compared with LSTM, BiLSTM, CNN-BiLSTM-attention, etc., the proposed model reduces the root mean square error by at least 50% and has advantages for high-risk forecasting when the water level exceeds the defense line and fluctuates prominently.

洪水灾害在全球范围内时有发生,洪水风险预测有利于保护人类生命和财产安全。受地形变化和降雨的影响,洪水过程中水位随机剧烈波动,引入了许多噪声,直接增加了洪水预报的难度。为了提高预报精度,本文提出了一种基于数据预处理和双层 BiLSTM-Attention 网络的数据驱动洪水预报方法。首先,利用变异模态分解(VMD)对数据进行分解以减少噪声,并产生合适的本征模态函数(IMF);然后,构建一个优化的基于注意力的双向长短时记忆(BiLSTM-Attention)双层网络来预测每个本征模态函数。最后,利用两种优化算法智能地获得 VMD 和 BiLSTM 的优化参数,提高自适应能力。改进粒子群优化的惯性因子后,用于优化 BiLSTM 的五个超参数。所提出的模型减少了较小训练集的存储误差,并能获得良好的性能。对比实验使用了中国长江的三个水位数据集。数值结果表明,峰值高度绝对误差在 2 厘米以内,峰值到达时间相对误差在 30% 以内。与 LSTM、BiLSTM、CNN-BiLSTM-注意力等相比,所提出的模型至少减少了 50%的均方根误差,在水位超过防线且波动剧烈的高风险预报中具有优势。
{"title":"A Data-Driven Method and Hybrid Deep Learning Model for Flood Risk Prediction","authors":"Chenmin Ni,&nbsp;Pei Shan Fam,&nbsp;Muhammad Fadhil Marsani","doi":"10.1155/2024/3562709","DOIUrl":"10.1155/2024/3562709","url":null,"abstract":"<p>Flood disasters occur worldwide, and flood risk prediction is conducive to protecting human life and property safety. Influenced by topographic changes and rainfall, the water level fluctuates randomly and violently during the flood, introducing many noises and directly increasing the difficulty of flood prediction. A data-driven flood forecasting method is proposed based on data preprocessing and a two-layer BiLSTM-Attention network to improve forecast accuracy. First, the Variational Mode Decomposition (VMD) is used to decompose the data for reducing noise and produce suitable Intrinsic Mode Functions (IMFs); Then, an optimized two-layer attention-based Bidirectional Long Sshort-Term memory (BiLSTM-Attention) network is constructed to predict each IMF. Finally, two optimization algorithms are used to obtain the optimized parameters of VMD and BiLSTM intelligently, increasing the self-adaptability. The inertia factor of particle swarm optimization is improved and then used to optimize the five hyperparameters of BiLSTM. The proposed model reduces storage errors for smaller training sets and can achieve good performance. Three water level data sets from the Yangtze River in China are used for comparative experiments. Numerical results show that the peak height absolute error is within 2 cm, and the relative error of peak time arrival is within 30%. Compared with LSTM, BiLSTM, CNN-BiLSTM-attention, etc., the proposed model reduces the root mean square error by at least 50% and has advantages for high-risk forecasting when the water level exceeds the defense line and fluctuates prominently.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140425827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subseasonal Prediction of Summer Temperature in West Africa Using Artificial Intelligence: A Case Study of Senegal 利用人工智能对西非夏季气温进行分季节预测:塞内加尔案例研究
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-27 DOI: 10.1155/2024/8869267
Annine Duclaire Kenne, Mory Toure, Lema Logamou Seknewna, Herve Landry Ketsemen

Despite the rapid growth of machine learning (ML) and its far-reaching applications in various fields such as healthcare, finance, and urban heat management, there are still some unresolved challenges in the field of climate change. Reliable subseasonal forecasts of summer temperatures would be a great benefit to society. Although numerical weather prediction (NWP) models are better at capturing relevant sources of predictability, such as temperatures, land, and sea surface conditions, the subseasonal potential is not fully exploited. One such challenge is accurate subseasonal temperature forecasting using cutting-edge ML technology. This study aims to assess and predict the changes in subseasonal temperature during the summer season (from March to June) in Senegal on 2-weeks time scales. Six ML techniques, including linear regression (LR), decision tree (DT), support vector machine (SVM), artificial neural network (ANN), long short-term memory (LSTM), and gated recurrent units (GRU), are used. The experiments utilize a multivariate approach by incorporating variables of the ERA-5 dataset from 1981 to 2022. The results compared all the performances of the methods to assess their overall effectiveness in forecasting air temperature (t2m) values over 2 weeks. Our analysis demonstrates that the GRU model outperforms the other ML models, achieving a Nash–Sutcliffe efficiency (NSE) score of 74.68% and a mean absolute percentage error (MAPE) of 2.51%. The GRU model effectively captures long-term dependencies and exhibits superior performance in temperature forecasting. Furthermore, a comparison between the observed and predicted values confirms the accuracy of the GRU model in aligning with actual temperature trends. Overall, this study contributes an impactful deep learning model to the field of subseasonal temperature forecasting in West Africa (Senegal), which offers local authorities the capability to anticipate climatic events and enact preventive measures accordingly.

尽管机器学习(ML)发展迅速,并在医疗保健、金融和城市供热管理等多个领域得到了广泛应用,但在气候变化领域仍存在一些尚未解决的难题。对夏季气温进行可靠的分季节预报将给社会带来极大的好处。尽管数值天气预报(NWP)模式能更好地捕捉相关的可预测性来源,如气温、陆地和海洋表面条件,但其亚季节潜力并未得到充分利用。其中一个挑战就是利用最先进的 ML 技术准确预报分季节气温。本研究旨在评估和预测塞内加尔夏季(3 月至 6 月)2 周时间尺度上的分季节气温变化。研究采用了六种 ML 技术,包括线性回归 (LR)、决策树 (DT)、支持向量机 (SVM)、人工神经网络 (ANN)、长短期记忆 (LSTM) 和门控递归单元 (GRU)。实验采用多元方法,纳入了 1981 年至 2022 年 ERA-5 数据集的变量。结果比较了所有方法的性能,以评估它们在预报两周内气温(t2m)值方面的总体效果。我们的分析表明,GRU 模型优于其他 ML 模型,其纳什-苏特克利夫效率(NSE)为 74.68%,平均绝对百分比误差(MAPE)为 2.51%。GRU 模型有效地捕捉了长期依赖关系,在气温预测方面表现出卓越的性能。此外,观测值和预测值之间的比较也证实了 GRU 模型在与实际气温趋势保持一致方面的准确性。总之,本研究为西非(塞内加尔)的分季节气温预报领域贡献了一个有影响力的深度学习模型,为当地政府提供了预测气候事件并制定相应预防措施的能力。
{"title":"Subseasonal Prediction of Summer Temperature in West Africa Using Artificial Intelligence: A Case Study of Senegal","authors":"Annine Duclaire Kenne,&nbsp;Mory Toure,&nbsp;Lema Logamou Seknewna,&nbsp;Herve Landry Ketsemen","doi":"10.1155/2024/8869267","DOIUrl":"10.1155/2024/8869267","url":null,"abstract":"<p>Despite the rapid growth of machine learning (ML) and its far-reaching applications in various fields such as healthcare, finance, and urban heat management, there are still some unresolved challenges in the field of climate change. Reliable subseasonal forecasts of summer temperatures would be a great benefit to society. Although numerical weather prediction (NWP) models are better at capturing relevant sources of predictability, such as temperatures, land, and sea surface conditions, the subseasonal potential is not fully exploited. One such challenge is accurate subseasonal temperature forecasting using cutting-edge ML technology. This study aims to assess and predict the changes in subseasonal temperature during the summer season (from March to June) in Senegal on 2-weeks time scales. Six ML techniques, including linear regression (LR), decision tree (DT), support vector machine (SVM), artificial neural network (ANN), long short-term memory (LSTM), and gated recurrent units (GRU), are used. The experiments utilize a multivariate approach by incorporating variables of the ERA-5 dataset from 1981 to 2022. The results compared all the performances of the methods to assess their overall effectiveness in forecasting air temperature (t2m) values over 2 weeks. Our analysis demonstrates that the GRU model outperforms the other ML models, achieving a Nash–Sutcliffe efficiency (NSE) score of 74.68% and a mean absolute percentage error (MAPE) of 2.51%. The GRU model effectively captures long-term dependencies and exhibits superior performance in temperature forecasting. Furthermore, a comparison between the observed and predicted values confirms the accuracy of the GRU model in aligning with actual temperature trends. Overall, this study contributes an impactful deep learning model to the field of subseasonal temperature forecasting in West Africa (Senegal), which offers local authorities the capability to anticipate climatic events and enact preventive measures accordingly.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140427583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kriging and Radial Basis Function Models for Optimized Design of UAV Wing Fences to Reduce Rolling Moment 用于优化设计无人机机翼围栏以减少滚动力矩的克里金法和径向基函数模型
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-27 DOI: 10.1155/2024/4108121
Mohammad Hossein Moghimi Esfand-Abadi, Mohammad Hassan Djavareshkian, Afshin Madani

In the present study, the effects of the wing fence on the wing tip vortices and control surfaces located at the tip of the wing in a flying wing aircraft have been investigated using a numerical method. For the size of the fences, the average dimensions extracted from the wing tip vortices at different angles of attack are used. The basic determining parameter is the rolling torque coefficient, which is tried to be shown by employing a parametric study of the flow behavior in different situations of fence placement. These effects on the rolling torque of the aircraft are measured due to the presence of the split drag rudder control system. In this study, the fences were installed at three different heights and three different positions along the length of the wing, which were investigated at angles of attack of 7 to 16 degrees. The next stage of the research is to design the dimensions of the fence using the single-objective optimization method (a method to find the best solution for a problem with a specific goal). The designing of the fences at three points based on the dimensions of the wing tip vortex is carried out with the computational fluid dynamics (CFD) method (CFD is a computational method that uses physical laws to predict the behavior of fluids.). The aim of this research is to achieve the best design that converges to an optimal solution with minimum time and cost (CFD solution is long). However, CFD analysis requires a lot of computational time. To address this challenge, we employed a hybrid learning model comprising the radial basis function (RBF), a type of artificial neural network, and Kriging, a Gaussian process-based interpolation technique. The dataset for training the hybrid model was obtained from numerical solutions of CFD simulations involving a fence placed at various locations on the wing. Additionally, a genetic algorithm was employed as the optimization method in all instances where it was required. Using the power of machine learning techniques helped us identify the optimal placement of the fence to prevent it from being engulfed by the vortex and to optimize the utilization of the split drag system, yielding significant improvements.

本研究采用数值方法研究了机翼栅栏对飞翼飞机翼尖涡流和翼尖控制面的影响。对于栅栏的尺寸,采用了从不同攻角下的翼尖涡流中提取的平均尺寸。基本决定参数是滚动扭矩系数,通过对不同栅栏放置情况下的流动行为进行参数化研究,试图显示出这一系数。这些对飞机滚动扭矩的影响是在分拖舵控制系统存在的情况下测量的。在这项研究中,栅栏被安装在机翼长度方向上的三个不同高度和三个不同位置,研究的攻角为 7 至 16 度。研究的下一阶段是使用单目标优化法(一种为具有特定目标的问题寻找最佳解决方案的方法)设计栅栏的尺寸。根据翼尖涡流的尺寸,利用计算流体动力学(CFD)方法(CFD 是一种利用物理规律预测流体行为的计算方法)设计三点栅栏。这项研究的目的是以最少的时间和成本(CFD 解决方案耗时较长)实现收敛到最优解的最佳设计。然而,CFD 分析需要大量的计算时间。为了应对这一挑战,我们采用了一种混合学习模型,包括径向基函数(一种人工神经网络)和克里金(一种基于高斯过程的插值技术)。用于训练混合模型的数据集来自 CFD 模拟的数值解,涉及机翼上不同位置的栅栏。此外,在所有需要优化的情况下,都采用了遗传算法作为优化方法。利用机器学习技术的强大功能,我们确定了栅栏的最佳位置,以防止其被涡流吞噬,并优化了分体式阻力系统的利用,取得了显著的改进。
{"title":"Kriging and Radial Basis Function Models for Optimized Design of UAV Wing Fences to Reduce Rolling Moment","authors":"Mohammad Hossein Moghimi Esfand-Abadi,&nbsp;Mohammad Hassan Djavareshkian,&nbsp;Afshin Madani","doi":"10.1155/2024/4108121","DOIUrl":"https://doi.org/10.1155/2024/4108121","url":null,"abstract":"<p>In the present study, the effects of the wing fence on the wing tip vortices and control surfaces located at the tip of the wing in a flying wing aircraft have been investigated using a numerical method. For the size of the fences, the average dimensions extracted from the wing tip vortices at different angles of attack are used. The basic determining parameter is the rolling torque coefficient, which is tried to be shown by employing a parametric study of the flow behavior in different situations of fence placement. These effects on the rolling torque of the aircraft are measured due to the presence of the split drag rudder control system. In this study, the fences were installed at three different heights and three different positions along the length of the wing, which were investigated at angles of attack of 7 to 16 degrees. The next stage of the research is to design the dimensions of the fence using the single-objective optimization method (a method to find the best solution for a problem with a specific goal). The designing of the fences at three points based on the dimensions of the wing tip vortex is carried out with the computational fluid dynamics (CFD) method (CFD is a computational method that uses physical laws to predict the behavior of fluids.). The aim of this research is to achieve the best design that converges to an optimal solution with minimum time and cost (CFD solution is long). However, CFD analysis requires a lot of computational time. To address this challenge, we employed a hybrid learning model comprising the radial basis function (RBF), a type of artificial neural network, and Kriging, a Gaussian process-based interpolation technique. The dataset for training the hybrid model was obtained from numerical solutions of CFD simulations involving a fence placed at various locations on the wing. Additionally, a genetic algorithm was employed as the optimization method in all instances where it was required. Using the power of machine learning techniques helped us identify the optimal placement of the fence to prevent it from being engulfed by the vortex and to optimize the utilization of the split drag system, yielding significant improvements.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141164964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deep Learning System for Detecting Cardiomegaly Disease Based on CXR Image 基于 CXR 图像检测心脏肿大疾病的深度学习系统
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-23 DOI: 10.1155/2024/8997093
Shaymaa E. Sorour, Abeer A. Wafa, Amr A. Abohany, Reda M. Hussien

The potential of technology to revolutionize healthcare is exemplified by the synergy between artificial intelligence (AI) and early detection of cardiomegaly, demonstrating the power of proactive intervention in cardiovascular health. This paper presents an innovative approach that leverages advanced AI algorithms, specifically deep learning (DL) technology, for the early detection of cardiomegaly. The methodology consists of five key steps, including data collection, image preprocessing, data augmentation, feature extraction, and classification. Utilizing chest X-ray (CXR) images from the National Institutes of Health (NIH), the study applies rigorous image preprocessing operations, including color transformation and normalization. To enhance model generalization, data augmentation is employed, paving the way for two distinct DL models, a convolutional neural network (CNN) developed from scratch and a pretrained residual network with 50 layers (ResNet50), and adapted to the problem domain. Both models are systematically evaluated with five optimizers, revealing the AdaMax optimizer’s superiority for the CNN model and AdaGrad’s efficacy for the modified ResNet50. The proposed CNN with AdaMax achieves an impressive 99.91% accuracy, outperforming recent techniques in precision, recall, and F1 − score. This research underscores the transformative potential of AI in cardiovascular health diagnostics, emphasizing the significance of timely intervention.

人工智能(AI)与心肌肥大早期检测之间的协同作用体现了技术彻底改变医疗保健的潜力,展示了主动干预心血管健康的力量。本文介绍了一种利用先进的人工智能算法(特别是深度学习(DL)技术)进行心肌肥大早期检测的创新方法。该方法由五个关键步骤组成,包括数据收集、图像预处理、数据增强、特征提取和分类。研究利用美国国立卫生研究院(NIH)的胸部 X 光(CXR)图像,进行了严格的图像预处理操作,包括颜色转换和归一化。为了增强模型的泛化能力,研究人员采用了数据增强技术,为两个不同的 DL 模型铺平了道路,一个是从头开始开发的卷积神经网络 (CNN),另一个是经过预训练的 50 层残差网络 (ResNet50),并根据问题领域进行了调整。这两种模型都通过五种优化器进行了系统评估,结果显示,AdaMax 优化器对 CNN 模型具有优势,而 AdaGrad 对修改后的 ResNet50 具有功效。使用 AdaMax 的拟议 CNN 获得了令人印象深刻的 99.91% 的准确率,在精确度、召回率和 F1 分数方面均优于最近的技术。这项研究凸显了人工智能在心血管健康诊断方面的变革潜力,强调了及时干预的重要性。
{"title":"A Deep Learning System for Detecting Cardiomegaly Disease Based on CXR Image","authors":"Shaymaa E. Sorour,&nbsp;Abeer A. Wafa,&nbsp;Amr A. Abohany,&nbsp;Reda M. Hussien","doi":"10.1155/2024/8997093","DOIUrl":"10.1155/2024/8997093","url":null,"abstract":"<p>The potential of technology to revolutionize healthcare is exemplified by the synergy between artificial intelligence (AI) and early detection of cardiomegaly, demonstrating the power of proactive intervention in cardiovascular health. This paper presents an innovative approach that leverages advanced AI algorithms, specifically deep learning (DL) technology, for the early detection of cardiomegaly. The methodology consists of five key steps, including data collection, image preprocessing, data augmentation, feature extraction, and classification. Utilizing chest X-ray (CXR) images from the National Institutes of Health (NIH), the study applies rigorous image preprocessing operations, including color transformation and normalization. To enhance model generalization, data augmentation is employed, paving the way for two distinct DL models, a convolutional neural network (CNN) developed from scratch and a pretrained residual network with 50 layers (ResNet50), and adapted to the problem domain. Both models are systematically evaluated with five optimizers, revealing the AdaMax optimizer’s superiority for the CNN model and AdaGrad’s efficacy for the modified ResNet50. The proposed CNN with AdaMax achieves an impressive 99.91% accuracy, outperforming recent techniques in precision, recall, and <i>F</i>1 − score. This research underscores the transformative potential of AI in cardiovascular health diagnostics, emphasizing the significance of timely intervention.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140435516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fast Optimal Coordination Method for Multiagent in Complex Environment 复杂环境中多机器人的快速优化协调方法
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-20 DOI: 10.1155/2024/5346187
Suyu Wang, Quan Yue, Mengyu Zhao, Huazhi Zhang, Yan Li

Facing the implementation problems such like low growth reward, long training time, and poor stability of the multiagent learning methods when dealing with complex environment and more agents, this paper proposes a fast optimal coordination method for multiagent in complex environment (FOC-MACE). Firstly, the environment exploration strategy is introduced into the policy network based on the MADDPG method for higher growth rewards. Then, the parallel computing technology is adopted in the critic network, in purpose to effectively reduce the training time. These tactics together are beneficial to enhance the stability of multiagent learning. Lastly, the optimal resource allocation is carried out to realize optimal coevolution of the multiagents and further improve the learning ability of the agents’ group. To verify the effectiveness of our proposal, the FOC-MACE is compared with several advanced methods at current stage in the MPE environment. Three different experiments prove that by using our method, the growth reward is increased by up to 37.1%, the training is speed up significantly, and the stability of the method, which represented by standardized variance, is also improved. In addition, this paper validated the fast optimal coordination method for multiagent systems in the context of UAV scenarios, demonstrating the practical performance of the approach. Through comprehensive experiments and scenario validations, the study successfully confirmed the effectiveness of the proposed fast optimal coordination method for multiagent systems in complex environments.

面对多代理学习方法在处理复杂环境和更多代理时存在的增长奖励低、训练时间长、稳定性差等实施问题,本文提出了一种复杂环境下多代理快速优化协调方法(FOC-MACE)。首先,基于 MADDPG 方法在策略网络中引入环境探索策略,以获得更高的增长奖励。然后,在批判网络中采用并行计算技术,以有效缩短训练时间。这些策略共同作用,有利于增强多代理学习的稳定性。最后,通过优化资源分配,实现多代理的最优协同进化,进一步提高代理群体的学习能力。为了验证我们建议的有效性,我们在 MPE 环境中将 FOC-MACE 与现阶段的几种先进方法进行了比较。三个不同的实验证明,通过使用我们的方法,增长奖励提高了 37.1%,训练速度明显加快,以标准化方差为代表的方法稳定性也得到了提高。此外,本文还在无人机场景中验证了多代理系统的快速最优协调方法,证明了该方法的实用性能。通过综合实验和场景验证,研究成功证实了所提出的复杂环境下多代理系统快速优化协调方法的有效性。
{"title":"A Fast Optimal Coordination Method for Multiagent in Complex Environment","authors":"Suyu Wang,&nbsp;Quan Yue,&nbsp;Mengyu Zhao,&nbsp;Huazhi Zhang,&nbsp;Yan Li","doi":"10.1155/2024/5346187","DOIUrl":"10.1155/2024/5346187","url":null,"abstract":"<p>Facing the implementation problems such like low growth reward, long training time, and poor stability of the multiagent learning methods when dealing with complex environment and more agents, this paper proposes a fast optimal coordination method for multiagent in complex environment (FOC-MACE). Firstly, the environment exploration strategy is introduced into the policy network based on the MADDPG method for higher growth rewards. Then, the parallel computing technology is adopted in the critic network, in purpose to effectively reduce the training time. These tactics together are beneficial to enhance the stability of multiagent learning. Lastly, the optimal resource allocation is carried out to realize optimal coevolution of the multiagents and further improve the learning ability of the agents’ group. To verify the effectiveness of our proposal, the FOC-MACE is compared with several advanced methods at current stage in the MPE environment. Three different experiments prove that by using our method, the growth reward is increased by up to 37.1%, the training is speed up significantly, and the stability of the method, which represented by standardized variance, is also improved. In addition, this paper validated the fast optimal coordination method for multiagent systems in the context of UAV scenarios, demonstrating the practical performance of the approach. Through comprehensive experiments and scenario validations, the study successfully confirmed the effectiveness of the proposed fast optimal coordination method for multiagent systems in complex environments.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140446959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
: Multilevel Breast Cancer Classification Framework Using Radiomic Features m B C C
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-17 DOI: 10.1155/2024/6631016
Lipismita Panigrahi, Tej Bahadur Chandra, Atul Kumar Srivastava, Neeraj Varshney, Kamred Udham Singh, Shambhu Mahato

Breast cancer characterization remains a significant and challenging issue in contemporary medicine. Accurately distinguishing between malignant and benign breast lesions is crucial for effective diagnosis and treatment. The anatomical structure of malignant breast ultrasound images is more chaotic than that of benign images due to disease pathologies. However, texture-based analysis alone often fails to identify the extent of chaoticness in malignant breast ultrasound images due to their vague appearance with normal echo patterns, leading to missed diagnoses and increased mortality rates. To address this issue, we proposed an angular feature-based multilevel breast cancer classification framework that aims to improve the accuracy and efficiency of classification. The proposed framework mimics the radiologist interpretation procedure by identifying the chaoticness on the periphery of the breast lesion in a breast ultrasound image (level-1). If the lesion contains an acute angle in any part of the periphery, it can be characterized as malignant or otherwise benign. However, solely relying on level-1 analysis may result in misclassification, especially when benign lesions exhibit echo patterns that resemble malignant ones. To overcome this limitation and to make the proposed system highly sensitive, advanced texture-based analysis (using combined shape, texture, and angular features) is performed (level-2). Finally, the performance of the proposed system is evaluated using a cross-dataset (consisting of 1293 breast ultrasound images) and compared with the different individual feature extraction techniques. Encouragingly, our system demonstrated an accuracy of 96.99% for classifying malignant and benign tumors, which is also validated using statistical analysis. The implications of our research lie in its potential to significantly improve breast cancer diagnosis by providing a reliable, efficient, and sensitive tool for radiologists.

在当代医学中,乳腺癌的特征描述仍然是一个重要而具有挑战性的问题。准确区分乳腺恶性和良性病变对有效诊断和治疗至关重要。由于疾病病理原因,恶性乳腺超声图像的解剖结构比良性图像更加混乱。然而,由于恶性乳腺超声图像外观模糊,回声模式正常,因此仅靠基于纹理的分析往往无法识别其混沌程度,从而导致漏诊和死亡率上升。针对这一问题,我们提出了基于角度特征的多层次乳腺癌分类框架 mBCCf,旨在提高分类的准确性和效率。所提出的框架模仿放射科医生的判读过程,通过识别乳腺超声图像中乳腺病变外围的混沌度(第一级)来进行判读。如果病变外围的任何部位出现锐角,则可将其定性为恶性或良性。然而,仅仅依靠第一级分析可能会导致分类错误,尤其是当良性病变表现出与恶性病变相似的回波模式时。为了克服这一局限性,并使所提出的系统具有高灵敏度,需要进行基于纹理的高级分析(使用形状、纹理和角度组合特征)(二级)。最后,我们使用交叉数据集(由 1293 幅乳腺超声图像组成)对所提系统的性能进行了评估,并与不同的单独特征提取技术进行了比较。令人鼓舞的是,我们的系统对恶性和良性肿瘤的分类准确率达到了 96.99%,这也通过统计分析得到了验证。我们研究的意义在于,它为放射科医生提供了一种可靠、高效和灵敏的工具,从而有可能显著改善乳腺癌的诊断。
{"title":": Multilevel Breast Cancer Classification Framework Using Radiomic Features","authors":"Lipismita Panigrahi,&nbsp;Tej Bahadur Chandra,&nbsp;Atul Kumar Srivastava,&nbsp;Neeraj Varshney,&nbsp;Kamred Udham Singh,&nbsp;Shambhu Mahato","doi":"10.1155/2024/6631016","DOIUrl":"10.1155/2024/6631016","url":null,"abstract":"<p>Breast cancer characterization remains a significant and challenging issue in contemporary medicine. Accurately distinguishing between malignant and benign breast lesions is crucial for effective diagnosis and treatment. The anatomical structure of malignant breast ultrasound images is more chaotic than that of benign images due to disease pathologies. However, texture-based analysis alone often fails to identify the extent of chaoticness in malignant breast ultrasound images due to their vague appearance with normal echo patterns, leading to missed diagnoses and increased mortality rates. To address this issue, we proposed an angular feature-based multilevel breast cancer classification framework <span></span><math></math> that aims to improve the accuracy and efficiency of classification. The proposed framework mimics the radiologist interpretation procedure by identifying the chaoticness on the periphery of the breast lesion in a breast ultrasound image (level-1). If the lesion contains an acute angle in any part of the periphery, it can be characterized as malignant or otherwise benign. However, solely relying on level-1 analysis may result in misclassification, especially when benign lesions exhibit echo patterns that resemble malignant ones. To overcome this limitation and to make the proposed system highly sensitive, advanced texture-based analysis (using combined shape, texture, and angular features) is performed (level-2). Finally, the performance of the proposed system is evaluated using a cross-dataset (consisting of 1293 breast ultrasound images) and compared with the different individual feature extraction techniques. Encouragingly, our system demonstrated an accuracy of 96.99% for classifying malignant and benign tumors, which is also validated using statistical analysis. The implications of our research lie in its potential to significantly improve breast cancer diagnosis by providing a reliable, efficient, and sensitive tool for radiologists.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139960044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution 用于轻量级单张图像超分辨率的多注意力特征蒸馏神经网络
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-15 DOI: 10.1155/2024/3255233
Yongfei Zhang, Xinying Lin, Hong Yang, Jie He, Linbo Qing, Xiaohai He, Yi Li, Honggang Chen

In recent years, remarkable performance improvements have been produced by deep convolutional neural networks (CNN) for single image super-resolution (SISR). Nevertheless, a high proportion of CNN-based SISR models are with quite a few network parameters and high computational complexity for deep or wide architectures. How to more fully utilize deep features to make a balance between model complexity and reconstruction performance is one of the main challenges in this field. To address this problem, on the basis of the well-known information multi-distillation model, a multi-attention feature distillation network termed as MAFDN is developed for lightweight and accurate SISR. Specifically, an effective multi-attention feature distillation block (MAFDB) is designed and used as the basic feature extraction unit in MAFDN. With the help of multi-attention layers including pixel attention, spatial attention, and channel attention, MAFDB uses multiple information distillation branches to learn more discriminative and representative features. Furthermore, MAFDB introduces the depthwise over-parameterized convolutional layer (DO-Conv)-based residual block (OPCRB) to enhance its ability without incurring any parameter and computation increase in the inference stage. The results on commonly used datasets demonstrate that our MAFDN outperforms existing representative lightweight SISR models when taking both reconstruction performance and model complexity into consideration. For example, for ×4 SR on Set5, MAFDN (597K/33.79G) obtains 0.21 dB/0.0037 and 0.10 dB/0.0015 PSNR/SSIM gains over the attention-based SR model AFAN (692K/50.90G) and the feature distillation-based SR model DDistill-SR (675K/32.83G), respectively.

近年来,深度卷积神经网络(CNN)在单图像超分辨率(SISR)方面的性能有了显著提高。尽管如此,基于 CNN 的 SISR 模型仍有很大一部分存在网络参数过多、计算复杂度过高等问题。如何更充分地利用深度特征,在模型复杂度和重建性能之间取得平衡,是该领域面临的主要挑战之一。为解决这一问题,在著名的信息多重蒸馏模型的基础上,开发了一种多注意特征蒸馏网络,称为 MAFDN,用于轻量级和精确的 SISR。具体来说,设计了一个有效的多注意特征蒸馏块(MAFDB),并将其作为 MAFDN 的基本特征提取单元。借助多注意层(包括像素注意、空间注意和通道注意),MAFDB 利用多个信息提取分支来学习更多具有区分性和代表性的特征。此外,MAFDB 还引入了基于深度过参数化卷积层(DO-Conv)的残差块(OPCRB),在不增加推理阶段参数和计算量的情况下提高了推理能力。对常用数据集的研究结果表明,考虑到重建性能和模型复杂度,我们的 MAFDN 优于现有的代表性轻量级 SISR 模型。例如,对于 Set5 上的 ×4 SR,MAFDN(597K/33.79G)比基于注意力的 SR 模型 AFAN(692K/50.90G)和基于特征蒸馏的 SR 模型 DDistill-SR(675K/32.83G)分别提高了 0.21 dB/0.0037 和 0.10 dB/0.0015 PSNR/SSIM。
{"title":"A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution","authors":"Yongfei Zhang,&nbsp;Xinying Lin,&nbsp;Hong Yang,&nbsp;Jie He,&nbsp;Linbo Qing,&nbsp;Xiaohai He,&nbsp;Yi Li,&nbsp;Honggang Chen","doi":"10.1155/2024/3255233","DOIUrl":"10.1155/2024/3255233","url":null,"abstract":"<p>In recent years, remarkable performance improvements have been produced by deep convolutional neural networks (CNN) for single image super-resolution (SISR). Nevertheless, a high proportion of CNN-based SISR models are with quite a few network parameters and high computational complexity for deep or wide architectures. How to more fully utilize deep features to make a balance between model complexity and reconstruction performance is one of the main challenges in this field. To address this problem, on the basis of the well-known information multi-distillation model, a multi-attention feature distillation network termed as MAFDN is developed for lightweight and accurate SISR. Specifically, an effective multi-attention feature distillation block (MAFDB) is designed and used as the basic feature extraction unit in MAFDN. With the help of multi-attention layers including pixel attention, spatial attention, and channel attention, MAFDB uses multiple information distillation branches to learn more discriminative and representative features. Furthermore, MAFDB introduces the depthwise over-parameterized convolutional layer (DO-Conv)-based residual block (OPCRB) to enhance its ability without incurring any parameter and computation increase in the inference stage. The results on commonly used datasets demonstrate that our MAFDN outperforms existing representative lightweight SISR models when taking both reconstruction performance and model complexity into consideration. For example, for ×4 SR on Set5, MAFDN (597K/33.79G) obtains 0.21 dB/0.0037 and 0.10 dB/0.0015 PSNR/SSIM gains over the attention-based SR model AFAN (692K/50.90G) and the feature distillation-based SR model DDistill-SR (675K/32.83G), respectively.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139776257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution 用于轻量级单张图像超分辨率的多注意力特征蒸馏神经网络
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-15 DOI: 10.1155/2024/3255233
Yongfei Zhang, Xinying Lin, Hong Yang, Jie He, Linbo Qing, Xiaohai He, Yi Li, Honggang Chen
In recent years, remarkable performance improvements have been produced by deep convolutional neural networks (CNN) for single image super-resolution (SISR). Nevertheless, a high proportion of CNN-based SISR models are with quite a few network parameters and high computational complexity for deep or wide architectures. How to more fully utilize deep features to make a balance between model complexity and reconstruction performance is one of the main challenges in this field. To address this problem, on the basis of the well-known information multi-distillation model, a multi-attention feature distillation network termed as MAFDN is developed for lightweight and accurate SISR. Specifically, an effective multi-attention feature distillation block (MAFDB) is designed and used as the basic feature extraction unit in MAFDN. With the help of multi-attention layers including pixel attention, spatial attention, and channel attention, MAFDB uses multiple information distillation branches to learn more discriminative and representative features. Furthermore, MAFDB introduces the depthwise over-parameterized convolutional layer (DO-Conv)-based residual block (OPCRB) to enhance its ability without incurring any parameter and computation increase in the inference stage. The results on commonly used datasets demonstrate that our MAFDN outperforms existing representative lightweight SISR models when taking both reconstruction performance and model complexity into consideration. For example, for ×4 SR on Set5, MAFDN (597K/33.79G) obtains 0.21 dB/0.0037 and 0.10 dB/0.0015 PSNR/SSIM gains over the attention-based SR model AFAN (692K/50.90G) and the feature distillation-based SR model DDistill-SR (675K/32.83G), respectively.
近年来,深度卷积神经网络(CNN)在单图像超分辨率(SISR)方面的性能有了显著提高。尽管如此,基于 CNN 的 SISR 模型仍有很大一部分存在网络参数过多、计算复杂度过高等问题。如何更充分地利用深度特征,在模型复杂度和重建性能之间取得平衡,是该领域面临的主要挑战之一。为解决这一问题,在著名的信息多重蒸馏模型的基础上,开发了一种多注意特征蒸馏网络,称为 MAFDN,用于轻量级和精确的 SISR。具体来说,设计了一个有效的多注意特征蒸馏块(MAFDB),并将其作为 MAFDN 的基本特征提取单元。借助多注意层(包括像素注意、空间注意和通道注意),MAFDB 利用多个信息提取分支来学习更多具有区分性和代表性的特征。此外,MAFDB 还引入了基于深度过参数化卷积层(DO-Conv)的残差块(OPCRB),在不增加推理阶段参数和计算量的情况下提高了推理能力。对常用数据集的研究结果表明,考虑到重建性能和模型复杂度,我们的 MAFDN 优于现有的代表性轻量级 SISR 模型。例如,对于 Set5 上的 ×4 SR,MAFDN(597K/33.79G)比基于注意力的 SR 模型 AFAN(692K/50.90G)和基于特征蒸馏的 SR 模型 DDistill-SR(675K/32.83G)分别提高了 0.21 dB/0.0037 和 0.10 dB/0.0015 PSNR/SSIM。
{"title":"A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution","authors":"Yongfei Zhang, Xinying Lin, Hong Yang, Jie He, Linbo Qing, Xiaohai He, Yi Li, Honggang Chen","doi":"10.1155/2024/3255233","DOIUrl":"https://doi.org/10.1155/2024/3255233","url":null,"abstract":"In recent years, remarkable performance improvements have been produced by deep convolutional neural networks (CNN) for single image super-resolution (SISR). Nevertheless, a high proportion of CNN-based SISR models are with quite a few network parameters and high computational complexity for deep or wide architectures. How to more fully utilize deep features to make a balance between model complexity and reconstruction performance is one of the main challenges in this field. To address this problem, on the basis of the well-known information multi-distillation model, a multi-attention feature distillation network termed as MAFDN is developed for lightweight and accurate SISR. Specifically, an effective multi-attention feature distillation block (MAFDB) is designed and used as the basic feature extraction unit in MAFDN. With the help of multi-attention layers including pixel attention, spatial attention, and channel attention, MAFDB uses multiple information distillation branches to learn more discriminative and representative features. Furthermore, MAFDB introduces the depthwise over-parameterized convolutional layer (DO-Conv)-based residual block (OPCRB) to enhance its ability without incurring any parameter and computation increase in the inference stage. The results on commonly used datasets demonstrate that our MAFDN outperforms existing representative lightweight SISR models when taking both reconstruction performance and model complexity into consideration. For example, for ×4 SR on Set5, MAFDN (597K/33.79G) obtains 0.21 dB/0.0037 and 0.10 dB/0.0015 PSNR/SSIM gains over the attention-based SR model AFAN (692K/50.90G) and the feature distillation-based SR model DDistill-SR (675K/32.83G), respectively.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139836013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1