International Journal of Intelligent Systems最新文献_第10页

Optimization of Biped Robot Walking Based on the Improved Particle Swarm Algorithm 基于改进型粒子群算法的双足机器人行走优化

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-23 DOI: 10.1155/2024/6689071

Chao Zhang, Mei Liu, Peisi Zhong, Shihao Yang, Zhongyuan Liang, Qingjun Song

The central pattern generator (CPG) is widely applied in biped gait generation, and the particle swarm optimization (PSO) algorithm is commonly used to solve optimization problems for CPG network controllers. However, the canonical PSO algorithms fail to balance exploration and exploitation, resulting in reduced optimization accuracy and stability, decreasing the control effectiveness of CPG controllers. In order to address this issue, a balanced PSO (BPSO) algorithm is proposed, which achieves better performance by balancing the algorithm’s exploration and exploitation capabilities. The BPSO algorithm’s solving process consists of two phases: the free exploration phase (FEP), which emphasizes exploration, and the attention exploration phase (AEP), which emphasizes exploitation. The proportion of each phase during optimization is controlled by an adjustable parameter. The BPSO algorithm is subjected to qualitative, numerical, convergence, and statistical analyses based on 13 benchmark functions. The experimental results from the benchmark functions demonstrate that the BPSO algorithm outperforms other comparison algorithms. Finally, a linear walking optimization method for humanoid robots based on the BPSO algorithm is established and tested in the Webots simulator. Comparative results with two other optimization methods show that the BPSO-based optimization method enables the robot to achieve greater walking distance and smaller lateral deviation within a fixed number of iterations. Compared to the other two methods, walking distance increases by at least 60.98% and lateral deviation decreases by at least 1.96%. This research contributes to enhancing the locomotion capabilities of CPG-controlled humanoid robots, enriching biped gait optimization theory and promoting the application of CPG gait control methods in humanoid robots.

中央模式发生器（CPG）广泛应用于双足步态生成，粒子群优化（PSO）算法常用于解决CPG网络控制器的优化问题。然而，典型的 PSO 算法无法兼顾探索和利用，导致优化精度和稳定性下降，降低了 CPG 控制器的控制效果。为了解决这个问题，我们提出了一种平衡 PSO（BPSO）算法，通过平衡算法的探索和利用能力来获得更好的性能。BPSO 算法的求解过程包括两个阶段：强调探索的自由探索阶段（FEP）和强调利用的注意力探索阶段（AEP）。优化过程中每个阶段的比例由一个可调参数控制。基于 13 个基准函数，对 BPSO 算法进行了定性、数值、收敛和统计分析。基准函数的实验结果表明，BPSO 算法优于其他比较算法。最后，建立了基于 BPSO 算法的仿人机器人线性行走优化方法，并在 Webots 模拟器中进行了测试。与其他两种优化方法的比较结果表明，基于 BPSO 的优化方法能使机器人在固定的迭代次数内实现更大的行走距离和更小的横向偏差。与其他两种方法相比，行走距离至少增加了 60.98%，横向偏差至少减少了 1.96%。该研究有助于提高CPG控制仿人机器人的运动能力，丰富双足步态优化理论，促进CPG步态控制方法在仿人机器人中的应用。

{"title":"Optimization of Biped Robot Walking Based on the Improved Particle Swarm Algorithm","authors":"Chao Zhang, Mei Liu, Peisi Zhong, Shihao Yang, Zhongyuan Liang, Qingjun Song","doi":"10.1155/2024/6689071","DOIUrl":"https://doi.org/10.1155/2024/6689071","url":null,"abstract":"<div>\u0000 <p>The central pattern generator (CPG) is widely applied in biped gait generation, and the particle swarm optimization (PSO) algorithm is commonly used to solve optimization problems for CPG network controllers. However, the canonical PSO algorithms fail to balance exploration and exploitation, resulting in reduced optimization accuracy and stability, decreasing the control effectiveness of CPG controllers. In order to address this issue, a balanced PSO (BPSO) algorithm is proposed, which achieves better performance by balancing the algorithm’s exploration and exploitation capabilities. The BPSO algorithm’s solving process consists of two phases: the free exploration phase (FEP), which emphasizes exploration, and the attention exploration phase (AEP), which emphasizes exploitation. The proportion of each phase during optimization is controlled by an adjustable parameter. The BPSO algorithm is subjected to qualitative, numerical, convergence, and statistical analyses based on 13 benchmark functions. The experimental results from the benchmark functions demonstrate that the BPSO algorithm outperforms other comparison algorithms. Finally, a linear walking optimization method for humanoid robots based on the BPSO algorithm is established and tested in the Webots simulator. Comparative results with two other optimization methods show that the BPSO-based optimization method enables the robot to achieve greater walking distance and smaller lateral deviation within a fixed number of iterations. Compared to the other two methods, walking distance increases by at least 60.98% and lateral deviation decreases by at least 1.96%. This research contributes to enhancing the locomotion capabilities of CPG-controlled humanoid robots, enriching biped gait optimization theory and promoting the application of CPG gait control methods in humanoid robots.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/6689071","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing Breast Cancer Detection With an Ensemble Deep Learning Approach 利用集合深度学习方法优化乳腺癌检测

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-20 DOI: 10.1155/2024/5564649

Dilawar Shah, Mohammad Asmat Ullah Khan, Mohammad Abrar, Muhammad Tahir

In the global fight against breast cancer, the importance of early diagnosis is unparalleled. Early identification not only improves treatment options but also significantly improves survival rates. Our research introduces an innovative ensemble method that synergistically combines the strengths of four state-of-the-art convolutional neural networks (CNNs): EfficientNet, AlexNet, ResNet, and DenseNet. These networks were chosen for their architectural advances and proven efficacy in image classification tasks, particularly in medical imaging. Each network within our ensemble is uniquely optimized: EfficientNet is fine-tuned with customized scaling to address dataset specifics; AlexNet employs a variable dropout mechanism to reduce overfitting; ResNet benefits from learnable weighted skip connections for better gradient flow; and DenseNet uses selective connectivity to balance computational efficiency and feature extraction. This ensembling strategy combines the predictive output of multiple CNNs, each trained with an individually optimized network, to enhance the ensemble’s overall diagnostic performance. This provides higher precision and stability than any model and shows outstanding performance in the early stage of breast cancer with a precision of up to 94.6%, sensitivity of 92.4%, specificity of 96.1%, and area under the curve (AUC) of 98.0%. This ensemble framework indicated a leap in the early diagnosis of breast cancer as it is a powerful tool that combines several state-of-the-art techniques, hence providing better results.

在全球抗击乳腺癌的斗争中，早期诊断的重要性无与伦比。早期识别不仅能改善治疗方案，还能显著提高生存率。我们的研究引入了一种创新的集合方法，它协同结合了四种最先进的卷积神经网络（CNN）的优势：EfficientNet、AlexNet、ResNet 和 DenseNet。之所以选择这些网络，是因为它们在架构上的先进性，以及在图像分类任务（尤其是医学成像）中久经考验的功效。我们组合中的每个网络都经过了独特的优化：EfficientNet 通过定制缩放进行了微调，以解决数据集的具体问题；AlexNet 采用了可变 dropout 机制，以减少过拟合；ResNet 受益于可学习的加权跳转连接，以获得更好的梯度流；DenseNet 采用了选择性连接，以平衡计算效率和特征提取。这种集合策略结合了多个 CNN 的预测输出，每个 CNN 都使用单独优化的网络进行训练，以提高集合的整体诊断性能。它比任何模型都具有更高的精确度和稳定性，在乳腺癌早期阶段表现突出，精确度高达 94.6%，灵敏度为 92.4%，特异性为 96.1%，曲线下面积（AUC）为 98.0%。这种集合框架是乳腺癌早期诊断的一次飞跃，因为它是一种结合了多种最先进技术的强大工具，因此能提供更好的结果。

{"title":"Optimizing Breast Cancer Detection With an Ensemble Deep Learning Approach","authors":"Dilawar Shah, Mohammad Asmat Ullah Khan, Mohammad Abrar, Muhammad Tahir","doi":"10.1155/2024/5564649","DOIUrl":"https://doi.org/10.1155/2024/5564649","url":null,"abstract":"<div>\u0000 <p>In the global fight against breast cancer, the importance of early diagnosis is unparalleled. Early identification not only improves treatment options but also significantly improves survival rates. Our research introduces an innovative ensemble method that synergistically combines the strengths of four state-of-the-art convolutional neural networks (CNNs): EfficientNet, AlexNet, ResNet, and DenseNet. These networks were chosen for their architectural advances and proven efficacy in image classification tasks, particularly in medical imaging. Each network within our ensemble is uniquely optimized: EfficientNet is fine-tuned with customized scaling to address dataset specifics; AlexNet employs a variable dropout mechanism to reduce overfitting; ResNet benefits from learnable weighted skip connections for better gradient flow; and DenseNet uses selective connectivity to balance computational efficiency and feature extraction. This ensembling strategy combines the predictive output of multiple CNNs, each trained with an individually optimized network, to enhance the ensemble’s overall diagnostic performance. This provides higher precision and stability than any model and shows outstanding performance in the early stage of breast cancer with a precision of up to 94.6%, sensitivity of 92.4%, specificity of 96.1%, and area under the curve (AUC) of 98.0%. This ensemble framework indicated a leap in the early diagnosis of breast cancer as it is a powerful tool that combines several state-of-the-art techniques, hence providing better results.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/5564649","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

EWRD: Entropy-Weighted Low-Light Image Enhancement via Reverse Diffusion Model EWRD：通过反向扩散模型进行熵加权弱光图像增强

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-19 DOI: 10.1155/2024/4650233

Yuheng Wu, Guangyuan Wu, Ronghao Liao

Low-light enhancement significantly aids vision tasks under poor illumination conditions. Existing methods primarily focus on enhancing severely degraded low-light areas, improving illumination accuracy, or noise suppression, yet they often overlook the loss of additional details and color distortion during calculation. In this paper, we propose an innovative entropy-weighted low-light image enhancement method via the reverse diffusion model, aiming at addressing the limitations of the traditional Retinex decomposition model in preserving local pixel details and handling excessive smoothing issues. This method integrates an entropy-weighting mechanism for improved image quality and entropy, along with a reverse diffusion model to address the detail loss in total variation regularization and refine the enhancement process. Furthermore, we utilize long short-term memory networks for the learning reverse process and the simulation of image degradation, based on a thermodynamics-based nonlinear anisotropic diffusion model. Comparative experiments reveal the superiority of our method over conventional Retinex-based approaches in terms of detail preservation and visual quality. Extensive tests across diverse datasets demonstrate the exceptional performance of our method, evidencing its potential as a robust solution for low-light image enhancement.

低照度增强技术对低照度条件下的视觉任务有很大帮助。现有的方法主要集中在增强严重劣化的低照度区域、提高照度精度或抑制噪声等方面，但它们往往忽略了计算过程中额外细节的损失和色彩失真。本文通过反向扩散模型提出了一种创新的熵加权弱光图像增强方法，旨在解决传统 Retinex 分解模型在保留局部像素细节和处理过度平滑问题上的局限性。该方法整合了熵加权机制，以提高图像质量和熵，同时还整合了反向扩散模型，以解决总变异正则化中的细节损失问题，并完善增强过程。此外，我们还基于热力学非线性各向异性扩散模型，利用长短期记忆网络进行反向学习过程和图像退化模拟。对比实验表明，在细节保留和视觉质量方面，我们的方法优于传统的基于 Retinex 的方法。在各种数据集上进行的广泛测试证明了我们的方法具有卓越的性能，证明了它作为低照度图像增强的稳健解决方案的潜力。

{"title":"EWRD: Entropy-Weighted Low-Light Image Enhancement via Reverse Diffusion Model","authors":"Yuheng Wu, Guangyuan Wu, Ronghao Liao","doi":"10.1155/2024/4650233","DOIUrl":"https://doi.org/10.1155/2024/4650233","url":null,"abstract":"<div>\u0000 <p>Low-light enhancement significantly aids vision tasks under poor illumination conditions. Existing methods primarily focus on enhancing severely degraded low-light areas, improving illumination accuracy, or noise suppression, yet they often overlook the loss of additional details and color distortion during calculation. In this paper, we propose an innovative entropy-weighted low-light image enhancement method via the reverse diffusion model, aiming at addressing the limitations of the traditional Retinex decomposition model in preserving local pixel details and handling excessive smoothing issues. This method integrates an entropy-weighting mechanism for improved image quality and entropy, along with a reverse diffusion model to address the detail loss in total variation regularization and refine the enhancement process. Furthermore, we utilize long short-term memory networks for the learning reverse process and the simulation of image degradation, based on a thermodynamics-based nonlinear anisotropic diffusion model. Comparative experiments reveal the superiority of our method over conventional Retinex-based approaches in terms of detail preservation and visual quality. Extensive tests across diverse datasets demonstrate the exceptional performance of our method, evidencing its potential as a robust solution for low-light image enhancement.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/4650233","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

E-Speech: Development of a Dataset for Speech Emotion Recognition and Analysis 电子语音：开发用于语音情感识别和分析的数据集

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-18 DOI: 10.1155/2024/5410080

Wenjin Liu, Jiaqi Shi, Shudong Zhang, Lijuan Zhou, Haoming Liu

Speech emotion recognition plays a crucial role in analyzing psychological disorders, behavioral decision-making, and human-machine interaction applications. However, the majority of current methods for speech emotion recognition heavily rely on data-driven approaches, and the scarcity of emotion speech datasets limits the progress in research and development of emotion analysis and recognition. To address this issue, this study introduces a new English speech dataset specifically designed for emotion analysis and recognition. This dataset consists of 5503 voices from over 60 English speakers in different emotional states. Furthermore, to enhance emotion analysis and recognition, fast Fourier transform (FFT), short-time Fourier transform (STFT), mel-frequency cepstral coefficients (MFCCs), and continuous wavelet transform (CWT) are employed for feature extraction from the speech data. Utilizing these algorithms, the spectrum images of the speeches are obtained, forming four datasets consisting of different speech feature images. Furthermore, to evaluate the dataset, 16 classification models and 19 detection algorithms are selected. The experimental results demonstrate that the majority of classification and detection models achieve exceptionally high recognition accuracy on this dataset, confirming its effectiveness and utility. The dataset proves to be valuable in advancing research and development in the field of emotion recognition.

语音情感识别在心理障碍分析、行为决策和人机交互应用中发挥着至关重要的作用。然而，目前大多数语音情感识别方法严重依赖于数据驱动方法，情感语音数据集的稀缺限制了情感分析和识别的研究与发展。为解决这一问题，本研究引入了一个新的英语语音数据集，专门用于情感分析和识别。该数据集由来自 60 多位英语发言人的 5503 个不同情绪状态的语音组成。此外，为了增强情感分析和识别能力，还采用了快速傅立叶变换（FFT）、短时傅立叶变换（STFT）、梅尔频率倒频谱系数（MFCC）和连续小波变换（CWT）等算法从语音数据中提取特征。利用这些算法，可以获得语音的频谱图像，形成由不同语音特征图像组成的四个数据集。此外，为了对数据集进行评估，还选择了 16 种分类模型和 19 种检测算法。实验结果表明，大多数分类和检测模型在该数据集上都达到了极高的识别准确率，证实了该数据集的有效性和实用性。事实证明，该数据集对推动情感识别领域的研究和开发具有重要价值。

{"title":"E-Speech: Development of a Dataset for Speech Emotion Recognition and Analysis","authors":"Wenjin Liu, Jiaqi Shi, Shudong Zhang, Lijuan Zhou, Haoming Liu","doi":"10.1155/2024/5410080","DOIUrl":"https://doi.org/10.1155/2024/5410080","url":null,"abstract":"<div>\u0000 <p>Speech emotion recognition plays a crucial role in analyzing psychological disorders, behavioral decision-making, and human-machine interaction applications. However, the majority of current methods for speech emotion recognition heavily rely on data-driven approaches, and the scarcity of emotion speech datasets limits the progress in research and development of emotion analysis and recognition. To address this issue, this study introduces a new English speech dataset specifically designed for emotion analysis and recognition. This dataset consists of 5503 voices from over 60 English speakers in different emotional states. Furthermore, to enhance emotion analysis and recognition, fast Fourier transform (FFT), short-time Fourier transform (STFT), mel-frequency cepstral coefficients (MFCCs), and continuous wavelet transform (CWT) are employed for feature extraction from the speech data. Utilizing these algorithms, the spectrum images of the speeches are obtained, forming four datasets consisting of different speech feature images. Furthermore, to evaluate the dataset, 16 classification models and 19 detection algorithms are selected. The experimental results demonstrate that the majority of classification and detection models achieve exceptionally high recognition accuracy on this dataset, confirming its effectiveness and utility. The dataset proves to be valuable in advancing research and development in the field of emotion recognition.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/5410080","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Collaborative Attack Sequence Generation Model Based on Multiagent Reinforcement Learning for Intelligent Traffic Signal System 基于多代理强化学习的智能交通信号系统协同攻击序列生成模型

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-18 DOI: 10.1155/2024/4734030

Yalun Wu, Yingxiao Xiang, Thar Baker, Endong Tong, Ye Zhu, Xiaoshu Cui, Zhenguo Zhang, Zhen Han, Jiqiang Liu, Wenjia Niu

Intelligent traffic signal systems, crucial for intelligent transportation systems, have been widely studied and deployed to enhance vehicle traffic efficiency and reduce air pollution. Unfortunately, intelligent traffic signal systems are at risk of data spoofing attack, causing traffic delays, congestion, and even paralysis. In this paper, we reveal a multivehicle collaborative data spoofing attack to intelligent traffic signal systems and propose a collaborative attack sequence generation model based on multiagent reinforcement learning (RL), aiming to explore efficient and stealthy attacks. Specifically, we first model the spoofing attack based on Partially Observable Markov Decision Process (POMDP) at single and multiple intersections. This involves constructing the state space, action space, and defining a reward function for the attack. Then, based on the attack modeling, we propose an automated approach for generating collaborative attack sequences using the Multi-Actor-Attention-Critic (MAAC) algorithm, a mainstream multiagent RL algorithm. Experiments conducted on the multimodal traffic simulation (VISSIM) platform demonstrate a 15% increase in delay time (DT) and a 40% reduction in attack ratio (AR) compared to the single-vehicle attack, confirming the effectiveness and stealthiness of our collaborative attack.

智能交通信号系统是智能交通系统的关键，已被广泛研究和部署，以提高车辆通行效率和减少空气污染。遗憾的是，智能交通信号系统存在数据欺骗攻击的风险，导致交通延误、拥堵甚至瘫痪。本文揭示了一种针对智能交通信号系统的多车协同数据欺骗攻击，并提出了一种基于多代理强化学习（RL）的协同攻击序列生成模型，旨在探索高效、隐蔽的攻击方式。具体来说，我们首先基于部分可观测马尔可夫决策过程（POMDP）对单路口和多路口的欺骗攻击进行建模。这包括构建攻击的状态空间、行动空间和定义奖励函数。然后，在攻击建模的基础上，我们提出了一种自动方法，利用主流多代理 RL 算法--多代理-注意-批判（MAAC）算法生成协同攻击序列。在多模式交通仿真（VISSIM）平台上进行的实验表明，与单车攻击相比，延迟时间（DT）增加了 15%，攻击比率（AR）降低了 40%，这证实了我们的协同攻击的有效性和隐蔽性。

{"title":"Collaborative Attack Sequence Generation Model Based on Multiagent Reinforcement Learning for Intelligent Traffic Signal System","authors":"Yalun Wu, Yingxiao Xiang, Thar Baker, Endong Tong, Ye Zhu, Xiaoshu Cui, Zhenguo Zhang, Zhen Han, Jiqiang Liu, Wenjia Niu","doi":"10.1155/2024/4734030","DOIUrl":"https://doi.org/10.1155/2024/4734030","url":null,"abstract":"<div>\u0000 <p>Intelligent traffic signal systems, crucial for intelligent transportation systems, have been widely studied and deployed to enhance vehicle traffic efficiency and reduce air pollution. Unfortunately, intelligent traffic signal systems are at risk of data spoofing attack, causing traffic delays, congestion, and even paralysis. In this paper, we reveal a multivehicle collaborative data spoofing attack to intelligent traffic signal systems and propose a collaborative attack sequence generation model based on multiagent reinforcement learning (RL), aiming to explore efficient and stealthy attacks. Specifically, we first model the spoofing attack based on Partially Observable Markov Decision Process (POMDP) at single and multiple intersections. This involves constructing the state space, action space, and defining a reward function for the attack. Then, based on the attack modeling, we propose an automated approach for generating collaborative attack sequences using the Multi-Actor-Attention-Critic (MAAC) algorithm, a mainstream multiagent RL algorithm. Experiments conducted on the multimodal traffic simulation (VISSIM) platform demonstrate a 15% increase in delay time (DT) and a 40% reduction in attack ratio (AR) compared to the single-vehicle attack, confirming the effectiveness and stealthiness of our collaborative attack.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/4734030","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast Subpixel Motion Estimation Based on Human Visual System 基于人类视觉系统的快速子像素运动估计

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-18 DOI: 10.1155/2024/6168548

Dadvar Hosseini Avashanagh, Mehdi Nooshyar, Saeed Barghandan, Majid Ghandchi

More than 80% of video coding times are consumed by motion estimation calculations, which are the most complex aspect of the process. This method eliminates temporal redundancies in a video sequence to achieve maximum compression. Numerous efforts have been made to bring calculations closer to real time, yielding fruitful results. This study proposes a fast subpixel motion estimation algorithm for video encoding with fewer search points. This method employs the capabilities of human visual systems (HVSs), physical motion characteristics of real-world objects, and special image information from successive frames. The number of search points (NSP) using the statistical data of the movement of the blocks in the frames of video sequences is reduced to apply fewer calculations to the system while maintaining the quality of images. Therefore, it is possible to approach fast and real-time calculations instead of time-consuming algorithms by accurately modeling this algorithm.

超过 80% 的视频编码时间消耗在运动估计计算上，而运动估计计算是整个过程中最复杂的环节。这种方法可以消除视频序列中的时间冗余，从而实现最大程度的压缩。为了使计算更接近实时，人们做出了许多努力，并取得了丰硕的成果。本研究为视频编码提出了一种搜索点较少的快速子像素运动估计算法。该方法利用了人类视觉系统（HVS）的能力、真实世界物体的物理运动特征以及连续帧的特殊图像信息。在保持图像质量的前提下，利用视频序列帧块运动的统计数据来减少搜索点（NSP）的数量，从而减少系统的计算量。因此，通过对该算法进行精确建模，有可能接近快速和实时计算，而不是耗时的算法。

引用次数: 0

TAE-RWP: Traceable Adversarial Examples With Recoverable Warping Perturbation TAE-RWP：具有可恢复翘曲扰动的可追溯对抗示例

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-17 DOI: 10.1155/2024/6054172

Fan Xing, Xiaoyi Zhou, Hongli Peng, Xuefeng Fan, Wenbao Han, Yuqing Zhang

Reversible adversarial example (RAE) is an effective cutting-edge technology for protecting the intellectual property (IP) of datasets. However, existing RAE schemes primarily focus on the adversarial and restoration capabilities of adversarial examples (AE), with little attention paid to traceability, which is crucial for IP protection. This oversight leads to the inability to prevent authorized users from redistributing data, thereby posing significant IP security risks. To address this issue, we propose a novel approach named TAE-RWP, wherein adversarial perturbations in AEs are treated as tools for IP verification. To enable the traceability of AEs, we introduce varying degrees of warping to the adversarial perturbations within the AEs of authorized users, utilizing the warping degree as a traceable feature. To further strengthen traceability, we adopt a technique named “random warping” to maintain the resilience of adversarial perturbations against distortions, and employ a strategy named “noise mode” to improve the verification model’s capacity to recognize distortion features. Experimental results indicate that AEs generated by TAE-RWP exhibit remarkable adversarial strength and restoration abilities, while the verification model demonstrates excellence in recognizing distortion features.

可逆对抗范例（RAE）是保护数据集知识产权（IP）的有效前沿技术。然而，现有的可逆对抗示例（RAE）方案主要关注对抗性和对抗性示例（AE）的还原能力，却很少关注对知识产权保护至关重要的可追溯性。这种疏忽导致无法阻止授权用户重新分发数据，从而带来了巨大的知识产权安全风险。为了解决这个问题，我们提出了一种名为 TAE-RWP 的新方法，将 AE 中的对抗性扰动视为知识产权验证的工具。为了实现 AE 的可追溯性，我们在授权用户的 AE 中引入了不同程度的翘曲对抗扰动，利用翘曲程度作为可追溯特征。为了进一步加强可追溯性，我们采用了一种名为 "随机翘曲 "的技术来保持对抗性扰动对扭曲的弹性，并采用了一种名为 "噪声模式 "的策略来提高验证模型识别扭曲特征的能力。实验结果表明，TAE-RWP 生成的 AE 具有出色的对抗强度和修复能力，而验证模型在识别失真特征方面表现出色。

{"title":"TAE-RWP: Traceable Adversarial Examples With Recoverable Warping Perturbation","authors":"Fan Xing, Xiaoyi Zhou, Hongli Peng, Xuefeng Fan, Wenbao Han, Yuqing Zhang","doi":"10.1155/2024/6054172","DOIUrl":"https://doi.org/10.1155/2024/6054172","url":null,"abstract":"<div>\u0000 <p>Reversible adversarial example (RAE) is an effective cutting-edge technology for protecting the intellectual property (IP) of datasets. However, existing RAE schemes primarily focus on the adversarial and restoration capabilities of adversarial examples (AE), with little attention paid to traceability, which is crucial for IP protection. This oversight leads to the inability to prevent authorized users from redistributing data, thereby posing significant IP security risks. To address this issue, we propose a novel approach named TAE-RWP, wherein adversarial perturbations in AEs are treated as tools for IP verification. To enable the traceability of AEs, we introduce varying degrees of warping to the adversarial perturbations within the AEs of authorized users, utilizing the warping degree as a traceable feature. To further strengthen traceability, we adopt a technique named “random warping” to maintain the resilience of adversarial perturbations against distortions, and employ a strategy named “noise mode” to improve the verification model’s capacity to recognize distortion features. Experimental results indicate that AEs generated by TAE-RWP exhibit remarkable adversarial strength and restoration abilities, while the verification model demonstrates excellence in recognizing distortion features.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/6054172","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142447783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Comprehensive Survey of Animal Identification: Exploring Data Sources, AI Advances, Classification Obstacles and the Role of Taxonomy 动物识别综合调查：探索数据来源、人工智能进展、分类障碍和分类学的作用

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-11 DOI: 10.1155/2024/7033535

Qianqian Zhang, Khandakar Ahmed, Nalin Sharda, Hua Wang

With the rapid development of entity recognition technology, animal recognition has gradually become essential in modern society, supporting labour-intensive agriculture and animal husbandry tasks. Severe problems such as maintaining biodiversity can also benefit from animal identification technology. However, certain invasive recognition systems have resulted in permanent harm to animals, while noninvasive identification methods also exhibit certain drawbacks. This paper conducts a systematic literature review (SLR), presenting a comprehensive overview of various animal recognition technologies and their applications. Specifically, it examines methodologies such as deep learning, image processing and acoustic analysis used for different animal characteristics and identification purposes. The contribution of machine learning to animal feature extraction is highlighted, emphasising its significance for animal taxonomy and wild species monitoring. Additionally, this review addresses the challenges and limitations of current technologies, including data scarcity, model accuracy and computational requirements, and suggests opportunities for future research to overcome these obstacles.

随着实体识别技术的飞速发展，动物识别已逐渐成为现代社会的必需品，为劳动密集型的农业和畜牧业提供支持。维护生物多样性等严峻问题也可以从动物识别技术中受益。然而，某些侵入式识别系统会对动物造成永久性伤害，而非侵入式识别方法也表现出一定的弊端。本文通过系统的文献综述（SLR），全面介绍了各种动物识别技术及其应用。具体而言，它研究了用于不同动物特征和识别目的的深度学习、图像处理和声学分析等方法。本综述突出了机器学习对动物特征提取的贡献，强调了机器学习对动物分类和野生物种监测的重要意义。此外，本综述还讨论了当前技术面临的挑战和局限性，包括数据稀缺、模型准确性和计算要求，并提出了未来研究克服这些障碍的机会。

{"title":"A Comprehensive Survey of Animal Identification: Exploring Data Sources, AI Advances, Classification Obstacles and the Role of Taxonomy","authors":"Qianqian Zhang, Khandakar Ahmed, Nalin Sharda, Hua Wang","doi":"10.1155/2024/7033535","DOIUrl":"https://doi.org/10.1155/2024/7033535","url":null,"abstract":"<div>\u0000 <p>With the rapid development of entity recognition technology, animal recognition has gradually become essential in modern society, supporting labour-intensive agriculture and animal husbandry tasks. Severe problems such as maintaining biodiversity can also benefit from animal identification technology. However, certain invasive recognition systems have resulted in permanent harm to animals, while noninvasive identification methods also exhibit certain drawbacks. This paper conducts a systematic literature review (SLR), presenting a comprehensive overview of various animal recognition technologies and their applications. Specifically, it examines methodologies such as deep learning, image processing and acoustic analysis used for different animal characteristics and identification purposes. The contribution of machine learning to animal feature extraction is highlighted, emphasising its significance for animal taxonomy and wild species monitoring. Additionally, this review addresses the challenges and limitations of current technologies, including data scarcity, model accuracy and computational requirements, and suggests opportunities for future research to overcome these obstacles.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/7033535","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142430019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Road Ahead: Emerging Trends, Unresolved Issues, and Concluding Remarks in Generative AI—A Comprehensive Review 未来之路：生成式人工智能的新趋势、未决问题和结语--全面回顾

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-10-08 DOI: 10.1155/2024/4013195

Balasubramaniam S., Vanajaroselin Chirchi, Seifedine Kadry, Moorthy Agoramoorthy, Gururama Senthilvel P., Satheesh Kumar K., Sivakumar T. A.

The field of generative artificial intelligence (AI) is experiencing rapid advancements, impacting a multitude of sectors, from computer vision to healthcare. This paper provides a comprehensive review of generative AI’s evolution, significance, and applications, including the foundational architectures such as generative adversarial networks (GANs), variational autoencoders (VAEs), autoregressive models, flow-based models, and diffusion models. We delve into the impact of generative algorithms on computer vision, natural language processing, artistic creation, and healthcare, demonstrating their revolutionary potential in data augmentation, text and speech synthesis, and medical image interpretation. While the transformative capabilities of generative AI are acknowledged, the paper also examines ethical concerns, most notably the advent of deepfakes, calling for the development of robust detection frameworks and responsible use guidelines. As generative AI continues to evolve, driven by advances in neural network architectures and deep learning methodologies, this paper provides a holistic overview of the current landscape and a roadmap for future research and ethical considerations in generative AI.

生成式人工智能（AI）领域正经历着快速发展，影响着从计算机视觉到医疗保健等众多领域。本文全面回顾了生成式人工智能的演变、意义和应用，包括生成对抗网络（GAN）、变异自动编码器（VAE）、自回归模型、基于流的模型和扩散模型等基础架构。我们深入探讨了生成算法对计算机视觉、自然语言处理、艺术创作和医疗保健的影响，展示了它们在数据增强、文本和语音合成以及医学图像解读方面的革命性潜力。在承认生成式人工智能的变革能力的同时，本文还探讨了伦理问题，其中最值得关注的是深度伪造的出现，呼吁开发强大的检测框架和负责任的使用指南。在神经网络架构和深度学习方法进步的推动下，生成式人工智能不断发展，本文全面概述了当前的形势，并为生成式人工智能的未来研究和伦理考虑提供了路线图。

{"title":"The Road Ahead: Emerging Trends, Unresolved Issues, and Concluding Remarks in Generative AI—A Comprehensive Review","authors":"Balasubramaniam S., Vanajaroselin Chirchi, Seifedine Kadry, Moorthy Agoramoorthy, Gururama Senthilvel P., Satheesh Kumar K., Sivakumar T. A.","doi":"10.1155/2024/4013195","DOIUrl":"https://doi.org/10.1155/2024/4013195","url":null,"abstract":"<div>\u0000 <p>The field of generative artificial intelligence (AI) is experiencing rapid advancements, impacting a multitude of sectors, from computer vision to healthcare. This paper provides a comprehensive review of generative AI’s evolution, significance, and applications, including the foundational architectures such as generative adversarial networks (GANs), variational autoencoders (VAEs), autoregressive models, flow-based models, and diffusion models. We delve into the impact of generative algorithms on computer vision, natural language processing, artistic creation, and healthcare, demonstrating their revolutionary potential in data augmentation, text and speech synthesis, and medical image interpretation. While the transformative capabilities of generative AI are acknowledged, the paper also examines ethical concerns, most notably the advent of deepfakes, calling for the development of robust detection frameworks and responsible use guidelines. As generative AI continues to evolve, driven by advances in neural network architectures and deep learning methodologies, this paper provides a holistic overview of the current landscape and a roadmap for future research and ethical considerations in generative AI.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/4013195","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142429383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Consistency and Complementarity Jointly Regularized Subspace Support Vector Data Description for Multimodal Data 多模态数据的一致性与互补性联合正则化子空间支持向量数据描述

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems

Pub Date : 2024-09-30 DOI: 10.1155/2024/1989706

Chuang Wang, Wenjun Hu, Juan Wang, Pengjiang Qian, Shitong Wang

The one-class classification (OCC) problem has always been a popular topic because it is difficult or expensive to obtain abnormal data in many practical applications. Most of OCC methods focused on monomodal data, such as support vector data description (SVDD) and its variants, while we often face multimodal data in reality. The data come from the same task in multimodal learning, and thus, the inherent structures among all modalities should be hold, which is called the consistency principle. However, each modality contains unique information that can be used to repair the incompleteness of other modalities. It is called the complementarity principle. To follow the above two principles, we designed a multimodal graph–regularized term and a sparse projection matrix–regularized term. The former aims to preserve the within-modal structural and between-modal relationships, while the latter aims to richly use the complementarity information hidden in multimodal data. Further, we follow the multimodal subspace (MS) SVDD architecture and use two regularized terms to regularize SVDD. Consequently, a novel OCC method for multimodal data is proposed, called the consistency and complementarity jointly regularized subspace SVDD (CCS-SVDD). Extensive experimental results demonstrate that our approach is more effective and competitive than other algorithms. The source codes are available at https://github.com/wongchuang/CCS_SVDD.

单类分类（OCC）问题一直是一个热门话题，因为在许多实际应用中，获取异常数据非常困难或昂贵。大多数 OCC 方法都侧重于单模态数据，如支持向量数据描述（SVDD）及其变体，而我们在现实中经常面对的是多模态数据。在多模态学习中，数据来自于同一个任务，因此，所有模态之间的固有结构应保持不变，这就是所谓的一致性原则。然而，每种模态都包含独特的信息，可以用来修复其他模态的不完整性。这就是互补性原则。为了遵循上述两个原则，我们设计了多模态图规则化术语和稀疏投影矩阵规则化术语。前者旨在保留模态内结构关系和模态间关系，后者旨在丰富利用隐藏在多模态数据中的互补性信息。此外，我们遵循多模态子空间（MS）SVDD 架构，使用两个正则化项对 SVDD 进行正则化。因此，我们提出了一种用于多模态数据的新型 OCC 方法，即一致性和互补性联合正则化子空间 SVDD（CCS-SVDD）。广泛的实验结果表明，我们的方法比其他算法更有效、更有竞争力。源代码见 https://github.com/wongchuang/CCS_SVDD。

{"title":"Consistency and Complementarity Jointly Regularized Subspace Support Vector Data Description for Multimodal Data","authors":"Chuang Wang, Wenjun Hu, Juan Wang, Pengjiang Qian, Shitong Wang","doi":"10.1155/2024/1989706","DOIUrl":"https://doi.org/10.1155/2024/1989706","url":null,"abstract":"<div>\u0000 <p>The one-class classification (OCC) problem has always been a popular topic because it is difficult or expensive to obtain abnormal data in many practical applications. Most of OCC methods focused on monomodal data, such as support vector data description (SVDD) and its variants, while we often face multimodal data in reality. The data come from the same task in multimodal learning, and thus, the inherent structures among all modalities should be hold, which is called the consistency principle. However, each modality contains unique information that can be used to repair the incompleteness of other modalities. It is called the complementarity principle. To follow the above two principles, we designed a multimodal graph–regularized term and a sparse projection matrix–regularized term. The former aims to preserve the within-modal structural and between-modal relationships, while the latter aims to richly use the complementarity information hidden in multimodal data. Further, we follow the multimodal subspace (MS) SVDD architecture and use two regularized terms to regularize SVDD. Consequently, a novel OCC method for multimodal data is proposed, called the consistency and complementarity jointly regularized subspace SVDD (CCS-SVDD). Extensive experimental results demonstrate that our approach is more effective and competitive than other algorithms. The source codes are available at https://github.com/wongchuang/CCS_SVDD.</p>\u0000 </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/1989706","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142359988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0