首页 > 最新文献

Third International Seminar on Artificial Intelligence, Networking, and Information Technology最新文献

英文 中文
Design of gesture recognition system based on machine vision 基于机器视觉的手势识别系统设计
Weiquan Chen, Jichao Yan, Shufen Huang, L.G. Tan
The project is based on Window10+Python3.6 environment, and uses Python libraries such as OpenCV, Sklearn and PyQt5 to construct a relatively complete gesture recognition and translation system, which can recognize all kinds of static gesture signals in life, and translate them into Chinese or Arabic numerals through image processing.Due to the limitation of the training amount of Support Vector Machines (SVM), this design is only used to recognize gesture actions 1-10, and the interface is designed using PyQt5 for real-time display of the results of gesture recognition translation.The paper focuses on the noise elimination, contour extraction, RGB colorspace and YCrCb feasibility comparison of still images of 1-10 gesture features extracted by computer camera, GUI page design, Fourier operator extraction of gestures, training SVM model, and debugging of the system. The system is also able to be integrated on different development boards and can be embedded in device carriers to meet multi-scene adaptability.
本项目基于Window10+Python3.6环境,使用OpenCV、Sklearn、PyQt5等Python库构建了一个比较完整的手势识别和翻译系统,可以识别生活中各种静态手势信号,并通过图像处理将其翻译成中文或阿拉伯数字。由于支持向量机(SVM)训练量的限制,本设计仅用于识别手势动作1-10,并使用PyQt5设计界面,用于实时显示手势识别翻译结果。本文重点研究了计算机摄像机提取的1-10个手势特征的静止图像的去噪、轮廓提取、RGB色彩空间和YCrCb可行性比较、GUI页面设计、手势的傅立叶算子提取、SVM模型的训练以及系统的调试。该系统还可以集成在不同的开发板上,并可以嵌入到设备载体中,以满足多场景的适应性。
{"title":"Design of gesture recognition system based on machine vision","authors":"Weiquan Chen, Jichao Yan, Shufen Huang, L.G. Tan","doi":"10.1117/12.2667314","DOIUrl":"https://doi.org/10.1117/12.2667314","url":null,"abstract":"The project is based on Window10+Python3.6 environment, and uses Python libraries such as OpenCV, Sklearn and PyQt5 to construct a relatively complete gesture recognition and translation system, which can recognize all kinds of static gesture signals in life, and translate them into Chinese or Arabic numerals through image processing.Due to the limitation of the training amount of Support Vector Machines (SVM), this design is only used to recognize gesture actions 1-10, and the interface is designed using PyQt5 for real-time display of the results of gesture recognition translation.The paper focuses on the noise elimination, contour extraction, RGB colorspace and YCrCb feasibility comparison of still images of 1-10 gesture features extracted by computer camera, GUI page design, Fourier operator extraction of gestures, training SVM model, and debugging of the system. The system is also able to be integrated on different development boards and can be embedded in device carriers to meet multi-scene adaptability.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126328988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emotion recognition of multimodal face images based on convolutional neural network 基于卷积神经网络的多模态人脸图像情感识别
Minli Wen
In the background of changing angle and multi-dimensional posture overlapping, it is necessary to extract and recognize the emotion of multi-modal face images, so as to improve the expression ability of facial emotion. A method of emotion recognition of multi-modal face images under changing angle and multi-dimensional posture overlapping background based on convolution neural network and morphological feature parameter recognition is proposed. Constructing a face feature collection model with variable angle and multi-dimensional posture overlapping background, performing fusion filtering on the collected face images with variable angle and multi-dimensional posture overlapping background, extracting the edge contour feature quantity of the face images with variable angle and multi-dimensional posture overlapping background, and filtering and denoising the original image by using morphological convolution neural network transformation method, Multi-modal wavelet scale decomposition method is used to decompose the emotion features of multi-modal face images under the background of changing angles and multi-dimensional postures, and the detection model of pixel points and similarity features of multi-modal face images under the background of changing angles and multi-dimensional postures is constructed. Morphological convolution neural network transformation method is used to transform the emotion features of multi-modal face images, and edge corner detection and expression feature point clustering analysis are combined to realize the emotion recognition of multi-modal face images. The simulation results show that this method has good performance in feature extraction and clustering of multimodal facial image emotion recognition, good expression ability of facial emotion and high image quality.
在角度变化和多维姿态重叠的背景下,有必要对多模态人脸图像的情感进行提取和识别,以提高面部情感的表达能力。提出了一种基于卷积神经网络和形态特征参数识别的多模态变角度和多维姿态重叠背景下人脸图像情绪识别方法。构建变角度、多维姿态背景叠加的人脸特征采集模型,对采集到的变角度、多维姿态背景叠加的人脸图像进行融合滤波,提取变角度、多维姿态背景叠加的人脸图像边缘轮廓特征量;利用形态学卷积神经网络变换方法对原始图像进行滤波去噪,利用多模态小波尺度分解方法对变换角度和多维姿态背景下多模态人脸图像的情感特征进行分解,构建变换角度和多维姿态背景下多模态人脸图像像素点和相似特征检测模型。采用形态学卷积神经网络变换方法对多模态人脸图像的情感特征进行变换,并结合边缘角检测和表情特征点聚类分析实现多模态人脸图像的情感识别。仿真结果表明,该方法在多模态人脸图像情感识别的特征提取和聚类方面具有良好的性能,具有良好的面部情感表达能力和较高的图像质量。
{"title":"Emotion recognition of multimodal face images based on convolutional neural network","authors":"Minli Wen","doi":"10.1117/12.2667333","DOIUrl":"https://doi.org/10.1117/12.2667333","url":null,"abstract":"In the background of changing angle and multi-dimensional posture overlapping, it is necessary to extract and recognize the emotion of multi-modal face images, so as to improve the expression ability of facial emotion. A method of emotion recognition of multi-modal face images under changing angle and multi-dimensional posture overlapping background based on convolution neural network and morphological feature parameter recognition is proposed. Constructing a face feature collection model with variable angle and multi-dimensional posture overlapping background, performing fusion filtering on the collected face images with variable angle and multi-dimensional posture overlapping background, extracting the edge contour feature quantity of the face images with variable angle and multi-dimensional posture overlapping background, and filtering and denoising the original image by using morphological convolution neural network transformation method, Multi-modal wavelet scale decomposition method is used to decompose the emotion features of multi-modal face images under the background of changing angles and multi-dimensional postures, and the detection model of pixel points and similarity features of multi-modal face images under the background of changing angles and multi-dimensional postures is constructed. Morphological convolution neural network transformation method is used to transform the emotion features of multi-modal face images, and edge corner detection and expression feature point clustering analysis are combined to realize the emotion recognition of multi-modal face images. The simulation results show that this method has good performance in feature extraction and clustering of multimodal facial image emotion recognition, good expression ability of facial emotion and high image quality.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126801429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wind power prediction method based on multi-loop improved gradient boosting decision tree 基于多回路改进梯度增强决策树的风电功率预测方法
Zheng He, Lin Xu, Yufei Ai, Wei Li, Huanhuan Dong
With the increase in energy demand, carbon emissions, environmental pollution, climate change and other issues have become increasingly prominent, China has accelerated the construction of new energy sources. Especially in the field of wind power generation, it is the most potential type of large-scale development of non-hydroelectric renewable energy. Due to the volatility, intermittency and low energy density of wind power, the power of wind power also fluctuates. However, with the early digital transformation of my country's energy industry, a large number of meteorological environments and equipment measurement points have been accumulated in wind power production sites. Power generation related data, using artificial intelligence, deep learning and other technologies can effectively predict the power generation of the station with high precision. A model algorithm of multi-loop gradient boosting decision tree is used in this paper, considering the stationarity test of time series and wind power fluctuation attribute, the accuracy of wind power prediction is effectively improved. Help the power dispatching department to pre-arrange dispatching plans according to wind power changes. Ensure the smooth and safe operation of the power grid.
随着能源需求的增加,碳排放、环境污染、气候变化等问题日益突出,中国加快了新能源的建设。特别是在风力发电领域,它是最具大规模开发潜力的非水电可再生能源类型。由于风电的波动性、间歇性和低能量密度,风电的功率也会出现波动。但随着我国能源行业数字化转型的早期,风电生产现场积累了大量气象环境和设备测量点。发电相关数据,利用人工智能、深度学习等技术,可以有效、高精度地预测电站的发电量。本文采用多环梯度增强决策树模型算法,考虑了时间序列的平稳性检验和风电功率波动属性,有效提高了风电功率预测的精度。根据风电变化情况,协助电力调度部门提前安排调度方案。确保电网平稳、安全运行。
{"title":"Wind power prediction method based on multi-loop improved gradient boosting decision tree","authors":"Zheng He, Lin Xu, Yufei Ai, Wei Li, Huanhuan Dong","doi":"10.1117/12.2667668","DOIUrl":"https://doi.org/10.1117/12.2667668","url":null,"abstract":"With the increase in energy demand, carbon emissions, environmental pollution, climate change and other issues have become increasingly prominent, China has accelerated the construction of new energy sources. Especially in the field of wind power generation, it is the most potential type of large-scale development of non-hydroelectric renewable energy. Due to the volatility, intermittency and low energy density of wind power, the power of wind power also fluctuates. However, with the early digital transformation of my country's energy industry, a large number of meteorological environments and equipment measurement points have been accumulated in wind power production sites. Power generation related data, using artificial intelligence, deep learning and other technologies can effectively predict the power generation of the station with high precision. A model algorithm of multi-loop gradient boosting decision tree is used in this paper, considering the stationarity test of time series and wind power fluctuation attribute, the accuracy of wind power prediction is effectively improved. Help the power dispatching department to pre-arrange dispatching plans according to wind power changes. Ensure the smooth and safe operation of the power grid.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131574864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerating real-time music beat tracking based on hidden Markov model by using genre information 利用类型信息加速基于隐马尔可夫模型的实时音乐节拍跟踪
Liangfeng Zhou, Guangxiao Song, Zhi-jian Wang, Meng Xia
The essence of enjoying music is that people can track music beat anytime and be brought into the scene expressed by music. Music beat tracking is a common task in Music Information Retrieve (MIR). While numerous studies have been done in this field, most works focus on the offline beat tracking. However, tracking music beat in real time is a challenging task for computers. In the past few years, people attach more importance to this field. Researchers care about the music beat without taking music style or context into consideration. In this paper, we propose a method for tracking music beats in real time in conjunction with music genre. Specifically, the proposed model is based on a widely-used framework of Hidden Markov Model (HMM). By recognizing the genre of input music, we narrow the range of beats per minute (BPM), which significantly reduces the number of hidden states in HMM. Consequently, the inference time of beat tracking decreases. We experimentally verify the model on the open-source Ballroom dataset, and its accuracy remains at a competitive level while having a much shorter inference time.
欣赏音乐的本质是人们可以随时跟随音乐的节拍,并被带入音乐所表达的场景。音乐节拍跟踪是音乐信息检索(MIR)中的一项常见任务。虽然这方面的研究很多,但大多数都集中在离线的节拍跟踪上。然而,实时跟踪音乐节拍对计算机来说是一项具有挑战性的任务。在过去的几年里,人们越来越重视这个领域。研究人员只关心音乐节拍,而不考虑音乐风格或背景。在本文中,我们提出了一种结合音乐类型实时跟踪音乐节拍的方法。具体来说,所提出的模型是基于一个广泛使用的隐马尔可夫模型(HMM)框架。通过识别输入音乐的类型,我们缩小了每分钟节拍(BPM)的范围,这大大减少了HMM中隐藏状态的数量。从而减少了拍跟踪的推理时间。我们在开源的舞厅数据集上实验验证了该模型,其准确性保持在竞争水平,同时具有更短的推理时间。
{"title":"Accelerating real-time music beat tracking based on hidden Markov model by using genre information","authors":"Liangfeng Zhou, Guangxiao Song, Zhi-jian Wang, Meng Xia","doi":"10.1117/12.2667472","DOIUrl":"https://doi.org/10.1117/12.2667472","url":null,"abstract":"The essence of enjoying music is that people can track music beat anytime and be brought into the scene expressed by music. Music beat tracking is a common task in Music Information Retrieve (MIR). While numerous studies have been done in this field, most works focus on the offline beat tracking. However, tracking music beat in real time is a challenging task for computers. In the past few years, people attach more importance to this field. Researchers care about the music beat without taking music style or context into consideration. In this paper, we propose a method for tracking music beats in real time in conjunction with music genre. Specifically, the proposed model is based on a widely-used framework of Hidden Markov Model (HMM). By recognizing the genre of input music, we narrow the range of beats per minute (BPM), which significantly reduces the number of hidden states in HMM. Consequently, the inference time of beat tracking decreases. We experimentally verify the model on the open-source Ballroom dataset, and its accuracy remains at a competitive level while having a much shorter inference time.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134433792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advantage policy update based on proximal policy optimization 基于近端策略优化的优势策略更新
Zilin Zeng, Junwei Wang, Zhigang Hu, Dongnan Su, Peng Shang
In this paper, a novel policy network update approach based on Proximal Policy Optimization (PPO), Advantageous Update Policy Proximal Policy Optimization (AUP-PPO), is proposed to alleviate the problem of over-fitting caused by the use of shared layers for policy and value functions. Extended from the previous sample-efficient reinforcement learning method PPO that uses separate networks to learn policy and value functions to make them decouple optimization, AUP-PPO uses the value function to calculate the advantage and updates the policy with the loss between the current and target advantage function as a penalty term instead of the value function. Evaluated by multiple benchmark control tasks in Open-AI gym, AUP-PPO exhibits better generalization to the environment and achieves faster convergence and better robustness compared with the original PPO.
本文提出了一种基于近端策略优化(PPO)的策略网络更新方法——优势更新策略近端策略优化(advantage update policy Proximal policy Optimization, upp -PPO),以缓解由于策略函数和价值函数使用共享层而导致的过拟合问题。upp -PPO从之前的样本高效强化学习方法PPO(使用单独的网络学习策略和价值函数,使其解耦优化)扩展而来,使用价值函数计算优势,并以当前和目标优势函数之间的损失作为惩罚项而不是价值函数来更新策略。通过Open-AI gym中多个基准控制任务的评估,与原来的PPO相比,upp -PPO对环境具有更好的泛化能力,收敛速度更快,鲁棒性更好。
{"title":"Advantage policy update based on proximal policy optimization","authors":"Zilin Zeng, Junwei Wang, Zhigang Hu, Dongnan Su, Peng Shang","doi":"10.1117/12.2667235","DOIUrl":"https://doi.org/10.1117/12.2667235","url":null,"abstract":"In this paper, a novel policy network update approach based on Proximal Policy Optimization (PPO), Advantageous Update Policy Proximal Policy Optimization (AUP-PPO), is proposed to alleviate the problem of over-fitting caused by the use of shared layers for policy and value functions. Extended from the previous sample-efficient reinforcement learning method PPO that uses separate networks to learn policy and value functions to make them decouple optimization, AUP-PPO uses the value function to calculate the advantage and updates the policy with the loss between the current and target advantage function as a penalty term instead of the value function. Evaluated by multiple benchmark control tasks in Open-AI gym, AUP-PPO exhibits better generalization to the environment and achieves faster convergence and better robustness compared with the original PPO.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131033673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target scale information detection based on improved Faster R-CNN 基于改进Faster R-CNN的目标尺度信息检测
Yu Liu, Zhiqiang Wang, Fengjing Zhang, Jun Xie, Zhaohong Xu
In order to obtain the scale information of ship targets effectively, we proposed an improved Faster R-CNN algorithm which integrated multi-scale region proposal and pooling of ROI, visual attention mechanism and rotation region regression and suppression, and ship targets can be positioned by the rotate quadrangle bounding boxes to obtain the scale information of them. Our improved model is based on the standard Faster R-CNN and is maintained through end-to-end training.
为了有效地获取舰船目标的尺度信息,我们提出了一种改进的Faster R-CNN算法,该算法将多尺度区域建议与ROI池化、视觉注意机制和旋转区域回归抑制相结合,通过旋转四边形包围框定位舰船目标,获取舰船目标的尺度信息。我们改进的模型基于标准的Faster R-CNN,并通过端到端训练来维护。
{"title":"Target scale information detection based on improved Faster R-CNN","authors":"Yu Liu, Zhiqiang Wang, Fengjing Zhang, Jun Xie, Zhaohong Xu","doi":"10.1117/12.2667260","DOIUrl":"https://doi.org/10.1117/12.2667260","url":null,"abstract":"In order to obtain the scale information of ship targets effectively, we proposed an improved Faster R-CNN algorithm which integrated multi-scale region proposal and pooling of ROI, visual attention mechanism and rotation region regression and suppression, and ship targets can be positioned by the rotate quadrangle bounding boxes to obtain the scale information of them. Our improved model is based on the standard Faster R-CNN and is maintained through end-to-end training.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132126119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DOA estimation of rail transit tunnel far field signals based on reconstructive subspace MUSIC 基于重构子空间MUSIC的轨道交通隧道远场信号DOA估计
Yanliang Jin, Rukun Lyu, Yuan Gao, Guoxing Zheng
For the rail traffic, feasibility of 5th Generation (5G) mobile communication massive Multiple Input Multiple Output (MIMO) used in the tunnel is studying. In order to effectively set up base station, antenna and intelligent reflecting surface (IRS), the direction of arrival (DOA) must be captured. Aiming at the problem of poor performance of two-dimensional MUltiple SIgnal Classification (2D-MUSIC) algorithm in the environment of low signal-to-noise ratio (SNR), small snapshots and small incident angle interval signals, an improved MUSIC algorithm with uniform rectangular array (URA) based on reconstructive subspace is proposed. By reconstructing subspace, a new spatial spectrum is obtained in terms of subspace eigenvectors. Then, the DOAs are obtained by searching the maximum of the new spatial spectrum. High estimation accuracy and angular resolution are often required in practical applications of rail traffic 5G system, and there exist tunnel scenarios that we need the improved MUSIC algorithm has the ability to conduct two-dimensional DOA estimation. Simulations and the measured data of straight tunnel scenario are used to verify the effectiveness of the proposed algorithm and its higher searching precision in complex signal environments such as low SNR, strong-to-weak proximity, and coherent interference.
针对轨道交通,正在研究第5代(5G)移动通信海量多输入多输出(MIMO)在隧道中应用的可行性。为了有效地设置基站、天线和智能反射面,必须捕获到达方向(DOA)。针对二维多信号分类(2D-MUSIC)算法在低信噪比、小快照和小入射角间隔信号环境下性能不佳的问题,提出了一种基于重构子空间的均匀矩形阵列(URA)改进MUSIC算法。通过重构子空间,得到了由子空间特征向量表示的新的空间谱。然后,通过搜索新空间谱的最大值得到doa;在轨道交通5G系统的实际应用中,往往需要较高的估计精度和角度分辨率,并且存在隧道场景,我们需要改进的MUSIC算法具有二维DOA估计的能力。通过仿真和直隧道场景的实测数据,验证了该算法在低信噪比、强弱接近、相干干扰等复杂信号环境下的有效性和较高的搜索精度。
{"title":"DOA estimation of rail transit tunnel far field signals based on reconstructive subspace MUSIC","authors":"Yanliang Jin, Rukun Lyu, Yuan Gao, Guoxing Zheng","doi":"10.1117/12.2667279","DOIUrl":"https://doi.org/10.1117/12.2667279","url":null,"abstract":"For the rail traffic, feasibility of 5th Generation (5G) mobile communication massive Multiple Input Multiple Output (MIMO) used in the tunnel is studying. In order to effectively set up base station, antenna and intelligent reflecting surface (IRS), the direction of arrival (DOA) must be captured. Aiming at the problem of poor performance of two-dimensional MUltiple SIgnal Classification (2D-MUSIC) algorithm in the environment of low signal-to-noise ratio (SNR), small snapshots and small incident angle interval signals, an improved MUSIC algorithm with uniform rectangular array (URA) based on reconstructive subspace is proposed. By reconstructing subspace, a new spatial spectrum is obtained in terms of subspace eigenvectors. Then, the DOAs are obtained by searching the maximum of the new spatial spectrum. High estimation accuracy and angular resolution are often required in practical applications of rail traffic 5G system, and there exist tunnel scenarios that we need the improved MUSIC algorithm has the ability to conduct two-dimensional DOA estimation. Simulations and the measured data of straight tunnel scenario are used to verify the effectiveness of the proposed algorithm and its higher searching precision in complex signal environments such as low SNR, strong-to-weak proximity, and coherent interference.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132144386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on feature extraction algorithm based on meta-learning 基于元学习的特征提取算法研究
Yanliang Jin, Baorong Fan, Yuan Gao
Feature extraction is an important research topic in the field of image processing.In autonomous driving, it is of great importance to extract the feature information of the picture obtained by the vehicle camera for the agent to better understand the environment information. In order to improve the quality of feature extraction, this paper combines meta-learning and deep learning-based feature extraction methods, and proposes a Meta-VAE-WGAN-GP (MVWP) feature extraction algorithm, and applies it to automatic driving. Firstly, aiming at the problem of parameter centralization in Wasserstein generative adversarial network (WGAN) and the problem of gradient explosion and gradient disappearance caused by improper manual parameter adjustment, a generative adversarial network based on gradient penalty and Wasserstein distance (WGAN-GP) was proposed, and it was combined with VAE. The VAE-WGAN-GP model is formed. Secondly, aiming at the problem that the feature extraction model needs to be trained from scratch every time it is faced with a new task, and the training time is too long, the MVWP model is formed by combining meta-learning with VAE-WGAN-GP (VWP) mentioned above. Finally, the experimental results show that compared with VAE, VAE-WGAN and VWP, the training speed of MVWP model is increased by about 6 times, the reconstruction loss is reduced by 55.9%, 37.8% and 20.2%, respectively, and the reconstructed images are clearer.
特征提取是图像处理领域的一个重要研究课题。在自动驾驶中,从车载摄像头获取的图像中提取特征信息对于智能体更好地理解环境信息非常重要。为了提高特征提取的质量,本文将元学习和基于深度学习的特征提取方法相结合,提出了一种Meta-VAE-WGAN-GP (MVWP)特征提取算法,并将其应用于自动驾驶。首先,针对Wasserstein生成式对抗网络(WGAN)中的参数集中化问题以及人工参数调整不当导致的梯度爆炸和梯度消失问题,提出了一种基于梯度惩罚和Wasserstein距离的生成式对抗网络(WGAN- gp),并将其与VAE相结合。形成VAE-WGAN-GP模型。其次,针对特征提取模型每次面对新任务都需要从头开始训练,且训练时间过长的问题,将元学习与上述VAE-WGAN-GP (VWP)相结合,形成MVWP模型。最后,实验结果表明,与VAE、VAE- wgan和VWP相比,MVWP模型的训练速度提高了约6倍,重构损失分别降低了55.9%、37.8%和20.2%,重构图像更加清晰。
{"title":"Research on feature extraction algorithm based on meta-learning","authors":"Yanliang Jin, Baorong Fan, Yuan Gao","doi":"10.1117/12.2667413","DOIUrl":"https://doi.org/10.1117/12.2667413","url":null,"abstract":"Feature extraction is an important research topic in the field of image processing.In autonomous driving, it is of great importance to extract the feature information of the picture obtained by the vehicle camera for the agent to better understand the environment information. In order to improve the quality of feature extraction, this paper combines meta-learning and deep learning-based feature extraction methods, and proposes a Meta-VAE-WGAN-GP (MVWP) feature extraction algorithm, and applies it to automatic driving. Firstly, aiming at the problem of parameter centralization in Wasserstein generative adversarial network (WGAN) and the problem of gradient explosion and gradient disappearance caused by improper manual parameter adjustment, a generative adversarial network based on gradient penalty and Wasserstein distance (WGAN-GP) was proposed, and it was combined with VAE. The VAE-WGAN-GP model is formed. Secondly, aiming at the problem that the feature extraction model needs to be trained from scratch every time it is faced with a new task, and the training time is too long, the MVWP model is formed by combining meta-learning with VAE-WGAN-GP (VWP) mentioned above. Finally, the experimental results show that compared with VAE, VAE-WGAN and VWP, the training speed of MVWP model is increased by about 6 times, the reconstruction loss is reduced by 55.9%, 37.8% and 20.2%, respectively, and the reconstructed images are clearer.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124890456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of maritime unmanned combat cluster employment in the context of anti-intervention/area denial 反干涉/区域拒止背景下海上无人作战集群部署分析
Jiangshan Liu, P. Pen
With the new operational styles such as mosaic warfare and multi-domain warfare of the US military moving from concept to application, the winning elements and operational modes of future maritime operations have been profoundly changed. Firstly, the military capabilities of the US military to break through anti-access/area denial are analyzed, and then the characteristics and effects of the implementation of multi-domain warfare under the mosaic warfare concept in breaking through anti-access/area denial are extracted from the military capabilities, operational styles, and practical applications. On this basis, countermeasure suggestions are proposed to deal with the implementation of multi-domain warfare under the mosaic warfare concept. The practical significance of building unmanned maritime combat clusters is explained from the perspective of military needs, and the force design, equipment development, and combat style of unmanned maritime combat clusters are described in detail. A typical combat scenario is used to show the combat process of unmanned maritime combat clusters. Finally, the key problem model of intelligent command and control decision of maritime unmanned cluster is established for this typical combat scenario, and the corresponding algorithmic framework is constructed for the input and output of the typical combat scenario, which provides research ideas for further empirical research.
随着美军马赛克战、多域战等新型作战方式从概念走向应用,未来海上作战的制胜要素和作战方式发生深刻变化。首先分析了美军突破反介入/区域拒止的军事能力,然后从军事能力、作战风格和实际应用等方面,提取了马赛克战争理念下实施多域战突破反介入/区域拒止的特点和效果。在此基础上,提出了应对马赛克作战理念下多域作战实施的对策建议。从军事需求的角度阐述了建设海上无人作战集群的现实意义,详细阐述了海上无人作战集群的力量设计、装备研制、作战风格等。通过典型作战场景,展示了海上无人作战集群的作战过程。最后,针对该典型作战场景建立了海上无人集群智能指挥控制决策关键问题模型,并针对典型作战场景的输入和输出构建了相应的算法框架,为进一步的实证研究提供了研究思路。
{"title":"Analysis of maritime unmanned combat cluster employment in the context of anti-intervention/area denial","authors":"Jiangshan Liu, P. Pen","doi":"10.1117/12.2667516","DOIUrl":"https://doi.org/10.1117/12.2667516","url":null,"abstract":"With the new operational styles such as mosaic warfare and multi-domain warfare of the US military moving from concept to application, the winning elements and operational modes of future maritime operations have been profoundly changed. Firstly, the military capabilities of the US military to break through anti-access/area denial are analyzed, and then the characteristics and effects of the implementation of multi-domain warfare under the mosaic warfare concept in breaking through anti-access/area denial are extracted from the military capabilities, operational styles, and practical applications. On this basis, countermeasure suggestions are proposed to deal with the implementation of multi-domain warfare under the mosaic warfare concept. The practical significance of building unmanned maritime combat clusters is explained from the perspective of military needs, and the force design, equipment development, and combat style of unmanned maritime combat clusters are described in detail. A typical combat scenario is used to show the combat process of unmanned maritime combat clusters. Finally, the key problem model of intelligent command and control decision of maritime unmanned cluster is established for this typical combat scenario, and the corresponding algorithmic framework is constructed for the input and output of the typical combat scenario, which provides research ideas for further empirical research.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127893662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Technology research of micro air-launched decoy 微型空射诱饵技术研究
Fang-zheng Zhao, Xiao Zhang, Xin Zhang
Micro air-launched decoy (MALD) is electronic weapon aiming to interfere with enemy air defense systems. MALD uses signal enhancement subsystems and active radar jammers as its loads. This paper discusses the basic situation of MALD in detail, and analyzes the technical advantages and development trends.
微型空射诱饵(MALD)是一种以干扰敌方防空系统为目标的电子武器。MALD采用信号增强子系统和有源雷达干扰机作为负载。本文详细论述了MALD的基本情况,分析了其技术优势和发展趋势。
{"title":"Technology research of micro air-launched decoy","authors":"Fang-zheng Zhao, Xiao Zhang, Xin Zhang","doi":"10.1117/12.2667665","DOIUrl":"https://doi.org/10.1117/12.2667665","url":null,"abstract":"Micro air-launched decoy (MALD) is electronic weapon aiming to interfere with enemy air defense systems. MALD uses signal enhancement subsystems and active radar jammers as its loads. This paper discusses the basic situation of MALD in detail, and analyzes the technical advantages and development trends.","PeriodicalId":128051,"journal":{"name":"Third International Seminar on Artificial Intelligence, Networking, and Information Technology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121179015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Third International Seminar on Artificial Intelligence, Networking, and Information Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1