首页 > 最新文献

Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition最新文献

英文 中文
Study on Variable Step Size Blind Equalization Algorithm Based on CMA 基于CMA的变步长盲均衡算法研究
Mingyu Yang, Dongming Xu
The inter-symbol interference caused by channel distortion in the communication process seriously affects the communication quality, and this problem is often solved by equalization technology. The principle of the traditional blind equalization constant modulus algorithm (CMA) with fixed step is introduced, and the problem of fast convergence speed and small steady-state error is analyzed by simulation. In order to solve this problem, a blind equalization algorithm with variable step size based on CMA is proposed. In order to solve the problem of fast convergence speed and small steady-state error, the CMA algorithm is improved and the principle of the improved algorithm is described, and the influence of the step on the performance of the algorithm is analyzed. Finally, the simulation experiment proves that the improved algorithm can speed up the convergence speed and keep a small steady-state error at the same time.
在通信过程中,由于信道失真引起的码间干扰严重影响通信质量,这一问题往往通过均衡技术来解决。介绍了传统固定步长盲均衡恒模算法的原理,并通过仿真分析了该算法收敛速度快、稳态误差小的问题。为了解决这一问题,提出了一种基于CMA的变步长盲均衡算法。为了解决CMA算法收敛速度快、稳态误差小的问题,对CMA算法进行了改进,阐述了改进算法的原理,并分析了步长对算法性能的影响。最后通过仿真实验证明,改进算法在加快收敛速度的同时保持较小的稳态误差。
{"title":"Study on Variable Step Size Blind Equalization Algorithm Based on CMA","authors":"Mingyu Yang, Dongming Xu","doi":"10.1145/3573942.3573997","DOIUrl":"https://doi.org/10.1145/3573942.3573997","url":null,"abstract":"The inter-symbol interference caused by channel distortion in the communication process seriously affects the communication quality, and this problem is often solved by equalization technology. The principle of the traditional blind equalization constant modulus algorithm (CMA) with fixed step is introduced, and the problem of fast convergence speed and small steady-state error is analyzed by simulation. In order to solve this problem, a blind equalization algorithm with variable step size based on CMA is proposed. In order to solve the problem of fast convergence speed and small steady-state error, the CMA algorithm is improved and the principle of the improved algorithm is described, and the influence of the step on the performance of the algorithm is analyzed. Finally, the simulation experiment proves that the improved algorithm can speed up the convergence speed and keep a small steady-state error at the same time.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115643709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SN-YOLO: Improved YOLOv5 with Softer-NMS and SIOU for Object Detection SN-YOLO:改进的YOLOv5与soft - nms和SIOU的目标检测
Wanyu Deng, Zhen Wang
As a lightweight target detection network, YOLOv5 is popular in the industry for its advantages of fast speed and small model, but the detection accuracy is not very high. In response to this problem, we propose an improved model SN-YOLO based on YOLOv5. First, we introduce Softer-NMS as the post-processing method of the model, which will make the prediction box more accurate. Secondly, we improved the loss function of the original algorithm and introduced the SIOU loss function to optimize the model and improve the accuracy of the algorithm. Finally, in order to improve the feature extraction ability of the backbone, we implanted the CBAM (Convolutional block attention module) module into the algorithm. We validate the model using the 2007 and 2012 datasets of PASCAL VOC. The experimental results show that SN-YOLO has a great improvement over the original model in all aspects. The effectiveness of the algorithm is verified.
YOLOv5作为一种轻量级的目标检测网络,以其速度快、模型小等优点受到业界的青睐,但检测精度不是很高。针对这一问题,我们提出了基于YOLOv5的改进模型SN-YOLO。首先,我们引入了soft - nms作为模型的后处理方法,使预测框更加准确。其次,对原算法的损失函数进行改进,引入SIOU损失函数对模型进行优化,提高算法的精度。最后,为了提高主干的特征提取能力,我们在算法中植入了CBAM (Convolutional block attention module)模块。我们使用PASCAL VOC的2007年和2012年数据集验证了该模型。实验结果表明,SN-YOLO在各方面都比原模型有很大的改进。验证了该算法的有效性。
{"title":"SN-YOLO: Improved YOLOv5 with Softer-NMS and SIOU for Object Detection","authors":"Wanyu Deng, Zhen Wang","doi":"10.1145/3573942.3574029","DOIUrl":"https://doi.org/10.1145/3573942.3574029","url":null,"abstract":"As a lightweight target detection network, YOLOv5 is popular in the industry for its advantages of fast speed and small model, but the detection accuracy is not very high. In response to this problem, we propose an improved model SN-YOLO based on YOLOv5. First, we introduce Softer-NMS as the post-processing method of the model, which will make the prediction box more accurate. Secondly, we improved the loss function of the original algorithm and introduced the SIOU loss function to optimize the model and improve the accuracy of the algorithm. Finally, in order to improve the feature extraction ability of the backbone, we implanted the CBAM (Convolutional block attention module) module into the algorithm. We validate the model using the 2007 and 2012 datasets of PASCAL VOC. The experimental results show that SN-YOLO has a great improvement over the original model in all aspects. The effectiveness of the algorithm is verified.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134236159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Multi-Scale Residual and Motion Feature Learning for Action Recognition 联合多尺度残差和运动特征学习用于动作识别
Linfeng Yang, Zhixiang Zhu, Chenwu Wang, Pei Wang, Shaobo Hei
For action recognition, two-stream networks consisting of RGB and optical flow has been widely used, showing high recognition accuracy. However, optical flow computation is time-consuming and requires a large amount of storage space, and the recognition efficiency is very low. To alleviate this problem, we propose an Adaptive Multi-Scale Residual (AMSR) module and a Long Short Term Motion Squeeze (LSMS) module, which are inserted into the 2D convolutional neural network to improve the accuracy of action recognition and achieve a balance of accuracy and speed. The AMSR module adaptively fuses multi-scale feature maps to fully utilize the semantic information provided by deep feature maps and the detailed information provided by shallow feature maps. The LSMS module is a learnable lightweight motion feature extractor for learning long-term motion features of adjacent and non-adjacent frames, thus replacing the traditional optical flow and improving the accuracy of action recognition. Experimental results on UCF-101 and HMDB-51 datasets demonstrate that the method proposed in this paper achieves competitive performance compared to state-of-the-art methods with only a small increase in parameters and computational cost.
在动作识别方面,由RGB和光流组成的两流网络得到了广泛的应用,具有较高的识别精度。但是,光流计算耗时长,需要大量的存储空间,识别效率很低。为了解决这一问题,我们提出了一个自适应多尺度残差(AMSR)模块和一个长短期运动挤压(LSMS)模块,将其插入到二维卷积神经网络中,以提高动作识别的精度,实现精度和速度的平衡。AMSR模块自适应融合多尺度特征图,充分利用深层特征图提供的语义信息和浅层特征图提供的详细信息。LSMS模块是一种可学习的轻量级运动特征提取器,用于学习相邻帧和非相邻帧的长期运动特征,从而取代传统的光流,提高动作识别的准确性。在UCF-101和HMDB-51数据集上的实验结果表明,本文提出的方法与现有方法相比具有竞争力,且参数和计算成本仅略有增加。
{"title":"Joint Multi-Scale Residual and Motion Feature Learning for Action Recognition","authors":"Linfeng Yang, Zhixiang Zhu, Chenwu Wang, Pei Wang, Shaobo Hei","doi":"10.1145/3573942.3574082","DOIUrl":"https://doi.org/10.1145/3573942.3574082","url":null,"abstract":"For action recognition, two-stream networks consisting of RGB and optical flow has been widely used, showing high recognition accuracy. However, optical flow computation is time-consuming and requires a large amount of storage space, and the recognition efficiency is very low. To alleviate this problem, we propose an Adaptive Multi-Scale Residual (AMSR) module and a Long Short Term Motion Squeeze (LSMS) module, which are inserted into the 2D convolutional neural network to improve the accuracy of action recognition and achieve a balance of accuracy and speed. The AMSR module adaptively fuses multi-scale feature maps to fully utilize the semantic information provided by deep feature maps and the detailed information provided by shallow feature maps. The LSMS module is a learnable lightweight motion feature extractor for learning long-term motion features of adjacent and non-adjacent frames, thus replacing the traditional optical flow and improving the accuracy of action recognition. Experimental results on UCF-101 and HMDB-51 datasets demonstrate that the method proposed in this paper achieves competitive performance compared to state-of-the-art methods with only a small increase in parameters and computational cost.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134520985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Face Forgery Detection Method Based on Augmented Dual-Stream Networks 一种基于增强双流网络的人脸伪造检测方法
Yumei Liu, Yong Zhang, Weiran Liu
The current face forgery methods based on deep learning are becoming more mature and abundant, and existing detection techniques have some limitations and applicability issues that make it difficult to effectively detect such behaviour. In this paper, we propose an enhanced dual-stream FC_2_stream network model based on dual-stream networks to detect forged regions in manipulated face images through end-to-end training of the images. The RGB stream is used to extract features from the RGB image to find the forged traces; the noise stream uses the filtering layer of the SRM (Steganalysis Rich Model) model to extract the noise features and find the inconsistency between the noise in the real region and the forged region in the fake face, then the features of the two streams are fused with a bilinear pooling layer to predict the forged region, and finally the forged region is determined by whether the blending boundary of the forged image is displayed to determine the image authenticity. Experiments conducted on four benchmark datasets show that our model is still effective against forgeries generated by unknown face manipulation methods, and also demonstrate the superior generalisation capability of our model.
目前基于深度学习的人脸伪造方法日趋成熟和丰富,现有的检测技术存在一定的局限性和适用性问题,难以有效检测出人脸伪造行为。在本文中,我们提出了一种基于双流网络的增强双流FC_2_stream网络模型,通过对被操纵的人脸图像进行端到端训练来检测伪造区域。利用RGB流从RGB图像中提取特征,查找伪造痕迹;噪声流使用SRM (Steganalysis Rich Model)模型的滤波层提取噪声特征,发现假人脸真实区域和伪造区域的噪声不一致,然后用双线性池化层将两流特征融合预测伪造区域,最后通过伪造图像的混合边界是否显示来确定伪造区域,以确定图像的真实性。在四个基准数据集上进行的实验表明,我们的模型仍然有效地对抗未知人脸操纵方法产生的伪造,并且也证明了我们的模型具有优越的泛化能力。
{"title":"A Novel Face Forgery Detection Method Based on Augmented Dual-Stream Networks","authors":"Yumei Liu, Yong Zhang, Weiran Liu","doi":"10.1145/3573942.3574030","DOIUrl":"https://doi.org/10.1145/3573942.3574030","url":null,"abstract":"The current face forgery methods based on deep learning are becoming more mature and abundant, and existing detection techniques have some limitations and applicability issues that make it difficult to effectively detect such behaviour. In this paper, we propose an enhanced dual-stream FC_2_stream network model based on dual-stream networks to detect forged regions in manipulated face images through end-to-end training of the images. The RGB stream is used to extract features from the RGB image to find the forged traces; the noise stream uses the filtering layer of the SRM (Steganalysis Rich Model) model to extract the noise features and find the inconsistency between the noise in the real region and the forged region in the fake face, then the features of the two streams are fused with a bilinear pooling layer to predict the forged region, and finally the forged region is determined by whether the blending boundary of the forged image is displayed to determine the image authenticity. Experiments conducted on four benchmark datasets show that our model is still effective against forgeries generated by unknown face manipulation methods, and also demonstrate the superior generalisation capability of our model.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130377352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Radar micro moving gesture recognition method based on multi-scale fusion deep network 基于多尺度融合深度网络的雷达微动手势识别方法
Zhiqiang Bao, Tiantian Liu
In order to solve the problem that the micro moving gesture features are not obvious and difficult to be identified, a micro moving gesture recognition method based on multi-scale fusion deep network for millimeter wave radar is proposed in this paper. The method is mainly composed of 2D convolution module, multi-scale fusion module and attention mechanism module. The multi-scale fusion module is composed of three residual blocks of different scales, which can obtain receptive fields of different sizes and obtain multi-scale features. Meanwhile, residual blocks of different scales are fused to increase the diversity of the network and better extract the deep features of the data. The Squeeze-and-congestion (SE) attention mechanism module is added to suppress the channel characteristics with little information. This improves the network identification accuracy and reduces the number of parameters and computation. The experimental results show that this method is simple to implement, doesn't need to do complex data preprocessing. The convergence speed of the network is fast, which can realize the effective recognition of the micro moving gesture.
为了解决微动手势特征不明显、难以识别的问题,本文提出了一种基于多尺度融合深度网络的毫米波雷达微动手势识别方法。该方法主要由二维卷积模块、多尺度融合模块和注意机制模块组成。多尺度融合模块由三个不同尺度的残差块组成,可以获得不同大小的感受场,获得多尺度特征。同时,对不同尺度的残差块进行融合,增加网络的多样性,更好地提取数据的深层特征。增加了SE (squeeze -and-拥塞)注意机制模块来抑制信息较少的信道特征。这提高了网络识别的精度,减少了参数的数量和计算量。实验结果表明,该方法实现简单,不需要进行复杂的数据预处理。该网络收敛速度快,能够实现对微动手势的有效识别。
{"title":"Radar micro moving gesture recognition method based on multi-scale fusion deep network","authors":"Zhiqiang Bao, Tiantian Liu","doi":"10.1145/3573942.3574076","DOIUrl":"https://doi.org/10.1145/3573942.3574076","url":null,"abstract":"In order to solve the problem that the micro moving gesture features are not obvious and difficult to be identified, a micro moving gesture recognition method based on multi-scale fusion deep network for millimeter wave radar is proposed in this paper. The method is mainly composed of 2D convolution module, multi-scale fusion module and attention mechanism module. The multi-scale fusion module is composed of three residual blocks of different scales, which can obtain receptive fields of different sizes and obtain multi-scale features. Meanwhile, residual blocks of different scales are fused to increase the diversity of the network and better extract the deep features of the data. The Squeeze-and-congestion (SE) attention mechanism module is added to suppress the channel characteristics with little information. This improves the network identification accuracy and reduces the number of parameters and computation. The experimental results show that this method is simple to implement, doesn't need to do complex data preprocessing. The convergence speed of the network is fast, which can realize the effective recognition of the micro moving gesture.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134167521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Recognition Model of Crop Diseases and Insect Pests Based on Convolutional Neural Network 基于卷积神经网络的农作物病虫害识别模型研究
Pi Qiao, Zilu Wang
Most of the traditional detection methods for crop diseases and insect pests are manually operated in the field according to the experience and technology of the staff, which have the disadvantages of long time and low efficiency. With the development of deep learning technology, the application of complex deep neural network algorithm models in the field of crop diseases and insect pests can effectively solve the above problems, however, the current research on the identification method of crop diseases and insect pests only focuses on the identification and analysis of single crop diseases and insect pests, and does not analyze and improve the analysis and improvement of various crops. Therefore, this paper proposes a recognition model of crop pests and diseases based on convolutional neural network. First, on the bilinear network model, the ResNet50 network is used as the feature extractor, that is, the backbone network of the network, instead of the original VGG-D and VGG-M backbone networks. Secondly, a connect module is added to design the bilinear network model and the extractor to do mutual outer product with the previous features of different levels, so that it is connected with the outer product of the feature vector. Finally, the loss function is used to conduct experiments on the AI Challenger 2018 crop pest and disease dataset. The experimental results show that the average recognition rate of the improved B-CNN-ResNet50-connect network model reaches 89.62%.
传统的农作物病虫害检测方法大多是根据工作人员的经验和技术在田间进行人工操作,存在时间长、效率低的缺点。随着深度学习技术的发展,复杂的深度神经网络算法模型在作物病虫害领域的应用可以有效地解决上述问题,然而,目前对作物病虫害识别方法的研究只侧重于对单一作物病虫害的识别和分析,并没有对各种作物的分析和改进进行分析和改进。为此,本文提出了一种基于卷积神经网络的农作物病虫害识别模型。首先,在双线性网络模型上,使用ResNet50网络作为特征提取器,即网络的骨干网,而不是原来的VGG-D和VGG-M骨干网。其次,增加连接模块设计双线性网络模型,提取器与之前不同层次的特征相互外积,使其与特征向量的外积相连接;最后,利用损失函数在AI Challenger 2018作物病虫害数据集上进行实验。实验结果表明,改进的B-CNN-ResNet50-connect网络模型的平均识别率达到89.62%。
{"title":"Research on Recognition Model of Crop Diseases and Insect Pests Based on Convolutional Neural Network","authors":"Pi Qiao, Zilu Wang","doi":"10.1145/3573942.3574087","DOIUrl":"https://doi.org/10.1145/3573942.3574087","url":null,"abstract":"Most of the traditional detection methods for crop diseases and insect pests are manually operated in the field according to the experience and technology of the staff, which have the disadvantages of long time and low efficiency. With the development of deep learning technology, the application of complex deep neural network algorithm models in the field of crop diseases and insect pests can effectively solve the above problems, however, the current research on the identification method of crop diseases and insect pests only focuses on the identification and analysis of single crop diseases and insect pests, and does not analyze and improve the analysis and improvement of various crops. Therefore, this paper proposes a recognition model of crop pests and diseases based on convolutional neural network. First, on the bilinear network model, the ResNet50 network is used as the feature extractor, that is, the backbone network of the network, instead of the original VGG-D and VGG-M backbone networks. Secondly, a connect module is added to design the bilinear network model and the extractor to do mutual outer product with the previous features of different levels, so that it is connected with the outer product of the feature vector. Finally, the loss function is used to conduct experiments on the AI Challenger 2018 crop pest and disease dataset. The experimental results show that the average recognition rate of the improved B-CNN-ResNet50-connect network model reaches 89.62%.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133422484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Image Encryption Algorithm Based on Latin Squares and Adaptive Z-Diffusion 基于拉丁平方和自适应z扩散的图像加密算法
Yangguang Lou, Shu-cui Xie, Jianzhong Zhang
This paper proposes a chaotic encryption algorithm based on Latin squares and adaptive Z-diffusion. First, in order to improve the defects of the traditional Sine system, two-dimensional enhance Sine chaotic system (2D-ESCS) is designed. In terms of bifurcation diagram, Lyapunov exponent and NIST, we can observe that 2D-ESCS have continuous and large chaotic ranges. Second, the generation of Latin squares through pseudorandom sequences generated by 2D-ESCS and then perform scrambling operation with the image. Third, adaptive Z-diffusion depends on the location of the pixels. the cipher image is calculated by different combinations of pseudorandom numbers, plain images pixel values and intermediate cipher image pixel values. Finally, simulation experiments and security analysis show that the proposed algorithm has a high security level to resist various cryptanalytic attacks and a high execution efficiency.
提出了一种基于拉丁平方和自适应z扩散的混沌加密算法。首先,为了改进传统正弦混沌系统的缺陷,设计了二维增强正弦混沌系统(2D-ESCS)。从分岔图、Lyapunov指数和NIST可以看出,2D-ESCS具有连续和大的混沌范围。其次,利用2D-ESCS生成的伪随机序列生成拉丁平方,然后对图像进行置乱操作;第三,自适应z扩散取决于像素的位置。通过伪随机数、普通图像像素值和中间密码图像像素值的不同组合来计算密码图像。最后,仿真实验和安全性分析表明,该算法具有较高的安全等级,能够抵抗各种密码分析攻击,并且具有较高的执行效率。
{"title":"Image Encryption Algorithm Based on Latin Squares and Adaptive Z-Diffusion","authors":"Yangguang Lou, Shu-cui Xie, Jianzhong Zhang","doi":"10.1145/3573942.3574062","DOIUrl":"https://doi.org/10.1145/3573942.3574062","url":null,"abstract":"This paper proposes a chaotic encryption algorithm based on Latin squares and adaptive Z-diffusion. First, in order to improve the defects of the traditional Sine system, two-dimensional enhance Sine chaotic system (2D-ESCS) is designed. In terms of bifurcation diagram, Lyapunov exponent and NIST, we can observe that 2D-ESCS have continuous and large chaotic ranges. Second, the generation of Latin squares through pseudorandom sequences generated by 2D-ESCS and then perform scrambling operation with the image. Third, adaptive Z-diffusion depends on the location of the pixels. the cipher image is calculated by different combinations of pseudorandom numbers, plain images pixel values and intermediate cipher image pixel values. Finally, simulation experiments and security analysis show that the proposed algorithm has a high security level to resist various cryptanalytic attacks and a high execution efficiency.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"241 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133683634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Industrial Internet Network Slice Prediction Algorithm Based on Multidimensional and Deep Neural Networks 基于多维深度神经网络的工业互联网网络切片预测算法
Jihong Zhao, Gao-Jing Peng
In the industrial Internet environment, the introduction of network slicing supports the connection of a large number of devices with different service requirements (QoS) sharing the same physical resources. Aiming at the problem of the adaptability of massive terminal devices and networks in industrial heterogeneous scenarios, this paper proposes a network slice prediction algorithm based on multi-dimensional and deep neural network (MDNN) based on the multi-dimensional resource network requirements of different terminal devices in specific industrial scenarios. The network slice prediction algorithm predicts the network resources required by the device at the next moment according to the historical network requirements and historical slice selection of the device, and selects the appropriate network slice for the device according to the prediction result. The simulation results show that the prediction accuracy of the proposed algorithm can reach 98.70%, which greatly improves the adaptability of the device and the network.
在工业互联网环境下,网络切片的引入支持大量具有不同服务需求(QoS)的设备连接,共享相同的物理资源。针对工业异构场景下海量终端设备和网络的适应性问题,本文基于特定工业场景下不同终端设备的多维资源网络需求,提出了一种基于多维深度神经网络(mmdnn)的网络切片预测算法。网络切片预测算法根据设备的历史网络需求和历史切片选择,预测设备下一时刻所需的网络资源,并根据预测结果为设备选择合适的网络切片。仿真结果表明,该算法的预测准确率可达98.70%,大大提高了设备和网络的自适应能力。
{"title":"Industrial Internet Network Slice Prediction Algorithm Based on Multidimensional and Deep Neural Networks","authors":"Jihong Zhao, Gao-Jing Peng","doi":"10.1145/3573942.3573989","DOIUrl":"https://doi.org/10.1145/3573942.3573989","url":null,"abstract":"In the industrial Internet environment, the introduction of network slicing supports the connection of a large number of devices with different service requirements (QoS) sharing the same physical resources. Aiming at the problem of the adaptability of massive terminal devices and networks in industrial heterogeneous scenarios, this paper proposes a network slice prediction algorithm based on multi-dimensional and deep neural network (MDNN) based on the multi-dimensional resource network requirements of different terminal devices in specific industrial scenarios. The network slice prediction algorithm predicts the network resources required by the device at the next moment according to the historical network requirements and historical slice selection of the device, and selects the appropriate network slice for the device according to the prediction result. The simulation results show that the prediction accuracy of the proposed algorithm can reach 98.70%, which greatly improves the adaptability of the device and the network.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133822980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Spatiotemporal Attention Convolutional Neural Network for Video Crowd Counting 一种用于视频人群统计的时空注意卷积神经网络
Shangjie Zhang, Yuelei Xiao
For most existing crowd counting methods, image-based methods are still used for crowd counting in the presence of video datasets, ignoring powerful time information. Thus, a novel spatiotemporal attention convolutional neural network is proposed to solve the video-based crowd counting problem. Firstly, the first ten layers of VGG-16 are used as the backbone network to extract features, and a single layer of ConvLSTM captures the time correlation of adjacent frames. Then, stacked dilated convolutional layers are used to enlarge the receptive field without increasing the computational load. Finally, a convolutional block attention module is introduced with the adaptive refinement of feature mapping. Its ability to emphasize or suppress information in the channel and spatial dimensions aids information dissemination. Experimental results on the two reference datasets (i.e., Mall and WorldExpo'10) show that the proposed method further improves the accuracy of crowd counting and is superior to the other existing crowd counting methods.
对于大多数现有的人群计数方法,仍然使用基于图像的方法在视频数据集存在的情况下进行人群计数,忽略了强大的时间信息。为此,提出了一种新的时空注意卷积神经网络来解决基于视频的人群计数问题。首先,利用VGG-16的前10层作为主干网提取特征,利用单层ConvLSTM捕获相邻帧的时间相关性;然后,在不增加计算负荷的情况下,使用堆叠的扩展卷积层来扩大接收场。最后,引入了一种基于特征映射自适应细化的卷积块注意力模块。它在渠道和空间维度上强调或抑制信息的能力有助于信息的传播。在Mall和WorldExpo’10两个参考数据集上的实验结果表明,本文方法进一步提高了人群计数的准确性,优于现有的其他人群计数方法。
{"title":"A Novel Spatiotemporal Attention Convolutional Neural Network for Video Crowd Counting","authors":"Shangjie Zhang, Yuelei Xiao","doi":"10.1145/3573942.3574069","DOIUrl":"https://doi.org/10.1145/3573942.3574069","url":null,"abstract":"For most existing crowd counting methods, image-based methods are still used for crowd counting in the presence of video datasets, ignoring powerful time information. Thus, a novel spatiotemporal attention convolutional neural network is proposed to solve the video-based crowd counting problem. Firstly, the first ten layers of VGG-16 are used as the backbone network to extract features, and a single layer of ConvLSTM captures the time correlation of adjacent frames. Then, stacked dilated convolutional layers are used to enlarge the receptive field without increasing the computational load. Finally, a convolutional block attention module is introduced with the adaptive refinement of feature mapping. Its ability to emphasize or suppress information in the channel and spatial dimensions aids information dissemination. Experimental results on the two reference datasets (i.e., Mall and WorldExpo'10) show that the proposed method further improves the accuracy of crowd counting and is superior to the other existing crowd counting methods.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115537956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Method with Universal Transformer for Multimodal Sentiment Analysis 基于通用变压器的多模态情感分析方法
Hao Ai, Ying Liu, Jie Fang, Sheikh Faisal Rashid
Multimodal sentiment analysis refers to the use of computers to analyze and identify the emotions that people want to express through the extracted multimodal sentiment features, and it plays a significant role in human-computer interaction and financial market prediction. Most existing approaches to multimodal sentiment analysis use contextual information for modeling, and while this modeling approach can effectively capture the contextual connections within modalities, the correlations between modalities are often overlooked, and the correlations between modalities are also critical to the final recognition results in multimodal sentiment analysis. Therefore, this paper proposes a multimodal sentiment analysis approach based on the universal transformer, a framework that uses the universal transformer to model the connections between multiple modalities while employing effective feature extraction methods to capture the contextual connections of individual modalities. We evaluated our proposed method on two benchmark datasets for multimodal sentiment analysis, CMU-MOSI and CMU-MOSEI, and the results outperformed other methods of the same type.
多模态情感分析是指利用计算机通过提取的多模态情感特征来分析和识别人们想要表达的情感,在人机交互和金融市场预测中具有重要作用。现有的多模态情感分析方法大多使用上下文信息进行建模,虽然这种建模方法可以有效地捕捉模态内部的上下文联系,但模态之间的相关性往往被忽视,而模态之间的相关性对多模态情感分析的最终识别结果也至关重要。因此,本文提出了一种基于通用变压器的多模态情感分析方法,该框架使用通用变压器对多个模态之间的连接进行建模,同时采用有效的特征提取方法捕获单个模态的上下文连接。我们在两个多模态情感分析基准数据集(CMU-MOSI和CMU-MOSEI)上对所提出的方法进行了评估,结果优于其他同类型的方法。
{"title":"A Method with Universal Transformer for Multimodal Sentiment Analysis","authors":"Hao Ai, Ying Liu, Jie Fang, Sheikh Faisal Rashid","doi":"10.1145/3573942.3573968","DOIUrl":"https://doi.org/10.1145/3573942.3573968","url":null,"abstract":"Multimodal sentiment analysis refers to the use of computers to analyze and identify the emotions that people want to express through the extracted multimodal sentiment features, and it plays a significant role in human-computer interaction and financial market prediction. Most existing approaches to multimodal sentiment analysis use contextual information for modeling, and while this modeling approach can effectively capture the contextual connections within modalities, the correlations between modalities are often overlooked, and the correlations between modalities are also critical to the final recognition results in multimodal sentiment analysis. Therefore, this paper proposes a multimodal sentiment analysis approach based on the universal transformer, a framework that uses the universal transformer to model the connections between multiple modalities while employing effective feature extraction methods to capture the contextual connections of individual modalities. We evaluated our proposed method on two benchmark datasets for multimodal sentiment analysis, CMU-MOSI and CMU-MOSEI, and the results outperformed other methods of the same type.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121151314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1