首页 > 最新文献

International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)最新文献

英文 中文
Visual recognition and comparison system and method of intelligent watt hour meter chip based on convolutional neural network 基于卷积神经网络的智能电能表芯片视觉识别和比较系统及方法
Zhengang Shi, C. Wu, W. Fu, Peng Tao, Linhao Zhang, Bo Gao
To enhance the performance of intelligent watt hour meters, a visual recognition and comparison system based on convolutional neural networks is proposed for intelligent watt hour meter chips. Firstly, the overall framework of the chip visual recognition comparison system is designed. Secondly, the hardware part of the system comprises the image acquisition module and image data transmission module of intelligent watt hour meter chips. In the software part, the classification function is selected based on the structural characteristics and operational principle of convolutional neural networks, and iterative training is used to complete the identification and comparison of smart meter chips. The experimental results demonstrate that this proposed system can significantly improve the accuracy of visual recognition and comparison, while also reducing the time consumption when compared to traditional recognition and comparison systems.
为提高智能电能表的性能,提出了一种基于卷积神经网络的智能电能表芯片视觉识别比对系统。首先,设计了芯片视觉识别比对系统的整体框架。其次,系统的硬件部分包括智能电能表芯片的图像采集模块和图像数据传输模块。在软件部分,根据卷积神经网络的结构特点和工作原理,选择分类函数,并采用迭代训练的方法完成智能电表芯片的识别比对。实验结果表明,与传统的识别和比对系统相比,本系统能显著提高视觉识别和比对的准确性,同时还能减少时间消耗。
{"title":"Visual recognition and comparison system and method of intelligent watt hour meter chip based on convolutional neural network","authors":"Zhengang Shi, C. Wu, W. Fu, Peng Tao, Linhao Zhang, Bo Gao","doi":"10.1117/12.3014476","DOIUrl":"https://doi.org/10.1117/12.3014476","url":null,"abstract":"To enhance the performance of intelligent watt hour meters, a visual recognition and comparison system based on convolutional neural networks is proposed for intelligent watt hour meter chips. Firstly, the overall framework of the chip visual recognition comparison system is designed. Secondly, the hardware part of the system comprises the image acquisition module and image data transmission module of intelligent watt hour meter chips. In the software part, the classification function is selected based on the structural characteristics and operational principle of convolutional neural networks, and iterative training is used to complete the identification and comparison of smart meter chips. The experimental results demonstrate that this proposed system can significantly improve the accuracy of visual recognition and comparison, while also reducing the time consumption when compared to traditional recognition and comparison systems.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on collaborative and integrated resource schedulingalgorithm in heterogeneous cloud environment 异构云环境中的协作与集成资源调度算法研究
Jiacheng Fu, Dengbin Liao, Chunzhi Meng, Anni Huang, Junbing Pan
The current conventional collaborative resource scheduling algorithms in heterogeneous cloud environments mainly process the allocation through the quantified results of data characteristics of heterogeneous cloud resources, which leads to low integrated scheduling efficiency due to the differences in the attributes of resources. In this regard, a collaborative and comprehensive resource scheduling algorithm in heterogeneous cloud environment is proposed. Firstly, the heterogeneous cloud resource information data is sampled and processed, and the resource quality is graded. The scheduling task model is constructed by constructing the mapping function of scheduling task assignment sub-nodes, and the hierarchical scheduling strategy is proposed by combining with ant colony algorithm. In the experiments, the designed collaborative integrated scheduling algorithm is tested for the scheduling efficiency. The final results can prove that the algorithm has a lower average delay and a more desirable integrated scheduling efficiency when the proposed method is used for scheduling heterogeneous cloud resources.
目前传统的异构云环境下协同资源调度算法主要通过对异构云资源数据特征的量化结果进行分配处理,由于资源属性的差异,导致综合调度效率较低。为此,提出了一种异构云环境下的协同综合资源调度算法。首先,对异构云资源信息数据进行采样处理,并对资源质量进行分级。通过构建调度任务分配子节点的映射函数,构建调度任务模型,并结合蚁群算法提出分层调度策略。在实验中,对所设计的协同综合调度算法进行了调度效率测试。最终结果可以证明,将所提出的方法用于异构云资源调度时,该算法具有更低的平均时延和更理想的综合调度效率。
{"title":"Research on collaborative and integrated resource schedulingalgorithm in heterogeneous cloud environment","authors":"Jiacheng Fu, Dengbin Liao, Chunzhi Meng, Anni Huang, Junbing Pan","doi":"10.1117/12.3014391","DOIUrl":"https://doi.org/10.1117/12.3014391","url":null,"abstract":"The current conventional collaborative resource scheduling algorithms in heterogeneous cloud environments mainly process the allocation through the quantified results of data characteristics of heterogeneous cloud resources, which leads to low integrated scheduling efficiency due to the differences in the attributes of resources. In this regard, a collaborative and comprehensive resource scheduling algorithm in heterogeneous cloud environment is proposed. Firstly, the heterogeneous cloud resource information data is sampled and processed, and the resource quality is graded. The scheduling task model is constructed by constructing the mapping function of scheduling task assignment sub-nodes, and the hierarchical scheduling strategy is proposed by combining with ant colony algorithm. In the experiments, the designed collaborative integrated scheduling algorithm is tested for the scheduling efficiency. The final results can prove that the algorithm has a lower average delay and a more desirable integrated scheduling efficiency when the proposed method is used for scheduling heterogeneous cloud resources.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Naming conventions-based multi-label and multi-task learning for fine-grained classification 基于命名规则的多标签和多任务学习,实现精细分类
Qinbang Zhou, Kezhi Zhang, Feng Yue, Zhaoliang Zhang, Hui Yu
This paper proposes a fine-grained image classification architecture using multi-task learning. The structure of the fine-grained classification network uses ResNest as the feature extraction layer of the multi-task hard parameter sharing mode with the fine-grained category label regression branch based on multi-hot naming conventions and classification branch based on cross-entropy loss with one-hot encoding. The coupling between the two branches enables multi-task classification through hyperparameter weighting. Subsequently, comparison and ablation experiments were performed on the public datasets of Stanford Cars, CUB-200-2011 and FGVC-Aircraft. The experimental result shows multi-label regression, multi-task learning and label smoothing can effectively improve the generalization of the model and increase the inter-class distance of the previous layer at the network output terminal, and reduces the intra-class distance.
本文提出了一种使用多任务学习的细粒度图像分类架构。细粒度分类网络的结构采用 ResNest 作为多任务硬参数共享模式的特征提取层,其细粒度类别标签回归分支基于多热命名约定,分类分支基于交叉熵损失与单热编码。两个分支之间的耦合可通过超参数加权实现多任务分类。随后,在斯坦福汽车、CUB-200-2011 和 FGVC-Aircraft 公开数据集上进行了对比和消融实验。实验结果表明,多标签回归、多任务学习和标签平滑可以有效提高模型的泛化能力,增加网络输出端的前一层的类间距离,并减小类内距离。
{"title":"Naming conventions-based multi-label and multi-task learning for fine-grained classification","authors":"Qinbang Zhou, Kezhi Zhang, Feng Yue, Zhaoliang Zhang, Hui Yu","doi":"10.1117/12.3014589","DOIUrl":"https://doi.org/10.1117/12.3014589","url":null,"abstract":"This paper proposes a fine-grained image classification architecture using multi-task learning. The structure of the fine-grained classification network uses ResNest as the feature extraction layer of the multi-task hard parameter sharing mode with the fine-grained category label regression branch based on multi-hot naming conventions and classification branch based on cross-entropy loss with one-hot encoding. The coupling between the two branches enables multi-task classification through hyperparameter weighting. Subsequently, comparison and ablation experiments were performed on the public datasets of Stanford Cars, CUB-200-2011 and FGVC-Aircraft. The experimental result shows multi-label regression, multi-task learning and label smoothing can effectively improve the generalization of the model and increase the inter-class distance of the previous layer at the network output terminal, and reduces the intra-class distance.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Embroidery style generation with machine learning 利用机器学习生成刺绣样式
Luojia Wang, Fei Guo
Embroidery is an important intangible cultural heritage in China. The development of digital technology has changed the way of transmission and inheritance of traditional culture. At present, the research on digital simulation of embroidery is still relatively small, and there are some problems such as weak generalization ability and weak three-dimensional sense. According to the characteristics of embroidery art works, this paper proposes an embroidery style generation method combining attention mechanism and cycle-consistent adversarial networks. The attention mechanism module is used to guide the generator and discriminator to control the target area migration of embroidery style images, so as to digitally simulate the embroidery art style. The results show that the proposed method has stronger generalization ability than the traditional embroidery digital simulation method, and has greater optimization in embroidery reality compared with the existing deep learning model.
刺绣是中国重要的非物质文化遗产。数字技术的发展改变了传统文化的传播和传承方式。目前,对刺绣数字化仿真的研究还比较少,存在概括能力弱、立体感不强等问题。根据刺绣艺术作品的特点,本文提出了一种结合注意力机制和循环一致性对抗网络的刺绣风格生成方法。注意机制模块用于引导生成器和判别器控制刺绣风格图像的目标区域迁移,从而对刺绣艺术风格进行数字化模拟。结果表明,与传统的刺绣数字模拟方法相比,所提出的方法具有更强的泛化能力,与现有的深度学习模型相比,在刺绣现实中具有更大的优化性。
{"title":"Embroidery style generation with machine learning","authors":"Luojia Wang, Fei Guo","doi":"10.1117/12.3014501","DOIUrl":"https://doi.org/10.1117/12.3014501","url":null,"abstract":"Embroidery is an important intangible cultural heritage in China. The development of digital technology has changed the way of transmission and inheritance of traditional culture. At present, the research on digital simulation of embroidery is still relatively small, and there are some problems such as weak generalization ability and weak three-dimensional sense. According to the characteristics of embroidery art works, this paper proposes an embroidery style generation method combining attention mechanism and cycle-consistent adversarial networks. The attention mechanism module is used to guide the generator and discriminator to control the target area migration of embroidery style images, so as to digitally simulate the embroidery art style. The results show that the proposed method has stronger generalization ability than the traditional embroidery digital simulation method, and has greater optimization in embroidery reality compared with the existing deep learning model.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D pipeline reconstruction and diameter measurement method based on target segmentation 基于目标细分的三维管道重建和直径测量方法
Guanghai Wu, Hao Zhang, Zhiqi Yan, Haoyu Wang, Zhihao Zhong, Ziao Yin
3D reconstruction technology utilizes 3D data to create models of physical objects. Cameras, laser scanners, and other sensors can be used to gather 3D data of objects, which can be processed using computer graphics technology for creating 3D models through 3D reconstruction technology. In engineering, high-precision 3D reconstruction models can substitute physical pipes for automatic measuring of pipe diameters. This paper proposes a target segmentation-based optimization method for single-frame reconstruction, which enables precise diameter measurement of pipes. Experimental results show that single-frame reconstruction, based on target segmentation technology, produces excellent results in the current application scenario. The proposed method is better adapted to complex construction conditions than the complex reconstruction methods. Complex backgrounds include excessive and uneven distributed light and interfering objects. Using target segmentation technology based on image processing, the MIVOS user-interactive video can produce and distribute the target object mask based on the user's interaction with the video frame. Complex background removal can improve the quality of reconstructed sample images. MIVOS is used to segment the pipe area in the image and remove most of the background noise. Consequently, the process lessens the interference of background noise in the reconstruction results. The proposed method exhibits significant progress in measuring both the inner and outer diameters of pipes when compared to both multi-frame and single-frame reconstruction methods. Their measurements have an average error of no more than 1 mm. The proposed method provides technical guidance for measuring the inner and outer diameters of pipes under complex conditions.
三维重建技术利用三维数据创建实物模型。照相机、激光扫描仪和其他传感器可用于收集物体的三维数据,然后利用计算机图形技术对这些数据进行处理,从而通过三维重建技术创建三维模型。在工程领域,高精度的三维重建模型可以替代实物管道,用于自动测量管道直径。本文提出了一种基于目标分割的单帧重建优化方法,可实现管道直径的精确测量。实验结果表明,基于目标分割技术的单帧重建在当前的应用场景中取得了优异的效果。与复杂的重建方法相比,所提出的方法能更好地适应复杂的施工条件。复杂背景包括光线过强、分布不均以及干扰物体。利用基于图像处理的目标分割技术,MIVOS 用户交互式视频可根据用户与视频帧的交互,生成并分发目标对象遮罩。复杂的背景去除可以提高重建样本图像的质量。MIVOS 用于分割图像中的管道区域并去除大部分背景噪声。因此,这一过程减少了背景噪声对重建结果的干扰。与多帧和单帧重建方法相比,所提出的方法在测量管道内径和外径方面都有显著进步。其测量结果的平均误差不超过 1 毫米。所提出的方法为在复杂条件下测量管道内径和外径提供了技术指导。
{"title":"3D pipeline reconstruction and diameter measurement method based on target segmentation","authors":"Guanghai Wu, Hao Zhang, Zhiqi Yan, Haoyu Wang, Zhihao Zhong, Ziao Yin","doi":"10.1117/12.3014403","DOIUrl":"https://doi.org/10.1117/12.3014403","url":null,"abstract":"3D reconstruction technology utilizes 3D data to create models of physical objects. Cameras, laser scanners, and other sensors can be used to gather 3D data of objects, which can be processed using computer graphics technology for creating 3D models through 3D reconstruction technology. In engineering, high-precision 3D reconstruction models can substitute physical pipes for automatic measuring of pipe diameters. This paper proposes a target segmentation-based optimization method for single-frame reconstruction, which enables precise diameter measurement of pipes. Experimental results show that single-frame reconstruction, based on target segmentation technology, produces excellent results in the current application scenario. The proposed method is better adapted to complex construction conditions than the complex reconstruction methods. Complex backgrounds include excessive and uneven distributed light and interfering objects. Using target segmentation technology based on image processing, the MIVOS user-interactive video can produce and distribute the target object mask based on the user's interaction with the video frame. Complex background removal can improve the quality of reconstructed sample images. MIVOS is used to segment the pipe area in the image and remove most of the background noise. Consequently, the process lessens the interference of background noise in the reconstruction results. The proposed method exhibits significant progress in measuring both the inner and outer diameters of pipes when compared to both multi-frame and single-frame reconstruction methods. Their measurements have an average error of no more than 1 mm. The proposed method provides technical guidance for measuring the inner and outer diameters of pipes under complex conditions.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-dimensional situation prediction of digital twin active power grid based on LSTM algorithm 基于 LSTM 算法的数字孪生有功电网多维形势预测
Xun Wang, Yinghui Tan, Tao Li, Chuang Liu, Guanghao Yang, Qian Wang
In order to understand the multidimensional situation prediction of digital twin active power grid, a research on multidimensional situation prediction of digital twin active power grid based on LSTM algorithm is proposed. In this paper, firstly, a multi-dimensional situation prediction algorithm of power grid key indicators based on LSTM is established to realize the change prediction of key indicators attributes of digital twin active power grid. Secondly, the data of several key indicators such as load characteristics are collected, and a multi-dimensional system prediction model is established, which can control the state of active power grid; The LSTM prediction algorithm is proposed to fit the characteristics of multi-dimensional data, and the next stage of multi-dimensional data prediction is mapped to the power digital twin, so as to realize the synchronous implementation and intelligent regulation of smart energy system operation planning. Finally, a simulation test model is established, and an example shows that the multi-dimensional situation prediction method of digital twin power grid based on deep learning can better predict and distinguish the power grid situation, and provide decision support for accurate planning of energy system in the future.
为了了解数字孪生有功电网的多维态势预测,提出了基于 LSTM 算法的数字孪生有功电网多维态势预测研究。本文首先建立了基于 LSTM 的电网关键指标多维态势预测算法,实现了数字孪生有功电网关键指标属性的变化预测。其次,采集负荷特性等多个关键指标数据,建立多维系统预测模型,控制有功电网状态;提出拟合多维数据特性的 LSTM 预测算法,将下一阶段的多维数据预测映射到电力数字孪生中,实现智能能源系统运行规划的同步实施和智能调节。最后,建立了仿真测试模型,实例表明基于深度学习的数字孪生电网多维态势预测方法能更好地预测和区分电网态势,为未来能源系统的精准规划提供决策支持。
{"title":"Multi-dimensional situation prediction of digital twin active power grid based on LSTM algorithm","authors":"Xun Wang, Yinghui Tan, Tao Li, Chuang Liu, Guanghao Yang, Qian Wang","doi":"10.1117/12.3014395","DOIUrl":"https://doi.org/10.1117/12.3014395","url":null,"abstract":"In order to understand the multidimensional situation prediction of digital twin active power grid, a research on multidimensional situation prediction of digital twin active power grid based on LSTM algorithm is proposed. In this paper, firstly, a multi-dimensional situation prediction algorithm of power grid key indicators based on LSTM is established to realize the change prediction of key indicators attributes of digital twin active power grid. Secondly, the data of several key indicators such as load characteristics are collected, and a multi-dimensional system prediction model is established, which can control the state of active power grid; The LSTM prediction algorithm is proposed to fit the characteristics of multi-dimensional data, and the next stage of multi-dimensional data prediction is mapped to the power digital twin, so as to realize the synchronous implementation and intelligent regulation of smart energy system operation planning. Finally, a simulation test model is established, and an example shows that the multi-dimensional situation prediction method of digital twin power grid based on deep learning can better predict and distinguish the power grid situation, and provide decision support for accurate planning of energy system in the future.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulation study of brushless DC motor speed control system based on GA-PID 基于 GA-PID 的无刷直流电机速度控制系统仿真研究
Yang Tang, Hao Chen, Faxin Zhu
This paper, based on the principles of brushless DC motors, constructs a mathematical model and combines genetic algorithms with traditional PID control. Genetic algorithms are used for parameter optimization to obtain the optimal solution for PID control, achieving higher control precision and stability. A simulation model of the motor and control system is developed using Simulink, and various operational conditions, including normal startup and sudden speed changes during operation, are simulated. The results show significant improvements in the motor's response speed and control precision. There is no overshoot during the startup phase, and the error during constant-speed operation is below 0.5%.
本文基于无刷直流电机的原理,构建了一个数学模型,并将遗传算法与传统的 PID 控制相结合。利用遗传算法进行参数优化,获得 PID 控制的最优解,实现更高的控制精度和稳定性。利用 Simulink 建立了电机和控制系统的仿真模型,并模拟了各种运行条件,包括正常启动和运行过程中的速度突变。结果表明,电机的响应速度和控制精度都有明显改善。启动阶段没有过冲,恒速运行时的误差低于 0.5%。
{"title":"Simulation study of brushless DC motor speed control system based on GA-PID","authors":"Yang Tang, Hao Chen, Faxin Zhu","doi":"10.1117/12.3014483","DOIUrl":"https://doi.org/10.1117/12.3014483","url":null,"abstract":"This paper, based on the principles of brushless DC motors, constructs a mathematical model and combines genetic algorithms with traditional PID control. Genetic algorithms are used for parameter optimization to obtain the optimal solution for PID control, achieving higher control precision and stability. A simulation model of the motor and control system is developed using Simulink, and various operational conditions, including normal startup and sudden speed changes during operation, are simulated. The results show significant improvements in the motor's response speed and control precision. There is no overshoot during the startup phase, and the error during constant-speed operation is below 0.5%.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on face recognition algorithm based on FaceNet and coordinate attention 基于 FaceNet 和协调注意力的人脸识别算法研究
Tao Zhang, zewu ke
The development of information technology has made the field of deep learning face recognition develop rapidly. The traditional face detection and recognition algorithm can perform well under constrained conditions, but under unconstrained conditions, its effect will be greatly discounted when low quality images and partial occlusion of faces are encountered. Based on MTCNN and FaceNet, this paper adopts two strategies to solve the above two problems respectively. On the one hand, by introducing the face image quality assessment function to solve the problem of low quality pictures, before face detection, a quality assessment of the face image is done, and only the image whose quality score reaches the threshold can be input into the model. On the other hand, the Coordinate attention mechanism is introduced to deal with the problem of partial occlusion of the face, which improves the recognition ability of the model by adaptively enhancing the weight of the unocclusion area of the face. Experimental results show that compared with existing algorithms, the accuracy of the proposed algorithm is significantly improved.
信息技术的发展使得深度学习人脸识别领域发展迅速。传统的人脸检测与识别算法在受限条件下可以有良好的表现,但在非受限条件下,如果遇到低质量图像和人脸部分遮挡,其效果就会大打折扣。本文基于 MTCNN 和 FaceNet,采用两种策略分别解决上述两个问题。一方面,通过引入人脸图像质量评估功能来解决低质量图片的问题,在进行人脸检测之前,先对人脸图像进行质量评估,只有质量得分达到阈值的图像才能输入模型。另一方面,针对人脸部分遮挡的问题,引入了坐标关注机制,通过自适应地增强人脸未遮挡区域的权重来提高模型的识别能力。实验结果表明,与现有算法相比,所提算法的准确率有了显著提高。
{"title":"Research on face recognition algorithm based on FaceNet and coordinate attention","authors":"Tao Zhang, zewu ke","doi":"10.1117/12.3014505","DOIUrl":"https://doi.org/10.1117/12.3014505","url":null,"abstract":"The development of information technology has made the field of deep learning face recognition develop rapidly. The traditional face detection and recognition algorithm can perform well under constrained conditions, but under unconstrained conditions, its effect will be greatly discounted when low quality images and partial occlusion of faces are encountered. Based on MTCNN and FaceNet, this paper adopts two strategies to solve the above two problems respectively. On the one hand, by introducing the face image quality assessment function to solve the problem of low quality pictures, before face detection, a quality assessment of the face image is done, and only the image whose quality score reaches the threshold can be input into the model. On the other hand, the Coordinate attention mechanism is introduced to deal with the problem of partial occlusion of the face, which improves the recognition ability of the model by adaptively enhancing the weight of the unocclusion area of the face. Experimental results show that compared with existing algorithms, the accuracy of the proposed algorithm is significantly improved.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dual population constrained multiobjective evolutionary algorithm with a feasible archive set 具有可行档案集的双群体约束多目标进化算法
Xinchang Yu, Yumeng Wang, Tong Zhang, Huaqing Xu
Continuous updating and maintenance of feasible solutions is crucial when solving constrained multi-objective optimization problems (CMOPs). However, most existing constrained multi-objective evolutionary algorithms (CMOEAs) are not efficient enough in updating and preserving competitive feasible solutions, thus reducing population diversity. To address this issue, this paper proposes a dual-population (i.e., mainPop and auxPop) constrained multi-objective evolutionary algorithm with a feasible archive set for CMOPs, named DPFAS. The two populations have different functions in the algorithm. Specifically, the ݉ܽ݅݊ܲmainPop considers both objectives and constraints for solving the original CMOPs, while the ܽauxPop is used only for the optimization of objectives without considering constraints. In addition, a feasible archive set is used to store feasible solutions that are competitive in the ܽauxPop and provide useful information for the ݉ܽ݅݊ܲmainPop. Moreover, a fitness assignment strategy is designed to speed up the algorithm’s convergence. Particularly, the population converges faster by selecting better-nondominated solutions into the matching pool. Finally, experimental studies on 23 benchmark functions show that the proposed algorithm was more competitive compared with five state-of-the-art CMOEAs.
在求解约束多目标优化问题(CMOP)时,持续更新和维护可行解至关重要。然而,现有的大多数约束多目标进化算法(CMOEAs)在更新和维护有竞争力的可行解方面不够有效,从而降低了种群多样性。为解决这一问题,本文提出了一种针对 CMOPs 的双种群(即 mainPop 和 auxPop)约束多目标进化算法,并将其命名为 DPFAS。两个种群在算法中具有不同的功能。具体来说,ܽ݉ܽ݅݊ܲmainPop 在求解原始 CMOPs 时既考虑目标又考虑约束,而 ܽauxPop 只用于优化目标而不考虑约束。此外,可行档案集用于存储在ܽauxPop 中具有竞争力的可行解,并为݉ܽ݅݊ܲmainPop 提供有用信息。此外,还设计了一种适合度分配策略,以加快算法的收敛速度。特别是,通过选择更好的非优势解进入匹配池,种群收敛得更快。最后,对 23 个基准函数的实验研究表明,与五种最先进的 CMOEA 相比,所提出的算法更具竞争力。
{"title":"A dual population constrained multiobjective evolutionary algorithm with a feasible archive set","authors":"Xinchang Yu, Yumeng Wang, Tong Zhang, Huaqing Xu","doi":"10.1117/12.3014412","DOIUrl":"https://doi.org/10.1117/12.3014412","url":null,"abstract":"Continuous updating and maintenance of feasible solutions is crucial when solving constrained multi-objective optimization problems (CMOPs). However, most existing constrained multi-objective evolutionary algorithms (CMOEAs) are not efficient enough in updating and preserving competitive feasible solutions, thus reducing population diversity. To address this issue, this paper proposes a dual-population (i.e., mainPop and auxPop) constrained multi-objective evolutionary algorithm with a feasible archive set for CMOPs, named DPFAS. The two populations have different functions in the algorithm. Specifically, the ݉ܽ݅݊ܲmainPop considers both objectives and constraints for solving the original CMOPs, while the ܽauxPop is used only for the optimization of objectives without considering constraints. In addition, a feasible archive set is used to store feasible solutions that are competitive in the ܽauxPop and provide useful information for the ݉ܽ݅݊ܲmainPop. Moreover, a fitness assignment strategy is designed to speed up the algorithm’s convergence. Particularly, the population converges faster by selecting better-nondominated solutions into the matching pool. Finally, experimental studies on 23 benchmark functions show that the proposed algorithm was more competitive compared with five state-of-the-art CMOEAs.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pedestrian object detection algorithm based on lightweight YOLOv7 in complex street scenarios 复杂街道场景中基于轻量级 YOLOv7 的行人物体检测算法
Shangqi Cheng, Hongxia Niu
In view of the problems of excessive parameter setting and large calculation of YOLOv7 in pedestrian object detection in complex street scenarios, this paper proposes a lightweight method to improve YOLOv7 algorithm. Under the YOLOv7 framework, Partial Convolution (PConv) is integrated into the convolution of the original algorithm, replacing part of the convolution in the original convolution layer, and the SEAttention attention module is introduced to ensure the detection accuracy of the lightweight algorithm. The experimental results on the home-made data set show that, compared with the original YOLOv7 algorithm, the number of model parameters decreased by 11.0% in the improved YOLOv7 algorithm, and the algorithm calculation volume decreased by 19.4%, while ensuring the high accuracy of the original YOLOv7 algorithm. In this paper, the algorithm reduces the number of parameters and calculations, and achieves the balance of lightweight and accuracy.
针对 YOLOv7 在复杂街道场景下行人物体检测中存在的参数设置过多、计算量过大等问题,本文提出了一种轻量级的方法来改进 YOLOv7 算法。在 YOLOv7 框架下,将部分卷积(PConv)集成到原算法的卷积中,替代原卷积层中的部分卷积,并引入 SEAttention 注意模块,保证轻量级算法的检测精度。在自制数据集上的实验结果表明,与原 YOLOv7 算法相比,改进后的 YOLOv7 算法的模型参数数减少了 11.0%,算法计算量减少了 19.4%,同时保证了原 YOLOv7 算法的高精度。本文的算法减少了参数和计算量,实现了轻量级和高精度的平衡。
{"title":"Pedestrian object detection algorithm based on lightweight YOLOv7 in complex street scenarios","authors":"Shangqi Cheng, Hongxia Niu","doi":"10.1117/12.3014518","DOIUrl":"https://doi.org/10.1117/12.3014518","url":null,"abstract":"In view of the problems of excessive parameter setting and large calculation of YOLOv7 in pedestrian object detection in complex street scenarios, this paper proposes a lightweight method to improve YOLOv7 algorithm. Under the YOLOv7 framework, Partial Convolution (PConv) is integrated into the convolution of the original algorithm, replacing part of the convolution in the original convolution layer, and the SEAttention attention module is introduced to ensure the detection accuracy of the lightweight algorithm. The experimental results on the home-made data set show that, compared with the original YOLOv7 algorithm, the number of model parameters decreased by 11.0% in the improved YOLOv7 algorithm, and the algorithm calculation volume decreased by 19.4%, while ensuring the high accuracy of the original YOLOv7 algorithm. In this paper, the algorithm reduces the number of parameters and calculations, and achieves the balance of lightweight and accuracy.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1