首页 > 最新文献

International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)最新文献

英文 中文
Jamming detection based on phase feature for SAR images 基于相位特征的合成孔径雷达图像干扰检测
Haoyu Zhang, Sinong Quan, Shiqi Xing, Yitao Liu
Synthetic Aperture Radar (SAR) is capable of producing high-resolution complex-valued pictures, which have extensive applications in both civil and military domains. Among these applications, SAR electronic countermeasures currently represent a prominent area of research interest. Presently, within the radar electronic countermeasures, there exists a diminishing disparity among the features of real and false targets, rendering the detection of jamming increasingly challenging. This paper examines the phase of SAR images and presents a method for identifying SAR jamming regions based on phase features. The initial step involves organizing the cluttered phase information into neighborhood phase differences. Subsequently, this information is coupled with the amplitude to obtain the weighted phase difference. This metric effectively captures the extent of phase distortion resulting from jamming. The findings from the simulation experiment demonstrate that the proposed feature and method are capable of accurately identifying and filtering out the jamming region in SAR pictures. Furthermore, it demonstrates the prospection of phase within the SAR image interpretation and electronic countermeasures.
合成孔径雷达(SAR)能够生成高分辨率的复值图像,在民用和军事领域都有广泛的应用。在这些应用中,合成孔径雷达电子对抗目前是一个突出的研究领域。目前,在雷达电子对抗中,真实目标和虚假目标的特征差距越来越小,使干扰探测变得越来越具有挑战性。本文研究了合成孔径雷达图像的相位,并提出了一种基于相位特征识别合成孔径雷达干扰区域的方法。第一步是将杂乱的相位信息整理为邻域相位差。随后,将这些信息与振幅结合起来,得到加权相位差。这一指标能有效捕捉干扰造成的相位失真程度。模拟实验的结果表明,所提出的特征和方法能够准确识别和滤除合成孔径雷达图像中的干扰区域。此外,它还证明了相位在合成孔径雷达图像判读和电子对抗中的应用前景。
{"title":"Jamming detection based on phase feature for SAR images","authors":"Haoyu Zhang, Sinong Quan, Shiqi Xing, Yitao Liu","doi":"10.1117/12.3014617","DOIUrl":"https://doi.org/10.1117/12.3014617","url":null,"abstract":"Synthetic Aperture Radar (SAR) is capable of producing high-resolution complex-valued pictures, which have extensive applications in both civil and military domains. Among these applications, SAR electronic countermeasures currently represent a prominent area of research interest. Presently, within the radar electronic countermeasures, there exists a diminishing disparity among the features of real and false targets, rendering the detection of jamming increasingly challenging. This paper examines the phase of SAR images and presents a method for identifying SAR jamming regions based on phase features. The initial step involves organizing the cluttered phase information into neighborhood phase differences. Subsequently, this information is coupled with the amplitude to obtain the weighted phase difference. This metric effectively captures the extent of phase distortion resulting from jamming. The findings from the simulation experiment demonstrate that the proposed feature and method are capable of accurately identifying and filtering out the jamming region in SAR pictures. Furthermore, it demonstrates the prospection of phase within the SAR image interpretation and electronic countermeasures.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Green glass of Cantonese colored windows based on color model 基于色彩模型的广东彩窗绿色玻璃研究
Xiaoqing Wang, Ying Du
Based on the color model, this paper investigates the green glass in Cantonese colored windows, aiming to establish the standard value of the green color of Cantonese colored windows, and to provide relevant data and suggestions to protect the design concept of Cantonese architectural decoration and regulate the use of color in colored windows. By collecting samples, with the help of image processing and color analysis software, the green glass was positioned and analyzed in both RGB and LAB color models. It is found that in the RGB color model, the threshold intervals of green glass are mainly concentrated in the Forest Green and Green regions; in the LAB color model, the threshold intervals of the two-color channels of green glass are mainly distributed in the range of medium and low saturation. This study provides digitalized standard values and reference data for the green of the Cantonese colored windows, which helps to maintain the design style of traditional architecture and promote the development and application of colored windows.
本文以色彩模型为基础,对广东彩窗中的绿色玻璃进行研究,旨在建立广东彩窗绿色的标准值,为保护广东建筑装饰的设计理念、规范彩窗的色彩使用提供相关数据和建议。通过采集样本,借助图像处理和色彩分析软件,对绿色玻璃进行了 RGB 和 LAB 两种色彩模型的定位和分析。结果发现,在 RGB 色彩模型中,绿色玻璃的阈值区间主要集中在森林绿和绿色区域;在 LAB 色彩模型中,绿色玻璃双色通道的阈值区间主要分布在中低饱和度范围内。本研究为广东彩窗的绿色提供了数字化的标准值和参考数据,有助于保持传统建筑的设计风格,促进彩窗的发展和应用。
{"title":"Research on Green glass of Cantonese colored windows based on color model","authors":"Xiaoqing Wang, Ying Du","doi":"10.1117/12.3014495","DOIUrl":"https://doi.org/10.1117/12.3014495","url":null,"abstract":"Based on the color model, this paper investigates the green glass in Cantonese colored windows, aiming to establish the standard value of the green color of Cantonese colored windows, and to provide relevant data and suggestions to protect the design concept of Cantonese architectural decoration and regulate the use of color in colored windows. By collecting samples, with the help of image processing and color analysis software, the green glass was positioned and analyzed in both RGB and LAB color models. It is found that in the RGB color model, the threshold intervals of green glass are mainly concentrated in the Forest Green and Green regions; in the LAB color model, the threshold intervals of the two-color channels of green glass are mainly distributed in the range of medium and low saturation. This study provides digitalized standard values and reference data for the green of the Cantonese colored windows, which helps to maintain the design style of traditional architecture and promote the development and application of colored windows.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The automated segmentation and enhancement of cracks on airport pavements using three-dimensional imaging techniques 利用三维成像技术自动分割和强化机场路面裂缝
Shanshan Zhai, Yanna Xu
Based on 3D images, this study aims to explore automatic segmentation and enhancement methods for airfield runway surface cracks. Firstly, a typical 2D Gaussian filter is used to remove noise from the road surface data. Then, Steerable Matched Filter (SMFB) is introduced to extract crack features. By constructing a set of 52 SMFB filters with different parameters, we are able to accurately capture cracks with different directions and sizes. After that, Tensor Voting (TV) technique is introduced to further enhance the continuity of the cracks. With this method, we are able to detect and segment the cracks in the airfield runway surface for a more accurate and comprehensive analysis. The experimental results show that the proposed method performs well in crack detection and segmentation, providing strong support for airport pavement maintenance and management.
基于三维图像,本研究旨在探索机场跑道表面裂缝的自动分割和增强方法。首先,使用典型的二维高斯滤波器去除路面数据中的噪声。然后,引入可转向匹配滤波器(SMFB)来提取裂缝特征。通过构建一组具有不同参数的 52 个 SMFB 滤波器,我们能够准确捕捉不同方向和尺寸的裂缝。之后,我们引入了张量投票(TV)技术,以进一步增强裂纹的连续性。有了这种方法,我们就能检测和分割机场跑道表面的裂缝,从而进行更准确、更全面的分析。实验结果表明,所提出的方法在裂缝检测和分割方面表现良好,为机场路面维护和管理提供了有力支持。
{"title":"The automated segmentation and enhancement of cracks on airport pavements using three-dimensional imaging techniques","authors":"Shanshan Zhai, Yanna Xu","doi":"10.1117/12.3014473","DOIUrl":"https://doi.org/10.1117/12.3014473","url":null,"abstract":"Based on 3D images, this study aims to explore automatic segmentation and enhancement methods for airfield runway surface cracks. Firstly, a typical 2D Gaussian filter is used to remove noise from the road surface data. Then, Steerable Matched Filter (SMFB) is introduced to extract crack features. By constructing a set of 52 SMFB filters with different parameters, we are able to accurately capture cracks with different directions and sizes. After that, Tensor Voting (TV) technique is introduced to further enhance the continuity of the cracks. With this method, we are able to detect and segment the cracks in the airfield runway surface for a more accurate and comprehensive analysis. The experimental results show that the proposed method performs well in crack detection and segmentation, providing strong support for airport pavement maintenance and management.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140512102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep learning approach for fruit detection: YOLO-GF 水果检测的深度学习方法YOLO-GF
J. Guo, Wei Wu
To achieve automatic fruit object recognition in complex backgrounds, this paper proposes a fruit object detection algorithm based on YOLO-GF. Addressing challenges such as complex backgrounds, significant variations in target shapes, and instances of occlusion in fruit images, we utilize the Global Attention Mechanism (GAM) to enhance the feature extraction capability for fruit targets, thereby improving fruit recognition accuracy. Additionally, the Focal-EIOU loss function is used instead of the CIOU loss function to expedite model convergence. Experimental results demonstrate a significant improvement in recognition accuracy under the same hardware conditions. On the same test dataset, the improved model achieves an mAP50 of 92.1% and mAP50:95 of 76.5%, representing increases of 5.8% and 11.9% compared to the original model, respectively.
为了实现复杂背景下的水果目标自动识别,本文提出了一种基于 YOLO-GF 的水果目标检测算法。针对水果图像中存在的复杂背景、目标形状的显著变化和遮挡等挑战,我们利用全局注意力机制(GAM)来增强水果目标的特征提取能力,从而提高水果识别的准确率。此外,我们还使用 Focal-EIOU 损失函数代替 CIOU 损失函数,以加快模型收敛速度。实验结果表明,在相同的硬件条件下,识别准确率有了显著提高。在相同的测试数据集上,改进后的模型的 mAP50 为 92.1%,mAP50:95 为 76.5%,与原始模型相比分别提高了 5.8%和 11.9%。
{"title":"A deep learning approach for fruit detection: YOLO-GF","authors":"J. Guo, Wei Wu","doi":"10.1117/12.3014430","DOIUrl":"https://doi.org/10.1117/12.3014430","url":null,"abstract":"To achieve automatic fruit object recognition in complex backgrounds, this paper proposes a fruit object detection algorithm based on YOLO-GF. Addressing challenges such as complex backgrounds, significant variations in target shapes, and instances of occlusion in fruit images, we utilize the Global Attention Mechanism (GAM) to enhance the feature extraction capability for fruit targets, thereby improving fruit recognition accuracy. Additionally, the Focal-EIOU loss function is used instead of the CIOU loss function to expedite model convergence. Experimental results demonstrate a significant improvement in recognition accuracy under the same hardware conditions. On the same test dataset, the improved model achieves an mAP50 of 92.1% and mAP50:95 of 76.5%, representing increases of 5.8% and 11.9% compared to the original model, respectively.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The application of target tracking algorithm in intelligent video system to flight support 智能视频系统中目标跟踪算法在飞行支持中的应用
Jianjun Peng, Jialei Zhai, Xiang Jin, Chengshuang Hu, Zaigang Li
As the global pandemic gradually eases and the aviation transport industry continues to experience steady growth, highdensity flight operations are becoming the new normal. The intelligentization of flight support processes is a crucial avenue for enhancing both the safety and efficiency of flight operations. With the advancement of computer vision technology, video-based object tracking has shown significant potential in the context of flight support processes. However, in real airport environments, object tracking often encounters challenges such as occlusion, scale variations, rotation, and changes in lighting conditions, leading to a decrease in tracking accuracy and even target loss. In this paper, our focus is on overcoming tracking failures caused by occlusion, deformation, and lighting variations. We have conducted the following work, taking into consideration the unique characteristics of airport environments and the specific requirements of flight support processes: (i) We utilized features at three levels, namely, Histogram of Oriented Gradient (HOG), Color Names, and Convolutional Neural Networks (CNN), to describe the texture, color, and high-level semantics of video images, respectively. (ii) We employed a multi-feature fusion approach using a trilinear interpolation function to integrate information from various sources. (iii) We implemented improved ECO algorithms for the tracking of moving objects in the airport environment. Finally, we validated this object tracking system using real surveillance videos from the airport. Experimental results have demonstrated the effectiveness and practicality of the method under challenging conditions.
随着全球疫情的逐渐缓解和航空运输业的持续稳定增长,高密度的飞行作业正在成为新常态。飞行支持流程的智能化是提高飞行安全和效率的重要途径。随着计算机视觉技术的发展,基于视频的物体跟踪技术在飞行支持流程中显示出了巨大的潜力。然而,在真实的机场环境中,物体跟踪经常会遇到遮挡、比例变化、旋转和光照条件变化等挑战,从而导致跟踪精度下降,甚至丢失目标。在本文中,我们的重点是克服由遮挡、变形和光照变化引起的跟踪失败。考虑到机场环境的特殊性和飞行保障流程的具体要求,我们开展了以下工作:(i) 我们利用三个层次的特征,即方向梯度直方图(HOG)、颜色名称和卷积神经网络(CNN),分别描述视频图像的纹理、颜色和高级语义。(ii) 我们采用了一种多特征融合方法,利用三线性插值函数来整合来自不同来源的信息。(iii) 我们改进了 ECO 算法,用于跟踪机场环境中的移动物体。最后,我们利用机场的真实监控视频验证了这一物体跟踪系统。实验结果证明了该方法在具有挑战性的条件下的有效性和实用性。
{"title":"The application of target tracking algorithm in intelligent video system to flight support","authors":"Jianjun Peng, Jialei Zhai, Xiang Jin, Chengshuang Hu, Zaigang Li","doi":"10.1117/12.3014375","DOIUrl":"https://doi.org/10.1117/12.3014375","url":null,"abstract":"As the global pandemic gradually eases and the aviation transport industry continues to experience steady growth, highdensity flight operations are becoming the new normal. The intelligentization of flight support processes is a crucial avenue for enhancing both the safety and efficiency of flight operations. With the advancement of computer vision technology, video-based object tracking has shown significant potential in the context of flight support processes. However, in real airport environments, object tracking often encounters challenges such as occlusion, scale variations, rotation, and changes in lighting conditions, leading to a decrease in tracking accuracy and even target loss. In this paper, our focus is on overcoming tracking failures caused by occlusion, deformation, and lighting variations. We have conducted the following work, taking into consideration the unique characteristics of airport environments and the specific requirements of flight support processes: (i) We utilized features at three levels, namely, Histogram of Oriented Gradient (HOG), Color Names, and Convolutional Neural Networks (CNN), to describe the texture, color, and high-level semantics of video images, respectively. (ii) We employed a multi-feature fusion approach using a trilinear interpolation function to integrate information from various sources. (iii) We implemented improved ECO algorithms for the tracking of moving objects in the airport environment. Finally, we validated this object tracking system using real surveillance videos from the airport. Experimental results have demonstrated the effectiveness and practicality of the method under challenging conditions.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on target detection algorithm based on vehicle detection 基于车辆检测的目标检测算法研究
Yanguo Huang, Zehao Rao, Luo Li
Aiming at the current problem of unsatisfactory vehicle detection in complex scenes, an improved vehicle target detection network model is proposed. First, Res2Net residual network is fused in SCP, and the CSP_R structure is proposed, so that the model can extract deeper feature information and strengthen the ability to characterize small-scale targets; the attention mechanism is introduced, and the C3_CBAM module is designed to strengthen the attention to the detection targets while avoiding the increase of the model's computational volume; the loss function of the MPDIoU regression optimization is introduced, and the loss function is optimized by combining the prediction frame with the real frame length, width and area loss, and quantitative indicators to improve the convergence speed and robustness of the model. Finally, the model is validated on the SODA10M dataset, and the experimental results show that the model detection speed reaches 32 frames per second. The average detection accuracy reaches 83.7%, which is an improvement of 7.8 percentage points compared with YOLOV5s.
针对目前复杂场景下车辆检测效果不理想的问题,提出了一种改进的车辆目标检测网络模型。首先,在 SCP 中融合 Res2Net 残差网络,提出 CSP_R 结构,使模型能够提取更深层次的特征信息,增强对小尺度目标的表征能力;引入关注机制,设计 C3_CBAM 模块,在避免增加模型计算量的同时,加强对检测目标的关注;引入 MPDIoU 回归优化的损失函数,结合预测帧的实际帧长、宽、面积损失和定量指标对损失函数进行优化,提高模型的收敛速度和鲁棒性。最后,在 SODA10M 数据集上对模型进行了验证,实验结果表明,模型的检测速度达到了每秒 32 帧。平均检测准确率达到 83.7%,比 YOLOV5s 提高了 7.8 个百分点。
{"title":"Research on target detection algorithm based on vehicle detection","authors":"Yanguo Huang, Zehao Rao, Luo Li","doi":"10.1117/12.3014382","DOIUrl":"https://doi.org/10.1117/12.3014382","url":null,"abstract":"Aiming at the current problem of unsatisfactory vehicle detection in complex scenes, an improved vehicle target detection network model is proposed. First, Res2Net residual network is fused in SCP, and the CSP_R structure is proposed, so that the model can extract deeper feature information and strengthen the ability to characterize small-scale targets; the attention mechanism is introduced, and the C3_CBAM module is designed to strengthen the attention to the detection targets while avoiding the increase of the model's computational volume; the loss function of the MPDIoU regression optimization is introduced, and the loss function is optimized by combining the prediction frame with the real frame length, width and area loss, and quantitative indicators to improve the convergence speed and robustness of the model. Finally, the model is validated on the SODA10M dataset, and the experimental results show that the model detection speed reaches 32 frames per second. The average detection accuracy reaches 83.7%, which is an improvement of 7.8 percentage points compared with YOLOV5s.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization research on pedestrian multiobjects tracking model based on TBD strategy 基于 TBD 策略的行人多目标跟踪模型优化研究
Shi Wang, Xiangju Liu, Xinshu Liu, JiaHui Chen, XiaoHong Wang
The main task of pedestrian multi objects tracking technology is to continuously track multiple pedestrian objects simultaneously in video sequences and maintain their unique ID numbers. However, current pedestrian multi objects tracking models still have many problems, such as false detection, missed detection, and frequent ID number switching when pedestrians are obstructed or have overly similar appearances, ultimately leading to tracking failure. Therefore, this paper proposes a pedestrian multi objects tracking model based on TBD strategy. It mainly consists of two parts: pedestrian detector and pedestrian tracker. In terms of pedestrian detectors, this paper uses ES-YOLO pedestrian detectors. In terms of pedestrian trackers, this paper draws on the Omni-scale feature learning module in OSNet to redesign the StrongSORT pedestrian appearance feature extraction network, and ultimately obtains the StrongSORT pedestrian tracker based on omni-scale feature fusion, further enhancing its pedestrian feature extraction ability. In terms of experimental results. The experimental results of the pedestrian multi objects tracking model based on the TBD strategy in this paper on the MOT16 dataset show that the proposed pedestrian multi-objective tracking model can effectively improve the accuracy of pedestrian multi objects tracking and reduce the problem of frequent pedestrian ID number switching.
行人多目标跟踪技术的主要任务是在视频序列中同时连续跟踪多个行人目标,并保持其唯一的 ID 编号。然而,目前的行人多目标跟踪模型仍然存在很多问题,例如误检、漏检,以及当行人受到遮挡或外观过于相似时频繁切换 ID 号,最终导致跟踪失败。因此,本文提出了一种基于 TBD 策略的行人多目标跟踪模型。它主要由两部分组成:行人检测器和行人跟踪器。在行人检测器方面,本文使用 ES-YOLO 行人检测器。在行人跟踪器方面,本文借鉴 OSNet 中的全尺度特征学习模块,重新设计了 StrongSORT 行人外观特征提取网络,最终得到了基于全尺度特征融合的 StrongSORT 行人跟踪器,进一步增强了其行人特征提取能力。在实验结果方面。基于本文 TBD 策略的行人多目标跟踪模型在 MOT16 数据集上的实验结果表明,本文提出的行人多目标跟踪模型能有效提高行人多目标跟踪的精度,减少行人 ID 号频繁切换的问题。
{"title":"Optimization research on pedestrian multiobjects tracking model based on TBD strategy","authors":"Shi Wang, Xiangju Liu, Xinshu Liu, JiaHui Chen, XiaoHong Wang","doi":"10.1117/12.3014360","DOIUrl":"https://doi.org/10.1117/12.3014360","url":null,"abstract":"The main task of pedestrian multi objects tracking technology is to continuously track multiple pedestrian objects simultaneously in video sequences and maintain their unique ID numbers. However, current pedestrian multi objects tracking models still have many problems, such as false detection, missed detection, and frequent ID number switching when pedestrians are obstructed or have overly similar appearances, ultimately leading to tracking failure. Therefore, this paper proposes a pedestrian multi objects tracking model based on TBD strategy. It mainly consists of two parts: pedestrian detector and pedestrian tracker. In terms of pedestrian detectors, this paper uses ES-YOLO pedestrian detectors. In terms of pedestrian trackers, this paper draws on the Omni-scale feature learning module in OSNet to redesign the StrongSORT pedestrian appearance feature extraction network, and ultimately obtains the StrongSORT pedestrian tracker based on omni-scale feature fusion, further enhancing its pedestrian feature extraction ability. In terms of experimental results. The experimental results of the pedestrian multi objects tracking model based on the TBD strategy in this paper on the MOT16 dataset show that the proposed pedestrian multi-objective tracking model can effectively improve the accuracy of pedestrian multi objects tracking and reduce the problem of frequent pedestrian ID number switching.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140512240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on the simplification of building complex model under multi-factor constraints 多因素约束下建筑复杂模型的简化研究
Haoyuan Bai, Kelong Yang, Shunhua Liao
With the wide application of 3D building cluster models in urban planning, visualization and other fields, how to improve the rendering efficiency and reduce the computational cost of building cluster models has become an important issue. To address this problem, this paper proposes a visual perception evaluation model used to assess the weights of buildings based on multi-factor considerations to determine the order of building simplification, and weights the vertex importance for the classical QEM algorithm to redefine the collapsing cost of the edges, which achieves the purpose of reducing the complexity of the model while maintaining the visual quality. Experimental results show that the algorithm can significantly reduce the model rendering time and computational cost while maintaining the visual quality.
随着三维建筑群模型在城市规划、可视化等领域的广泛应用,如何提高建筑群模型的渲染效率、降低计算成本已成为一个重要课题。针对这一问题,本文提出了一种基于多因素考虑的视觉感知评估模型,用于评估建筑物的权重,确定建筑物简化的顺序,并对经典 QEM 算法的顶点重要性进行加权,重新定义边缘的折叠成本,达到了在保持视觉质量的前提下降低模型复杂度的目的。实验结果表明,该算法可以在保证视觉质量的前提下显著减少模型渲染时间和计算成本。
{"title":"Research on the simplification of building complex model under multi-factor constraints","authors":"Haoyuan Bai, Kelong Yang, Shunhua Liao","doi":"10.1117/12.3014388","DOIUrl":"https://doi.org/10.1117/12.3014388","url":null,"abstract":"With the wide application of 3D building cluster models in urban planning, visualization and other fields, how to improve the rendering efficiency and reduce the computational cost of building cluster models has become an important issue. To address this problem, this paper proposes a visual perception evaluation model used to assess the weights of buildings based on multi-factor considerations to determine the order of building simplification, and weights the vertex importance for the classical QEM algorithm to redefine the collapsing cost of the edges, which achieves the purpose of reducing the complexity of the model while maintaining the visual quality. Experimental results show that the algorithm can significantly reduce the model rendering time and computational cost while maintaining the visual quality.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RGB-D visual SLAM for point association local edge features 针对点关联局部边缘特征的 RGB-D 视觉 SLAM
Hongtu Li, Fang Wang, Yunjiang Zhang
Aiming at the difficulty of point feature matching in 3D reconstruction to meet the tracking requirements of weakly textured scenes, this paper proposes a visual SLAM algorithm based on grid method combining points with edge features. In the tracking thread, a method based on grid method is proposed to evaluate the feature quality of points. The textures of external environment are judged according to ORB feature description, and the information of Canny edge features of weakly textured mesh is added to improve the positioning accuracy. In the local mapping thread, the joint feature points pose and map points are iteratively optimized to improve the convergence rate of the algorithm. The simulation results show that the proposed algorithm has a good location and tracking effects in the weak texture scene.
针对三维重建中点特征匹配难以满足弱纹理场景跟踪要求的问题,本文提出了一种基于网格法的视觉 SLAM 算法,将点与边缘特征相结合。在跟踪线程中,提出了一种基于网格法的点特征质量评估方法。根据 ORB 特征描述对外部环境的纹理进行判断,并加入弱纹理网格的 Canny 边缘特征信息,以提高定位精度。在局部映射线程中,对联合特征点姿态和映射点进行迭代优化,以提高算法的收敛速度。仿真结果表明,所提出的算法在弱纹理场景中具有良好的定位和跟踪效果。
{"title":"RGB-D visual SLAM for point association local edge features","authors":"Hongtu Li, Fang Wang, Yunjiang Zhang","doi":"10.1117/12.3014358","DOIUrl":"https://doi.org/10.1117/12.3014358","url":null,"abstract":"Aiming at the difficulty of point feature matching in 3D reconstruction to meet the tracking requirements of weakly textured scenes, this paper proposes a visual SLAM algorithm based on grid method combining points with edge features. In the tracking thread, a method based on grid method is proposed to evaluate the feature quality of points. The textures of external environment are judged according to ORB feature description, and the information of Canny edge features of weakly textured mesh is added to improve the positioning accuracy. In the local mapping thread, the joint feature points pose and map points are iteratively optimized to improve the convergence rate of the algorithm. The simulation results show that the proposed algorithm has a good location and tracking effects in the weak texture scene.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of customer electricity usage anomalies based on random matrix theory 基于随机矩阵理论识别用户用电异常情况
Shuo Zhou, Qihui Wang
A detection algorithm of maximum and minimum eigenvalues based on random matrix theory is proposed for the problem of abnormal detection of customer electricity consumption. Firstly, the data source matrix is constructed by time alignment and superimposed Gaussian white noise, and the sliding window method is used to obtain the window data indicating the operation status at each moment; secondly, the window data are standardized, feature extraction and other operations are performed, and the difference and the sum of the maximum and minimum eigenvalues are compared to construct the feature detection indexes and thresholds; finally, the algorithm is studied and verified by simulation. The results show that the algorithm does not depend on any model, can analyze the operation status of the system more comprehensively and adequately, and realizes the effective detection of abnormal data
针对用户用电异常检测问题,提出了一种基于随机矩阵理论的最大最小特征值检测算法。首先,通过时间对齐和叠加高斯白噪声构建数据源矩阵,并利用滑动窗口法得到表示各时刻运行状态的窗口数据;其次,对窗口数据进行标准化处理,并进行特征提取等操作,比较最大特征值和最小特征值之差和,构建特征检测指标和阈值;最后,对算法进行仿真研究和验证。结果表明,该算法不依赖于任何模型,能更全面、更充分地分析系统的运行状况,实现对异常数据的有效检测。
{"title":"Identification of customer electricity usage anomalies based on random matrix theory","authors":"Shuo Zhou, Qihui Wang","doi":"10.1117/12.3014405","DOIUrl":"https://doi.org/10.1117/12.3014405","url":null,"abstract":"A detection algorithm of maximum and minimum eigenvalues based on random matrix theory is proposed for the problem of abnormal detection of customer electricity consumption. Firstly, the data source matrix is constructed by time alignment and superimposed Gaussian white noise, and the sliding window method is used to obtain the window data indicating the operation status at each moment; secondly, the window data are standardized, feature extraction and other operations are performed, and the difference and the sum of the maximum and minimum eigenvalues are compared to construct the feature detection indexes and thresholds; finally, the algorithm is studied and verified by simulation. The results show that the algorithm does not depend on any model, can analyze the operation status of the system more comprehensively and adequately, and realizes the effective detection of abnormal data","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1