Video object matching based on SIFT algorithm

2008 International Conference on Neural Networks and Signal Processing Pub Date : 2008-06-07 DOI:10.1109/ICNNSP.2008.4590383

Xuelong Hu, Yingcheng Tang, Zheng-Ben Zhang

引用次数: 51

Abstract

SIFT (scale invariant feature transform) is used to solve visual tracking problem, where the appearances of the tracked object and scene background change during tracking. The implementation of this algorithm has five major stages: scale-space extrema detection; keypoint localization; orientation assignment; keypoint descriptor; keypoint matching. From the beginning frame, object is selected as the template, its SIFT features are computed. Then in the following frames, the SIFT features are computed. Euclidean distance between the object's SIFT features and the frames' SIFT features can be used to compute the accurate position of the matched object. The experimental results on real video sequences demonstrate the effectiveness of this approach and show this algorithm is of higher robustness and real-time performance. It can solve the matching problem with translation, rotation and affine distortion between images. It plays an important role in video object tracking and video object retrieval.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于SIFT算法的视频目标匹配

SIFT (scale invariant feature transform)用于解决视觉跟踪问题，在跟踪过程中被跟踪对象的外观和场景背景会发生变化。该算法的实现分为五个主要阶段:尺度空间极值检测;关键点定位;定向分配;关键点描述符;关键点匹配。从起始帧开始，选取目标作为模板，计算其SIFT特征。然后在接下来的帧中，计算SIFT特征。目标的SIFT特征与帧的SIFT特征之间的欧氏距离可以用来计算匹配目标的精确位置。在真实视频序列上的实验结果验证了该方法的有效性，表明该算法具有较高的鲁棒性和实时性。它可以解决图像间的平移、旋转和仿射畸变等匹配问题。它在视频目标跟踪和视频目标检索中起着重要的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊