Automated 3D object identification using Bayesian networks

2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009) Pub Date : 2009-10-01 DOI:10.1109/AIPR.2009.5466289

Prudhvi K. Gurram, E. Saber, F. Sahin, H. Rhody

{"title":"Automated 3D object identification using Bayesian networks","authors":"Prudhvi K. Gurram, E. Saber, F. Sahin, H. Rhody","doi":"10.1109/AIPR.2009.5466289","DOIUrl":null,"url":null,"abstract":"3D object reconstruction from images involves two important parts: object identification and object modeling. Human beings are very adept at automatically identifying different objects in a scene due to the extensive training they receive over their lifetimes. Similarly, machines need to be trained to perform this task. At present, automated 3D object identification process from aerial video imagery encounters various problems due to uncertainties in data. The first problem is setting the input parameters of segmentation algorithm for accurate identification of the homogeneous surfaces in the scene. The second problem is deterministic inference used on the features extracted from these homogeneous surfaces or segments to identify different objects such as buildings, and trees. These problems would result in the 3D models being overfitted to a particular data set as a result of which they would fail when applied to other data sets. In this paper, an algorithm for using probabilistic inference to determine input segmentation parameters and to identify 3D objects from aerial video imagery is described. Bayesian networks are used to perform the probabilistic inference. In order to improve the accuracy of the identification process, information from Lidar data is fused with the visual imagery in a Bayesian network. The imagery is generated using the DIRSIG (Digital Imaging and Remote Sensing Image Generation) model at RIT. The parameters of the airborne sensor such as focal length, detector size, average flying height and the external parameters such as solar zenith angle can be simulated using this tool. The results show a significant improvement in the accuracy of object identification when Lidar data is fused with visual imagery compared to that when visual imagery is used alone.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2009.5466289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

3D object reconstruction from images involves two important parts: object identification and object modeling. Human beings are very adept at automatically identifying different objects in a scene due to the extensive training they receive over their lifetimes. Similarly, machines need to be trained to perform this task. At present, automated 3D object identification process from aerial video imagery encounters various problems due to uncertainties in data. The first problem is setting the input parameters of segmentation algorithm for accurate identification of the homogeneous surfaces in the scene. The second problem is deterministic inference used on the features extracted from these homogeneous surfaces or segments to identify different objects such as buildings, and trees. These problems would result in the 3D models being overfitted to a particular data set as a result of which they would fail when applied to other data sets. In this paper, an algorithm for using probabilistic inference to determine input segmentation parameters and to identify 3D objects from aerial video imagery is described. Bayesian networks are used to perform the probabilistic inference. In order to improve the accuracy of the identification process, information from Lidar data is fused with the visual imagery in a Bayesian network. The imagery is generated using the DIRSIG (Digital Imaging and Remote Sensing Image Generation) model at RIT. The parameters of the airborne sensor such as focal length, detector size, average flying height and the external parameters such as solar zenith angle can be simulated using this tool. The results show a significant improvement in the accuracy of object identification when Lidar data is fused with visual imagery compared to that when visual imagery is used alone.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用贝叶斯网络的自动三维物体识别

从图像中重建三维物体包括两个重要部分:物体识别和物体建模。人类非常擅长自动识别场景中的不同物体，这是由于他们一生中接受了广泛的训练。同样，机器也需要经过训练才能完成这项任务。目前，基于航拍视频图像的三维目标自动识别过程中，由于数据的不确定性存在各种问题。第一个问题是设置分割算法的输入参数，以准确识别场景中的均匀曲面。第二个问题是对从这些同质表面或片段中提取的特征进行确定性推理，以识别不同的物体，如建筑物和树木。这些问题将导致3D模型过度拟合到特定的数据集，从而导致它们在应用于其他数据集时失败。本文描述了一种利用概率推理确定输入分割参数并从航拍视频图像中识别三维目标的算法。采用贝叶斯网络进行概率推理。为了提高识别过程的准确性，将激光雷达数据信息与视觉图像融合在贝叶斯网络中。图像是使用RIT的DIRSIG(数字成像和遥感图像生成)模型生成的。利用该工具可以模拟机载传感器的焦距、探测器尺寸、平均飞行高度等参数以及太阳天顶角等外部参数。结果表明，与单独使用视觉图像相比，激光雷达数据与视觉图像融合后的目标识别精度有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)

自引率

0.00%

发文量

期刊最新文献

Image-based querying of urban photos and videos Large-scale functional models of visual cortex for remote sensing Overhead imagery research data set — an annotated data library & tools to aid in the development of computer vision algorithms 3D shape retrieval by visual parts similarity Kalman filter based video background estimation