Automated 3D object identification using Bayesian networks

Prudhvi K. Gurram, E. Saber, F. Sahin, H. Rhody
{"title":"Automated 3D object identification using Bayesian networks","authors":"Prudhvi K. Gurram, E. Saber, F. Sahin, H. Rhody","doi":"10.1109/AIPR.2009.5466289","DOIUrl":null,"url":null,"abstract":"3D object reconstruction from images involves two important parts: object identification and object modeling. Human beings are very adept at automatically identifying different objects in a scene due to the extensive training they receive over their lifetimes. Similarly, machines need to be trained to perform this task. At present, automated 3D object identification process from aerial video imagery encounters various problems due to uncertainties in data. The first problem is setting the input parameters of segmentation algorithm for accurate identification of the homogeneous surfaces in the scene. The second problem is deterministic inference used on the features extracted from these homogeneous surfaces or segments to identify different objects such as buildings, and trees. These problems would result in the 3D models being overfitted to a particular data set as a result of which they would fail when applied to other data sets. In this paper, an algorithm for using probabilistic inference to determine input segmentation parameters and to identify 3D objects from aerial video imagery is described. Bayesian networks are used to perform the probabilistic inference. In order to improve the accuracy of the identification process, information from Lidar data is fused with the visual imagery in a Bayesian network. The imagery is generated using the DIRSIG (Digital Imaging and Remote Sensing Image Generation) model at RIT. The parameters of the airborne sensor such as focal length, detector size, average flying height and the external parameters such as solar zenith angle can be simulated using this tool. The results show a significant improvement in the accuracy of object identification when Lidar data is fused with visual imagery compared to that when visual imagery is used alone.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2009.5466289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

3D object reconstruction from images involves two important parts: object identification and object modeling. Human beings are very adept at automatically identifying different objects in a scene due to the extensive training they receive over their lifetimes. Similarly, machines need to be trained to perform this task. At present, automated 3D object identification process from aerial video imagery encounters various problems due to uncertainties in data. The first problem is setting the input parameters of segmentation algorithm for accurate identification of the homogeneous surfaces in the scene. The second problem is deterministic inference used on the features extracted from these homogeneous surfaces or segments to identify different objects such as buildings, and trees. These problems would result in the 3D models being overfitted to a particular data set as a result of which they would fail when applied to other data sets. In this paper, an algorithm for using probabilistic inference to determine input segmentation parameters and to identify 3D objects from aerial video imagery is described. Bayesian networks are used to perform the probabilistic inference. In order to improve the accuracy of the identification process, information from Lidar data is fused with the visual imagery in a Bayesian network. The imagery is generated using the DIRSIG (Digital Imaging and Remote Sensing Image Generation) model at RIT. The parameters of the airborne sensor such as focal length, detector size, average flying height and the external parameters such as solar zenith angle can be simulated using this tool. The results show a significant improvement in the accuracy of object identification when Lidar data is fused with visual imagery compared to that when visual imagery is used alone.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用贝叶斯网络的自动三维物体识别
从图像中重建三维物体包括两个重要部分:物体识别和物体建模。人类非常擅长自动识别场景中的不同物体,这是由于他们一生中接受了广泛的训练。同样,机器也需要经过训练才能完成这项任务。目前,基于航拍视频图像的三维目标自动识别过程中,由于数据的不确定性存在各种问题。第一个问题是设置分割算法的输入参数,以准确识别场景中的均匀曲面。第二个问题是对从这些同质表面或片段中提取的特征进行确定性推理,以识别不同的物体,如建筑物和树木。这些问题将导致3D模型过度拟合到特定的数据集,从而导致它们在应用于其他数据集时失败。本文描述了一种利用概率推理确定输入分割参数并从航拍视频图像中识别三维目标的算法。采用贝叶斯网络进行概率推理。为了提高识别过程的准确性,将激光雷达数据信息与视觉图像融合在贝叶斯网络中。图像是使用RIT的DIRSIG(数字成像和遥感图像生成)模型生成的。利用该工具可以模拟机载传感器的焦距、探测器尺寸、平均飞行高度等参数以及太阳天顶角等外部参数。结果表明,与单独使用视觉图像相比,激光雷达数据与视觉图像融合后的目标识别精度有显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Image-based querying of urban photos and videos Large-scale functional models of visual cortex for remote sensing Overhead imagery research data set — an annotated data library & tools to aid in the development of computer vision algorithms 3D shape retrieval by visual parts similarity Kalman filter based video background estimation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1