Super pixels transmission map-based object detection using deep neural network in UAV video

J. Evangelin, Deva Sheela, P. Arockia, J. Rani, M. A. Paul
{"title":"Super pixels transmission map-based object detection using deep neural network in UAV video","authors":"J. Evangelin, Deva Sheela, P. Arockia, J. Rani, M. A. Paul","doi":"10.1080/13682199.2023.2195121","DOIUrl":null,"url":null,"abstract":"ABSTRACT Object detection has become a very prominent subject for research in recent times. This study's main goal is to suggest a technique for video saliency object detection. It seems to sense that using the depth information in photos to detect salient things. Since depth offers abundant information about scene structure, object forms, and other 3D cues. This information is very compatible to distinguish between objects in the foreground and background. As a result of the high object density, small object size, and cluttered background, aerial photos and movies provide results with low precision. In this paper, the proposed SPTM (Super Pixel Transmission Map)-YOLO model, the input RGB image has applied Dark Channel Prior (DCP) method for estimating the transmission map. From the transmission map only, the background probability is estimated with the help of SLIC (simple linear iterative clustering algorithm) superpixel segmentation. That foreground extracted image is further learned with YOLO architecture to detect the objects effectively. For object detection in aerial images, this proposed SPTM-YOLO approach outperforms classic YOLO by up to 6% accuracy. Accurate detection of things that are small in size, partially occluded, and out of view is possible.","PeriodicalId":22456,"journal":{"name":"The Imaging Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Imaging Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/13682199.2023.2195121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

ABSTRACT Object detection has become a very prominent subject for research in recent times. This study's main goal is to suggest a technique for video saliency object detection. It seems to sense that using the depth information in photos to detect salient things. Since depth offers abundant information about scene structure, object forms, and other 3D cues. This information is very compatible to distinguish between objects in the foreground and background. As a result of the high object density, small object size, and cluttered background, aerial photos and movies provide results with low precision. In this paper, the proposed SPTM (Super Pixel Transmission Map)-YOLO model, the input RGB image has applied Dark Channel Prior (DCP) method for estimating the transmission map. From the transmission map only, the background probability is estimated with the help of SLIC (simple linear iterative clustering algorithm) superpixel segmentation. That foreground extracted image is further learned with YOLO architecture to detect the objects effectively. For object detection in aerial images, this proposed SPTM-YOLO approach outperforms classic YOLO by up to 6% accuracy. Accurate detection of things that are small in size, partially occluded, and out of view is possible.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
无人机视频中基于超像素传输图的深度神经网络目标检测
摘要:目标检测是近年来研究的一个非常突出的课题。本研究的主要目的是提出一种视频显著性目标检测技术。利用照片中的深度信息来发现突出的东西似乎是有意义的。因为深度提供了关于场景结构、对象形式和其他3D线索的丰富信息。这个信息非常兼容,可以区分前景和背景中的物体。由于物体密度高,物体尺寸小,背景杂乱,航空照片和电影提供的结果精度较低。本文提出了SPTM (Super Pixel Transmission Map)-YOLO模型,输入RGB图像采用暗通道先验(Dark Channel Prior, DCP)方法估计传输图。仅从传输图出发,借助SLIC(简单线性迭代聚类算法)超像素分割估计背景概率。利用YOLO架构对提取的前景图像进行进一步学习,有效检测目标。对于航空图像中的目标检测,本文提出的SPTM-YOLO方法比经典的YOLO方法准确率高出6%。精确地探测小的、部分遮挡的、在视线之外的物体是可能的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impact of the Internet of Medical Things on Artificial Intelligence-enhanced medical imaging systems from 2019 to 2023 Advancements in adversarial generative text-to-image models: a review Enhancing image encryption security through integration multi-chaotic systems and mixed pixel-bit level Unsupervised low-light image enhancement by data augmentation and contrastive learning Minimum error threshold segmentation method for SAR image based on Rayleigh distribution assumption
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1