Object Detection based Approach for an Efficient Video Summarization with System Statistics over Cloud

Alok Negi, Krishan Kumar, Parul Saini, Shamal Kashid
{"title":"Object Detection based Approach for an Efficient Video Summarization with System Statistics over Cloud","authors":"Alok Negi, Krishan Kumar, Parul Saini, Shamal Kashid","doi":"10.1109/UPCON56432.2022.9986376","DOIUrl":null,"url":null,"abstract":"The tremendous volume of video data generated by industrial surveillance networks presents a number of difficulties when examining such videos for a variety of purposes, including video summarization (VS), analysis, indexing and retrieval. The task of creating video summaries is extremely difficult because of the huge amount of data, redundancy, interleaved views and light variations. Multiple object detection and identification in video is difficult for machines to recognize and classify. To address all such issues, multiple low-feature and clustering-based machine learning strategies that fail to completely exploit VS are recommended. In this work, we achieved VS by embedding deep neural network-based soft computing methods. Firstly, the objects in extracted frames are detected using YOLOv5, and then the frames without objects (useless frames) are removed. Video summary generation occurs with the help of frames containing Objects. To check the quality of the proposed work Summary length, precision, recall, PR curve, and mean average precision (mAP) are used and system resource utilization during the model training are also tracked. As a result, the proposed work was able to identify the most effective video summarization framework with best summary length under varying conditions.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"321 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The tremendous volume of video data generated by industrial surveillance networks presents a number of difficulties when examining such videos for a variety of purposes, including video summarization (VS), analysis, indexing and retrieval. The task of creating video summaries is extremely difficult because of the huge amount of data, redundancy, interleaved views and light variations. Multiple object detection and identification in video is difficult for machines to recognize and classify. To address all such issues, multiple low-feature and clustering-based machine learning strategies that fail to completely exploit VS are recommended. In this work, we achieved VS by embedding deep neural network-based soft computing methods. Firstly, the objects in extracted frames are detected using YOLOv5, and then the frames without objects (useless frames) are removed. Video summary generation occurs with the help of frames containing Objects. To check the quality of the proposed work Summary length, precision, recall, PR curve, and mean average precision (mAP) are used and system resource utilization during the model training are also tracked. As a result, the proposed work was able to identify the most effective video summarization framework with best summary length under varying conditions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于目标检测的云上系统统计高效视频摘要方法
工业监控网络产生的大量视频数据在为各种目的(包括视频摘要、分析、索引和检索)检查这些视频时提出了一些困难。由于大量的数据、冗余、交错的视图和光线变化,创建视频摘要的任务非常困难。视频中的多目标检测和识别是机器难以识别和分类的问题。为了解决所有这些问题,建议使用多种低特征和基于集群的机器学习策略,这些策略不能完全利用VS。在这项工作中,我们通过嵌入基于深度神经网络的软计算方法来实现VS。首先使用YOLOv5检测提取帧中的对象,然后去除没有对象的帧(无用帧)。视频摘要在包含对象的帧的帮助下生成。为了检查所提出的工作的质量,使用了摘要长度、精度、召回率、PR曲线和平均平均精度(mAP),并跟踪了模型训练过程中的系统资源利用率。结果表明,所提出的工作能够确定在不同条件下具有最佳摘要长度的最有效的视频摘要框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mains Interface Circuit Design for Traveling Wave Tube Amplifier A Passive Technique for Detecting Islanding Using Voltage Sequence Component A Unified Framework for Covariance Adaptation with Multiple Source Domains Advance Sensor for Monitoring Electrolyte Leakage in Lithium-ion Batteries for Electric Vehicles A comparative study of survey papers based on energy efficient, coverage-aware, and fault tolerant in static sink node of WSN
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1