The Embedded VGG-Net Video Stream Processing Framework for Real-Time Classification of Cutting Volume at Shale Shaker

Day 3 Thu, March 28, 2019 Pub Date : 2019-03-22 DOI:10.2523/IPTC-19312-MS

Xunsheng Du, Yuchen Jin, Xuqing Wu, Yu Liu, Xianping Wu, Omar Awan, Joey Roth, K. C. See, Nicolas Tognini, Jiefu Chen, Zhu Han

{"title":"The Embedded VGG-Net Video Stream Processing Framework for Real-Time Classification of Cutting Volume at Shale Shaker","authors":"Xunsheng Du, Yuchen Jin, Xuqing Wu, Yu Liu, Xianping Wu, Omar Awan, Joey Roth, K. C. See, Nicolas Tognini, Jiefu Chen, Zhu Han","doi":"10.2523/IPTC-19312-MS","DOIUrl":null,"url":null,"abstract":"\n A deep learning framework is proposed and implemented for monitoring the cutting volumes of a shaker in real-time on a deep-water drilling rig. The framework aims at performing classification and quantification with a live video streaming. Compared to the traditional video analytics method that is time-consuming, the proposed framework is more efficient and can be implemented for a real-time video analysis application. The real-time deep learning video analysis model consists of two parts for processing. The first part is a multi-thread video processing engine. A modularized service named Rig-Site Virtual Presence (RSVP) provides real-time video streaming from the rig. The multi-thread video processing engine implements real-time decoding, preprocessing and encoding of the video stream. The second part is a customized deep classification model. Based on the deep neural network (DNN), we implement the following adaptations: 1) Applied whitening and instance normalization to video frames; 2) Optimized the number of convolutional layers and the number of nodes in fully-connected layers; 3) Applied L2-norm regularization. The customized model is embedded in the multi-thread video processing engine, which ensures the capability for the real-time inference. The deep learning model categorizes every video frame into \"ExtraHeavy\", \"Heavy\", \"Light\" or \"None\". The model also outputs the corresponding numerical probabilities of each outcome. The training of the model is accomplished on a Nvidia GeForce 1070 GPU using the video stream with 137Kbps bitrate, 5.84 frames/s, and a frame size of 720×486. With only a common CPU support, the inference of the pre-trained model can be conducted in real-time. Both labeled frames and numerical results will be saved for later examination. Compared to the manual labeling results, the proposed deep learning framework achieves very promising results for analyzing video streaming in real-time.","PeriodicalId":11267,"journal":{"name":"Day 3 Thu, March 28, 2019","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 3 Thu, March 28, 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2523/IPTC-19312-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

A deep learning framework is proposed and implemented for monitoring the cutting volumes of a shaker in real-time on a deep-water drilling rig. The framework aims at performing classification and quantification with a live video streaming. Compared to the traditional video analytics method that is time-consuming, the proposed framework is more efficient and can be implemented for a real-time video analysis application. The real-time deep learning video analysis model consists of two parts for processing. The first part is a multi-thread video processing engine. A modularized service named Rig-Site Virtual Presence (RSVP) provides real-time video streaming from the rig. The multi-thread video processing engine implements real-time decoding, preprocessing and encoding of the video stream. The second part is a customized deep classification model. Based on the deep neural network (DNN), we implement the following adaptations: 1) Applied whitening and instance normalization to video frames; 2) Optimized the number of convolutional layers and the number of nodes in fully-connected layers; 3) Applied L2-norm regularization. The customized model is embedded in the multi-thread video processing engine, which ensures the capability for the real-time inference. The deep learning model categorizes every video frame into "ExtraHeavy", "Heavy", "Light" or "None". The model also outputs the corresponding numerical probabilities of each outcome. The training of the model is accomplished on a Nvidia GeForce 1070 GPU using the video stream with 137Kbps bitrate, 5.84 frames/s, and a frame size of 720×486. With only a common CPU support, the inference of the pre-trained model can be conducted in real-time. Both labeled frames and numerical results will be saved for later examination. Compared to the manual labeling results, the proposed deep learning framework achieves very promising results for analyzing video streaming in real-time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向振动筛切削量实时分类的嵌入式VGG-Net视频流处理框架

提出并实现了一种深度学习框架，用于实时监测深水钻机激振器的切削量。该框架旨在对实时视频流进行分类和量化。与耗时的传统视频分析方法相比，该框架具有更高的效率，可以实现实时视频分析应用。实时深度学习视频分析模型包括两个处理部分。第一部分是一个多线程视频处理引擎。一种名为rig - site Virtual Presence (RSVP)的模块化服务可提供来自钻机的实时视频流。多线程视频处理引擎实现了视频流的实时解码、预处理和编码。第二部分是定制的深度分类模型。基于深度神经网络(DNN)，我们实现了以下改进:1)对视频帧进行白化和实例归一化处理;2)优化卷积层数和全连接层节点数;3)应用l2范数正则化。将定制模型嵌入到多线程视频处理引擎中，保证了实时推理的能力。深度学习模型将每个视频帧分为“extra - Heavy”、“Heavy”、“Light”和“None”。该模型还输出每个结果的相应数值概率。模型的训练是在Nvidia GeForce 1070 GPU上完成的，使用137Kbps比特率，5.84帧/秒，帧大小为720×486的视频流。只需要一个通用的CPU支持，就可以实时地对预训练模型进行推理。标记框架和数值结果将被保存以供以后检查。与人工标记结果相比，所提出的深度学习框架在实时分析视频流方面取得了非常有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Day 3 Thu, March 28, 2019

自引率

0.00%

发文量