{"title":"基于深度学习的视频监控系统研究","authors":"Chunfang Xue, Peng Liu, Weiping Liu","doi":"10.1109/IST48021.2019.9010234","DOIUrl":null,"url":null,"abstract":"This paper proposes a new video surveillance system designed for Deep Learning. The proposed system uses three steps to transfer RTSP streams to pictures for Deep Learning. First it decapsulates the streams, then decodes and converts color space & extracts frames. The proposed system has two ways to decode RTSP streams, hardware decoding and software decoding. By checking the processor's version of CPU firstly, system chooses a better way to decode. The proposed system has GPU and CPU. CPU is used to process RTSP streams, extract frames and do human-machine interaction. GPU is used for computing and analyzing the algorithms of Deep Learning. So the complex computing does not run on the CPU. The proposed system runs on Linux system and has Python interface, so it can easily connect with the models of Deep Learning. By running on multiple machines, the result shows that the proposed system can process up to 16 channels of stream. After 7*24 hours of testing on several machines, this system can run continuously without downtime and the delay time is less than 7 seconds.","PeriodicalId":117219,"journal":{"name":"2019 IEEE International Conference on Imaging Systems and Techniques (IST)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Studies on a Video Surveillance System Designed for Deep Learning\",\"authors\":\"Chunfang Xue, Peng Liu, Weiping Liu\",\"doi\":\"10.1109/IST48021.2019.9010234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a new video surveillance system designed for Deep Learning. The proposed system uses three steps to transfer RTSP streams to pictures for Deep Learning. First it decapsulates the streams, then decodes and converts color space & extracts frames. The proposed system has two ways to decode RTSP streams, hardware decoding and software decoding. By checking the processor's version of CPU firstly, system chooses a better way to decode. The proposed system has GPU and CPU. CPU is used to process RTSP streams, extract frames and do human-machine interaction. GPU is used for computing and analyzing the algorithms of Deep Learning. So the complex computing does not run on the CPU. The proposed system runs on Linux system and has Python interface, so it can easily connect with the models of Deep Learning. By running on multiple machines, the result shows that the proposed system can process up to 16 channels of stream. After 7*24 hours of testing on several machines, this system can run continuously without downtime and the delay time is less than 7 seconds.\",\"PeriodicalId\":117219,\"journal\":{\"name\":\"2019 IEEE International Conference on Imaging Systems and Techniques (IST)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Conference on Imaging Systems and Techniques (IST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IST48021.2019.9010234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Imaging Systems and Techniques (IST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IST48021.2019.9010234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Studies on a Video Surveillance System Designed for Deep Learning
This paper proposes a new video surveillance system designed for Deep Learning. The proposed system uses three steps to transfer RTSP streams to pictures for Deep Learning. First it decapsulates the streams, then decodes and converts color space & extracts frames. The proposed system has two ways to decode RTSP streams, hardware decoding and software decoding. By checking the processor's version of CPU firstly, system chooses a better way to decode. The proposed system has GPU and CPU. CPU is used to process RTSP streams, extract frames and do human-machine interaction. GPU is used for computing and analyzing the algorithms of Deep Learning. So the complex computing does not run on the CPU. The proposed system runs on Linux system and has Python interface, so it can easily connect with the models of Deep Learning. By running on multiple machines, the result shows that the proposed system can process up to 16 channels of stream. After 7*24 hours of testing on several machines, this system can run continuously without downtime and the delay time is less than 7 seconds.