Almendra Prisila Laureano Lumba, Roy Roger Rios Nuñez, Isaac Ocampo Yahuarcani, Rodolfo Cárdenas Vigo, C. A. G. Cortegano, Alejandro Reategui Pezo, A. M. N. Satalaya, Edgar Gutiérrez Gómez, L. A. S. Llaja
{"title":"基于卷积神经网络、视频序列和高性能计算的实时基本暴力行为识别的计算解决方案","authors":"Almendra Prisila Laureano Lumba, Roy Roger Rios Nuñez, Isaac Ocampo Yahuarcani, Rodolfo Cárdenas Vigo, C. A. G. Cortegano, Alejandro Reategui Pezo, A. M. N. Satalaya, Edgar Gutiérrez Gómez, L. A. S. Llaja","doi":"10.1109/CLEI47609.2019.235100","DOIUrl":null,"url":null,"abstract":"Among the most important social problems that Peru is currently facing, the high rates of domestic violence (violence against women and children) and criminal acts (assaults and robberies) frequently carried out on the streets stand out. In this work, a real-time tool for the detection of two types of violent actions is proposed: kick and punch.This research proposes to use the CNN called YOLO (You Only Look Once). The methodology involves Supervised Learning and Transfer Learning techniques since there is a small batch of data for training. In addition, a database of 1000 images (Frames) has been generated from 90 video sequences showing violence, which were obtained from the internet (YouTube) and by own elaboration (video recording). Taking into account that conventional computers have many limitations and that this type of work requires a large computational capacity, the processing was carried out in the IIAP \"Manati\" Supercomputer, in this way the tool can be run in real time.This computer solution achieved 84% accuracy, to detect two main acts of violence: punch and kick; which shows an appropriate result for the use and application of the tool. The results are promising and show that the proposed strategy is adequate to reach a solution.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Computing Solution for the Recognition of Basic Actions of Violence in Real Time, from the use of Convolutional Neural Networks, Video Sequences and High Performance Computing\",\"authors\":\"Almendra Prisila Laureano Lumba, Roy Roger Rios Nuñez, Isaac Ocampo Yahuarcani, Rodolfo Cárdenas Vigo, C. A. G. Cortegano, Alejandro Reategui Pezo, A. M. N. Satalaya, Edgar Gutiérrez Gómez, L. A. S. Llaja\",\"doi\":\"10.1109/CLEI47609.2019.235100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Among the most important social problems that Peru is currently facing, the high rates of domestic violence (violence against women and children) and criminal acts (assaults and robberies) frequently carried out on the streets stand out. In this work, a real-time tool for the detection of two types of violent actions is proposed: kick and punch.This research proposes to use the CNN called YOLO (You Only Look Once). The methodology involves Supervised Learning and Transfer Learning techniques since there is a small batch of data for training. In addition, a database of 1000 images (Frames) has been generated from 90 video sequences showing violence, which were obtained from the internet (YouTube) and by own elaboration (video recording). Taking into account that conventional computers have many limitations and that this type of work requires a large computational capacity, the processing was carried out in the IIAP \\\"Manati\\\" Supercomputer, in this way the tool can be run in real time.This computer solution achieved 84% accuracy, to detect two main acts of violence: punch and kick; which shows an appropriate result for the use and application of the tool. The results are promising and show that the proposed strategy is adequate to reach a solution.\",\"PeriodicalId\":216193,\"journal\":{\"name\":\"2019 XLV Latin American Computing Conference (CLEI)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 XLV Latin American Computing Conference (CLEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLEI47609.2019.235100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 XLV Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI47609.2019.235100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
秘鲁目前面临的最重要的社会问题之一是,经常在街头发生的家庭暴力(对妇女和儿童的暴力)和犯罪行为(袭击和抢劫)发生率很高。在这项工作中,提出了一种实时工具,用于检测两种类型的暴力行为:踢和打。本研究建议使用CNN的YOLO (You Only Look Once)。该方法涉及监督学习和迁移学习技术,因为有少量的数据用于训练。此外,从互联网(YouTube)和自己制作的(录像)中获得的90个暴力视频序列中产生了1000个图像(帧)的数据库。考虑到传统计算机有许多局限性,而且这类工作需要很大的计算能力,因此在IIAP“Manati”超级计算机中进行处理,这样该工具就可以实时运行。这个计算机解决方案达到了84%的准确率,检测两种主要的暴力行为:拳打脚踢;为该工具的使用和应用提供了合理的结果。结果是有希望的,表明所提出的策略足以达到解决方案。
Computing Solution for the Recognition of Basic Actions of Violence in Real Time, from the use of Convolutional Neural Networks, Video Sequences and High Performance Computing
Among the most important social problems that Peru is currently facing, the high rates of domestic violence (violence against women and children) and criminal acts (assaults and robberies) frequently carried out on the streets stand out. In this work, a real-time tool for the detection of two types of violent actions is proposed: kick and punch.This research proposes to use the CNN called YOLO (You Only Look Once). The methodology involves Supervised Learning and Transfer Learning techniques since there is a small batch of data for training. In addition, a database of 1000 images (Frames) has been generated from 90 video sequences showing violence, which were obtained from the internet (YouTube) and by own elaboration (video recording). Taking into account that conventional computers have many limitations and that this type of work requires a large computational capacity, the processing was carried out in the IIAP "Manati" Supercomputer, in this way the tool can be run in real time.This computer solution achieved 84% accuracy, to detect two main acts of violence: punch and kick; which shows an appropriate result for the use and application of the tool. The results are promising and show that the proposed strategy is adequate to reach a solution.