{"title":"基于深度卷积神经网络和互概率k近邻的视频摘要","authors":"Jimson L, Dr. J. P. Ananth","doi":"10.1080/0952813X.2022.2078888","DOIUrl":null,"url":null,"abstract":"ABSTRACT The video summarisation is an advanced mechanism for enabling users to handle and browse large videos in an effective manner. Various video summarisation methods are developed in recent days, in which handling of synchronisation and timing issues remain as the important challenge. The proposed video summarisation technique produces a short summary from the huge video stream. Initially, from an input database, the cricket videos containing number of frames are fed to keyframe extraction unit. Here, the keyframe extraction is done by the Euclidean distance and discrete cosine transform, and the best keyframes are selected based on the Euclidean distance. The residual frame is obtained by passing the input frames through deep convolutional neural network. Then, the similarity is calculated by Bhattacharyya distance. For video summarisation process, the optimal frameset is evaluated by matching residual keyframe with obtained keyframes. Here, input queries consisting of face object are subjected to object matching process, which is performed using the proposed mutual probability-based k-nearest neighbour (MP-KNN) to obtain relevant frames based on texture features. The performance of the proposed MP-KNN is superior based on precision, recall, and F-measure with values 0.963, 0.960, and 0.909, respectively.","PeriodicalId":15677,"journal":{"name":"Journal of Experimental & Theoretical Artificial Intelligence","volume":"26 1","pages":"1251 - 1267"},"PeriodicalIF":1.7000,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Video Summarization using Deep Convolutional Neural Networks and Mutual Probability-based K-Nearest Neighbour\",\"authors\":\"Jimson L, Dr. J. P. Ananth\",\"doi\":\"10.1080/0952813X.2022.2078888\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT The video summarisation is an advanced mechanism for enabling users to handle and browse large videos in an effective manner. Various video summarisation methods are developed in recent days, in which handling of synchronisation and timing issues remain as the important challenge. The proposed video summarisation technique produces a short summary from the huge video stream. Initially, from an input database, the cricket videos containing number of frames are fed to keyframe extraction unit. Here, the keyframe extraction is done by the Euclidean distance and discrete cosine transform, and the best keyframes are selected based on the Euclidean distance. The residual frame is obtained by passing the input frames through deep convolutional neural network. Then, the similarity is calculated by Bhattacharyya distance. For video summarisation process, the optimal frameset is evaluated by matching residual keyframe with obtained keyframes. Here, input queries consisting of face object are subjected to object matching process, which is performed using the proposed mutual probability-based k-nearest neighbour (MP-KNN) to obtain relevant frames based on texture features. The performance of the proposed MP-KNN is superior based on precision, recall, and F-measure with values 0.963, 0.960, and 0.909, respectively.\",\"PeriodicalId\":15677,\"journal\":{\"name\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"volume\":\"26 1\",\"pages\":\"1251 - 1267\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2022-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1080/0952813X.2022.2078888\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental & Theoretical Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/0952813X.2022.2078888","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Video Summarization using Deep Convolutional Neural Networks and Mutual Probability-based K-Nearest Neighbour
ABSTRACT The video summarisation is an advanced mechanism for enabling users to handle and browse large videos in an effective manner. Various video summarisation methods are developed in recent days, in which handling of synchronisation and timing issues remain as the important challenge. The proposed video summarisation technique produces a short summary from the huge video stream. Initially, from an input database, the cricket videos containing number of frames are fed to keyframe extraction unit. Here, the keyframe extraction is done by the Euclidean distance and discrete cosine transform, and the best keyframes are selected based on the Euclidean distance. The residual frame is obtained by passing the input frames through deep convolutional neural network. Then, the similarity is calculated by Bhattacharyya distance. For video summarisation process, the optimal frameset is evaluated by matching residual keyframe with obtained keyframes. Here, input queries consisting of face object are subjected to object matching process, which is performed using the proposed mutual probability-based k-nearest neighbour (MP-KNN) to obtain relevant frames based on texture features. The performance of the proposed MP-KNN is superior based on precision, recall, and F-measure with values 0.963, 0.960, and 0.909, respectively.
期刊介绍:
Journal of Experimental & Theoretical Artificial Intelligence (JETAI) is a world leading journal dedicated to publishing high quality, rigorously reviewed, original papers in artificial intelligence (AI) research.
The journal features work in all subfields of AI research and accepts both theoretical and applied research. Topics covered include, but are not limited to, the following:
• cognitive science
• games
• learning
• knowledge representation
• memory and neural system modelling
• perception
• problem-solving