{"title":"TV News Retrieval Based on Story Segmentation and Concept Association","authors":"Ruxandra Tapu, B. Mocanu, T. Zaharia","doi":"10.1109/SITIS.2016.60","DOIUrl":null,"url":null,"abstract":"In this paper we propose a novel method for TV news retrieval. A first stage concerns a temporal segmentation into stories units. Then, for each story the most relevant concepts are extracted based on a multimodal fusion between visual and textual information. By analyzing the video stream, we perform global frame representation, image retrieval and re-ranking, in order to determine, with high confidence, the segments boundaries. In addition, by using the video subtitle, we identify the most relevant concepts / topics addressed in each independent segment. The framework is evaluated using one week video archive of France Television and 20 journals from NBC and CNN TV stations. For the temporal video segmentation, our system returns high precision and recall scores, superior to 90%. Regarding the topic association technique, we obtain a mean average precision score superior to 0.5.","PeriodicalId":403704,"journal":{"name":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"359 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2016.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper we propose a novel method for TV news retrieval. A first stage concerns a temporal segmentation into stories units. Then, for each story the most relevant concepts are extracted based on a multimodal fusion between visual and textual information. By analyzing the video stream, we perform global frame representation, image retrieval and re-ranking, in order to determine, with high confidence, the segments boundaries. In addition, by using the video subtitle, we identify the most relevant concepts / topics addressed in each independent segment. The framework is evaluated using one week video archive of France Television and 20 journals from NBC and CNN TV stations. For the temporal video segmentation, our system returns high precision and recall scores, superior to 90%. Regarding the topic association technique, we obtain a mean average precision score superior to 0.5.