{"title":"用VOD演讲日语字幕的共现词估计电影片段的关联","authors":"Nobuyuki Kobayashi, Shingo Nakamura, Hiromitsu Shiina, Fumio Kitagawa","doi":"10.1109/IIAI-AAI.2014.15","DOIUrl":null,"url":null,"abstract":"Many universities are now using Video On Demand (VOD) lectures, which utilize the Internet to conduct lectures. However, because current systems are yet to implement search functionality for VOD, users must search manually by selecting movies based on their titles, playing a portion of the movies, and searching for the contents they are seeking. This study aims to create a search function that allows users to easily search important points and points to review. We propose a system in which the search function applies mixed beta distributions to the occurrence frequency of a search word based on the appearance frequency of the search word in subtitle data. Using the components of these distributions, the algorithm estimates the movie segments the users are searching for. The shapes of approximation distribution components differ in the mixed beta distribution. In this method, in addition to movie segment estimation for a search word, a frequency distribution is found for words appearing in the same text of the subtitles as the search word, called \"co-occurrence words,\" and by applying mixed beta distribution in the same manner to the frequency distribution of the search word, the movie segments common to the original search word and the co-occurrence word are estimated. However, in cases where few words appear, approximation is difficult with the mixed beta distribution, movie segments of the slides where the words appear are provided. Furthermore, by creating a scatter graph related to the occurrence time and distance of the search word and co-occurrence words, the emergence of the words can be comprehended visually, and the movies with high association and low association among the co-occurrence words, as well as the scope of the slides with high association and low association can be confirmed.","PeriodicalId":432222,"journal":{"name":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimating Association of Movie Segment by Co-occurrence Word from Japanese Subtitles of VOD Lecture\",\"authors\":\"Nobuyuki Kobayashi, Shingo Nakamura, Hiromitsu Shiina, Fumio Kitagawa\",\"doi\":\"10.1109/IIAI-AAI.2014.15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many universities are now using Video On Demand (VOD) lectures, which utilize the Internet to conduct lectures. However, because current systems are yet to implement search functionality for VOD, users must search manually by selecting movies based on their titles, playing a portion of the movies, and searching for the contents they are seeking. This study aims to create a search function that allows users to easily search important points and points to review. We propose a system in which the search function applies mixed beta distributions to the occurrence frequency of a search word based on the appearance frequency of the search word in subtitle data. Using the components of these distributions, the algorithm estimates the movie segments the users are searching for. The shapes of approximation distribution components differ in the mixed beta distribution. In this method, in addition to movie segment estimation for a search word, a frequency distribution is found for words appearing in the same text of the subtitles as the search word, called \\\"co-occurrence words,\\\" and by applying mixed beta distribution in the same manner to the frequency distribution of the search word, the movie segments common to the original search word and the co-occurrence word are estimated. However, in cases where few words appear, approximation is difficult with the mixed beta distribution, movie segments of the slides where the words appear are provided. Furthermore, by creating a scatter graph related to the occurrence time and distance of the search word and co-occurrence words, the emergence of the words can be comprehended visually, and the movies with high association and low association among the co-occurrence words, as well as the scope of the slides with high association and low association can be confirmed.\",\"PeriodicalId\":432222,\"journal\":{\"name\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAI-AAI.2014.15\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2014.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Estimating Association of Movie Segment by Co-occurrence Word from Japanese Subtitles of VOD Lecture
Many universities are now using Video On Demand (VOD) lectures, which utilize the Internet to conduct lectures. However, because current systems are yet to implement search functionality for VOD, users must search manually by selecting movies based on their titles, playing a portion of the movies, and searching for the contents they are seeking. This study aims to create a search function that allows users to easily search important points and points to review. We propose a system in which the search function applies mixed beta distributions to the occurrence frequency of a search word based on the appearance frequency of the search word in subtitle data. Using the components of these distributions, the algorithm estimates the movie segments the users are searching for. The shapes of approximation distribution components differ in the mixed beta distribution. In this method, in addition to movie segment estimation for a search word, a frequency distribution is found for words appearing in the same text of the subtitles as the search word, called "co-occurrence words," and by applying mixed beta distribution in the same manner to the frequency distribution of the search word, the movie segments common to the original search word and the co-occurrence word are estimated. However, in cases where few words appear, approximation is difficult with the mixed beta distribution, movie segments of the slides where the words appear are provided. Furthermore, by creating a scatter graph related to the occurrence time and distance of the search word and co-occurrence words, the emergence of the words can be comprehended visually, and the movies with high association and low association among the co-occurrence words, as well as the scope of the slides with high association and low association can be confirmed.