{"title":"一个垂直视频搜索引擎的设计与实现","authors":"Yingyi Liang, Zhenyu He, Yi Li","doi":"10.1109/SPAC.2014.6982660","DOIUrl":null,"url":null,"abstract":"In this paper, a video vertical search engine is designed and implemented based on the theory of vertical search engine. Firstly, we introduce the vertical search engine and its research situation at home and abroad, analyze the principle of implementing the vertical search engine, and introduce the key technology used in this paper, such as subject information acquisition method, Chinese segmentation algorithm, and the search result re-sorting. We provide the video resource acquisition process and the video resources storage, and repeat video resources exclusion. Then, we analyze an information retrieval tool library, Lucene, which is with a advanced design and superior performance. Based on this library, a Chinese segmentation algorithm and a result sorting method are added. Unlike current other studies, a variable length matching strategy is taken for designing Chinese word with bidirectional matching method for disambiguation. Compared with the latest open source word segmentation algorithm, our segmentation algorithm designed in this paper outperforms better. With the video resources fetching from the internet and the Chinese word segmentation of VKAnalyzer extending from Lucene designed and implemented in the paper, we design related video re-sorting methods by different ways, such as length, times and comments, and implement the sorting method for search results according to users' various requirements. The experiments shows that the recall rate of the search engine is 90% and the accuracy is 97%, as are satisfactory.","PeriodicalId":326246,"journal":{"name":"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Design and implementation of one vertical video search engine\",\"authors\":\"Yingyi Liang, Zhenyu He, Yi Li\",\"doi\":\"10.1109/SPAC.2014.6982660\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a video vertical search engine is designed and implemented based on the theory of vertical search engine. Firstly, we introduce the vertical search engine and its research situation at home and abroad, analyze the principle of implementing the vertical search engine, and introduce the key technology used in this paper, such as subject information acquisition method, Chinese segmentation algorithm, and the search result re-sorting. We provide the video resource acquisition process and the video resources storage, and repeat video resources exclusion. Then, we analyze an information retrieval tool library, Lucene, which is with a advanced design and superior performance. Based on this library, a Chinese segmentation algorithm and a result sorting method are added. Unlike current other studies, a variable length matching strategy is taken for designing Chinese word with bidirectional matching method for disambiguation. Compared with the latest open source word segmentation algorithm, our segmentation algorithm designed in this paper outperforms better. With the video resources fetching from the internet and the Chinese word segmentation of VKAnalyzer extending from Lucene designed and implemented in the paper, we design related video re-sorting methods by different ways, such as length, times and comments, and implement the sorting method for search results according to users' various requirements. The experiments shows that the recall rate of the search engine is 90% and the accuracy is 97%, as are satisfactory.\",\"PeriodicalId\":326246,\"journal\":{\"name\":\"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPAC.2014.6982660\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC.2014.6982660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design and implementation of one vertical video search engine
In this paper, a video vertical search engine is designed and implemented based on the theory of vertical search engine. Firstly, we introduce the vertical search engine and its research situation at home and abroad, analyze the principle of implementing the vertical search engine, and introduce the key technology used in this paper, such as subject information acquisition method, Chinese segmentation algorithm, and the search result re-sorting. We provide the video resource acquisition process and the video resources storage, and repeat video resources exclusion. Then, we analyze an information retrieval tool library, Lucene, which is with a advanced design and superior performance. Based on this library, a Chinese segmentation algorithm and a result sorting method are added. Unlike current other studies, a variable length matching strategy is taken for designing Chinese word with bidirectional matching method for disambiguation. Compared with the latest open source word segmentation algorithm, our segmentation algorithm designed in this paper outperforms better. With the video resources fetching from the internet and the Chinese word segmentation of VKAnalyzer extending from Lucene designed and implemented in the paper, we design related video re-sorting methods by different ways, such as length, times and comments, and implement the sorting method for search results according to users' various requirements. The experiments shows that the recall rate of the search engine is 90% and the accuracy is 97%, as are satisfactory.