Topic-Aware Automatic Snippet Generation for Resolving Multiple Meaning on Web Search Result

Hiroyuki Abe, Masafumi Matsuhara, G. Chakraborty, H. Mabuchi
{"title":"Topic-Aware Automatic Snippet Generation for Resolving Multiple Meaning on Web Search Result","authors":"Hiroyuki Abe, Masafumi Matsuhara, G. Chakraborty, H. Mabuchi","doi":"10.1109/ICAWST.2018.8517190","DOIUrl":null,"url":null,"abstract":"In recent years, the amount of information on the Web is growing exponentially with the spread of the Internet. We generally use search engines to search for the intended information. However, the search engine displays the Web pages including the entered search query in list format. It is difficult for the user to find out the intended information if the entered search query is a word whose meaning depends on the situation and location of the user. It needs the intended information to the multiple hidden topics. In this research, we classify Web search results based on each topic. The topic is defined as the latent meaning, and the contents included in the word. Moreover, our method displays automatically generated snippets for each topic with the Web search results to the user. It is easy to find required information from snippets, even though the intended information is ambiguous. It first classifies the Web search results by Latent Dirichlet Allocation (LDA) which is a major topic model method. It then generates the snippets using Conditional Variational AutoEncoder (Conditional VAE) based on the clustering of We search results. It is expected that using LDA for the clustering will group the Web search result according to the latent meanings of the search query. Also, we expect that proper snippets will be generated for each topic by Conditional VAE. In this paper, we show that LDA is effective for the clustering of Web search results. Moreover, the snippets generated by Conditional VAE is able to generate sentences considered each topic.","PeriodicalId":277939,"journal":{"name":"2018 9th International Conference on Awareness Science and Technology (iCAST)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 9th International Conference on Awareness Science and Technology (iCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAWST.2018.8517190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, the amount of information on the Web is growing exponentially with the spread of the Internet. We generally use search engines to search for the intended information. However, the search engine displays the Web pages including the entered search query in list format. It is difficult for the user to find out the intended information if the entered search query is a word whose meaning depends on the situation and location of the user. It needs the intended information to the multiple hidden topics. In this research, we classify Web search results based on each topic. The topic is defined as the latent meaning, and the contents included in the word. Moreover, our method displays automatically generated snippets for each topic with the Web search results to the user. It is easy to find required information from snippets, even though the intended information is ambiguous. It first classifies the Web search results by Latent Dirichlet Allocation (LDA) which is a major topic model method. It then generates the snippets using Conditional Variational AutoEncoder (Conditional VAE) based on the clustering of We search results. It is expected that using LDA for the clustering will group the Web search result according to the latent meanings of the search query. Also, we expect that proper snippets will be generated for each topic by Conditional VAE. In this paper, we show that LDA is effective for the clustering of Web search results. Moreover, the snippets generated by Conditional VAE is able to generate sentences considered each topic.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
主题感知的自动片段生成,用于解决Web搜索结果中的多个含义
近年来,随着互联网的普及,网络上的信息量呈指数级增长。我们通常使用搜索引擎来搜索所需的信息。但是,搜索引擎以列表格式显示包含输入的搜索查询的Web页面。如果输入的搜索查询是一个单词,其含义取决于用户的情况和位置,则用户很难找到想要的信息。它需要多个隐藏主题的预期信息。在本研究中,我们基于每个主题对Web搜索结果进行分类。主题被定义为隐含意义,并包含在词中的内容。此外,我们的方法将为每个主题自动生成的片段与Web搜索结果一起显示给用户。从代码片段中很容易找到所需的信息,即使预期的信息是不明确的。该方法首先采用一种主流的主题模型方法——潜狄利克雷分配(Latent Dirichlet Allocation, LDA)对网络搜索结果进行分类。然后使用条件变分自动编码器(Conditional Variational AutoEncoder, Conditional VAE)基于We搜索结果的聚类生成片段。期望使用LDA进行聚类,根据搜索查询的潜在含义对Web搜索结果进行分组。此外,我们期望通过条件VAE为每个主题生成适当的片段。在本文中,我们证明了LDA对于Web搜索结果的聚类是有效的。此外,条件VAE生成的片段能够生成考虑每个主题的句子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Assistance for Drug Dispensing Using LED Notification and IR Sensor-based Monitoring Methods Exploring a Topical Representation of Documents for Recommendation Systems Why Tourists Don’t Visit Again? Pre-accident Situation Analysis Based on Locally of Motion Estimation of Influence of Each Variable on User’s Evaluation in Interactive Evolutionary Computation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1