学习电子商务查询图像信息

U. Porwal
{"title":"学习电子商务查询图像信息","authors":"U. Porwal","doi":"10.1145/3372124.3372126","DOIUrl":null,"url":null,"abstract":"Computing similarity between a query and a document is fundamental in any information retrieval system. In search engines, computing query-document similarity is an essential step in both retrieval and ranking stages. In eBay search, document is an item and the query-item similarity can be computed by comparing different facets of the query-item pair. Query text can be compared with the text of the item title. Likewise, a category constraint applied on the query can be compared with the listing category of the item. However, images are one signal that are usually present in the items but are not present in the query. Images are one of the most intuitive signals used by users to determine the relevance of the item given a query. Including this signal in estimating similarity between the query-item pair is likely to improve the relevance of the search engine. We propose a novel way of deriving image information for queries. We attempt to learn image information for queries from item images instead of generating explicit image features or an image for queries. We use canonical correlation analysis (CCA) to learn a new subspace where projecting the original data will give us a new query and item representation. We hypothesize that this new query representation will also have image information about the query. We estimate the query-item similarity using a vector space model and report the performance of the proposed method on eBay's search data. We show 11.89% relevance improvement over the baseline using Area Under the Receiver Operating Characteristic curve (AUROC) as the evaluation metric. We also show 3.1% relevance improvement over the baseline with Area Under the Precision Recall Curve (AUPRC).","PeriodicalId":145556,"journal":{"name":"Proceedings of the 24th Australasian Document Computing Symposium","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Learning Image Information for eCommerce Queries\",\"authors\":\"U. Porwal\",\"doi\":\"10.1145/3372124.3372126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computing similarity between a query and a document is fundamental in any information retrieval system. In search engines, computing query-document similarity is an essential step in both retrieval and ranking stages. In eBay search, document is an item and the query-item similarity can be computed by comparing different facets of the query-item pair. Query text can be compared with the text of the item title. Likewise, a category constraint applied on the query can be compared with the listing category of the item. However, images are one signal that are usually present in the items but are not present in the query. Images are one of the most intuitive signals used by users to determine the relevance of the item given a query. Including this signal in estimating similarity between the query-item pair is likely to improve the relevance of the search engine. We propose a novel way of deriving image information for queries. We attempt to learn image information for queries from item images instead of generating explicit image features or an image for queries. We use canonical correlation analysis (CCA) to learn a new subspace where projecting the original data will give us a new query and item representation. We hypothesize that this new query representation will also have image information about the query. We estimate the query-item similarity using a vector space model and report the performance of the proposed method on eBay's search data. We show 11.89% relevance improvement over the baseline using Area Under the Receiver Operating Characteristic curve (AUROC) as the evaluation metric. We also show 3.1% relevance improvement over the baseline with Area Under the Precision Recall Curve (AUPRC).\",\"PeriodicalId\":145556,\"journal\":{\"name\":\"Proceedings of the 24th Australasian Document Computing Symposium\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 24th Australasian Document Computing Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3372124.3372126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 24th Australasian Document Computing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3372124.3372126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

计算查询和文档之间的相似性是任何信息检索系统的基础。在搜索引擎中,计算查询文档相似度是检索和排序阶段的重要步骤。在eBay搜索中,文档是一个项目,可以通过比较查询项目对的不同方面来计算查询项目的相似度。查询文本可以与项目标题的文本进行比较。同样,可以将应用于查询的类别约束与项目的列出类别进行比较。但是,图像是一种通常出现在项目中但不出现在查询中的信号。图像是用户用来确定给定查询项的相关性的最直观的信号之一。在估计查询项对之间的相似性时包含该信号可能会提高搜索引擎的相关性。我们提出了一种新的获取查询图像信息的方法。我们尝试从项目图像中学习图像信息,而不是为查询生成显式的图像特征或图像。我们使用典型相关分析(CCA)来学习一个新的子空间,在这个子空间中,投影原始数据将给我们一个新的查询和项目表示。我们假设这个新的查询表示也将包含关于查询的图像信息。我们使用向量空间模型估计查询项目相似度,并报告了该方法在eBay搜索数据上的性能。我们使用受试者工作特征曲线下面积(AUROC)作为评估指标,显示了比基线11.89%的相关性改善。我们还显示,与精确召回曲线下面积(AUPRC)的基线相比,相关性提高了3.1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Learning Image Information for eCommerce Queries
Computing similarity between a query and a document is fundamental in any information retrieval system. In search engines, computing query-document similarity is an essential step in both retrieval and ranking stages. In eBay search, document is an item and the query-item similarity can be computed by comparing different facets of the query-item pair. Query text can be compared with the text of the item title. Likewise, a category constraint applied on the query can be compared with the listing category of the item. However, images are one signal that are usually present in the items but are not present in the query. Images are one of the most intuitive signals used by users to determine the relevance of the item given a query. Including this signal in estimating similarity between the query-item pair is likely to improve the relevance of the search engine. We propose a novel way of deriving image information for queries. We attempt to learn image information for queries from item images instead of generating explicit image features or an image for queries. We use canonical correlation analysis (CCA) to learn a new subspace where projecting the original data will give us a new query and item representation. We hypothesize that this new query representation will also have image information about the query. We estimate the query-item similarity using a vector space model and report the performance of the proposed method on eBay's search data. We show 11.89% relevance improvement over the baseline using Area Under the Receiver Operating Characteristic curve (AUROC) as the evaluation metric. We also show 3.1% relevance improvement over the baseline with Area Under the Precision Recall Curve (AUPRC).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Differences in language use: Insights from job and talent search Proceedings of the 24th Australasian Document Computing Symposium Taking Risks with Confidence Towards Automatically Classifying Case Law Citation Treatment Using Neural Networks Character Profiling in Low-Resource Language Documents
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1