Beyond Two-Tower: Attribute Guided Representation Learning for Candidate Retrieval

Proceedings of the ACM Web Conference 2023 Pub Date : 2023-04-30 DOI:10.1145/3543507.3583254

Hongyuan Shan, Qishen Zhang, Zhongyi Liu, Guannan Zhang, Chenliang Li

{"title":"Beyond Two-Tower: Attribute Guided Representation Learning for Candidate Retrieval","authors":"Hongyuan Shan, Qishen Zhang, Zhongyi Liu, Guannan Zhang, Chenliang Li","doi":"10.1145/3543507.3583254","DOIUrl":null,"url":null,"abstract":"Candidate retrieval is a key part of the modern search engines whose goal is to find candidate items that are semantically related to the query from a large item pool. The core difference against the later ranking stage is the requirement of low latency. Hence, two-tower structure with two parallel yet independent encoder for both query and item is prevalent in many systems. In these efforts, the semantic information of a query and a candidate item is fed into the corresponding encoder and then use their representations for retrieval. With the popularity of pre-trained semantic models, the state-of-the-art for semantic retrieval tasks has achieved the significant performance gain. However, the capacity of learning relevance signals is still limited by the isolation between the query and the item. The interaction-based modeling between the query and the item has been widely validated to be useful for the ranking stage, where more computation cost is affordable. Here, we are quite initerested in an demanding question: how to exploiting query-item interaction-based learning to enhance candidate retrieval and still maintain the low computation cost. Note that an item usually contain various heteorgeneous attributes which could help us understand the item characteristics more precisely. To this end, we propose a novel attribute guided representation learning framework (named AGREE) to enhance the candidate retrieval by exploiting query-attribute relevance. The key idea is to couple the query and item representation learning together during the training phase, but also enable easy decoupling for efficient inference. Specifically, we introduce an attribute fusion layer in the item side to identify most relevant item features for item representation. On the query side, an attribute-aware learning process is introduced to better infer the search intent also from these attributes. After model training, we then decouple the attribute information away from the query encoder, which guarantees the low latency for the inference phase. Extensive experiments over two real-world large-scale datasets demonstrate the superiority of the proposed AGREE against several state-of-the-art technical alternatives. Further online A/B test from AliPay search servise also show that AGREE achieves substantial performance gain over four business metrics. Currently, the proposed AGREE has been deployed online in AliPay for serving major traffic.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Web Conference 2023","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3543507.3583254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Candidate retrieval is a key part of the modern search engines whose goal is to find candidate items that are semantically related to the query from a large item pool. The core difference against the later ranking stage is the requirement of low latency. Hence, two-tower structure with two parallel yet independent encoder for both query and item is prevalent in many systems. In these efforts, the semantic information of a query and a candidate item is fed into the corresponding encoder and then use their representations for retrieval. With the popularity of pre-trained semantic models, the state-of-the-art for semantic retrieval tasks has achieved the significant performance gain. However, the capacity of learning relevance signals is still limited by the isolation between the query and the item. The interaction-based modeling between the query and the item has been widely validated to be useful for the ranking stage, where more computation cost is affordable. Here, we are quite initerested in an demanding question: how to exploiting query-item interaction-based learning to enhance candidate retrieval and still maintain the low computation cost. Note that an item usually contain various heteorgeneous attributes which could help us understand the item characteristics more precisely. To this end, we propose a novel attribute guided representation learning framework (named AGREE) to enhance the candidate retrieval by exploiting query-attribute relevance. The key idea is to couple the query and item representation learning together during the training phase, but also enable easy decoupling for efficient inference. Specifically, we introduce an attribute fusion layer in the item side to identify most relevant item features for item representation. On the query side, an attribute-aware learning process is introduced to better infer the search intent also from these attributes. After model training, we then decouple the attribute information away from the query encoder, which guarantees the low latency for the inference phase. Extensive experiments over two real-world large-scale datasets demonstrate the superiority of the proposed AGREE against several state-of-the-art technical alternatives. Further online A/B test from AliPay search servise also show that AGREE achieves substantial performance gain over four business metrics. Currently, the proposed AGREE has been deployed online in AliPay for serving major traffic.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

超越双塔:用于候选检索的属性引导表示学习

候选检索是现代搜索引擎的关键部分，其目标是从一个大的项目池中找到与查询在语义上相关的候选项目。与后面的排名阶段的核心区别是对低延迟的要求。因此，在许多系统中，查询和项目都采用两个并行但独立的编码器的双塔结构。在这些工作中，查询和候选项的语义信息被输入到相应的编码器中，然后使用它们的表示进行检索。随着预训练语义模型的普及，语义检索任务的性能得到了显著提高。然而，相关性信号的学习能力仍然受到查询和项目之间的隔离的限制。查询和项目之间基于交互的建模已被广泛验证，可用于排序阶段，因为在排序阶段可以负担得起更多的计算成本。如何利用基于查询项交互的学习来增强候选检索，同时保持较低的计算成本，这是我们非常感兴趣的问题。请注意，一个项目通常包含各种异构属性，这些属性可以帮助我们更准确地理解项目特征。为此，我们提出了一种新的属性引导表示学习框架(命名为AGREE)，通过利用查询-属性相关性来增强候选检索。关键思想是在训练阶段将查询和项表示学习耦合在一起，但也可以轻松解耦以实现有效的推理。具体来说，我们在项目端引入了一个属性融合层，以识别项目表示中最相关的项目特征。在查询端，引入了属性感知学习过程，从这些属性中更好地推断出搜索意图。在模型训练之后，我们将属性信息与查询编码器解耦，这保证了推理阶段的低延迟。在两个真实世界的大规模数据集上进行的大量实验表明，与几种最先进的技术替代方案相比，所提出的AGREE具有优越性。支付宝搜索服务的进一步在线A/B测试也表明，AGREE在四个业务指标上取得了实质性的性能提升。目前，提议的协议已经在线部署在支付宝中，用于服务主要流量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助