首页 > 最新文献

Proceedings of the ACM Web Conference 2023最新文献

英文 中文
Eligibility Mechanisms: Auctions Meet Information Retrieval 资格机制:拍卖满足信息检索
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583478
G. Goel, Renato Paes Leme, Jon Schneider, David R. M. Thompson, Hanrui Zhang
The design of internet advertisement systems is both an auction design problem and an information retrieval (IR) problem. As an auction, the designer needs to take the participants incentives into account. As an information retrieval problem, it needs to identify the ad that it is the most relevant to a user out of an enormous set of ad candidates. Those aspects are combined by first having an IR system narrow down the initial set of ad candidates to a manageable size followed by an auction that ranks and prices those candidates. If the IR system uses information about bids, agents could in principle manipulate the system by manipulating the IR stage even when the subsequent auction is truthful. In this paper we investigate the design of truthful IR mechanisms, which we term eligibility mechanisms. We model it as a truthful version of the stochastic probing problem. We show that there is a constant gap between the truthful and non-truthful versions of the stochastic probing problem and exhibit a constant approximation algorithm. En route, we also characterize the set of eligibility mechanisms, which provides necessary and sufficient conditions for an IR system to be truthful.
网络广告系统的设计既是一个拍卖设计问题,也是一个信息检索问题。作为拍卖,设计者需要考虑参与者的动机。作为一个信息检索问题,它需要从大量的候选广告中识别出与用户最相关的广告。这些方面首先通过IR系统将最初的候选广告缩小到可管理的规模,然后通过拍卖对这些候选广告进行排名和定价。如果IR系统使用有关出价的信息,则代理人原则上可以通过操纵IR阶段来操纵系统,即使随后的拍卖是真实的。在本文中,我们研究了真实的IR机制的设计,我们称之为资格机制。我们将其建模为随机探测问题的真实版本。我们证明了随机探测问题的真实和非真实版本之间存在一个恒定的差距,并展示了一个恒定的近似算法。在此过程中,我们还描述了一套资格机制,它为IR系统的真实性提供了必要和充分的条件。
{"title":"Eligibility Mechanisms: Auctions Meet Information Retrieval","authors":"G. Goel, Renato Paes Leme, Jon Schneider, David R. M. Thompson, Hanrui Zhang","doi":"10.1145/3543507.3583478","DOIUrl":"https://doi.org/10.1145/3543507.3583478","url":null,"abstract":"The design of internet advertisement systems is both an auction design problem and an information retrieval (IR) problem. As an auction, the designer needs to take the participants incentives into account. As an information retrieval problem, it needs to identify the ad that it is the most relevant to a user out of an enormous set of ad candidates. Those aspects are combined by first having an IR system narrow down the initial set of ad candidates to a manageable size followed by an auction that ranks and prices those candidates. If the IR system uses information about bids, agents could in principle manipulate the system by manipulating the IR stage even when the subsequent auction is truthful. In this paper we investigate the design of truthful IR mechanisms, which we term eligibility mechanisms. We model it as a truthful version of the stochastic probing problem. We show that there is a constant gap between the truthful and non-truthful versions of the stochastic probing problem and exhibit a constant approximation algorithm. En route, we also characterize the set of eligibility mechanisms, which provides necessary and sufficient conditions for an IR system to be truthful.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125418312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Hierarchy-Aware Graph Networks with Deep Dual Clustering for Session-based Recommendation 基于会话推荐的深度双聚类增强层次感知图网络
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583247
Jiajie Su, Chaochao Chen, Weiming Liu, Fei Wu, Xiaolin Zheng, Haoming Lyu
Session-based Recommendation aims at predicting the next interacted item based on short anonymous behavior sessions. However, existing solutions neglect to model two inherent properties of sequential representing distributions, i.e., hierarchy structures resulted from item popularity and collaborations existing in both intra- and inter-session. Tackling with these two factors at the same time is challenging. On the one hand, traditional Euclidean space utilized in previous studies fails to capture hierarchy structures due to a restricted representation ability. On the other hand, the intuitive apply of hyperbolic geometry could extract hierarchical patterns but more emphasis on degree distribution weakens intra- and inter-session collaborations. To address the challenges, we propose a Hierarchy-Aware Dual Clustering Graph Network (HADCG) model for session-based recommendation. Towards the first challenge, we design the hierarchy-aware graph modeling module which converts sessions into hyperbolic session graphs, adopting hyperbolic geometry in propagation and attention mechanism so as to integrate chronological and hierarchical information. As for the second challenge, we introduce the deep dual clustering module which develops a two-level clustering strategy, i.e., information regularizer for intra-session clustering and contrastive learner for inter-session clustering, to enhance hyperbolic representation learning from collaborative perspectives and further promote recommendation performance. Extensive experiments on three real-world datasets demonstrate the effectiveness of the proposed HADCG.
基于会话的推荐旨在基于简短的匿名行为会话预测下一个交互项。然而,现有的解决方案忽略了对序列表示分布的两个固有属性进行建模,即,由项目受欢迎程度和存在于会话内和会话间的协作引起的层次结构。同时处理这两个因素是具有挑战性的。一方面,以往研究中使用的传统欧几里得空间由于表征能力有限而无法捕捉层次结构。另一方面,双曲几何的直观应用可以提取层次模式,但更强调程度分布削弱了会议内和会议间的协作。为了解决这些挑战,我们提出了一种基于会话的推荐层次感知双聚类图网络(HADCG)模型。针对第一个挑战,我们设计了层次感知图建模模块,该模块将会话转换为双曲会话图,在传播和关注机制上采用双曲几何,以整合时间信息和层次信息。对于第二个挑战,我们引入了深度双聚类模块,该模块开发了两层聚类策略,即会话内聚类的信息正则化器和会话间聚类的对比学习器,从协作的角度增强双曲表示学习,进一步提高推荐性能。在三个真实数据集上的大量实验证明了所提出的hadgg的有效性。
{"title":"Enhancing Hierarchy-Aware Graph Networks with Deep Dual Clustering for Session-based Recommendation","authors":"Jiajie Su, Chaochao Chen, Weiming Liu, Fei Wu, Xiaolin Zheng, Haoming Lyu","doi":"10.1145/3543507.3583247","DOIUrl":"https://doi.org/10.1145/3543507.3583247","url":null,"abstract":"Session-based Recommendation aims at predicting the next interacted item based on short anonymous behavior sessions. However, existing solutions neglect to model two inherent properties of sequential representing distributions, i.e., hierarchy structures resulted from item popularity and collaborations existing in both intra- and inter-session. Tackling with these two factors at the same time is challenging. On the one hand, traditional Euclidean space utilized in previous studies fails to capture hierarchy structures due to a restricted representation ability. On the other hand, the intuitive apply of hyperbolic geometry could extract hierarchical patterns but more emphasis on degree distribution weakens intra- and inter-session collaborations. To address the challenges, we propose a Hierarchy-Aware Dual Clustering Graph Network (HADCG) model for session-based recommendation. Towards the first challenge, we design the hierarchy-aware graph modeling module which converts sessions into hyperbolic session graphs, adopting hyperbolic geometry in propagation and attention mechanism so as to integrate chronological and hierarchical information. As for the second challenge, we introduce the deep dual clustering module which develops a two-level clustering strategy, i.e., information regularizer for intra-session clustering and contrastive learner for inter-session clustering, to enhance hyperbolic representation learning from collaborative perspectives and further promote recommendation performance. Extensive experiments on three real-world datasets demonstrate the effectiveness of the proposed HADCG.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123677283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Geographic Information Retrieval Using Wikipedia Articles 地理信息检索使用维基百科文章
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583469
Amir Krause, S. Cohen
Assigning semantically relevant, real-world locations to documents opens new possibilities to perform geographic information retrieval. We propose a novel approach to automatically determine the latitude-longitude coordinates of appropriate Wikipedia articles with high accuracy, leveraging both text and metadata in the corpus. By examining articles whose base-truth coordinates are known, we show that our method attains a substantial improvement over state of the art works. We subsequently demonstrate how our approach could yield two benefits: (1) detecting significant geolocation errors in Wikipedia; and (2) proposing approximated coordinates for hundreds of thousands of articles which are not traditionally considered to be locations (such as events, ideas or people), opening new possibilities for conceptual geographic retrievals over Wikipedia.
将语义上相关的、真实世界的位置分配给文档,为执行地理信息检索提供了新的可能性。我们提出了一种利用语料库中的文本和元数据,高精度地自动确定适当维基百科文章的经纬度坐标的新方法。通过检查其基本真值坐标已知的文章,我们表明我们的方法比艺术作品的状态有了实质性的改进。我们随后展示了我们的方法如何产生两个好处:(1)检测维基百科中的重大地理定位错误;(2)为成千上万的传统上不被认为是位置(如事件、思想或人物)的文章提出近似坐标,为在维基百科上进行概念性地理检索开辟了新的可能性。
{"title":"Geographic Information Retrieval Using Wikipedia Articles","authors":"Amir Krause, S. Cohen","doi":"10.1145/3543507.3583469","DOIUrl":"https://doi.org/10.1145/3543507.3583469","url":null,"abstract":"Assigning semantically relevant, real-world locations to documents opens new possibilities to perform geographic information retrieval. We propose a novel approach to automatically determine the latitude-longitude coordinates of appropriate Wikipedia articles with high accuracy, leveraging both text and metadata in the corpus. By examining articles whose base-truth coordinates are known, we show that our method attains a substantial improvement over state of the art works. We subsequently demonstrate how our approach could yield two benefits: (1) detecting significant geolocation errors in Wikipedia; and (2) proposing approximated coordinates for hundreds of thousands of articles which are not traditionally considered to be locations (such as events, ideas or people), opening new possibilities for conceptual geographic retrievals over Wikipedia.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128576749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hidden Indicators of Collective Intelligence in Crowdfunding 众筹中集体智慧的隐性指标
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583414
Emőke-Ágnes Horvát, H. Dambanemuya, Jayaram Uparna, Brian Uzzi
Extensive literature argues that crowds possess essential collective intelligence benefits that allow superior decision-making by untrained individuals working in low-information environments. Classic wisdom of crowds theory is based on evidence gathered from studying large groups of diverse and independent decision-makers. Yet, most human decisions are reached in online settings of interconnected like-minded people that challenge these criteria. This observation raises a key question: Are there surprising expressions of collective intelligence online? Here, we explore whether crowds furnish collective intelligence benefits in crowdfunding systems. Crowdfunding has grown and diversified quickly over the past decade, expanding from funding aspirant creative works and supplying pro-social donations to enabling large citizen-funded urban projects and providing commercial interest-based unsecured loans. Using nearly 10 million loan contributions from a market-dominant lending platform, we find evidence for collective intelligence indicators in crowdfunding. Our results, which are based on a two-stage Heckman selection model, indicate that opinion diversity and the speed at which funds are contributed predict who gets funded and who repays, even after accounting for traditional measures of creditworthiness. Moreover, crowds work consistently well in correctly assessing the outcome of high-risk projects. Finally, diversity and speed serve as early warning signals when inferring fundraising based solely on the initial part of the campaign. Our findings broaden the field of crowd-aware system design and inform discussions about the augmentation of traditional financing systems with tech innovations.
大量文献认为,群体拥有基本的集体智慧优势,这使得未经训练的个人在低信息环境中工作时能够做出更好的决策。群体理论的经典智慧是基于对大量不同的、独立的决策者进行研究而得到的证据。然而,大多数人的决定都是在网络环境中做出的,这些环境中有志同道合的人相互联系,挑战了这些标准。这一观察提出了一个关键问题:网上是否存在集体智慧的惊人表现?在这里,我们探讨群体是否在众筹系统中提供集体智慧利益。在过去的十年中,众筹迅速发展并多样化,从资助有抱负的创意作品和提供亲社会捐赠,扩展到支持大型公民资助的城市项目,并提供基于商业利息的无抵押贷款。利用某市场主导型借贷平台近1000万笔贷款,我们发现了众筹中集体智慧指标存在的证据。我们基于两阶段赫克曼选择模型的研究结果表明,意见多样性和提供资金的速度预测了谁获得资金和谁偿还资金,即使在考虑了传统的信用衡量标准之后也是如此。此外,群体在正确评估高风险项目的结果方面一直表现良好。最后,多样性和速度可以作为早期预警信号,当仅仅根据活动的初始部分来推断筹款时。我们的研究结果拓宽了群体感知系统设计领域,并为有关利用技术创新增强传统融资系统的讨论提供了信息。
{"title":"Hidden Indicators of Collective Intelligence in Crowdfunding","authors":"Emőke-Ágnes Horvát, H. Dambanemuya, Jayaram Uparna, Brian Uzzi","doi":"10.1145/3543507.3583414","DOIUrl":"https://doi.org/10.1145/3543507.3583414","url":null,"abstract":"Extensive literature argues that crowds possess essential collective intelligence benefits that allow superior decision-making by untrained individuals working in low-information environments. Classic wisdom of crowds theory is based on evidence gathered from studying large groups of diverse and independent decision-makers. Yet, most human decisions are reached in online settings of interconnected like-minded people that challenge these criteria. This observation raises a key question: Are there surprising expressions of collective intelligence online? Here, we explore whether crowds furnish collective intelligence benefits in crowdfunding systems. Crowdfunding has grown and diversified quickly over the past decade, expanding from funding aspirant creative works and supplying pro-social donations to enabling large citizen-funded urban projects and providing commercial interest-based unsecured loans. Using nearly 10 million loan contributions from a market-dominant lending platform, we find evidence for collective intelligence indicators in crowdfunding. Our results, which are based on a two-stage Heckman selection model, indicate that opinion diversity and the speed at which funds are contributed predict who gets funded and who repays, even after accounting for traditional measures of creditworthiness. Moreover, crowds work consistently well in correctly assessing the outcome of high-risk projects. Finally, diversity and speed serve as early warning signals when inferring fundraising based solely on the initial part of the campaign. Our findings broaden the field of crowd-aware system design and inform discussions about the augmentation of traditional financing systems with tech innovations.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129411849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CMINet: a Graph Learning Framework for Content-aware Multi-channel Influence Diffusion CMINet:面向内容感知的多通道影响扩散的图学习框架
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583465
Hsi-Wen Chen, De-Nian Yang, Wang-Chien Lee, P. Yu, Ming-Syan Chen
The phenomena of influence diffusion on social networks have received tremendous research interests in the past decade. While most prior works mainly focus on predicting the total influence spread on a single network, a marketing campaign that exploits influence diffusion often involves multiple channels with various information disseminated on different media. In this paper, we introduce a new influence estimation problem, namely Content-aware Multi-channel Influence Diffusion (CMID), and accordingly propose CMINet to predict newly influenced users, given a set of seed users with different multimedia contents. In CMINet, we first introduce DiffGNN to encode the influencing power of users (nodes) and Influence-aware Optimal Transport (IOT) to align the embeddings to address the distribution shift across different diffusion channels. Then, we transform CMID into a node classification problem and propose Social-based Multimedia Feature Extractor (SMFE) and Content-aware Multi-channel Influence Propagation (CMIP) to jointly learn the user preferences on multimedia contents and predict the susceptibility of users. Furthermore, we prove that CMINet preserves monotonicity and submodularity, thus enabling (1 − 1/e)-approximate solutions for influence maximization. Experimental results manifest that CMINet outperforms eleven baselines on three public datasets.
近十年来,社会网络上的影响扩散现象引起了人们极大的研究兴趣。以往的大部分工作主要集中在预测单个网络的总影响力传播,而利用影响力传播的营销活动往往涉及多个渠道,在不同的媒体上传播各种信息。本文引入了一个新的影响估计问题,即内容感知的多通道影响扩散(CMID),并在给定一组具有不同多媒体内容的种子用户的情况下,提出了CMINet来预测新影响用户。在CMINet中,我们首先引入DiffGNN对用户(节点)的影响力进行编码,并引入影响感知的最优传输(IOT)来对齐嵌入,以解决不同扩散通道之间的分布转移。然后,将CMID转化为节点分类问题,提出基于社交的多媒体特征提取器(SMFE)和内容感知多通道影响传播(CMIP),共同学习用户对多媒体内容的偏好,预测用户的敏感性。此外,我们证明了CMINet保持单调性和子模块化,从而实现(1−1/e)-近似解的影响最大化。实验结果表明,CMINet在三个公共数据集上优于11个基线。
{"title":"CMINet: a Graph Learning Framework for Content-aware Multi-channel Influence Diffusion","authors":"Hsi-Wen Chen, De-Nian Yang, Wang-Chien Lee, P. Yu, Ming-Syan Chen","doi":"10.1145/3543507.3583465","DOIUrl":"https://doi.org/10.1145/3543507.3583465","url":null,"abstract":"The phenomena of influence diffusion on social networks have received tremendous research interests in the past decade. While most prior works mainly focus on predicting the total influence spread on a single network, a marketing campaign that exploits influence diffusion often involves multiple channels with various information disseminated on different media. In this paper, we introduce a new influence estimation problem, namely Content-aware Multi-channel Influence Diffusion (CMID), and accordingly propose CMINet to predict newly influenced users, given a set of seed users with different multimedia contents. In CMINet, we first introduce DiffGNN to encode the influencing power of users (nodes) and Influence-aware Optimal Transport (IOT) to align the embeddings to address the distribution shift across different diffusion channels. Then, we transform CMID into a node classification problem and propose Social-based Multimedia Feature Extractor (SMFE) and Content-aware Multi-channel Influence Propagation (CMIP) to jointly learn the user preferences on multimedia contents and predict the susceptibility of users. Furthermore, we prove that CMINet preserves monotonicity and submodularity, thus enabling (1 − 1/e)-approximate solutions for influence maximization. Experimental results manifest that CMINet outperforms eleven baselines on three public datasets.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131160680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Differentiable Optimized Product Quantization and Beyond 可微优化积量化及其他
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583482
Zepu Lu, Defu Lian, Jin Zhang, Zaixin Zhang, Chao Feng, Hao Wang, Enhong Chen
Vector quantization techniques, such as Product Quantization (PQ), play a vital role in approximate nearest neighbor search (ANNs) and maximum inner product search (MIPS) owing to their remarkable search and storage efficiency. However, the indexes in vector quantization cannot be trained together with the inference models since data indexing is not differentiable. To this end, differentiable vector quantization approaches, such as DiffPQ and DeepPQ, have been recently proposed, but existing methods have two drawbacks. First, they do not impose any constraints on codebooks, such that the resultant codebooks lack diversity, leading to limited retrieval performance. Second, since data indexing resorts to operator, differentiability is usually achieved by either relaxation or Straight-Through Estimation (STE), which leads to biased gradient and slow convergence. To address these problems, we propose a Differentiable Optimized Product Quantization method (DOPQ) and beyond in this paper. Particularly, each data is projected into multiple orthogonal spaces, to generate multiple views of data. Thus, each codebook is learned with one view of data, guaranteeing the diversity of codebooks. Moreover, instead of simple differentiable relaxation, DOPQ optimizes the loss based on direct loss minimization, significantly reducing the gradient bias problem. Finally, DOPQ is evaluated with seven datasets of both recommendation and image search tasks. Extensive experimental results show that DOPQ outperforms state-of-the-art baselines by a large margin.
矢量量化技术,如积量化(PQ),由于其显著的搜索和存储效率,在近似最近邻搜索(ann)和最大内积搜索(MIPS)中起着至关重要的作用。然而,由于数据索引不可微,矢量量化中的索引不能与推理模型一起训练。为此,最近提出了可微矢量量化方法,如DiffPQ和DeepPQ,但现有方法存在两个缺点。首先,它们没有对码本施加任何约束,因此生成的码本缺乏多样性,导致检索性能受限。其次,由于数据索引依赖于算子,可微性通常通过松弛或直通估计(STE)来实现,这导致梯度偏置和收敛缓慢。为了解决这些问题,本文提出了一种可微优化积量化方法(DOPQ)。特别是,每个数据被投影到多个正交空间中,以生成数据的多个视图。因此,每个码本都是用一种数据视图来学习的,保证了码本的多样性。此外,DOPQ不是简单的可微松弛,而是基于直接损耗最小化来优化损耗,显著降低了梯度偏置问题。最后,利用推荐任务和图像搜索任务的7个数据集对DOPQ进行评估。大量的实验结果表明,DOPQ在很大程度上优于最先进的基线。
{"title":"Differentiable Optimized Product Quantization and Beyond","authors":"Zepu Lu, Defu Lian, Jin Zhang, Zaixin Zhang, Chao Feng, Hao Wang, Enhong Chen","doi":"10.1145/3543507.3583482","DOIUrl":"https://doi.org/10.1145/3543507.3583482","url":null,"abstract":"Vector quantization techniques, such as Product Quantization (PQ), play a vital role in approximate nearest neighbor search (ANNs) and maximum inner product search (MIPS) owing to their remarkable search and storage efficiency. However, the indexes in vector quantization cannot be trained together with the inference models since data indexing is not differentiable. To this end, differentiable vector quantization approaches, such as DiffPQ and DeepPQ, have been recently proposed, but existing methods have two drawbacks. First, they do not impose any constraints on codebooks, such that the resultant codebooks lack diversity, leading to limited retrieval performance. Second, since data indexing resorts to operator, differentiability is usually achieved by either relaxation or Straight-Through Estimation (STE), which leads to biased gradient and slow convergence. To address these problems, we propose a Differentiable Optimized Product Quantization method (DOPQ) and beyond in this paper. Particularly, each data is projected into multiple orthogonal spaces, to generate multiple views of data. Thus, each codebook is learned with one view of data, guaranteeing the diversity of codebooks. Moreover, instead of simple differentiable relaxation, DOPQ optimizes the loss based on direct loss minimization, significantly reducing the gradient bias problem. Finally, DOPQ is evaluated with seven datasets of both recommendation and image search tasks. Extensive experimental results show that DOPQ outperforms state-of-the-art baselines by a large margin.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126257004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
HybridEval: A Human-AI Collaborative Approach for Evaluating Design Ideas at Scale HybridEval:大规模评估设计理念的人类-人工智能协作方法
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583496
S. Mesbah, Ines Arous, Jie Yang, A. Bozzon
Evaluating design ideas is necessary to predict their success and assess their impact early on in the process. Existing methods rely either on metrics computed by systems that are effective but subject to errors and bias, or experts’ ratings, which are accurate but expensive and long to collect. Crowdsourcing offers a compelling way to evaluate a large number of design ideas in a short amount of time while being cost-effective. Workers’ evaluation is, however, less reliable and might substantially differ from experts’ evaluation. In this work, we investigate workers’ rating behavior and compare it with experts. First, we instrument a crowdsourcing study where we asked workers to evaluate design ideas from three innovation challenges. We show that workers share similar insights with experts but tend to rate more generously and weigh certain criteria more importantly. Next, we develop a hybrid human-AI approach that combines a machine learning model with crowdsourcing to evaluate ideas. Our approach models workers’ reliability and bias while leveraging ideas’ textual content to train a machine learning model. It is able to incorporate experts’ ratings whenever available, to supervise the model training and infer worker performance. Results show that our framework outperforms baseline methods and requires significantly less training data from experts, thus providing a viable solution for evaluating ideas at scale.
评估设计理念对于预测其成功和评估其在过程中的早期影响是必要的。现有的方法要么依赖于有效但存在误差和偏差的系统计算的指标,要么依赖于专家的评级,这是准确的,但昂贵且收集时间长。众包提供了一种令人信服的方法,可以在很短的时间内评估大量的设计理念,同时又具有成本效益。然而,工人的评价不太可靠,可能与专家的评价有很大的不同。在这项工作中,我们调查了工人的评级行为,并将其与专家进行了比较。首先,我们进行了一项众包研究,要求员工从三个创新挑战中评估设计想法。我们发现,员工与专家有着相似的见解,但往往更慷慨地打分,更看重某些标准。接下来,我们开发了一种混合的人类-人工智能方法,将机器学习模型与众包相结合来评估想法。我们的方法对工人的可靠性和偏见进行建模,同时利用思想的文本内容来训练机器学习模型。它能够随时结合专家的评级,监督模型培训并推断工人的表现。结果表明,我们的框架优于基线方法,并且需要的专家培训数据显着减少,从而为大规模评估想法提供了可行的解决方案。
{"title":"HybridEval: A Human-AI Collaborative Approach for Evaluating Design Ideas at Scale","authors":"S. Mesbah, Ines Arous, Jie Yang, A. Bozzon","doi":"10.1145/3543507.3583496","DOIUrl":"https://doi.org/10.1145/3543507.3583496","url":null,"abstract":"Evaluating design ideas is necessary to predict their success and assess their impact early on in the process. Existing methods rely either on metrics computed by systems that are effective but subject to errors and bias, or experts’ ratings, which are accurate but expensive and long to collect. Crowdsourcing offers a compelling way to evaluate a large number of design ideas in a short amount of time while being cost-effective. Workers’ evaluation is, however, less reliable and might substantially differ from experts’ evaluation. In this work, we investigate workers’ rating behavior and compare it with experts. First, we instrument a crowdsourcing study where we asked workers to evaluate design ideas from three innovation challenges. We show that workers share similar insights with experts but tend to rate more generously and weigh certain criteria more importantly. Next, we develop a hybrid human-AI approach that combines a machine learning model with crowdsourcing to evaluate ideas. Our approach models workers’ reliability and bias while leveraging ideas’ textual content to train a machine learning model. It is able to incorporate experts’ ratings whenever available, to supervise the model training and infer worker performance. Results show that our framework outperforms baseline methods and requires significantly less training data from experts, thus providing a viable solution for evaluating ideas at scale.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126423078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Internal Multi-Interest Exploration and External Domain Alignment for Cross Domain Sequential Recommendation 跨领域顺序推荐的内部多兴趣探索与外部领域对齐
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583366
Weiming Liu, Xiaolin Zheng, Chaochao Chen, Jiajie Su, Xinting Liao, Mengling Hu, Yanchao Tan
Sequential Cross-Domain Recommendation (CDR) has been popularly studied to utilize different domain knowledge and users’ historical behaviors for the next-item prediction. In this paper, we focus on the cross-domain sequential recommendation problem. This commonly exist problem is rather challenging from two perspectives, i.e., the implicit user historical rating sequences are difficult in modeling and the users/items on different domains are mostly non-overlapped. Most previous sequential CDR approaches cannot solve the cross-domain sequential recommendation problem well, since (1) they cannot sufficiently depict the users’ actual preferences, (2) they cannot leverage and transfer useful knowledge across domains. To tackle the above issues, we propose joint Internal multi-interest exploration and External domain alignment for cross domain Sequential Recommendation model (IESRec). IESRec includes two main modules, i.e., internal multi-interest exploration module and external domain alignment module. To reflect the users’ diverse characteristics with multi-interests evolution, we first propose internal temporal optimal transport method in the internal multi-interest exploration module. We further propose external alignment optimal transport method in the external domain alignment module to reduce domain discrepancy for the item embeddings. Our empirical studies on Amazon datasets demonstrate that IESRec significantly outperforms the state-of-the-art models.
序贯跨领域推荐(CDR)是一种利用不同领域知识和用户历史行为进行下一项预测的方法。本文主要研究跨域顺序推荐问题。这一普遍存在的问题从两个方面具有挑战性,即隐式用户历史评级序列难以建模和不同域上的用户/项目大多不重叠。大多数以前的顺序CDR方法不能很好地解决跨领域的顺序推荐问题,因为(1)它们不能充分描述用户的实际偏好,(2)它们不能跨领域利用和转移有用的知识。为了解决上述问题,我们提出了跨领域顺序推荐模型(IESRec)的内部多兴趣探索和外部领域对齐联合方法。IESRec包括两个主要模块,即内部多兴趣探索模块和外部域对齐模块。为了反映用户多利益演化的多样性特征,我们首先在内部多利益探索模块中提出了内部时间最优传输方法。我们进一步在外部域对齐模块中提出了外部对齐最优传输方法,以减少项目嵌入的域差异。我们对亚马逊数据集的实证研究表明,IESRec显著优于最先进的模型。
{"title":"Joint Internal Multi-Interest Exploration and External Domain Alignment for Cross Domain Sequential Recommendation","authors":"Weiming Liu, Xiaolin Zheng, Chaochao Chen, Jiajie Su, Xinting Liao, Mengling Hu, Yanchao Tan","doi":"10.1145/3543507.3583366","DOIUrl":"https://doi.org/10.1145/3543507.3583366","url":null,"abstract":"Sequential Cross-Domain Recommendation (CDR) has been popularly studied to utilize different domain knowledge and users’ historical behaviors for the next-item prediction. In this paper, we focus on the cross-domain sequential recommendation problem. This commonly exist problem is rather challenging from two perspectives, i.e., the implicit user historical rating sequences are difficult in modeling and the users/items on different domains are mostly non-overlapped. Most previous sequential CDR approaches cannot solve the cross-domain sequential recommendation problem well, since (1) they cannot sufficiently depict the users’ actual preferences, (2) they cannot leverage and transfer useful knowledge across domains. To tackle the above issues, we propose joint Internal multi-interest exploration and External domain alignment for cross domain Sequential Recommendation model (IESRec). IESRec includes two main modules, i.e., internal multi-interest exploration module and external domain alignment module. To reflect the users’ diverse characteristics with multi-interests evolution, we first propose internal temporal optimal transport method in the internal multi-interest exploration module. We further propose external alignment optimal transport method in the external domain alignment module to reduce domain discrepancy for the item embeddings. Our empirical studies on Amazon datasets demonstrate that IESRec significantly outperforms the state-of-the-art models.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126874583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Multitask Peer Prediction With Task-dependent Strategies 基于任务依赖策略的多任务同伴预测
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583292
Yichi Zhang, G. Schoenebeck
Peer prediction aims to incentivize truthful reports from agents whose reports cannot be assessed with any objective ground truthful information. In the multi-task setting where each agent is asked multiple questions, a sequence of mechanisms have been proposed which are truthful — truth-telling is guaranteed to be an equilibrium, or even better, informed truthful — truth-telling is guaranteed to be one of the best-paid equilibria. However, these guarantees assume agents’ strategies are restricted to be task-independent: an agent’s report on a task is not affected by her information about other tasks. We provide the first discussion on how to design (informed) truthful mechanisms for task-dependent strategies, which allows the agents to report based on all her information on the assigned tasks. We call such stronger mechanisms (informed) omni-truthful. In particular, we propose the joint-disjoint task framework, a new paradigm which builds upon the previous penalty-bonus task framework. First, we show a natural reduction from mechanisms in the penalty-bonus task framework to mechanisms in the joint-disjoint task framework that maps every truthful mechanism to an omni-truthful mechanism. Such a reduction is non-trivial as we show that current penalty-bonus task mechanisms are not, in general, omni-truthful. Second, for a stronger truthful guarantee, we design the matching agreement (MA) mechanism which is informed omni-truthful. Finally, for the MA mechanism in the detail-free setting where no prior knowledge is assumed, we show how many tasks are required to (approximately) retain the truthful guarantees.
同伴预测的目的是激励那些无法用客观的、真实的信息来评估报告的代理人的真实报告。在多任务环境下,每个主体被问及多个问题,已经提出了一系列机制,这些机制保证说实话是一种均衡,或者更好地说,保证知情的说实话是一种报酬最高的均衡。然而,这些保证假设代理的策略被限制为任务独立的:代理对任务的报告不受她关于其他任务的信息的影响。我们首次讨论了如何为任务依赖策略设计(知情的)真实机制,该机制允许代理根据分配任务的所有信息进行报告。我们称这种更强大的机制(知情)为全真实。特别地,我们提出了联合-分离任务框架,这是在先前的惩罚-奖励任务框架的基础上建立的一种新的范式。首先,我们展示了从惩罚-奖励任务框架中的机制到联合-分离任务框架中的机制的自然还原,该框架将每个真实机制映射到一个全真实机制。这种减少不是微不足道的,因为我们表明,目前的惩罚-奖金任务机制通常不是完全真实的。其次,为了提供更强的真实保证,我们设计了知情全真实的匹配协议机制。最后,对于假设没有先验知识的无细节设置中的MA机制,我们展示了需要多少任务来(近似地)保留真实保证。
{"title":"Multitask Peer Prediction With Task-dependent Strategies","authors":"Yichi Zhang, G. Schoenebeck","doi":"10.1145/3543507.3583292","DOIUrl":"https://doi.org/10.1145/3543507.3583292","url":null,"abstract":"Peer prediction aims to incentivize truthful reports from agents whose reports cannot be assessed with any objective ground truthful information. In the multi-task setting where each agent is asked multiple questions, a sequence of mechanisms have been proposed which are truthful — truth-telling is guaranteed to be an equilibrium, or even better, informed truthful — truth-telling is guaranteed to be one of the best-paid equilibria. However, these guarantees assume agents’ strategies are restricted to be task-independent: an agent’s report on a task is not affected by her information about other tasks. We provide the first discussion on how to design (informed) truthful mechanisms for task-dependent strategies, which allows the agents to report based on all her information on the assigned tasks. We call such stronger mechanisms (informed) omni-truthful. In particular, we propose the joint-disjoint task framework, a new paradigm which builds upon the previous penalty-bonus task framework. First, we show a natural reduction from mechanisms in the penalty-bonus task framework to mechanisms in the joint-disjoint task framework that maps every truthful mechanism to an omni-truthful mechanism. Such a reduction is non-trivial as we show that current penalty-bonus task mechanisms are not, in general, omni-truthful. Second, for a stronger truthful guarantee, we design the matching agreement (MA) mechanism which is informed omni-truthful. Finally, for the MA mechanism in the detail-free setting where no prior knowledge is assumed, we show how many tasks are required to (approximately) retain the truthful guarantees.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126295337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Online Advertising in Ukraine and Russia During the 2022 Russian Invasion 2022年俄罗斯入侵期间乌克兰和俄罗斯的在线广告
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583484
Christina Yeung, U. Iqbal, Y. O'Neil, Tadayoshi Kohno, Franziska Roesner
Online ads are a major source of information on the web. The mass reach of online advertising is often leveraged for information dissemination, at times with an objective to influence public opinion (e.g., election misinformation). We hypothesized that online advertising, due to its reach and potential, might have been used to spread information around the 2022 Russian invasion of Ukraine. Thus, to understand the online ad ecosystem during this conflict, we conducted a five-month long large-scale measurement study of online advertising in Ukraine, Russia, and the US. We studied advertising trends of ad platforms that delivered ads in Ukraine, Russia, and the US and conducted an in-depth qualitative analysis of the conflict-related ad content. We found that prominent US-based advertisers continued to support Russian websites, and a portion of online ads were used to spread conflict-related information, including protesting the invasion, and spreading awareness, which might have otherwise potentially been censored in Russia.
在线广告是网络信息的主要来源。网上广告的广泛覆盖范围常常被用来传播信息,有时目的是影响公众舆论(例如,选举错误信息)。我们假设,由于网络广告的覆盖范围和潜力,它可能被用来传播有关2022年俄罗斯入侵乌克兰的信息。因此,为了了解这场冲突中的在线广告生态系统,我们对乌克兰、俄罗斯和美国的在线广告进行了为期五个月的大规模测量研究。我们研究了在乌克兰、俄罗斯和美国投放广告的广告平台的广告趋势,并对与冲突相关的广告内容进行了深入的定性分析。我们发现,美国著名的广告商继续支持俄罗斯网站,一部分在线广告被用来传播与冲突有关的信息,包括抗议入侵和传播意识,否则这些信息在俄罗斯可能会被审查。
{"title":"Online Advertising in Ukraine and Russia During the 2022 Russian Invasion","authors":"Christina Yeung, U. Iqbal, Y. O'Neil, Tadayoshi Kohno, Franziska Roesner","doi":"10.1145/3543507.3583484","DOIUrl":"https://doi.org/10.1145/3543507.3583484","url":null,"abstract":"Online ads are a major source of information on the web. The mass reach of online advertising is often leveraged for information dissemination, at times with an objective to influence public opinion (e.g., election misinformation). We hypothesized that online advertising, due to its reach and potential, might have been used to spread information around the 2022 Russian invasion of Ukraine. Thus, to understand the online ad ecosystem during this conflict, we conducted a five-month long large-scale measurement study of online advertising in Ukraine, Russia, and the US. We studied advertising trends of ad platforms that delivered ads in Ukraine, Russia, and the US and conducted an in-depth qualitative analysis of the conflict-related ad content. We found that prominent US-based advertisers continued to support Russian websites, and a portion of online ads were used to spread conflict-related information, including protesting the invasion, and spreading awareness, which might have otherwise potentially been censored in Russia.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"50 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120915905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the ACM Web Conference 2023
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1