arXiv - CS - Information Retrieval最新文献_第5页

NeSHFS: Neighborhood Search with Heuristic-based Feature Selection for Click-Through Rate Prediction NeSHFS：基于启发式特征选择的邻域搜索，用于点击率预测

arXiv - CS - Information Retrieval

Pub Date : 2024-09-13 DOI: arxiv-2409.08703

Dogukan Aksu, Ismail Hakki Toroslu, Hasan Davulcu

Click-through-rate (CTR) prediction plays an important role in onlineadvertising and ad recommender systems. In the past decade, maximizing CTR hasbeen the main focus of model development and solution creation. Therefore,researchers and practitioners have proposed various models and solutions toenhance the effectiveness of CTR prediction. Most of the existing literaturefocuses on capturing either implicit or explicit feature interactions. Althoughimplicit interactions are successfully captured in some studies, explicitinteractions present a challenge for achieving high CTR by extracting bothlow-order and high-order feature interactions. Unnecessary and irrelevantfeatures may cause high computational time and low prediction performance.Furthermore, certain features may perform well with specific predictive modelswhile underperforming with others. Also, feature distribution may fluctuate dueto traffic variations. Most importantly, in live production environments,resources are limited, and the time for inference is just as crucial astraining time. Because of all these reasons, feature selection is one of themost important factors in enhancing CTR prediction model performance. Simplefilter-based feature selection algorithms do not perform well and they are notsufficient. An effective and efficient feature selection algorithm is needed toconsistently filter the most useful features during live CTR predictionprocess. In this paper, we propose a heuristic algorithm named NeighborhoodSearch with Heuristic-based Feature Selection (NeSHFS) to enhance CTRprediction performance while reducing dimensionality and training time costs.We conduct comprehensive experiments on three public datasets to validate theefficiency and effectiveness of our proposed solution.

点击率（CTR）预测在在线广告和广告推荐系统中发挥着重要作用。在过去十年中，最大化点击率一直是模型开发和解决方案创建的重点。因此，研究人员和从业人员提出了各种模型和解决方案，以提高 CTR 预测的有效性。现有文献大多侧重于捕捉隐式或显式特征交互。虽然一些研究成功地捕捉到了隐式交互，但显式交互对通过提取低阶和高阶特征交互来实现高点击率提出了挑战。此外，某些特征可能在特定预测模型中表现良好，而在其他预测模型中表现不佳。此外，特征分布可能会因流量变化而波动。最重要的是，在实时生产环境中，资源是有限的，推理时间与训练时间同样重要。鉴于上述原因，特征选择是提高点击率预测模型性能的最重要因素之一。基于简单过滤器的特征选择算法性能不佳，而且不够充分。我们需要一种有效且高效的特征选择算法，以便在实时点击率预测过程中持续筛选出最有用的特征。本文提出了一种名为 "基于启发式特征选择的邻域搜索（NeighborhoodSearch with Heuristic-based Feature Selection，NeSHFS）"的启发式算法，以提高 CTR 预测性能，同时降低维度和训练时间成本。

{"title":"NeSHFS: Neighborhood Search with Heuristic-based Feature Selection for Click-Through Rate Prediction","authors":"Dogukan Aksu, Ismail Hakki Toroslu, Hasan Davulcu","doi":"arxiv-2409.08703","DOIUrl":"https://doi.org/arxiv-2409.08703","url":null,"abstract":"Click-through-rate (CTR) prediction plays an important role in online\u0000advertising and ad recommender systems. In the past decade, maximizing CTR has\u0000been the main focus of model development and solution creation. Therefore,\u0000researchers and practitioners have proposed various models and solutions to\u0000enhance the effectiveness of CTR prediction. Most of the existing literature\u0000focuses on capturing either implicit or explicit feature interactions. Although\u0000implicit interactions are successfully captured in some studies, explicit\u0000interactions present a challenge for achieving high CTR by extracting both\u0000low-order and high-order feature interactions. Unnecessary and irrelevant\u0000features may cause high computational time and low prediction performance.\u0000Furthermore, certain features may perform well with specific predictive models\u0000while underperforming with others. Also, feature distribution may fluctuate due\u0000to traffic variations. Most importantly, in live production environments,\u0000resources are limited, and the time for inference is just as crucial as\u0000training time. Because of all these reasons, feature selection is one of the\u0000most important factors in enhancing CTR prediction model performance. Simple\u0000filter-based feature selection algorithms do not perform well and they are not\u0000sufficient. An effective and efficient feature selection algorithm is needed to\u0000consistently filter the most useful features during live CTR prediction\u0000process. In this paper, we propose a heuristic algorithm named Neighborhood\u0000Search with Heuristic-based Feature Selection (NeSHFS) to enhance CTR\u0000prediction performance while reducing dimensionality and training time costs.\u0000We conduct comprehensive experiments on three public datasets to validate the\u0000efficiency and effectiveness of our proposed solution.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparative Analysis of Pretrained Audio Representations in Music Recommender Systems 音乐推荐系统中的预训练音频表示比较分析

arXiv - CS - Information Retrieval

Pub Date : 2024-09-13 DOI: arxiv-2409.08987

Yan-Martin Tamm, Anna Aljanaki

Over the years, Music Information Retrieval (MIR) has proposed various modelspretrained on large amounts of music data. Transfer learning showcases theproven effectiveness of pretrained backend models with a broad spectrum ofdownstream tasks, including auto-tagging and genre classification. However, MIRpapers generally do not explore the efficiency of pretrained models for MusicRecommender Systems (MRS). In addition, the Recommender Systems community tendsto favour traditional end-to-end neural network learning over these models. Ourresearch addresses this gap and evaluates the applicability of six pretrainedbackend models (MusicFM, Music2Vec, MERT, EncodecMAE, Jukebox, and MusiCNN) inthe context of MRS. We assess their performance using three recommendationmodels: K-nearest neighbours (KNN), shallow neural network, and BERT4Rec. Ourfindings suggest that pretrained audio representations exhibit significantperformance variability between traditional MIR tasks and MRS, indicating thatvaluable aspects of musical information captured by backend models may differdepending on the task. This study establishes a foundation for furtherexploration of pretrained audio representations to enhance music recommendationsystems.

多年来，音乐信息检索（MIR）提出了各种在大量音乐数据上进行预训练的模型。迁移学习展示了预训练后端模型在自动标记和流派分类等广泛下游任务中的有效性。然而，MIR 论文一般不探讨预训练模型在音乐推荐系统（MRS）中的效率。此外，与这些模型相比，推荐系统社区更倾向于传统的端到端神经网络学习。我们的研究填补了这一空白，并评估了六种预训练后端模型（MusicFM、Music2Vec、MERT、EncodecMAE、Jukebox 和 MusiCNN）在 MRS 中的适用性。我们使用三种推荐模型来评估它们的性能：我们使用三种推荐模型评估了它们的性能：K-近邻（KNN）、浅层神经网络和 BERT4Rec。我们的研究结果表明，在传统的 MIR 任务和 MRS 之间，预训练的音频表征表现出显著的性能差异，这表明后端模型捕捉到的音乐信息的宝贵方面可能因任务而异。这项研究为进一步探索预训练音频表征以增强音乐推荐系统奠定了基础。

{"title":"Comparative Analysis of Pretrained Audio Representations in Music Recommender Systems","authors":"Yan-Martin Tamm, Anna Aljanaki","doi":"arxiv-2409.08987","DOIUrl":"https://doi.org/arxiv-2409.08987","url":null,"abstract":"Over the years, Music Information Retrieval (MIR) has proposed various models\u0000pretrained on large amounts of music data. Transfer learning showcases the\u0000proven effectiveness of pretrained backend models with a broad spectrum of\u0000downstream tasks, including auto-tagging and genre classification. However, MIR\u0000papers generally do not explore the efficiency of pretrained models for Music\u0000Recommender Systems (MRS). In addition, the Recommender Systems community tends\u0000to favour traditional end-to-end neural network learning over these models. Our\u0000research addresses this gap and evaluates the applicability of six pretrained\u0000backend models (MusicFM, Music2Vec, MERT, EncodecMAE, Jukebox, and MusiCNN) in\u0000the context of MRS. We assess their performance using three recommendation\u0000models: K-nearest neighbours (KNN), shallow neural network, and BERT4Rec. Our\u0000findings suggest that pretrained audio representations exhibit significant\u0000performance variability between traditional MIR tasks and MRS, indicating that\u0000valuable aspects of musical information captured by backend models may differ\u0000depending on the task. This study establishes a foundation for further\u0000exploration of pretrained audio representations to enhance music recommendation\u0000systems.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence 社交网络中的主动推荐：通过邻居影响引导用户兴趣

arXiv - CS - Information Retrieval

Pub Date : 2024-09-13 DOI: arxiv-2409.08934

Hang Pan, Shuxian Bi, Wenjie Wang, Haoxuan Li, Peng Wu, Fuli Feng, Xiangnan He

Recommending items solely catering to users' historical interests narrowsusers' horizons. Recent works have considered steering target users beyondtheir historical interests by directly adjusting items exposed to them.However, the recommended items for direct steering might not align perfectlywith users' interests evolution, detrimentally affecting target users'experience. To avoid this issue, we propose a new task named ProactiveRecommendation in Social Networks (PRSN) that indirectly steers users' interestby utilizing the influence of social neighbors, i.e., indirect steering byadjusting the exposure of a target item to target users' neighbors. The key toPRSN lies in answering an interventional question: what would a target user'sfeedback be on a target item if the item is exposed to the user's differentneighbors? To answer this question, we resort to causal inference and formalizePRSN as: (1) estimating the potential feedback of a user on an item, under thenetwork interference by the item's exposure to the user's neighbors; and (2)adjusting the exposure of a target item to target users' neighbors to trade offsteering performance and the damage to the neighbors' experience. To this end,we propose a Neighbor Interference Recommendation (NIRec) framework with twokey modules: (1)an interference representation-based estimation module formodeling potential feedback; and (2) a post-learning-based optimization modulefor optimizing a target item's exposure to trade off steering performance andthe neighbors' experience by greedy search. We conduct extensivesemi-simulation experiments based on three real-world datasets, validating thesteering effectiveness of NIRec.

仅根据用户的历史兴趣推荐项目会缩小用户的视野。然而，直接引导的推荐项目可能与用户的兴趣演变不完全一致，从而对目标用户的体验造成不利影响。为了避免这个问题，我们提出了一个名为 "社交网络中的主动推荐"（PRSN）的新任务，通过利用社交网络中邻居的影响力来间接引导用户的兴趣，即通过调整目标项目在目标用户邻居中的曝光率来实现间接引导。PRSN的关键在于回答一个干预性问题：如果目标用户的目标项目被暴露在其不同的邻居面前，那么他对该项目会有怎样的反馈？为了回答这个问题，我们采用了因果推理方法，并将PRSN 形式化为(1)估算用户对某一物品的潜在反馈，在该物品暴露于用户邻居的网络干扰下；(2)调整目标物品暴露于目标用户邻居的程度，以权衡转向性能和对邻居体验的损害。为此，我们提出了一个邻居干扰推荐（NIRec）框架，其中包含两个关键模块：（1）基于干扰表示的估计模块，用于模拟潜在的反馈；（2）基于后学习的优化模块，用于优化目标项目的曝光率，通过贪婪搜索来权衡转向性能和邻居体验。我们基于三个真实世界的数据集进行了广泛的半仿真实验，验证了 NIRec 的转向效果。

{"title":"Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence","authors":"Hang Pan, Shuxian Bi, Wenjie Wang, Haoxuan Li, Peng Wu, Fuli Feng, Xiangnan He","doi":"arxiv-2409.08934","DOIUrl":"https://doi.org/arxiv-2409.08934","url":null,"abstract":"Recommending items solely catering to users' historical interests narrows\u0000users' horizons. Recent works have considered steering target users beyond\u0000their historical interests by directly adjusting items exposed to them.\u0000However, the recommended items for direct steering might not align perfectly\u0000with users' interests evolution, detrimentally affecting target users'\u0000experience. To avoid this issue, we propose a new task named Proactive\u0000Recommendation in Social Networks (PRSN) that indirectly steers users' interest\u0000by utilizing the influence of social neighbors, i.e., indirect steering by\u0000adjusting the exposure of a target item to target users' neighbors. The key to\u0000PRSN lies in answering an interventional question: what would a target user's\u0000feedback be on a target item if the item is exposed to the user's different\u0000neighbors? To answer this question, we resort to causal inference and formalize\u0000PRSN as: (1) estimating the potential feedback of a user on an item, under the\u0000network interference by the item's exposure to the user's neighbors; and (2)\u0000adjusting the exposure of a target item to target users' neighbors to trade off\u0000steering performance and the damage to the neighbors' experience. To this end,\u0000we propose a Neighbor Interference Recommendation (NIRec) framework with two\u0000key modules: (1)an interference representation-based estimation module for\u0000modeling potential feedback; and (2) a post-learning-based optimization module\u0000for optimizing a target item's exposure to trade off steering performance and\u0000the neighbors' experience by greedy search. We conduct extensive\u0000semi-simulation experiments based on three real-world datasets, validating the\u0000steering effectiveness of NIRec.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Evaluation Framework for Attributed Information Retrieval using Large Language Models 使用大型语言模型进行归属式信息检索的评估框架

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.08014

Hanane Djeddal, Pierre Erbacher, Raouf Toukal, Laure Soulier, Karen Pinel-Sauvagnat, Sophia Katrenko, Lynda Tamine

With the growing success of Large Language models (LLMs) ininformation-seeking scenarios, search engines are now adopting generativeapproaches to provide answers along with in-line citations as attribution.While existing work focuses mainly on attributed question answering, in thispaper, we target information-seeking scenarios which are often more challengingdue to the open-ended nature of the queries and the size of the label space interms of the diversity of candidate-attributed answers per query. We propose areproducible framework to evaluate and benchmark attributed informationseeking, using any backbone LLM, and different architectural designs: (1)Generate (2) Retrieve then Generate, and (3) Generate then Retrieve.Experiments using HAGRID, an attributed information-seeking dataset, show theimpact of different scenarios on both the correctness and attributability ofanswers.

随着大语言模型（LLM）在信息搜索场景中取得越来越大的成功，搜索引擎现在开始采用生成式方法来提供答案以及作为归因的内联引文。现有的工作主要集中在归因式问题解答，而在本文中，我们的目标是信息搜索场景，由于查询的开放性和标签空间的大小（即每个查询的候选归因答案的多样性），这种场景往往更具挑战性。我们提出了一个可实现的框架，利用任何骨干 LLM 和不同的架构设计：（1）先生成（2）先检索再生成，以及（3）先生成再检索，对归属式信息搜索进行评估和基准测试。

引用次数: 0

Harnessing TI Feeds for Exploitation Detection 利用 TI 数据源进行开发检测

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.07709

Kajal Patel, Zubair Shafiq, Mateus Nogueira, Daniel Sadoc Menasché, Enrico Lovat, Taimur Kashif, Ashton Woiwood, Matheus Martins

Many organizations rely on Threat Intelligence (TI) feeds to assess the riskassociated with security threats. Due to the volume and heterogeneity of data,it is prohibitive to manually analyze the threat information available indifferent loosely structured TI feeds. Thus, there is a need to developautomated methods to vet and extract actionable information from TI feeds. Tothis end, we present a machine learning pipeline to automatically detectvulnerability exploitation from TI feeds. We first model threat vocabulary inloosely structured TI feeds using state-of-the-art embedding techniques(Doc2Vec and BERT) and then use it to train a supervised machine learningclassifier to detect exploitation of security vulnerabilities. We use ourapproach to identify exploitation events in 191 different TI feeds. Ourlongitudinal evaluation shows that it is able to accurately identifyexploitation events from TI feeds only using past data for training and even onTI feeds withheld from training. Our proposed approach is useful for a varietyof downstream tasks such as data-driven vulnerability risk assessment.

许多组织依靠威胁情报（TI）信息源来评估与安全威胁相关的风险。由于数据量大且异构，要手动分析结构松散的 TI 源中的威胁信息非常困难。因此，有必要开发自动化方法，从 TI 源中审核和提取可操作的信息。为此，我们提出了一种机器学习管道，用于自动检测技术信息源中的漏洞利用情况。我们首先使用最先进的嵌入技术（Doc2Vec 和 BERT）对结构松散的 TI feed 中的威胁词汇进行建模，然后使用它来训练监督机器学习分类器，以检测安全漏洞的利用情况。我们使用该方法识别了 191 个不同 TI 源中的利用事件。我们的纵向评估结果表明，该方法仅使用过去的数据进行训练，就能准确识别 TI feed 中的利用事件，甚至能识别未进行训练的 TI feed 中的利用事件。我们提出的方法适用于各种下游任务，如数据驱动的漏洞风险评估。

{"title":"Harnessing TI Feeds for Exploitation Detection","authors":"Kajal Patel, Zubair Shafiq, Mateus Nogueira, Daniel Sadoc Menasché, Enrico Lovat, Taimur Kashif, Ashton Woiwood, Matheus Martins","doi":"arxiv-2409.07709","DOIUrl":"https://doi.org/arxiv-2409.07709","url":null,"abstract":"Many organizations rely on Threat Intelligence (TI) feeds to assess the risk\u0000associated with security threats. Due to the volume and heterogeneity of data,\u0000it is prohibitive to manually analyze the threat information available in\u0000different loosely structured TI feeds. Thus, there is a need to develop\u0000automated methods to vet and extract actionable information from TI feeds. To\u0000this end, we present a machine learning pipeline to automatically detect\u0000vulnerability exploitation from TI feeds. We first model threat vocabulary in\u0000loosely structured TI feeds using state-of-the-art embedding techniques\u0000(Doc2Vec and BERT) and then use it to train a supervised machine learning\u0000classifier to detect exploitation of security vulnerabilities. We use our\u0000approach to identify exploitation events in 191 different TI feeds. Our\u0000longitudinal evaluation shows that it is able to accurately identify\u0000exploitation events from TI feeds only using past data for training and even on\u0000TI feeds withheld from training. Our proposed approach is useful for a variety\u0000of downstream tasks such as data-driven vulnerability risk assessment.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"117 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG 利用排名模型加强问答文本检索：为 RAG 制定基准、微调和部署 Rerankers

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.07691

Gabriel de Souza P. Moreira, Ronay Ak, Benedikt Schifferer, Mengyao Xu, Radek Osmulski, Even Oldridge

Ranking models play a crucial role in enhancing overall accuracy of textretrieval systems. These multi-stage systems typically utilize either denseembedding models or sparse lexical indices to retrieve relevant passages basedon a given query, followed by ranking models that refine the ordering of thecandidate passages by its relevance to the query. This paper benchmarks various publicly available ranking models and examinestheir impact on ranking accuracy. We focus on text retrieval forquestion-answering tasks, a common use case for Retrieval-Augmented Generationsystems. Our evaluation benchmarks include models some of which arecommercially viable for industrial applications. We introduce a state-of-the-art ranking model, NV-RerankQA-Mistral-4B-v3,which achieves a significant accuracy increase of ~14% compared to pipelineswith other rerankers. We also provide an ablation study comparing thefine-tuning of ranking models with different sizes, losses and self-attentionmechanisms. Finally, we discuss challenges of text retrieval pipelines with rankingmodels in real-world industry applications, in particular the trade-offs amongmodel size, ranking accuracy and system requirements like indexing and servinglatency / throughput.

排序模型在提高文本检索系统的整体准确性方面起着至关重要的作用。这些多阶段系统通常利用密集嵌入模型或稀疏词性索引来检索基于给定查询的相关段落，然后利用排序模型根据其与查询的相关性来完善候选段落的排序。本文对各种公开可用的排序模型进行了基准测试，并检验了它们对排序准确性的影响。我们的重点是问题解答任务的文本检索，这是检索增强生成系统的常见用例。我们的评估基准包括一些在工业应用中具有商业可行性的模型。我们引入了最先进的排序模型 NV-RerankQA-Mistral-4B-v3，与使用其他排序器的管道相比，它的准确率显著提高了约 14%。我们还提供了一项消融研究，比较了不同规模、损失和自我关注机制的排序模型的微调。最后，我们讨论了在实际行业应用中使用排名模型的文本检索管道所面临的挑战，特别是在模型大小、排名准确性和系统要求（如索引和服务延迟/吞吐量）之间的权衡。

{"title":"Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG","authors":"Gabriel de Souza P. Moreira, Ronay Ak, Benedikt Schifferer, Mengyao Xu, Radek Osmulski, Even Oldridge","doi":"arxiv-2409.07691","DOIUrl":"https://doi.org/arxiv-2409.07691","url":null,"abstract":"Ranking models play a crucial role in enhancing overall accuracy of text\u0000retrieval systems. These multi-stage systems typically utilize either dense\u0000embedding models or sparse lexical indices to retrieve relevant passages based\u0000on a given query, followed by ranking models that refine the ordering of the\u0000candidate passages by its relevance to the query. This paper benchmarks various publicly available ranking models and examines\u0000their impact on ranking accuracy. We focus on text retrieval for\u0000question-answering tasks, a common use case for Retrieval-Augmented Generation\u0000systems. Our evaluation benchmarks include models some of which are\u0000commercially viable for industrial applications. We introduce a state-of-the-art ranking model, NV-RerankQA-Mistral-4B-v3,\u0000which achieves a significant accuracy increase of ~14% compared to pipelines\u0000with other rerankers. We also provide an ablation study comparing the\u0000fine-tuning of ranking models with different sizes, losses and self-attention\u0000mechanisms. Finally, we discuss challenges of text retrieval pipelines with ranking\u0000models in real-world industry applications, in particular the trade-offs among\u0000model size, ranking accuracy and system requirements like indexing and serving\u0000latency / throughput.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing Cross-Market Recommendation System with Graph Isomorphism Networks: A Novel Approach to Personalized User Experience 利用图同构网络增强跨市场推荐系统：个性化用户体验的新方法

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.07850

Sümeyye Öztürk, Ahmed Burak Ercan, Resul Tugay, Şule Gündüz Öğüdücü

In today's world of globalized commerce, cross-market recommendation systems(CMRs) are crucial for providing personalized user experiences across diversemarket segments. However, traditional recommendation algorithms havedifficulties dealing with market specificity and data sparsity, especially innew or emerging markets. In this paper, we propose the CrossGR model, whichutilizes Graph Isomorphism Networks (GINs) to improve CMR systems. Itoutperforms existing benchmarks in NDCG@10 and HR@10 metrics, demonstrating itsadaptability and accuracy in handling diverse market segments. The CrossGRmodel is adaptable and accurate, making it well-suited for handling thecomplexities of cross-market recommendation tasks. Its robustness isdemonstrated by consistent performance across different evaluation timeframes,indicating its potential to cater to evolving market trends and userpreferences. Our findings suggest that GINs represent a promising direction forCMRs, paving the way for more sophisticated, personalized, and context-awarerecommendation systems in the dynamic landscape of global e-commerce.

在商业全球化的今天，跨市场推荐系统（CMR）对于在不同细分市场提供个性化用户体验至关重要。然而，传统的推荐算法在处理市场特殊性和数据稀缺性方面存在困难，尤其是在新兴市场。在本文中，我们提出了 CrossGR 模型，该模型利用图形同构网络（GIN）来改进 CMR 系统。该模型在 NDCG@10 和 HR@10 指标方面优于现有基准，证明了它在处理不同细分市场方面的适应性和准确性。CrossGR 模型适应性强、准确度高，非常适合处理复杂的跨市场推荐任务。在不同的评估时间范围内，该模型的性能始终如一，这证明了它的稳健性，同时也表明它具有迎合不断变化的市场趋势和用户偏好的潜力。我们的研究结果表明，GINs 代表了 CMR 的一个有前途的发展方向，为在全球电子商务的动态环境中开发更复杂、更个性化和情境感知的推荐系统铺平了道路。

{"title":"Enhancing Cross-Market Recommendation System with Graph Isomorphism Networks: A Novel Approach to Personalized User Experience","authors":"Sümeyye Öztürk, Ahmed Burak Ercan, Resul Tugay, Şule Gündüz Öğüdücü","doi":"arxiv-2409.07850","DOIUrl":"https://doi.org/arxiv-2409.07850","url":null,"abstract":"In today's world of globalized commerce, cross-market recommendation systems\u0000(CMRs) are crucial for providing personalized user experiences across diverse\u0000market segments. However, traditional recommendation algorithms have\u0000difficulties dealing with market specificity and data sparsity, especially in\u0000new or emerging markets. In this paper, we propose the CrossGR model, which\u0000utilizes Graph Isomorphism Networks (GINs) to improve CMR systems. It\u0000outperforms existing benchmarks in NDCG@10 and HR@10 metrics, demonstrating its\u0000adaptability and accuracy in handling diverse market segments. The CrossGR\u0000model is adaptable and accurate, making it well-suited for handling the\u0000complexities of cross-market recommendation tasks. Its robustness is\u0000demonstrated by consistent performance across different evaluation timeframes,\u0000indicating its potential to cater to evolving market trends and user\u0000preferences. Our findings suggest that GINs represent a promising direction for\u0000CMRs, paving the way for more sophisticated, personalized, and context-aware\u0000recommendation systems in the dynamic landscape of global e-commerce.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"55 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Collaborative Automatic Modulation Classification via Deep Edge Inference for Hierarchical Cognitive Radio Networks 通过深度边缘推理为分层认知无线电网络协同自动调制分类

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.07946

Chaowei He, Peihao Dong, Fuhui Zhou, Qihui Wu

In hierarchical cognitive radio networks, edge or cloud servers utilize thedata collected by edge devices for modulation classification, which, however,is faced with problems of the transmission overhead, data privacy, andcomputation load. In this article, an edge learning (EL) based frameworkjointly mobilizing the edge device and the edge server for intelligentco-inference is proposed to realize the collaborative automatic modulationclassification (C-AMC) between them. A spectrum semantic compression neuralnetwork (SSCNet) with the lightweight structure is designed for the edge deviceto compress the collected raw data into a compact semantic message that is thensent to the edge server via the wireless channel. On the edge server side, amodulation classification neural network (MCNet) combining bidirectional longshort-term memory (Bi?LSTM) and multi-head attention layers is elaborated todeter?mine the modulation type from the noisy semantic message. By leveragingthe computation resources of both the edge device and the edge server, hightransmission overhead and risks of data privacy leakage are avoided. Thesimulation results verify the effectiveness of the proposed C-AMC framework,significantly reducing the model size and computational complexity.

在分层认知无线电网络中，边缘或云服务器利用边缘设备收集的数据进行调制分类，但这面临着传输开销、数据隐私和计算负荷等问题。本文提出了一种基于边缘学习（EL）的框架，将边缘设备和边缘服务器联合起来进行智能协同推理，以实现它们之间的协同自动调制分类（C-AMC）。为边缘设备设计了轻量级结构的频谱语义压缩神经网络（SSCNet），将采集到的原始数据压缩成紧凑的语义信息，然后通过无线信道发送到边缘服务器。在边缘服务器端，结合双向长短时记忆（Bi?LSTM）和多头注意层的调制分类神经网络（MCNet）被精心设计，以从噪声语义信息中识别调制类型。通过充分利用边缘设备和边缘服务器的计算资源，避免了高传输开销和数据隐私泄露的风险。仿真结果验证了所提出的 C-AMC 框架的有效性，大大降低了模型大小和计算复杂度。

{"title":"Collaborative Automatic Modulation Classification via Deep Edge Inference for Hierarchical Cognitive Radio Networks","authors":"Chaowei He, Peihao Dong, Fuhui Zhou, Qihui Wu","doi":"arxiv-2409.07946","DOIUrl":"https://doi.org/arxiv-2409.07946","url":null,"abstract":"In hierarchical cognitive radio networks, edge or cloud servers utilize the\u0000data collected by edge devices for modulation classification, which, however,\u0000is faced with problems of the transmission overhead, data privacy, and\u0000computation load. In this article, an edge learning (EL) based framework\u0000jointly mobilizing the edge device and the edge server for intelligent\u0000co-inference is proposed to realize the collaborative automatic modulation\u0000classification (C-AMC) between them. A spectrum semantic compression neural\u0000network (SSCNet) with the lightweight structure is designed for the edge device\u0000to compress the collected raw data into a compact semantic message that is then\u0000sent to the edge server via the wireless channel. On the edge server side, a\u0000modulation classification neural network (MCNet) combining bidirectional long\u0000short-term memory (Bi?LSTM) and multi-head attention layers is elaborated to\u0000deter?mine the modulation type from the noisy semantic message. By leveraging\u0000the computation resources of both the edge device and the edge server, high\u0000transmission overhead and risks of data privacy leakage are avoided. The\u0000simulation results verify the effectiveness of the proposed C-AMC framework,\u0000significantly reducing the model size and computational complexity.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"173 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PDC-FRS: Privacy-preserving Data Contribution for Federated Recommender System PDC-FRS：联盟推荐系统的隐私保护数据贡献

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.07773

Chaoqun Yang, Wei Yuan, Liang Qu, Thanh Tam Nguyen

Federated recommender systems (FedRecs) have emerged as a popular researchdirection for protecting users' privacy in on-device recommendations. InFedRecs, users keep their data locally and only contribute their localcollaborative information by uploading model parameters to a central server.While this rigid framework protects users' raw data during training, itseverely compromises the recommendation model's performance due to thefollowing reasons: (1) Due to the power law distribution nature of userbehavior data, individual users have few data points to train a recommendationmodel, resulting in uploaded model updates that may be far from optimal; (2) Aseach user's uploaded parameters are learned from local data, which lacks globalcollaborative information, relying solely on parameter aggregation methods suchas FedAvg to fuse global collaborative information may be suboptimal. To bridgethis performance gap, we propose a novel federated recommendation framework,PDC-FRS. Specifically, we design a privacy-preserving data contributionmechanism that allows users to share their data with a differential privacyguarantee. Based on the shared but perturbed data, an auxiliary model istrained in parallel with the original federated recommendation process. Thisauxiliary model enhances FedRec by augmenting each user's local dataset andintegrating global collaborative information. To demonstrate the effectivenessof PDC-FRS, we conduct extensive experiments on two widely used recommendationdatasets. The empirical results showcase the superiority of PDC-FRS compared tobaseline methods.

联合推荐系统（FedRecs）已成为在设备推荐中保护用户隐私的一个热门研究方向。在 FedRecs 中，用户在本地保存数据，只通过向中央服务器上传模型参数来贡献本地协作信息。虽然这种僵化的框架在训练过程中保护了用户的原始数据，但由于以下原因，它严重影响了推荐模型的性能：(1) 由于用户行为数据的幂律分布特性，单个用户用于训练推荐模型的数据点很少，导致上传的模型更新可能远非最优；(2) 每个用户上传的参数都是从本地数据中学习的，缺乏全局协作信息，因此仅仅依靠参数聚合方法（如 FedAvg）来融合全局协作信息可能不是最优的。为了弥补这一性能差距，我们提出了一种新颖的联合推荐框架 PDC-FRS。具体来说，我们设计了一种保护隐私的数据贡献机制，允许用户在不同的隐私保证下共享他们的数据。在共享但扰动数据的基础上，与原始联合推荐流程并行训练一个辅助模型。这个辅助模型通过增强每个用户的本地数据集和整合全球协作信息来增强 FedRec。为了证明 PDC-FRS 的有效性，我们在两个广泛使用的推荐数据集上进行了大量实验。实证结果表明，与基准方法相比，PDC-FRS 更具优势。

{"title":"PDC-FRS: Privacy-preserving Data Contribution for Federated Recommender System","authors":"Chaoqun Yang, Wei Yuan, Liang Qu, Thanh Tam Nguyen","doi":"arxiv-2409.07773","DOIUrl":"https://doi.org/arxiv-2409.07773","url":null,"abstract":"Federated recommender systems (FedRecs) have emerged as a popular research\u0000direction for protecting users' privacy in on-device recommendations. In\u0000FedRecs, users keep their data locally and only contribute their local\u0000collaborative information by uploading model parameters to a central server.\u0000While this rigid framework protects users' raw data during training, it\u0000severely compromises the recommendation model's performance due to the\u0000following reasons: (1) Due to the power law distribution nature of user\u0000behavior data, individual users have few data points to train a recommendation\u0000model, resulting in uploaded model updates that may be far from optimal; (2) As\u0000each user's uploaded parameters are learned from local data, which lacks global\u0000collaborative information, relying solely on parameter aggregation methods such\u0000as FedAvg to fuse global collaborative information may be suboptimal. To bridge\u0000this performance gap, we propose a novel federated recommendation framework,\u0000PDC-FRS. Specifically, we design a privacy-preserving data contribution\u0000mechanism that allows users to share their data with a differential privacy\u0000guarantee. Based on the shared but perturbed data, an auxiliary model is\u0000trained in parallel with the original federated recommendation process. This\u0000auxiliary model enhances FedRec by augmenting each user's local dataset and\u0000integrating global collaborative information. To demonstrate the effectiveness\u0000of PDC-FRS, we conduct extensive experiments on two widely used recommendation\u0000datasets. The empirical results showcase the superiority of PDC-FRS compared to\u0000baseline methods.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"404 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the challenges of studying bias in Recommender Systems: A UserKNN case study 研究推荐系统中的偏见所面临的挑战：用户 KNN 案例研究

arXiv - CS - Information Retrieval

Pub Date : 2024-09-12 DOI: arxiv-2409.08046

Savvina Daniil, Manel Slokom, Mirjam Cuper, Cynthia C. S. Liem, Jacco van Ossenbruggen, Laura Hollink

Statements on the propagation of bias by recommender systems are often hardto verify or falsify. Research on bias tends to draw from a small pool ofpublicly available datasets and is therefore bound by their specificproperties. Additionally, implementation choices are often not explicitlydescribed or motivated in research, while they may have an effect on biaspropagation. In this paper, we explore the challenges of measuring andreporting popularity bias. We showcase the impact of data properties andalgorithm configurations on popularity bias by combining synthetic data withwell known recommender systems frameworks that implement UserKNN. First, weidentify data characteristics that might impact popularity bias, based on thefunctionality of UserKNN. Accordingly, we generate various datasets thatcombine these characteristics. Second, we locate UserKNN configurations thatvary across implementations in literature. We evaluate popularity bias for fivesynthetic datasets and five UserKNN configurations, and offer insights on theirjoint effect. We find that, depending on the data characteristics, variousUserKNN configurations can lead to different conclusions regarding thepropagation of popularity bias. These results motivate the need for explicitlyaddressing algorithmic configuration and data properties when reporting andinterpreting bias in recommender systems.

关于推荐系统传播偏见的说法往往难以验证或证伪。有关偏差的研究往往从一小部分公开可用的数据集中提取数据，因此受到这些数据集特定属性的限制。此外，在研究中，实施选择往往没有明确的描述或动机，而这些选择可能会对生物传播产生影响。在本文中，我们探讨了测量和报告流行度偏差所面临的挑战。我们通过将合成数据与实施 UserKNN 的已知推荐系统框架相结合，展示了数据属性和算法配置对流行度偏差的影响。首先，我们根据 UserKNN 的功能，识别出可能影响流行度偏差的数据特征。因此，我们生成了结合这些特征的各种数据集。其次，我们找到了文献中不同实现的 UserKNN 配置。我们评估了五个合成数据集和五种 UserKNN 配置的流行度偏差，并就它们的共同影响提出了见解。我们发现，根据数据特征的不同，不同的用户 KNN 配置会导致关于流行度偏差传播的不同结论。这些结果表明，在报告和解释推荐系统中的偏差时，有必要明确处理算法配置和数据属性。

{"title":"On the challenges of studying bias in Recommender Systems: A UserKNN case study","authors":"Savvina Daniil, Manel Slokom, Mirjam Cuper, Cynthia C. S. Liem, Jacco van Ossenbruggen, Laura Hollink","doi":"arxiv-2409.08046","DOIUrl":"https://doi.org/arxiv-2409.08046","url":null,"abstract":"Statements on the propagation of bias by recommender systems are often hard\u0000to verify or falsify. Research on bias tends to draw from a small pool of\u0000publicly available datasets and is therefore bound by their specific\u0000properties. Additionally, implementation choices are often not explicitly\u0000described or motivated in research, while they may have an effect on bias\u0000propagation. In this paper, we explore the challenges of measuring and\u0000reporting popularity bias. We showcase the impact of data properties and\u0000algorithm configurations on popularity bias by combining synthetic data with\u0000well known recommender systems frameworks that implement UserKNN. First, we\u0000identify data characteristics that might impact popularity bias, based on the\u0000functionality of UserKNN. Accordingly, we generate various datasets that\u0000combine these characteristics. Second, we locate UserKNN configurations that\u0000vary across implementations in literature. We evaluate popularity bias for five\u0000synthetic datasets and five UserKNN configurations, and offer insights on their\u0000joint effect. We find that, depending on the data characteristics, various\u0000UserKNN configurations can lead to different conclusions regarding the\u0000propagation of popularity bias. These results motivate the need for explicitly\u0000addressing algorithmic configuration and data properties when reporting and\u0000interpreting bias in recommender systems.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0