首页 > 最新文献

ACM Transactions on Intelligent Systems and Technology最新文献

英文 中文
A Survey of Trustworthy Federated Learning: Issues, Solutions, and Challenges 值得信赖的联合学习调查:问题、解决方案和挑战
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-23 DOI: 10.1145/3678181
Yifei Zhang, Dun Zeng, Jinglong Luo, Xinyu Fu, Guanzhong Chen, Zenglin Xu, Irwin King
Trustworthy Artificial Intelligence (TAI) has proven invaluable in curbing potential negative repercussions tied to AI applications. Within the TAI spectrum, Federated Learning (FL) emerges as a promising solution to safeguard personal information in distributed settings across a multitude of practical contexts. However, the realm of FL is not without its challenges. Especially worrisome are adversarial attacks targeting its algorithmic robustness and systemic confidentiality. Moreover, the presence of biases and opacity in prediction outcomes further complicates FL’s broader adoption. Consequently, there is a growing expectation for FL to instill trust. To address this, we chart out a comprehensive road-map for Trustworthy Federated Learning (TFL) and provide an overview of existing efforts across four pivotal dimensions: Privacy & Security , Robustness , Fairness , and Explainability . For each dimension, we identify potential pitfalls that might undermine TFL and present a curated selection of defensive strategies, enriched by a discourse on technical solutions tailored for TFL. Furthermore, we present potential challenges and future directions to be explored for in-depth TFL research with broader impacts.
事实证明,可信赖的人工智能(TAI)在遏制与人工智能应用相关的潜在负面影响方面具有重要价值。在值得信赖的人工智能(TAI)领域中,联邦学习(FL)是一种前景广阔的解决方案,可在多种实际情况下保护分布式环境中的个人信息。然而,FL 领域并非没有挑战。尤其令人担忧的是针对其算法鲁棒性和系统保密性的对抗性攻击。此外,预测结果中存在的偏差和不透明也使 FL 的广泛应用变得更加复杂。因此,人们越来越期待 FL 能够赢得信任。为了解决这个问题,我们为值得信赖的联合学习(TFL)绘制了一个全面的路线图,并概述了四个关键方面的现有努力:隐私与安全、稳健性、公平性和可解释性。针对每个维度,我们确定了可能破坏 TFL 的潜在隐患,并介绍了经过精心挑选的防御策略,同时还讨论了为 TFL 量身定制的技术解决方案。此外,我们还提出了具有广泛影响的 TFL 深入研究的潜在挑战和未来探索方向。
{"title":"A Survey of Trustworthy Federated Learning: Issues, Solutions, and Challenges","authors":"Yifei Zhang, Dun Zeng, Jinglong Luo, Xinyu Fu, Guanzhong Chen, Zenglin Xu, Irwin King","doi":"10.1145/3678181","DOIUrl":"https://doi.org/10.1145/3678181","url":null,"abstract":"\u0000 Trustworthy Artificial Intelligence (TAI) has proven invaluable in curbing potential negative repercussions tied to AI applications. Within the TAI spectrum, Federated Learning (FL) emerges as a promising solution to safeguard personal information in distributed settings across a multitude of practical contexts. However, the realm of FL is not without its challenges. Especially worrisome are adversarial attacks targeting its algorithmic robustness and systemic confidentiality. Moreover, the presence of biases and opacity in prediction outcomes further complicates FL’s broader adoption. Consequently, there is a growing expectation for FL to instill trust. To address this, we chart out a comprehensive road-map for\u0000 Trustworthy Federated Learning (TFL)\u0000 and provide an overview of existing efforts across four pivotal dimensions:\u0000 Privacy & Security\u0000 ,\u0000 Robustness\u0000 ,\u0000 Fairness\u0000 , and\u0000 Explainability\u0000 . For each dimension, we identify potential pitfalls that might undermine TFL and present a curated selection of defensive strategies, enriched by a discourse on technical solutions tailored for TFL. Furthermore, we present potential challenges and future directions to be explored for in-depth TFL research with broader impacts.\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141812628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepSneak: User GPS Trajectory Reconstruction from Federated Route Recommendation Models DeepSneak:从联合路线推荐模型重构用户 GPS 轨迹
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-22 DOI: 10.1145/3670412
Thirasara Ariyarathna, Meisam Mohommady, Hye-young Paik, S. Kanhere
Decentralized machine learning, such as Federated Learning (FL), is widely adopted in many application domains. Especially in domains like recommendation systems, sharing gradients instead of private data has recently caught the research community’s attention. Personalized travel route recommendation utilizes users’ location data to recommend optimal travel routes. Location data is extremely privacy sensitive, presenting increased risks of exposing behavioural patterns and demographic attributes. FL for route recommendation can mitigate the sharing of location data. However, this paper shows that an adversary can recover the user trajectories used to train the federated recommendation models with high proximity accuracy. To this effect, we propose a novel attack called DeepSneak, which uses shared gradients obtained from global model training in FL to reconstruct private user trajectories. We formulate the attack as a regression problem and train a generative model by minimizing the distance between gradients. We validate the success of DeepSneak on two real-world trajectory datasets. The results show that we can recover the location trajectories of users with reasonable spatial and semantic accuracy.
去中心化机器学习(如联盟学习(FL))已在许多应用领域得到广泛采用。特别是在推荐系统等领域,共享梯度数据而不是私人数据最近引起了研究界的关注。个性化旅行路线推荐利用用户的位置数据来推荐最佳旅行路线。位置数据对隐私极为敏感,会增加暴露行为模式和人口属性的风险。用于路线推荐的 FL 可以减少位置数据的共享。然而,本文表明,敌方可以恢复用于训练联合推荐模型的用户轨迹,而且接近精度很高。为此,我们提出了一种名为 DeepSneak 的新型攻击,它利用从 FL 中的全局模型训练中获得的共享梯度来重建私有用户轨迹。我们将该攻击表述为回归问题,并通过最小化梯度间的距离来训练生成模型。我们在两个真实世界的轨迹数据集上验证了 DeepSneak 的成功。结果表明,我们能以合理的空间和语义精度恢复用户的位置轨迹。
{"title":"DeepSneak: User GPS Trajectory Reconstruction from Federated Route Recommendation Models","authors":"Thirasara Ariyarathna, Meisam Mohommady, Hye-young Paik, S. Kanhere","doi":"10.1145/3670412","DOIUrl":"https://doi.org/10.1145/3670412","url":null,"abstract":"Decentralized machine learning, such as Federated Learning (FL), is widely adopted in many application domains. Especially in domains like recommendation systems, sharing gradients instead of private data has recently caught the research community’s attention. Personalized travel route recommendation utilizes users’ location data to recommend optimal travel routes. Location data is extremely privacy sensitive, presenting increased risks of exposing behavioural patterns and demographic attributes. FL for route recommendation can mitigate the sharing of location data. However, this paper shows that an adversary can recover the user trajectories used to train the federated recommendation models with high proximity accuracy. To this effect, we propose a novel attack called DeepSneak, which uses shared gradients obtained from global model training in FL to reconstruct private user trajectories. We formulate the attack as a regression problem and train a generative model by minimizing the distance between gradients. We validate the success of DeepSneak on two real-world trajectory datasets. The results show that we can recover the location trajectories of users with reasonable spatial and semantic accuracy.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141815799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WC-SBERT: Zero-Shot Topic Classification Using SBERT and Light Self-Training on Wikipedia Categories WC-SBERT:使用 SBERT 和维基百科类别的轻度自我训练进行零镜头主题分类
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-18 DOI: 10.1145/3678183
Te-Yu Chi, Jyh-Shing Roger Jang
In NLP (natural language processing), zero-shot topic classification requires machines to understand the contextual meanings of texts in a downstream task without using the corresponding labeled texts for training, which is highly desirable for various applications [2]. In this paper, we propose a novel approach to construct a zero-shot task-specific model called WC-SBERT with satisfactory performance. The proposed approach is highly efficient since it uses light self-training requiring target labels (target class names of downstream tasks) only, which is distinct from other research that uses both the target labels and the unlabeled texts for training. In particular, during the pre-training stage, WC-SBERT uses contrastive learning with the multiple negative ranking loss [9] to construct the pre-trained model based on the similarity between Wiki categories. For the self-training stage, online contrastive loss is utilized to reduce the distance between a target label and Wiki categories of similar Wiki pages to the label. Experimental results indicate that compared to existing self-training models, WC-SBERT achieves rapid inference on approximately 6.45 million Wiki text entries by utilizing pre-stored Wikipedia text embeddings, significantly reducing inference time per sample by a factor of 2,746 to 16,746. During the fine-tuning step, the time required for each sample is reduced by a factor of 23 to 67. Overall, the total training time shows a maximum reduction of 27.5 times across different datasets. Most importantly, our model has achieved SOTA (state-of-the-art) accuracy on two of the three commonly used datasets for evaluating zero-shot classification, namely the AG News (0.84) and Yahoo! Answers (0.64) datasets. The code for WC-SBERT is publicly available on GitHub 1 , and the dataset can also be accessed on Hugging Face 2 .
在 NLP(自然语言处理)中,零镜头主题分类要求机器在下游任务中理解文本的上下文含义,而无需使用相应的标记文本进行训练,这在各种应用中都是非常理想的[2]。在本文中,我们提出了一种新颖的方法来构建一种名为 WC-SBERT 的零点任务特定模型,并取得了令人满意的效果。与其他同时使用目标标签和未标签文本进行训练的研究不同,本文提出的方法只需目标标签(下游任务的目标类名)即可进行轻量级自我训练,因此具有很高的效率。具体而言,在预训练阶段,WC-SBERT 利用多重负排序损失(multiple negative ranking loss)[9]进行对比学习,根据维基类别之间的相似性构建预训练模型。在自我训练阶段,则利用在线对比损失来减小目标标签与与该标签相似的维基页面类别之间的距离。实验结果表明,与现有的自我训练模型相比,WC-SBERT 利用预先存储的维基百科文本嵌入,在约 645 万个维基文本条目上实现了快速推理,将每个样本的推理时间显著减少了 2,746 到 16,746 倍。在微调步骤中,每个样本所需的时间减少了 23 倍,达到 67 倍。总体而言,在不同的数据集上,总训练时间最多减少了 27.5 倍。最重要的是,我们的模型在三个常用于评估零点分类的数据集中的两个数据集上达到了 SOTA(最先进)的准确率,即 AG 新闻(0.84)和雅虎答案(0.64)数据集。WC-SBERT 的代码可在 GitHub 1 上公开获取,数据集也可在 Hugging Face 2 上访问。
{"title":"WC-SBERT: Zero-Shot Topic Classification Using SBERT and Light Self-Training on Wikipedia Categories","authors":"Te-Yu Chi, Jyh-Shing Roger Jang","doi":"10.1145/3678183","DOIUrl":"https://doi.org/10.1145/3678183","url":null,"abstract":"\u0000 In NLP (natural language processing), zero-shot topic classification requires machines to understand the contextual meanings of texts in a downstream task without using the corresponding labeled texts for training, which is highly desirable for various applications [2]. In this paper, we propose a novel approach to construct a zero-shot task-specific model called WC-SBERT with satisfactory performance. The proposed approach is highly efficient since it uses light self-training requiring target labels (target class names of downstream tasks) only, which is distinct from other research that uses both the target labels and the unlabeled texts for training. In particular, during the pre-training stage, WC-SBERT uses contrastive learning with the multiple negative ranking loss [9] to construct the pre-trained model based on the similarity between Wiki categories. For the self-training stage, online contrastive loss is utilized to reduce the distance between a target label and Wiki categories of similar Wiki pages to the label. Experimental results indicate that compared to existing self-training models, WC-SBERT achieves rapid inference on approximately 6.45 million Wiki text entries by utilizing pre-stored Wikipedia text embeddings, significantly reducing inference time per sample by a factor of 2,746 to 16,746. During the fine-tuning step, the time required for each sample is reduced by a factor of 23 to 67. Overall, the total training time shows a maximum reduction of 27.5 times across different datasets. Most importantly, our model has achieved SOTA (state-of-the-art) accuracy on two of the three commonly used datasets for evaluating zero-shot classification, namely the AG News (0.84) and Yahoo! Answers (0.64) datasets. The code for WC-SBERT is publicly available on GitHub\u0000 \u0000 1\u0000 \u0000 , and the dataset can also be accessed on Hugging Face\u0000 \u0000 2\u0000 \u0000 .\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141825634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks 利用循环一致性对抗网络实现自监督文本风格转移
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-18 DOI: 10.1145/3678179
Moreno La Quatra, Giuseppe Gallipoli, Luca Cagliero
Text Style Transfer (TST) is a relevant branch of natural language processing that aims to control the style attributes of a piece of text while preserving its original content. To address TST in the absence of parallel data, Cycle-consistent Generative Adversarial Networks (CycleGANs) have recently emerged as promising solutions. Existing CycleGAN-based TST approaches suffer from the following limitations: (1) They apply self-supervision, based on the cycle-consistency principle, in the latent space. This approach turns out to be less robust to mixed-style inputs, i.e., when the source text is partly in the original and partly in the target style; (2) Generators and discriminators rely on recurrent networks, which are exposed to known issues with long-term text dependencies; (3) The target style is weakly enforced, as the discriminator distinguishes real from fake sentences without explicitly accounting for the generated text’s style. We propose a new CycleGAN-based TST approach that applies self-supervision directly at the sequence level to effectively handle mixed-style inputs and employs Transformers to leverage the attention mechanism for both text encoding and decoding. We also employ a pre-trained style classifier to guide the generation of text in the target style while maintaining the original content’s meaning. The experimental results achieved on the formality and sentiment transfer tasks show that our approach outperforms existing ones, both CycleGAN-based and not (including an open-source Large Language Model), on benchmark data and shows better robustness to mixed-style inputs.
文本风格转换(TST)是自然语言处理的一个相关分支,旨在控制一段文本的风格属性,同时保留其原始内容。为了在没有并行数据的情况下解决 TST 问题,循环一致性生成对抗网络(Cycle-consistent Generative Adversarial Networks,CycleGANs)最近成为一种很有前途的解决方案。现有的基于 CycleGAN 的 TST 方法存在以下局限性:(1) 它们根据循环一致性原则在潜在空间中应用自我监督。这种方法对混合风格输入的鲁棒性较差,即源文本部分为原始风格,部分为目标风格时;(2) 生成器和判别器依赖于递归网络,而递归网络存在已知的长期文本依赖性问题;(3) 目标风格执行不力,因为判别器在区分真假句子时没有明确考虑生成文本的风格。我们提出了一种新的基于 CycleGAN 的 TST 方法,该方法直接在序列级别应用自监督,以有效处理混合风格输入,并使用 Transformers 利用注意力机制进行文本编码和解码。我们还采用了一个预先训练好的文体分类器,以指导生成目标文体的文本,同时保持原始内容的含义。在格式和情感转换任务上取得的实验结果表明,我们的方法在基准数据上优于现有的基于 CycleGAN 的方法和非基于 CycleGAN 的方法(包括开源的大型语言模型),并对混合风格输入表现出更好的鲁棒性。
{"title":"Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks","authors":"Moreno La Quatra, Giuseppe Gallipoli, Luca Cagliero","doi":"10.1145/3678179","DOIUrl":"https://doi.org/10.1145/3678179","url":null,"abstract":"Text Style Transfer (TST) is a relevant branch of natural language processing that aims to control the style attributes of a piece of text while preserving its original content. To address TST in the absence of parallel data, Cycle-consistent Generative Adversarial Networks (CycleGANs) have recently emerged as promising solutions. Existing CycleGAN-based TST approaches suffer from the following limitations: (1) They apply self-supervision, based on the cycle-consistency principle, in the latent space. This approach turns out to be less robust to mixed-style inputs, i.e., when the source text is partly in the original and partly in the target style; (2) Generators and discriminators rely on recurrent networks, which are exposed to known issues with long-term text dependencies; (3) The target style is weakly enforced, as the discriminator distinguishes real from fake sentences without explicitly accounting for the generated text’s style. We propose a new CycleGAN-based TST approach that applies self-supervision directly at the sequence level to effectively handle mixed-style inputs and employs Transformers to leverage the attention mechanism for both text encoding and decoding. We also employ a pre-trained style classifier to guide the generation of text in the target style while maintaining the original content’s meaning. The experimental results achieved on the formality and sentiment transfer tasks show that our approach outperforms existing ones, both CycleGAN-based and not (including an open-source Large Language Model), on benchmark data and shows better robustness to mixed-style inputs.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141827121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Learning Survey: A Multi-Level Taxonomy of Aggregation Techniques, Experimental Insights, and Future Frontiers 联合学习调查:聚合技术、实验见解和未来前沿的多层次分类法
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-17 DOI: 10.1145/3678182
Meriem Arbaoui, Mohamed-el-Amine Brahmia, Abdellatif Rahmoun, M. Zghal
The emerging integration of IoT (Internet of Things) and AI (Artificial Intelligence) has unlocked numerous opportunities for innovation across diverse industries. However, growing privacy concerns and data isolation issues have inhibited this promising advancement. Unfortunately, traditional centralized machine learning (ML) methods have demonstrated their limitations in addressing these hurdles. In response to this ever-evolving landscape, Federated Learning (FL) has surfaced as a cutting-edge machine learning paradigm, enabling collaborative training across decentralized devices. FL allows users to jointly construct AI models without sharing their local raw data, ensuring data privacy, network scalability, and minimal data transfer. One essential aspect of FL revolves around proficient knowledge aggregation within a heterogeneous environment. Yet, the inherent characteristics of FL have amplified the complexity of its practical implementation compared to centralized ML. This survey delves into three prominent clusters of FL research contributions: personalization, optimization, and robustness. The objective is to provide a well-structured and fine-grained classification scheme related to these research areas through a unique methodology for selecting related work. Unlike other survey papers, we employed a hybrid approach that amalgamates bibliometric analysis and systematic scrutinizing to find the most influential work in the literature. Therefore, we examine challenges and contemporary techniques related to heterogeneity, efficiency, security, and privacy. Another valuable asset of this study is its comprehensive coverage of FL aggregation strategies, encompassing architectural features, synchronization methods, and several federation motivations. To further enrich our investigation, we provide practical insights into evaluating novel FL proposals and conduct experiments to assess and compare aggregation methods under IID and non-IID data distributions. Finally, we present a compelling set of research avenues that call for further exploration to open up a treasure of advancement.
新兴的物联网(IoT)与人工智能(AI)的融合为各行各业的创新带来了众多机遇。然而,日益增长的隐私问题和数据隔离问题阻碍了这一充满希望的进步。遗憾的是,传统的集中式机器学习(ML)方法在解决这些障碍方面已显示出其局限性。为了应对这种不断变化的局面,联邦学习(Federated Learning,FL)作为一种前沿的机器学习范式浮出水面,实现了跨分散设备的协作训练。联邦学习允许用户在不共享本地原始数据的情况下共同构建人工智能模型,从而确保数据隐私、网络可扩展性和最小化数据传输。FL 的一个重要方面是在异构环境中进行熟练的知识聚合。然而,与集中式人工智能相比,FL 的固有特性放大了其实际实施的复杂性。本调查深入探讨了 FL 研究的三个突出贡献集群:个性化、优化和鲁棒性。目的是通过选择相关工作的独特方法,提供与这些研究领域相关的结构合理、粒度精细的分类方案。与其他调查论文不同的是,我们采用了一种混合方法,将文献计量分析和系统性审查结合起来,以找到文献中最具影响力的工作。因此,我们研究了与异构性、效率、安全性和隐私相关的挑战和当代技术。本研究的另一个宝贵之处在于它全面涵盖了 FL 聚合策略,包括架构特点、同步方法和几种联盟动机。为了进一步丰富我们的研究,我们提供了评估新型 FL 建议的实用见解,并进行了实验,以评估和比较 IID 和非 IID 数据分布下的聚合方法。最后,我们提出了一系列令人信服的研究途径,这些途径需要进一步探索,以开辟前进的宝藏。
{"title":"Federated Learning Survey: A Multi-Level Taxonomy of Aggregation Techniques, Experimental Insights, and Future Frontiers","authors":"Meriem Arbaoui, Mohamed-el-Amine Brahmia, Abdellatif Rahmoun, M. Zghal","doi":"10.1145/3678182","DOIUrl":"https://doi.org/10.1145/3678182","url":null,"abstract":"The emerging integration of IoT (Internet of Things) and AI (Artificial Intelligence) has unlocked numerous opportunities for innovation across diverse industries. However, growing privacy concerns and data isolation issues have inhibited this promising advancement. Unfortunately, traditional centralized machine learning (ML) methods have demonstrated their limitations in addressing these hurdles. In response to this ever-evolving landscape, Federated Learning (FL) has surfaced as a cutting-edge machine learning paradigm, enabling collaborative training across decentralized devices. FL allows users to jointly construct AI models without sharing their local raw data, ensuring data privacy, network scalability, and minimal data transfer. One essential aspect of FL revolves around proficient knowledge aggregation within a heterogeneous environment. Yet, the inherent characteristics of FL have amplified the complexity of its practical implementation compared to centralized ML. This survey delves into three prominent clusters of FL research contributions: personalization, optimization, and robustness. The objective is to provide a well-structured and fine-grained classification scheme related to these research areas through a unique methodology for selecting related work. Unlike other survey papers, we employed a hybrid approach that amalgamates bibliometric analysis and systematic scrutinizing to find the most influential work in the literature. Therefore, we examine challenges and contemporary techniques related to heterogeneity, efficiency, security, and privacy. Another valuable asset of this study is its comprehensive coverage of FL aggregation strategies, encompassing architectural features, synchronization methods, and several federation motivations. To further enrich our investigation, we provide practical insights into evaluating novel FL proposals and conduct experiments to assess and compare aggregation methods under IID and non-IID data distributions. Finally, we present a compelling set of research avenues that call for further exploration to open up a treasure of advancement.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141828352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Spatial-Temporal EV Charging Scheduling with Incentive Promotion 利用激励机制进行在线时空电动汽车充电调度
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-17 DOI: 10.1145/3678180
Lo Pang-Yun Ting, Huan-Yang Wang, Jhe-Yun Jhang, Kun-Ta Chuang
The growing adoption of electric vehicles (EVs) has resulted in an increased demand for public EV charging infrastructure. Currently, the collaboration between these stations has become vital for efficient charging scheduling and cost reduction. However, most existing scheduling methods primarily focus on recommending charging stations without considering users’ charging preferences. Adopting these strategies may require considerable modifications to how people charge their EVs, which could lead to a reluctance to follow the scheduling plan from charging services in real-world situations. To address these challenges, we propose the POSKID framework in this paper. It focuses on spatial-temporal charging scheduling, aiming to recommend a feasible charging arrangement, including a charging station and a charging time slot, to each EV user while minimizing overall operating costs and ensuring users’ charging satisfaction. The framework adopts an online charging mechanism that provides recommendations without prior knowledge of future electricity information or charging requests. To enhance users’ willingness to accept the recommendations, POSKID incorporates an incentive strategy and a novel embedding method combined with Bayesian personalized analysis. These techniques reveal users’ implicit charging preferences, enhancing the success probability of the charging scheduling task. Furthermore, POSKID integrates an online candidate arrangement selection and an explore-exploit strategy to improve the charging arrangement recommendations based on users’ feedback. Experimental results using real-world datasets validate the effectiveness of POSKID in optimizing charging management, surpassing other strategies. The results demonstrate that POSKID benefits each charging station while ensuring user charging satisfaction.
随着电动汽车(EV)的日益普及,对公共电动汽车充电基础设施的需求也随之增加。目前,这些充电站之间的协作已成为高效充电调度和降低成本的关键。然而,大多数现有的调度方法主要侧重于推荐充电站,而不考虑用户的充电偏好。采用这些策略可能需要对人们为电动汽车充电的方式进行相当大的调整,这可能会导致人们在实际情况下不愿意遵循充电服务的调度计划。为了应对这些挑战,我们在本文中提出了 POSKID 框架。该框架侧重于空间-时间充电调度,旨在向每位电动汽车用户推荐可行的充电安排,包括充电站和充电时间段,同时最大限度地降低总体运营成本并确保用户的充电满意度。该框架采用在线充电机制,在事先不了解未来电力信息或充电请求的情况下提供建议。为提高用户接受建议的意愿,POSKID 采用了激励策略和新颖的嵌入方法,并结合贝叶斯个性化分析。这些技术揭示了用户隐含的充电偏好,提高了充电调度任务的成功概率。此外,POSKID 还整合了在线候选安排选择和探索-开发策略,以根据用户反馈改进充电安排推荐。使用真实数据集的实验结果验证了 POSKID 在优化充电管理方面的有效性,超过了其他策略。结果表明,POSKID 在确保用户充电满意度的同时,也使每个充电站受益。
{"title":"Online Spatial-Temporal EV Charging Scheduling with Incentive Promotion","authors":"Lo Pang-Yun Ting, Huan-Yang Wang, Jhe-Yun Jhang, Kun-Ta Chuang","doi":"10.1145/3678180","DOIUrl":"https://doi.org/10.1145/3678180","url":null,"abstract":"\u0000 The growing adoption of electric vehicles (EVs) has resulted in an increased demand for public EV charging infrastructure. Currently, the collaboration between these stations has become vital for efficient charging scheduling and cost reduction. However, most existing scheduling methods primarily focus on recommending charging stations without considering users’ charging preferences. Adopting these strategies may require considerable modifications to how people charge their EVs, which could lead to a reluctance to follow the scheduling plan from charging services in real-world situations. To address these challenges, we propose the\u0000 POSKID\u0000 framework in this paper. It focuses on spatial-temporal charging scheduling, aiming to recommend a feasible charging arrangement, including a charging station and a charging time slot, to each EV user while minimizing overall operating costs and ensuring users’ charging satisfaction. The framework adopts an online charging mechanism that provides recommendations without prior knowledge of future electricity information or charging requests. To enhance users’ willingness to accept the recommendations,\u0000 POSKID\u0000 incorporates an incentive strategy and a novel embedding method combined with Bayesian personalized analysis. These techniques reveal users’ implicit charging preferences, enhancing the success probability of the charging scheduling task. Furthermore,\u0000 POSKID\u0000 integrates an online candidate arrangement selection and an explore-exploit strategy to improve the charging arrangement recommendations based on users’ feedback. Experimental results using real-world datasets validate the effectiveness of\u0000 POSKID\u0000 in optimizing charging management, surpassing other strategies. The results demonstrate that\u0000 POSKID\u0000 benefits each charging station while ensuring user charging satisfaction.\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141828891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MedNER: Enhanced Named Entity Recognition in Medical Corpus via Optimized Balanced and Deep Active Learning MedNER:通过优化平衡和深度主动学习增强医学语料库中的命名实体识别能力
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-17 DOI: 10.1145/3678178
Zhuang Yan, Junyan Zhang, Ruogu Lu, Kunlun He, Xiuxing Li
Ever-growing electronic medical corpora provide unprecedented opportunities for researchers to analyze patient conditions and drug effects. Meanwhile, severe challenges emerged in the large-scale electronic medical records process phase. Primarily, emerging words for medical terms, including informal descriptions, are difficult to recognize. Moreover, although deep models can help in entity extraction on medical texts, it requires large-scale labels which are time-intensive to obtain and not always available in the medical domain. However, when encountering a situation where massive unseen concepts appear, or labeled data is insufficient, the performance of existing algorithms will suffer an intolerable decline. In this paper, we propose a balanced and deep active learning framework ( MedNER ) for Named Entity Recognition in the medical corpus to alleviate above problems. Specifically, to describe our selection strategy precisely, we first define the uncertainty of a medical sentence as a labeling loss predicted by a loss-prediction module and define diversity as the least text distance between pairs of sentences in a sample batch computed based on word-morpheme embeddings. Furthermore, aiming to make a trade-off between uncertainty and diversity, we formulate a Distinct-K optimization problem to maximize the slightest uncertainty and diversity of chosen sentences. Finally, we propose a threshold-based approximation selection algorithm, Distinct-K Filter , which selects the most beneficial training samples by balancing diversity and uncertainty. Extensive experimental results on real datasets demonstrate that MedNER significantly outperforms existing approaches.
不断增长的电子医疗库为研究人员分析患者病情和药物效果提供了前所未有的机会。与此同时,在大规模电子病历处理阶段也出现了严峻的挑战。首先,包括非正式描述在内的医学术语的新词很难识别。此外,虽然深度模型可以帮助医疗文本中的实体提取,但它需要大规模的标签,而这些标签的获取需要大量时间,而且在医疗领域并非总能获得。然而,当遇到出现大量未见概念或标记数据不足的情况时,现有算法的性能就会出现难以忍受的下降。为了解决上述问题,我们在本文中提出了一种用于医学语料库中命名实体识别的平衡深度主动学习框架(MedNER)。具体来说,为了准确描述我们的选择策略,我们首先将医学句子的不确定性定义为由损失预测模块预测的标记损失,并将多样性定义为基于词-词素嵌入计算的样本批次中成对句子之间的最小文本距离。此外,为了在不确定性和多样性之间做出权衡,我们提出了一个 Distinct-K 优化问题,以最大化所选句子的最小不确定性和多样性。最后,我们提出了一种基于阈值的近似选择算法 Distinct-K Filter,该算法通过平衡多样性和不确定性来选择最有利的训练样本。在真实数据集上的大量实验结果表明,MedNER 的性能明显优于现有方法。
{"title":"MedNER: Enhanced Named Entity Recognition in Medical Corpus via Optimized Balanced and Deep Active Learning","authors":"Zhuang Yan, Junyan Zhang, Ruogu Lu, Kunlun He, Xiuxing Li","doi":"10.1145/3678178","DOIUrl":"https://doi.org/10.1145/3678178","url":null,"abstract":"\u0000 Ever-growing electronic medical corpora provide unprecedented opportunities for researchers to analyze patient conditions and drug effects. Meanwhile, severe challenges emerged in the large-scale electronic medical records process phase. Primarily, emerging words for medical terms, including informal descriptions, are difficult to recognize. Moreover, although deep models can help in entity extraction on medical texts, it requires large-scale labels which are time-intensive to obtain and not always available in the medical domain. However, when encountering a situation where massive unseen concepts appear, or labeled data is insufficient, the performance of existing algorithms will suffer an intolerable decline. In this paper, we propose a balanced and deep active learning framework (\u0000 MedNER\u0000 ) for Named Entity Recognition in the medical corpus to alleviate above problems. Specifically, to describe our selection strategy precisely, we first define the uncertainty of a medical sentence as a labeling loss predicted by a loss-prediction module and define diversity as the least text distance between pairs of sentences in a sample batch computed based on word-morpheme embeddings. Furthermore, aiming to make a trade-off between uncertainty and diversity, we formulate a\u0000 Distinct-K\u0000 optimization problem to maximize the slightest uncertainty and diversity of chosen sentences. Finally, we propose a threshold-based approximation selection algorithm,\u0000 Distinct-K Filter\u0000 , which selects the most beneficial training samples by balancing diversity and uncertainty. Extensive experimental results on real datasets demonstrate that\u0000 MedNER\u0000 significantly outperforms existing approaches.\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141831230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recommender System-Induced Eating Disorder Relapse: Harmful Content and the Challenges of Responsible Recommendation 推荐系统诱发的饮食失调复发:有害内容与负责任推荐的挑战
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-05 DOI: 10.1145/3675404
Jennifer Golbeck
As users’ social media feeds have become increasingly driven by algorithmically recommended content, there is a need to understand the impact these recommendations have on users. People in recovery from eating disorders (ED) may try to avoid content that features severely underweight bodies or that encourages disordered eating. However, if recommender systems show them this type of content anyway, it may impact their recovery or even lead to relapse. In this study, we take a two-pronged approach to understanding the intersection of recommender systems, eating disorder content, and users in recovery. We performed a content analysis of tweets about recommended eating disorder content and conducted a small-scale study on Pinterest to show that eating disorder content is recommended in response to interaction with posts about eating disorder recovery. We discuss the implications for responsible recommendation and harm prevention.
随着用户的社交媒体推送越来越多地受到算法推荐内容的驱动,我们有必要了解这些推荐内容对用户的影响。处于饮食失调(ED)康复期的人可能会尽量避开那些显示体重严重不足或鼓励饮食失调的内容。但是,如果推荐系统还是向他们展示这类内容,就可能影响他们的康复,甚至导致复发。在本研究中,我们采取了双管齐下的方法来了解推荐系统、饮食失调内容和处于康复期的用户之间的交集。我们对有关饮食失调症推荐内容的推文进行了内容分析,并在 Pinterest 上进行了小规模研究,结果表明饮食失调症内容是在与有关饮食失调症康复的帖子互动后被推荐的。我们讨论了负责任的推荐和预防伤害的意义。
{"title":"Recommender System-Induced Eating Disorder Relapse: Harmful Content and the Challenges of Responsible Recommendation","authors":"Jennifer Golbeck","doi":"10.1145/3675404","DOIUrl":"https://doi.org/10.1145/3675404","url":null,"abstract":"As users’ social media feeds have become increasingly driven by algorithmically recommended content, there is a need to understand the impact these recommendations have on users. People in recovery from eating disorders (ED) may try to avoid content that features severely underweight bodies or that encourages disordered eating. However, if recommender systems show them this type of content anyway, it may impact their recovery or even lead to relapse. In this study, we take a two-pronged approach to understanding the intersection of recommender systems, eating disorder content, and users in recovery. We performed a content analysis of tweets about recommended eating disorder content and conducted a small-scale study on Pinterest to show that eating disorder content is recommended in response to interaction with posts about eating disorder recovery. We discuss the implications for responsible recommendation and harm prevention.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141673165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
M2SKD: Multi-to-Single Knowledge Distillation of Real-Time Epileptic Seizure Detection for Low-Power Wearable Systems M2SKD:针对低功耗可穿戴系统的癫痫发作实时检测的多对单知识提炼
IF 7.2 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-04 DOI: 10.1145/3675402
Saleh Baghersalimi, A. Amirshahi, Farnaz Forooghifar, T. Teijeiro, A. Aminifar, David Atienza
Integrating low-power wearable systems into routine health monitoring is an ongoing challenge. Recent advances in the computation capabilities of wearables make it possible to target complex scenarios by exploiting multiple biosignals and using high-performance algorithms, such as Deep Neural Networks (DNNs). However, there is a trade-off between the algorithms’ performance and the low-power requirements of platforms with limited resources. Besides, physically larger and multi-biosignal-based wearables bring significant discomfort to the patients. Consequently, reducing power consumption and discomfort is necessary for patients to use wearable devices continuously during everyday life. To overcome these challenges, in the context of epileptic seizure detection, we propose the M2SKD (Multi-to-Single Knowledge Distillation) approach targeting single-biosignal processing in wearable systems. The starting point is to train a highly-accurate multi-biosignal DNN, then apply M2SKD to develop a single-biosignal DNN solution for wearable systems that achieves an accuracy comparable to the original multi-biosignal DNN. To assess the practicality of our approach to real-life scenarios, we perform a comprehensive simulation experiment analysis on several edge computing platforms.
将低功耗可穿戴系统集成到常规健康监测中是一项持续的挑战。可穿戴设备计算能力的最新进展使得利用多种生物信号和高性能算法(如深度神经网络 (DNN))针对复杂场景成为可能。然而,算法的性能与资源有限的平台对低功耗的要求之间存在权衡。此外,体积较大和基于多生物信号的可穿戴设备会给患者带来明显不适。因此,要让患者在日常生活中持续使用可穿戴设备,就必须降低功耗和不适感。为了克服这些挑战,在癫痫发作检测方面,我们提出了针对可穿戴系统中单生物信号处理的 M2SKD(多对单知识蒸馏)方法。该方法的出发点是训练一个高精确度的多生物信号 DNN,然后应用 M2SKD 为可穿戴系统开发一个单生物信号 DNN 解决方案,其精确度可与原始的多生物信号 DNN 相媲美。为了评估我们的方法在现实生活场景中的实用性,我们在多个边缘计算平台上进行了全面的模拟实验分析。
{"title":"M2SKD: Multi-to-Single Knowledge Distillation of Real-Time Epileptic Seizure Detection for Low-Power Wearable Systems","authors":"Saleh Baghersalimi, A. Amirshahi, Farnaz Forooghifar, T. Teijeiro, A. Aminifar, David Atienza","doi":"10.1145/3675402","DOIUrl":"https://doi.org/10.1145/3675402","url":null,"abstract":"Integrating low-power wearable systems into routine health monitoring is an ongoing challenge. Recent advances in the computation capabilities of wearables make it possible to target complex scenarios by exploiting multiple biosignals and using high-performance algorithms, such as Deep Neural Networks (DNNs). However, there is a trade-off between the algorithms’ performance and the low-power requirements of platforms with limited resources. Besides, physically larger and multi-biosignal-based wearables bring significant discomfort to the patients. Consequently, reducing power consumption and discomfort is necessary for patients to use wearable devices continuously during everyday life. To overcome these challenges, in the context of epileptic seizure detection, we propose the M2SKD (Multi-to-Single Knowledge Distillation) approach targeting single-biosignal processing in wearable systems. The starting point is to train a highly-accurate multi-biosignal DNN, then apply M2SKD to develop a single-biosignal DNN solution for wearable systems that achieves an accuracy comparable to the original multi-biosignal DNN. To assess the practicality of our approach to real-life scenarios, we perform a comprehensive simulation experiment analysis on several edge computing platforms.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":7.2,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141677621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aspect-enhanced Explainable Recommendation with Multi-modal Contrastive Learning 通过多模态对比学习增强可解释推荐功能
IF 5 4区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-06-19 DOI: 10.1145/3673234
Hao Liao, Shuo Wang, Hao Cheng, Wei Zhang, Jiwei Zhang, Mingyang Zhou, Kezhong Lu, Rui Mao, Xing Xie

Explainable recommender systems (ERS) aim to enhance users’ trust in the systems by offering personalized recommendations with transparent explanations. This transparency provides users with a clear understanding of the rationale behind the recommendations, fostering a sense of confidence and reliability in the system’s outputs. Generally, the explanations are presented in a familiar and intuitive way, which is in the form of natural language, thus enhancing their accessibility to users. Recently, there has been an increasing focus on leveraging reviews as a valuable source of rich information in both modeling user-item preferences and generating textual interpretations, which can be performed simultaneously in a multi-task framework. Despite the progress made in these review-based recommendation systems, the integration of implicit feedback derived from user-item interactions and user-written text reviews has yet to be fully explored. To fill this gap, we propose a model named SERMON (Aspect-enhanced Explainable Recommendation with Multi-modal Contrast Learning). Our model explores the application of multimodal contrastive learning to facilitate reciprocal learning across two modalities, thereby enhancing the modeling of user preferences. Moreover, our model incorporates the aspect information extracted from the review, which provides two significant enhancements to our tasks. Firstly, the quality of the generated explanations is improved by incorporating the aspect characteristics into the explanations generated by a pre-trained model with controlled textual generation ability. Secondly, the commonly used user-item interactions are transformed into user-item-aspect interactions, which we refer to as interaction triple, resulting in a more nuanced representation of user preference. To validate the effectiveness of our model, we conduct extensive experiments on three real-world datasets. The experimental results show that our model outperforms state-of-the-art baselines, with a 2.0% improvement in prediction accuracy and a substantial 24.5% enhancement in explanation quality for the TripAdvisor dataset.

可解释推荐系统(ERS)旨在通过提供带有透明解释的个性化推荐,增强用户对系统的信任。这种透明度能让用户清楚地了解推荐背后的理由,从而增强用户对系统输出结果的信任感和可靠性。一般来说,解释都是以用户熟悉和直观的方式,即自然语言的形式呈现的,从而增强了用户的可访问性。最近,越来越多的人开始关注利用评论作为丰富信息的宝贵来源,为用户物品偏好建模并生成文本解释,这些工作可以在多任务框架中同时进行。尽管这些基于评论的推荐系统取得了进展,但对来自用户-物品交互的隐式反馈和用户撰写的文本评论的整合仍有待充分探索。为了填补这一空白,我们提出了一个名为 SERMON(多模态对比学习的方面增强可解释推荐)的模型。我们的模型探索了多模态对比学习的应用,以促进两种模态之间的互惠学习,从而增强对用户偏好的建模。此外,我们的模型还纳入了从评论中提取的方面信息,这为我们的任务提供了两个重大改进。首先,通过将方面特征纳入由具有可控文本生成能力的预训练模型生成的解释中,提高了生成解释的质量。其次,常用的用户-物品交互被转化为用户-物品-方面交互,我们称之为交互三重,从而更细致地反映了用户的偏好。为了验证我们模型的有效性,我们在三个真实世界的数据集上进行了广泛的实验。实验结果表明,我们的模型优于最先进的基线模型,在 TripAdvisor 数据集上,预测准确率提高了 2.0%,解释质量大幅提高了 24.5%。
{"title":"Aspect-enhanced Explainable Recommendation with Multi-modal Contrastive Learning","authors":"Hao Liao, Shuo Wang, Hao Cheng, Wei Zhang, Jiwei Zhang, Mingyang Zhou, Kezhong Lu, Rui Mao, Xing Xie","doi":"10.1145/3673234","DOIUrl":"https://doi.org/10.1145/3673234","url":null,"abstract":"<p>Explainable recommender systems (<b>ERS</b>) aim to enhance users’ trust in the systems by offering personalized recommendations with transparent explanations. This transparency provides users with a clear understanding of the rationale behind the recommendations, fostering a sense of confidence and reliability in the system’s outputs. Generally, the explanations are presented in a familiar and intuitive way, which is in the form of natural language, thus enhancing their accessibility to users. Recently, there has been an increasing focus on leveraging reviews as a valuable source of rich information in both modeling user-item preferences and generating textual interpretations, which can be performed simultaneously in a multi-task framework. Despite the progress made in these review-based recommendation systems, the integration of implicit feedback derived from user-item interactions and user-written text reviews has yet to be fully explored. To fill this gap, we propose a model named <b>SERMON</b> (A<b><underline>s</underline></b>pect-enhanced <b><underline>E</underline></b>xplainable <b><underline>R</underline></b>ecommendation with <b><underline>M</underline></b>ulti-modal C<b><underline>o</underline></b>ntrast Lear<b><underline>n</underline></b>ing). Our model explores the application of multimodal contrastive learning to facilitate reciprocal learning across two modalities, thereby enhancing the modeling of user preferences. Moreover, our model incorporates the aspect information extracted from the review, which provides two significant enhancements to our tasks. Firstly, the quality of the generated explanations is improved by incorporating the aspect characteristics into the explanations generated by a pre-trained model with controlled textual generation ability. Secondly, the commonly used user-item interactions are transformed into user-item-aspect interactions, which we refer to as interaction triple, resulting in a more nuanced representation of user preference. To validate the effectiveness of our model, we conduct extensive experiments on three real-world datasets. The experimental results show that our model outperforms state-of-the-art baselines, with a 2.0% improvement in prediction accuracy and a substantial 24.5% enhancement in explanation quality for the TripAdvisor dataset.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141508268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on Intelligent Systems and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1