ArXiv最新文献

英文中文

Human I/O: Towards a Unified Approach to Detecting Situational Impairments 人类输入/输出：采用统一方法检测情境障碍

ArXiv

Pub Date : 2024-03-06 DOI: 10.1145/3613904.3642065

Xingyu Bruce Liu, Jiahao Nick Li, David Kim, Xiang 'Anthony' Chen, Ruofei Du

Situationally Induced Impairments and Disabilities (SIIDs) can significantly hinder user experience in contexts such as poor lighting, noise, and multi-tasking. While prior research has introduced algorithms and systems to address these impairments, they predominantly cater to specific tasks or environments and fail to accommodate the diverse and dynamic nature of SIIDs. We introduce Human I/O, a unified approach to detecting a wide range of SIIDs by gauging the availability of human input/output channels. Leveraging egocentric vision, multimodal sensing and reasoning with large language models, Human I/O achieves a 0.22 mean absolute error and a 82% accuracy in availability prediction across 60 in-the-wild egocentric video recordings in 32 different scenarios. Furthermore, while the core focus of our work is on the detection of SIIDs rather than the creation of adaptive user interfaces, we showcase the efficacy of our prototype via a user study with 10 participants. Findings suggest that Human I/O significantly reduces effort and improves user experience in the presence of SIIDs, paving the way for more adaptive and accessible interactive systems in the future.

在光线不足、噪音和多任务处理等情况下，情境诱发的障碍和残疾（SIIDs）会严重影响用户体验。虽然之前的研究已经推出了解决这些障碍的算法和系统，但它们主要是针对特定的任务或环境，无法适应 SIIDs 的多样性和动态性。我们引入了人类输入/输出（Human I/O），这是一种通过测量人类输入/输出通道的可用性来检测各种 SIID 的统一方法。人类 I/O 利用以自我为中心的视觉、多模态传感和大型语言模型推理，在 32 种不同场景下的 60 个野外以自我为中心的视频记录中，实现了 0.22 的平均绝对误差和 82% 的可用性预测准确率。此外，虽然我们工作的核心重点是检测 SIID 而不是创建自适应用户界面，但我们通过对 10 名参与者的用户研究展示了我们原型的功效。研究结果表明，人类输入/输出（Human I/O）可以在出现 SIID 的情况下显著减少工作量并改善用户体验，为未来开发更具适应性和可访问性的交互系统铺平了道路。

{"title":"Human I/O: Towards a Unified Approach to Detecting Situational Impairments","authors":"Xingyu Bruce Liu, Jiahao Nick Li, David Kim, Xiang 'Anthony' Chen, Ruofei Du","doi":"10.1145/3613904.3642065","DOIUrl":"https://doi.org/10.1145/3613904.3642065","url":null,"abstract":"Situationally Induced Impairments and Disabilities (SIIDs) can significantly hinder user experience in contexts such as poor lighting, noise, and multi-tasking. While prior research has introduced algorithms and systems to address these impairments, they predominantly cater to specific tasks or environments and fail to accommodate the diverse and dynamic nature of SIIDs. We introduce Human I/O, a unified approach to detecting a wide range of SIIDs by gauging the availability of human input/output channels. Leveraging egocentric vision, multimodal sensing and reasoning with large language models, Human I/O achieves a 0.22 mean absolute error and a 82% accuracy in availability prediction across 60 in-the-wild egocentric video recordings in 32 different scenarios. Furthermore, while the core focus of our work is on the detection of SIIDs rather than the creation of adaptive user interfaces, we showcase the efficacy of our prototype via a user study with 10 participants. Findings suggest that Human I/O significantly reduces effort and improves user experience in the presence of SIIDs, paving the way for more adaptive and accessible interactive systems in the future.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"1 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

I3DE: An IDE for Inspecting Inconsistencies in PL/SQL Code I3DE：用于检查 PL/SQL 代码中不一致之处的集成开发环境

ArXiv

Pub Date : 2024-03-06 DOI: 10.1145/3643796.3648461

Jiangshan Liu, Shuang Liu, Junjie Chen

In this paper, we introduce I3DE (Inconsistency Inspecting IDE) - an IDE plugin to inspect inconsistencies in PL/SQL code. We first observed the potential issues, e.g., misuses or bugs, that are introduced by the inconsistent understanding of PL/SQL semantics by PL/SQL programmers and DBMS developers, and propose a metamorphic testing-based approach for inspecting such inconsistencies in PL/SQL code. We design and implement our approach in I3DE, a widely usable plugin for the IntelliJ Platform. We conducted a comparative user study involving 16 participants, and the findings indicate that I3DE is consistently effective and efficient in helping programmers identify and avoid inconsistencies across different programming difficulties

本文介绍了 I3DE（不一致性检查集成开发环境）--一种用于检查 PL/SQL 代码中不一致性的集成开发环境插件。我们首先观察了由于 PL/SQL 程序员和 DBMS 开发人员对 PL/SQL 语义的理解不一致而导致的潜在问题（如误用或错误），并提出了一种基于元测试的方法来检查 PL/SQL 代码中的不一致之处。我们在 IntelliJ 平台的一个广泛使用的插件 I3DE 中设计并实现了我们的方法。我们进行了一项有 16 人参与的用户对比研究，研究结果表明，I3DE 在帮助程序员识别和避免不同编程困难中的不一致性方面始终是有效和高效的。

引用次数: 0

SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization SalienTime：用户驱动的大规模地理空间数据可视化突出时间步骤选择

ArXiv

Pub Date : 2024-03-06 DOI: 10.1145/3613904.3642944

Juntong Chen, Haiwen Huang, Huayuan Ye, Zhong Peng, Chenhui Li, Changbo Wang

The voluminous nature of geospatial temporal data from physical monitors and simulation models poses challenges to efficient data access, often resulting in cumbersome temporal selection experiences in web-based data portals. Thus, selecting a subset of time steps for prioritized visualization and pre-loading is highly desirable. Addressing this issue, this paper establishes a multifaceted definition of salient time steps via extensive need-finding studies with domain experts to understand their workflows. Building on this, we propose a novel approach that leverages autoencoders and dynamic programming to facilitate user-driven temporal selections. Structural features, statistical variations, and distance penalties are incorporated to make more flexible selections. User-specified priorities, spatial regions, and aggregations are used to combine different perspectives. We design and implement a web-based interface to enable efficient and context-aware selection of time steps and evaluate its efficacy and usability through case studies, quantitative evaluations, and expert interviews.

来自物理监测器和仿真模型的大量地理空间时间数据给高效数据访问带来了挑战，往往导致基于网络的数据门户在时间选择方面的繁琐。因此，选择一个时间步骤子集进行优先可视化和预加载是非常可取的。针对这一问题，本文通过与领域专家进行广泛的需求调查研究，了解他们的工作流程，从而确定了突出时间步骤的多方面定义。在此基础上，我们提出了一种新方法，利用自动编码器和动态编程来促进用户驱动的时间选择。结构特征、统计变化和距离惩罚都被纳入其中，以实现更灵活的选择。用户指定的优先级、空间区域和聚合被用来结合不同的视角。我们设计并实施了一个基于网络的界面，以实现高效和情境感知的时间步骤选择，并通过案例研究、定量评估和专家访谈对其有效性和可用性进行了评估。

引用次数: 0

KG-TREAT: Pre-training for Treatment Effect Estimation by Synergizing Patient Data with Knowledge Graphs KG-TREAT：通过患者数据与知识图谱的协同作用进行治疗效果估计的预训练

ArXiv

Pub Date : 2024-03-06 DOI: 10.1609/aaai.v38i8.28727

Ruoqi Liu, Lingfei Wu, Ping Zhang

Treatment effect estimation (TEE) is the task of determining the impact of various treatments on patient outcomes. Current TEE methods fall short due to reliance on limited labeled data and challenges posed by sparse and high-dimensional observational patient data. To address the challenges, we introduce a novel pre-training and fine-tuning framework, KG-TREAT, which synergizes large-scale observational patient data with biomedical knowledge graphs (KGs) to enhance TEE. Unlike previous approaches, KG-TREAT constructs dual-focus KGs and integrates a deep bi-level attention synergy method for in-depth information fusion, enabling distinct encoding of treatment-covariate and outcome-covariate relationships. KG-TREAT also incorporates two pre-training tasks to ensure a thorough grounding and contextualization of patient data and KGs. Evaluation on four downstream TEE tasks shows KG-TREAT’s superiority over existing methods, with an average improvement of 7% in Area under the ROC Curve (AUC) and 9% in Influence Function-based Precision of Estimating Heterogeneous Effects (IF-PEHE). The effectiveness of our estimated treatment effects is further affirmed by alignment with established randomized clinical trial findings.

治疗效果估计（TEE）是确定各种治疗方法对患者预后影响的任务。由于依赖于有限的标记数据以及稀疏和高维观察性患者数据带来的挑战，目前的 TEE 方法存在不足。为了应对这些挑战，我们引入了一个新颖的预训练和微调框架 KG-TREAT，该框架将大规模患者观察数据与生物医学知识图谱（KGs）协同作用，以增强 TEE。与以往的方法不同，KG-TREAT 构建了双焦点 KG，并集成了一种深度双级注意力协同方法，用于深度信息融合，从而实现治疗-协变量和结果-协变量关系的不同编码。KG-TREAT 还包含两个预训练任务，以确保患者数据和 KG 的全面基础化和情境化。对四项下游 TEE 任务的评估表明，KG-TREAT 比现有方法更具优势，ROC 曲线下面积（AUC）平均提高了 7%，基于影响函数的异质性效应估计精度（IF-PEHE）平均提高了 9%。我们估计的治疗效果与既定的随机临床试验结果一致，这进一步证实了我们估计的治疗效果的有效性。

{"title":"KG-TREAT: Pre-training for Treatment Effect Estimation by Synergizing Patient Data with Knowledge Graphs","authors":"Ruoqi Liu, Lingfei Wu, Ping Zhang","doi":"10.1609/aaai.v38i8.28727","DOIUrl":"https://doi.org/10.1609/aaai.v38i8.28727","url":null,"abstract":"Treatment effect estimation (TEE) is the task of determining the impact of various treatments on patient outcomes. Current TEE methods fall short due to reliance on limited labeled data and challenges posed by sparse and high-dimensional observational patient data. To address the challenges, we introduce a novel pre-training and fine-tuning framework, KG-TREAT, which synergizes large-scale observational patient data with biomedical knowledge graphs (KGs) to enhance TEE. Unlike previous approaches, KG-TREAT constructs dual-focus KGs and integrates a deep bi-level attention synergy method for in-depth information fusion, enabling distinct encoding of treatment-covariate and outcome-covariate relationships. KG-TREAT also incorporates two pre-training tasks to ensure a thorough grounding and contextualization of patient data and KGs. Evaluation on four downstream TEE tasks shows KG-TREAT’s superiority over existing methods, with an average improvement of 7% in Area under the ROC Curve (AUC) and 9% in Influence Function-based Precision of Estimating Heterogeneous Effects (IF-PEHE). The effectiveness of our estimated treatment effects is further affirmed by alignment with established randomized clinical trial findings.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"5 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers PaperWeaver：通过将推荐论文与用户收集的论文联系起来，丰富专题论文提醒功能

ArXiv

Pub Date : 2024-03-05 DOI: 10.1145/3613904.3642196

Yoonjoo Lee, Hyeonsu B Kang, Matt Latzke, Juho Kim, Jonathan Bragg, Joseph Chee Chang, Pao Siangliulue

With the rapid growth of scholarly archives, researchers subscribe to"paper alert"systems that periodically provide them with recommendations of recently published papers that are similar to previously collected papers. However, researchers sometimes struggle to make sense of nuanced connections between recommended papers and their own research context, as existing systems only present paper titles and abstracts. To help researchers spot these connections, we present PaperWeaver, an enriched paper alerts system that provides contextualized text descriptions of recommended papers based on user-collected papers. PaperWeaver employs a computational method based on Large Language Models (LLMs) to infer users' research interests from their collected papers, extract context-specific aspects of papers, and compare recommended and collected papers on these aspects. Our user study (N=15) showed that participants using PaperWeaver were able to better understand the relevance of recommended papers and triage them more confidently when compared to a baseline that presented the related work sections from recommended papers.

随着学术档案的快速增长，研究人员订阅了 "论文提醒 "系统，该系统会定期向他们推荐最近发表的、与以前收集的论文相似的论文。然而，由于现有系统只提供论文标题和摘要，研究人员有时很难理解推荐论文与其自身研究背景之间的细微联系。为了帮助研究人员发现这些联系，我们推出了 PaperWeaver，这是一个丰富的论文提醒系统，它能根据用户收集的论文提供推荐论文的上下文文本描述。PaperWeaver 采用一种基于大型语言模型（LLM）的计算方法，从用户收集的论文中推断用户的研究兴趣，提取论文的特定上下文，并就这些方面对推荐论文和收集的论文进行比较。我们的用户研究（N=15）显示，使用PaperWeaver的参与者能够更好地理解推荐论文的相关性，并在与展示推荐论文中相关工作部分的基线相比时更有信心地对其进行分流。

{"title":"PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers","authors":"Yoonjoo Lee, Hyeonsu B Kang, Matt Latzke, Juho Kim, Jonathan Bragg, Joseph Chee Chang, Pao Siangliulue","doi":"10.1145/3613904.3642196","DOIUrl":"https://doi.org/10.1145/3613904.3642196","url":null,"abstract":"With the rapid growth of scholarly archives, researchers subscribe to\"paper alert\"systems that periodically provide them with recommendations of recently published papers that are similar to previously collected papers. However, researchers sometimes struggle to make sense of nuanced connections between recommended papers and their own research context, as existing systems only present paper titles and abstracts. To help researchers spot these connections, we present PaperWeaver, an enriched paper alerts system that provides contextualized text descriptions of recommended papers based on user-collected papers. PaperWeaver employs a computational method based on Large Language Models (LLMs) to infer users' research interests from their collected papers, extract context-specific aspects of papers, and compare recommended and collected papers on these aspects. Our user study (N=15) showed that participants using PaperWeaver were able to better understand the relevance of recommended papers and triage them more confidently when compared to a baseline that presented the related work sections from recommended papers.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"332 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Recall-Oriented Continual Learning with Generative Adversarial Meta-Model 以回忆为导向的持续学习与生成式对抗元模型

ArXiv

Pub Date : 2024-03-05 DOI: 10.1609/aaai.v38i12.29202

Haneol Kang, Dong-Wan Choi

The stability-plasticity dilemma is a major challenge in continual learning, as it involves balancing the conflicting objectives of maintaining performance on previous tasks while learning new tasks. In this paper, we propose the recalloriented continual learning framework to address this challenge. Inspired by the human brain’s ability to separate the mechanisms responsible for stability and plasticity, our framework consists of a two-level architecture where an inference network effectively acquires new knowledge and a generative network recalls past knowledge when necessary. In particular, to maximize the stability of past knowledge, we investigate the complexity of knowledge depending on different representations, and thereby introducing generative adversarial meta-model (GAMM) that incrementally learns task-specific parameters instead of input data samples of the task. Through our experiments, we show that our framework not only effectively learns new knowledge without any disruption but also achieves high stability of previous knowledge in both task-aware and task-agnostic learning scenarios. Our code is available at: https://github.com/bigdata-inha/recall-orientedcl-framework.

稳定性与可塑性的两难问题是持续学习中的一大挑战，因为它涉及到在学习新任务的同时保持以前任务的成绩这一相互冲突的目标之间取得平衡。在本文中，我们提出了以回忆为导向的持续学习框架来解决这一难题。受人脑将稳定性和可塑性机制分开的能力启发，我们的框架由两级架构组成，其中推理网络有效地获取新知识，而生成网络则在必要时回顾过去的知识。特别是，为了最大限度地提高过去知识的稳定性，我们研究了知识的复杂性取决于不同的表征，从而引入了生成对抗元模型（GAMM），该模型以增量方式学习特定任务的参数，而不是任务的输入数据样本。通过实验，我们发现我们的框架不仅能有效地学习新知识，而且在任务感知和任务无关的学习场景中都能实现先前知识的高度稳定性。我们的代码可在以下网址获取：https://github.com/bigdata-inha/recall-orientedcl-framework.

{"title":"Recall-Oriented Continual Learning with Generative Adversarial Meta-Model","authors":"Haneol Kang, Dong-Wan Choi","doi":"10.1609/aaai.v38i12.29202","DOIUrl":"https://doi.org/10.1609/aaai.v38i12.29202","url":null,"abstract":"The stability-plasticity dilemma is a major challenge in continual learning, as it involves balancing the conflicting objectives of maintaining performance on previous tasks while learning new tasks. In this paper, we propose the recalloriented continual learning framework to address this challenge. Inspired by the human brain’s ability to separate the mechanisms responsible for stability and plasticity, our framework consists of a two-level architecture where an inference network effectively acquires new knowledge and a generative network recalls past knowledge when necessary. In particular, to maximize the stability of past knowledge, we investigate the complexity of knowledge depending on different representations, and thereby introducing generative adversarial meta-model (GAMM) that incrementally learns task-specific parameters instead of input data samples of the task. Through our experiments, we show that our framework not only effectively learns new knowledge without any disruption but also achieves high stability of previous knowledge in both task-aware and task-agnostic learning scenarios. Our code is available at: https://github.com/bigdata-inha/recall-orientedcl-framework.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"329 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

"It's the only thing I can trust": Envisioning Large Language Model Use by Autistic Workers for Communication Assistance "这是我唯一可以信任的东西"：自闭症工人使用大型语言模型进行交流的设想

ArXiv

Pub Date : 2024-03-05 DOI: 10.1145/3613904.3642894

JiWoong Jang, Sanika Moharana, Patrick Carrington, Andrew Begel

Autistic adults often experience stigma and discrimination at work, leading them to seek social communication support from coworkers, friends, and family despite emotional risks. Large language models (LLMs) are increasingly considered an alternative. In this work, we investigate the phenomenon of LLM use by autistic adults at work and explore opportunities and risks of LLMs as a source of social communication advice. We asked 11 autistic participants to present questions about their own workplace-related social difficulties to (1) a GPT-4-based chatbot and (2) a disguised human confederate. Our evaluation shows that participants strongly preferred LLM over confederate interactions. However, a coach specializing in supporting autistic job-seekers raised concerns that the LLM was dispensing questionable advice. We highlight how this divergence in participant and practitioner attitudes reflects existing schisms in HCI on the relative privileging of end-user wants versus normative good and propose design considerations for LLMs to center autistic experiences.

患有自闭症的成年人在工作中经常会受到羞辱和歧视，这导致他们不顾情感风险，向同事、朋友和家人寻求社会交流支持。大语言模型（LLM）越来越多地被认为是一种替代方案。在这项研究中，我们调查了成人自闭症患者在工作中使用大语言模型的现象，并探讨了大语言模型作为社会沟通建议来源的机遇和风险。我们请 11 位自闭症参与者向（1）基于 GPT-4 的聊天机器人和（2）伪装的人类同声传译者提出与他们自身工作场所相关的社交困难问题。我们的评估结果表明，参与者非常喜欢 LLM 而不是密友互动。然而，一位专门为自闭症求职者提供支持的教练却担心 LLM 提供的建议有问题。我们强调了参与者和从业人员的态度分歧如何反映了人机交互领域在最终用户需求与规范性良好需求之间存在的分歧，并提出了以自闭症体验为中心的 LLM 设计注意事项。

引用次数: 0

Wrist-bound Guanxi, Jiazu, and Kuolie: Unpacking Chinese Adolescent Smartwatch-Mediated Socialization 手腕上的 "关西"、"甲午 "和 "国立"：解读中国青少年以智能手表为媒介的社交活动

ArXiv

Pub Date : 2024-03-05 DOI: 10.1145/3613904.3642044

Lanjing Liu, Chao Zhang, Zhicong Lu

Adolescent peer relationships, essential for their development, are increasingly mediated by digital technologies. As this trend continues, wearable devices, especially smartwatches tailored for adolescents, are reshaping their socialization. In China, smartwatches like XTC have gained wide popularity, introducing unique features such as"Bump-to-Connect"and exclusive social platforms. Nonetheless, how these devices influence adolescents' peer experience remains unknown. Addressing this, we interviewed 18 Chinese adolescents (age: 11 -- 16), discovering a smartwatch-mediated social ecosystem. Our findings highlight the ice-breaking role of smartwatches in friendship initiation and their use for secret messaging with local peers. Within the online smartwatch community, peer status is determined by likes and visibility, leading to diverse pursuit activities (i.e., chu guanxi, jiazu, kuolie) and negative social dynamics. We discuss the core affordances of smartwatches and Chinese cultural factors that influence adolescent social behavior and offer implications for designing future wearables that responsibly and safely support adolescent socialization.

青少年的同伴关系对他们的成长至关重要，但现在越来越多地以数字技术为媒介。随着这一趋势的发展，可穿戴设备，尤其是为青少年量身定制的智能手表，正在重塑他们的社交方式。在中国，像 XTC 这样的智能手表已经广受欢迎，并推出了 "碰一碰 "和专属社交平台等独特功能。然而，这些设备如何影响青少年的同伴体验仍是个未知数。为此，我们采访了 18 名中国青少年（年龄在 11-16 岁之间），发现了一个以智能手表为媒介的社交生态系统。我们的研究结果强调了智能手表在建立友谊过程中的破冰作用，以及它们在与本地同龄人进行秘密信息交流时的作用。在在线智能手表社区中，同伴地位由点赞数和可见度决定，这导致了多种多样的追求活动（即 "chu guanxi"、"jiazu"、"kuolie"）和消极的社会动态。我们讨论了智能手表的核心功能和影响青少年社交行为的中国文化因素，并为设计未来的可穿戴设备，以负责任和安全的方式支持青少年社交提供了启示。

引用次数: 0

Uplift Modeling for Target User Attacks on Recommender Systems 针对目标用户攻击推荐系统的上行建模

ArXiv

Pub Date : 2024-03-05 DOI: 10.1145/3589334.3645403

Wenjie Wang, Changsheng Wang, Fuli Feng, Wentao Shi, Daizong Ding, Tat-seng Chua

Recommender systems are vulnerable to injective attacks, which inject limited fake users into the platforms to manipulate the exposure of target items to all users. In this work, we identify that conventional injective attackers overlook the fact that each item has its unique potential audience, and meanwhile, the attack difficulty across different users varies. Blindly attacking all users will result in a waste of fake user budgets and inferior attack performance. To address these issues, we focus on an under-explored attack task called target user attacks, aiming at promoting target items to a particular user group. In addition, we formulate the varying attack difficulty as heterogeneous treatment effects through a causal lens and propose an Uplift-guided Budget Allocation (UBA) framework. UBA estimates the treatment effect on each target user and optimizes the allocation of fake user budgets to maximize the attack performance. Theoretical and empirical analysis demonstrates the rationality of treatment effect estimation methods of UBA. By instantiating UBA on multiple attackers, we conduct extensive experiments on three datasets under various settings with different target items, target users, fake user budgets, victim models, and defense models, validating the effectiveness and robustness of UBA.

推荐系统很容易受到注入式攻击，这种攻击将有限的虚假用户注入平台，以操纵目标项目对所有用户的曝光率。在这项工作中，我们发现传统的注入式攻击者忽视了一个事实，即每个项目都有其独特的潜在受众，同时不同用户的攻击难度也各不相同。盲目攻击所有用户会造成虚假用户预算的浪费和攻击性能的下降。为了解决这些问题，我们重点研究了一种尚未被充分开发的攻击任务，即目标用户攻击，旨在向特定用户群推广目标项目。此外，我们还通过因果视角将不同的攻击难度表述为异质处理效果，并提出了上行引导预算分配（UBA）框架。UBA 估算了对每个目标用户的处理效果，并优化了假用户预算的分配，使攻击性能最大化。理论和实证分析证明了 UBA 治疗效果估算方法的合理性。通过在多个攻击者身上实例化 UBA，我们在不同目标项目、目标用户、虚假用户预算、受害者模型和防御模型的各种设置下对三个数据集进行了广泛的实验，验证了 UBA 的有效性和鲁棒性。

{"title":"Uplift Modeling for Target User Attacks on Recommender Systems","authors":"Wenjie Wang, Changsheng Wang, Fuli Feng, Wentao Shi, Daizong Ding, Tat-seng Chua","doi":"10.1145/3589334.3645403","DOIUrl":"https://doi.org/10.1145/3589334.3645403","url":null,"abstract":"Recommender systems are vulnerable to injective attacks, which inject limited fake users into the platforms to manipulate the exposure of target items to all users. In this work, we identify that conventional injective attackers overlook the fact that each item has its unique potential audience, and meanwhile, the attack difficulty across different users varies. Blindly attacking all users will result in a waste of fake user budgets and inferior attack performance. To address these issues, we focus on an under-explored attack task called target user attacks, aiming at promoting target items to a particular user group. In addition, we formulate the varying attack difficulty as heterogeneous treatment effects through a causal lens and propose an Uplift-guided Budget Allocation (UBA) framework. UBA estimates the treatment effect on each target user and optimizes the allocation of fake user budgets to maximize the attack performance. Theoretical and empirical analysis demonstrates the rationality of treatment effect estimation methods of UBA. By instantiating UBA on multiple attackers, we conduct extensive experiments on three datasets under various settings with different target items, target users, fake user budgets, victim models, and defense models, validating the effectiveness and robustness of UBA.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"362 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Single-Channel Robot Ego-Speech Filtering during Human-Robot Interaction 人机交互过程中的单通道机器人自我语音过滤

ArXiv

Pub Date : 2024-03-05 DOI: 10.1145/3648536.3648539

Yue Li, Koen V. Hindriks, Florian Kunneman

In this paper, we study how well human speech can automatically be filtered when this overlaps with the voice and fan noise of a social robot, Pepper. We ultimately aim for an HRI scenario where the microphone can remain open when the robot is speaking, enabling a more natural turn-taking scheme where the human can interrupt the robot. To respond appropriately, the robot would need to understand what the interlocutor said in the overlapping part of the speech, which can be accomplished by target speech extraction (TSE). To investigate how well TSE can be accomplished in the context of the popular social robot Pepper, we set out to manufacture a datase composed of a mixture of recorded speech of Pepper itself, its fan noise (which is close to the microphones), and human speech as recorded by the Pepper microphone, in a room with low reverberation and high reverberation. Comparing a signal processing approach, with and without post-filtering, and a convolutional recurrent neural network (CRNN) approach to a state-of-the-art speaker identification-based TSE model, we found that the signal processing approach without post-filtering yielded the best performance in terms of Word Error Rate on the overlapping speech signals with low reverberation, while the CRNN approach is more robust for reverberation. These results show that estimating the human voice in overlapping speech with a robot is possible in real-life application, provided that the room reverberation is low and the human speech has a high volume or high pitch.

在本文中，我们将研究当人类语音与社交机器人 Pepper 的声音和风扇噪音重叠时，人类语音的自动过滤效果如何。我们的最终目标是，在人机交互场景中，当机器人说话时，麦克风可以保持打开状态，从而实现更自然的轮流方案，让人类可以打断机器人。为了做出适当的回应，机器人需要理解对话者在重叠部分所说的话，这可以通过目标语音提取（TSE）来实现。为了研究目标语音提取在流行的社交机器人 Pepper 中的应用效果，我们在一个混响和高混响的房间里制作了一个数据集，该数据集由 Pepper 本身的语音录音、其风扇噪声（靠近麦克风）和 Pepper 麦克风录制的人类语音混合组成。我们比较了一种信号处理方法（包括后置滤波和不带滤波）和一种卷积递归神经网络（CRNN）方法，以及一种最先进的基于说话人识别的 TSE 模型，发现不带后置滤波的信号处理方法在低混响的重叠语音信号中的字错误率方面表现最佳，而 CRNN 方法对混响具有更强的鲁棒性。这些结果表明，在实际应用中，只要室内混响较小，且人类语音音量较大或音调较高，就有可能在与机器人的重叠语音中估计出人类的声音。

{"title":"Single-Channel Robot Ego-Speech Filtering during Human-Robot Interaction","authors":"Yue Li, Koen V. Hindriks, Florian Kunneman","doi":"10.1145/3648536.3648539","DOIUrl":"https://doi.org/10.1145/3648536.3648539","url":null,"abstract":"In this paper, we study how well human speech can automatically be filtered when this overlaps with the voice and fan noise of a social robot, Pepper. We ultimately aim for an HRI scenario where the microphone can remain open when the robot is speaking, enabling a more natural turn-taking scheme where the human can interrupt the robot. To respond appropriately, the robot would need to understand what the interlocutor said in the overlapping part of the speech, which can be accomplished by target speech extraction (TSE). To investigate how well TSE can be accomplished in the context of the popular social robot Pepper, we set out to manufacture a datase composed of a mixture of recorded speech of Pepper itself, its fan noise (which is close to the microphones), and human speech as recorded by the Pepper microphone, in a room with low reverberation and high reverberation. Comparing a signal processing approach, with and without post-filtering, and a convolutional recurrent neural network (CRNN) approach to a state-of-the-art speaker identification-based TSE model, we found that the signal processing approach without post-filtering yielded the best performance in terms of Word Error Rate on the overlapping speech signals with low reverberation, while the CRNN approach is more robust for reverberation. These results show that estimating the human voice in overlapping speech with a robot is possible in real-life application, provided that the room reverberation is low and the human speech has a high volume or high pitch.","PeriodicalId":513202,"journal":{"name":"ArXiv","volume":"358 15","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ArXiv

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀