Proceedings of the ACM Web Conference 2023最新文献

pFedPrompt: Learning Personalized Prompt for Vision-Language Models in Federated Learning pFedPrompt:联邦学习中视觉语言模型的个性化学习提示

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583518

Tao Guo, Song Guo, Junxiao Wang

Pre-trained vision-language models like CLIP show great potential in learning representations that capture latent characteristics of users. A recently proposed method called Contextual Optimization (CoOp) introduces the concept of training prompt for adapting pre-trained vision-language models. Given the lightweight nature of this method, researchers have migrated the paradigm from centralized to decentralized system to innovate the collaborative training framework of Federated Learning (FL). However, current prompt training in FL mainly focuses on modeling user consensus and lacks the adaptation to user characteristics, leaving the personalization of prompt largely under-explored. Researches over the past few years have applied personalized FL (pFL) approaches to customizing models for heterogeneous users. Unfortunately, we find that with the variation of modality and training behavior, directly applying the pFL methods to prompt training leads to insufficient personalization and performance. To bridge the gap, we present pFedPrompt, which leverages the unique advantage of multimodality in vision-language models by learning user consensus from linguistic space and adapting to user characteristics in visual space in a non-parametric manner. Through this dual collaboration, the learned prompt will be fully personalized and aligned to the user’s local characteristics. We conduct extensive experiments across various datasets under the FL setting with statistical heterogeneity. The results demonstrate the superiority of our pFedPrompt against the alternative approaches with robust performance.

像CLIP这样的预训练视觉语言模型在学习捕捉用户潜在特征的表示方面显示出巨大的潜力。最近提出的一种称为上下文优化(CoOp)的方法引入了训练提示的概念，以适应预训练的视觉语言模型。考虑到该方法的轻量级特性，研究人员已经将范式从集中式系统迁移到分散式系统，以创新联邦学习(FL)的协作训练框架。然而，目前的提示训练主要集中在用户共识的建模上，缺乏对用户特征的适应，提示的个性化在很大程度上没有得到充分的探索。在过去的几年里，研究人员将个性化FL (pFL)方法应用于异构用户的自定义模型。不幸的是，我们发现随着训练方式和训练行为的变化，直接使用pFL方法来提示训练会导致个性化和绩效不足。为了弥补这一差距，我们提出了pFedPrompt，它利用了视觉语言模型中多模态的独特优势，从语言空间中学习用户共识，并以非参数方式适应视觉空间中的用户特征。通过这种双重协作，学习提示将完全个性化，并与用户的本地特征保持一致。我们在具有统计异质性的FL设置下对各种数据集进行了广泛的实验。结果证明了我们的pFedPrompt相对于其他具有鲁棒性能的方法的优越性。

{"title":"pFedPrompt: Learning Personalized Prompt for Vision-Language Models in Federated Learning","authors":"Tao Guo, Song Guo, Junxiao Wang","doi":"10.1145/3543507.3583518","DOIUrl":"https://doi.org/10.1145/3543507.3583518","url":null,"abstract":"Pre-trained vision-language models like CLIP show great potential in learning representations that capture latent characteristics of users. A recently proposed method called Contextual Optimization (CoOp) introduces the concept of training prompt for adapting pre-trained vision-language models. Given the lightweight nature of this method, researchers have migrated the paradigm from centralized to decentralized system to innovate the collaborative training framework of Federated Learning (FL). However, current prompt training in FL mainly focuses on modeling user consensus and lacks the adaptation to user characteristics, leaving the personalization of prompt largely under-explored. Researches over the past few years have applied personalized FL (pFL) approaches to customizing models for heterogeneous users. Unfortunately, we find that with the variation of modality and training behavior, directly applying the pFL methods to prompt training leads to insufficient personalization and performance. To bridge the gap, we present pFedPrompt, which leverages the unique advantage of multimodality in vision-language models by learning user consensus from linguistic space and adapting to user characteristics in visual space in a non-parametric manner. Through this dual collaboration, the learned prompt will be fully personalized and aligned to the user’s local characteristics. We conduct extensive experiments across various datasets under the FL setting with statistical heterogeneity. The results demonstrate the superiority of our pFedPrompt against the alternative approaches with robust performance.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115389310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Look Deep into the Microservice System Anomaly through Very Sparse Logs 通过非常稀疏日志深入了解微服务系统异常

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583338

Xinrui Jiang, Yicheng Pan, Meng Ma, Ping Wang

Intensive monitoring and anomaly diagnosis have become a knotty problem for modern microservice architecture due to the dynamics of service dependency. While most previous studies rely heavily on ample monitoring metrics, we raise a fundamental but always neglected issue - the diagnostic metric integrity problem. This paper solves the problem by proposing MicroCU – a novel approach to diagnose microservice systems using very sparse API logs. We design a structure named dynamic causal curves to portray time-varying service dependencies and a temporal dynamics discovery algorithm based on Granger causal intervals. Our algorithm generates a smoother space of causal curves and designs the concept of causal unimodalization to calibrate the causality infidelities brought by missing metrics. Finally, a path search algorithm on dynamic causality graphs is proposed to pinpoint the root cause. Experiments on commercial system cases show that MicroCU outperforms many state-of-the-art approaches and reflects the superiorities of causal unimodalization to raw metric imputation.

由于服务依赖的动态性，密集监测和异常诊断已成为现代微服务体系结构中的一个棘手问题。虽然大多数先前的研究严重依赖于充足的监测指标，但我们提出了一个基本但总是被忽视的问题-诊断指标完整性问题。本文通过提出MicroCU来解决这个问题，MicroCU是一种利用非常稀疏的API日志来诊断微服务系统的新方法。我们设计了动态因果曲线结构来描述时变的服务依赖关系，并设计了基于格兰杰因果区间的时间动态发现算法。该算法生成了一个平滑的因果曲线空间，并设计了因果单模化的概念来校准缺失度量带来的因果不实度。最后，提出了一种基于动态因果图的路径搜索算法来查找根本原因。商业系统案例的实验表明，MicroCU优于许多最先进的方法，并反映了因果单模化对原始度量推算的优势。

引用次数: 1

Ginver: Generative Model Inversion Attacks Against Collaborative Inference Ginver:针对协同推理的生成模型反转攻击

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583306

Yupeng Yin, Xianglong Zhang, Huanle Zhang, Feng Li, Yue Yu, Xiuzhen Cheng, Pengfei Hu

Deep Learning (DL) has been widely adopted in almost all domains, from threat recognition to medical diagnosis. Albeit its supreme model accuracy, DL imposes a heavy burden on devices as it incurs overwhelming system overhead to execute DL models, especially on Internet-of-Things (IoT) and edge devices. Collaborative inference is a promising approach to supporting DL models, by which the data owner (the victim) runs the first layers of the model on her local device and then a cloud provider (the adversary) runs the remaining layers of the model. Compared to offloading the entire model to the cloud, the collaborative inference approach is more data privacy-preserving as the owner’s model input is not exposed to outsiders. However, we show in this paper that the adversary can restore the victim’s model input by exploiting the output of the victim’s local model. Our attack is dubbed Ginver 1: Generative model inversion attacks against collaborative inference. Once trained, Ginver can infer the victim’s unseen model inputs without remaking the inversion attack model and thus has the generative capability. We extensively evaluate Ginver under different settings (e.g., white-box and black-box of the victim’s local model) and applications (e.g., CIFAR10 and FaceScrub datasets). The experimental results show that Ginver recovers high-quality images from the victims.

深度学习(DL)已被广泛应用于几乎所有领域，从威胁识别到医学诊断。尽管具有极高的模型准确性，但深度学习给设备带来了沉重的负担，因为执行深度学习模型会带来巨大的系统开销，尤其是在物联网(IoT)和边缘设备上。协作推理是支持深度学习模型的一种很有前途的方法，通过这种方法，数据所有者(受害者)在其本地设备上运行模型的第一层，然后云提供商(对手)运行模型的其余层。与将整个模型卸载到云端相比，协作推理方法更能保护数据隐私，因为所有者的模型输入不会暴露给外部人员。然而，我们在本文中表明，攻击者可以通过利用受害者的局部模型的输出来恢复受害者的模型输入。我们的攻击被称为Ginver 1:针对协作推理的生成模型反转攻击。Ginver经过训练后，无需重新构建逆攻击模型，即可推断出受害者未见的模型输入，从而具有生成能力。我们在不同设置(例如受害者本地模型的白盒和黑盒)和应用程序(例如CIFAR10和FaceScrub数据集)下广泛评估Ginver。实验结果表明，Ginver从受害者身上恢复了高质量的图像。

{"title":"Ginver: Generative Model Inversion Attacks Against Collaborative Inference","authors":"Yupeng Yin, Xianglong Zhang, Huanle Zhang, Feng Li, Yue Yu, Xiuzhen Cheng, Pengfei Hu","doi":"10.1145/3543507.3583306","DOIUrl":"https://doi.org/10.1145/3543507.3583306","url":null,"abstract":"Deep Learning (DL) has been widely adopted in almost all domains, from threat recognition to medical diagnosis. Albeit its supreme model accuracy, DL imposes a heavy burden on devices as it incurs overwhelming system overhead to execute DL models, especially on Internet-of-Things (IoT) and edge devices. Collaborative inference is a promising approach to supporting DL models, by which the data owner (the victim) runs the first layers of the model on her local device and then a cloud provider (the adversary) runs the remaining layers of the model. Compared to offloading the entire model to the cloud, the collaborative inference approach is more data privacy-preserving as the owner’s model input is not exposed to outsiders. However, we show in this paper that the adversary can restore the victim’s model input by exploiting the output of the victim’s local model. Our attack is dubbed Ginver 1: Generative model inversion attacks against collaborative inference. Once trained, Ginver can infer the victim’s unseen model inputs without remaking the inversion attack model and thus has the generative capability. We extensively evaluate Ginver under different settings (e.g., white-box and black-box of the victim’s local model) and applications (e.g., CIFAR10 and FaceScrub datasets). The experimental results show that Ginver recovers high-quality images from the victims.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124425549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Testing Cluster Properties of Signed Graphs 测试有符号图的聚类属性

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583213

Florian Adriaens, Simon Apers

This work initiates the study of property testing in signed graphs, where every edge has either a positive or a negative sign. We show that there exist sublinear query and time algorithms for testing three key properties of signed graphs: balance (or 2-clusterability), clusterability and signed triangle freeness. We consider both the dense graph model, where one queries the adjacency matrix entries of a signed graph, and the bounded-degree model, where one queries for the neighbors of a node and the sign of the connecting edge. Our algorithms use a variety of tools from unsigned graph property testing, as well as reductions from one setting to the other. Our main technical contribution is a sublinear algorithm for testing clusterability in the bounded-degree model. This contrasts with the property of k-clusterability in unsigned graphs, which is not testable with a sublinear number of queries in the bounded-degree model. We experimentally evaluate the complexity and usefulness of several of our testers on real-life and synthetic datasets.

这项工作开启了符号图中性质检验的研究，其中每条边都有正号或负号。我们证明了有符号图的平衡性(或2-可聚性)、可聚性和有符号三角形自由性这三个关键性质的子线性查询和时间算法。我们考虑密集图模型和有界度模型，前者查询有符号图的邻接矩阵条目，后者查询节点的邻居和连接边的符号。我们的算法使用各种工具，从无符号图属性测试，以及从一种设置到另一种设置的缩减。我们的主要技术贡献是一种用于测试有界度模型的聚类性的次线性算法。这与无符号图的k-聚类性形成对比，无符号图的k-聚类性在有界度模型中不能用次线性的查询次数进行测试。我们通过实验评估了我们在现实生活和合成数据集上的几个测试器的复杂性和有用性。

引用次数: 0

Multi-Aspect Heterogeneous Graph Augmentation 多向异构图增强

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583208

Yuchen Zhou, Yanan Cao, Yongchao Liu, Yanmin Shang, P. Zhang, Zheng Lin, Yun Yue, Baokun Wang, Xingbo Fu, Weiqiang Wang

Data augmentation has been widely studied as it can be used to improve the generalizability of graph representation learning models. However, existing works focus only on the data augmentation on homogeneous graphs. Data augmentation for heterogeneous graphs remains under-explored. Considering that heterogeneous graphs contain different types of nodes and links, ignoring the type information and directly applying the data augmentation methods of homogeneous graphs to heterogeneous graphs will lead to suboptimal results. In this paper, we propose a novel Multi-Aspect Heterogeneous Graph Augmentation framework named MAHGA. Specifically, MAHGA consists of two core augmentation strategies: structure-level augmentation and metapath-level augmentation. Structure-level augmentation pays attention to network schema aspect and designs a relation-aware conditional variational auto-encoder that can generate synthetic features of neighbors to augment the nodes and the node types with scarce links. Metapath-level augmentation concentrates on metapath aspect, which constructs metapath reachable graphs for different metapaths and estimates the graphons of them. By sampling and mixing up based on the graphons, MAHGA yields intra-metapath and inter-metapath augmentation. Finally, we conduct extensive experiments on multiple benchmarks to validate the effectiveness of MAHGA. Experimental results demonstrate that our method improves the performances across a set of heterogeneous graph learning models and datasets.

数据增强可以提高图表示学习模型的泛化能力，因此得到了广泛的研究。然而，现有的工作只集中在齐次图上的数据扩充。异构图的数据增强仍然有待探索。考虑到异构图包含不同类型的节点和链接，忽略类型信息，直接将同构图的数据增强方法应用于异构图会导致次优结果。本文提出了一种新的多面向异构图增强框架MAHGA。具体而言，MAHGA包括两种核心增强策略:结构级增强和元路径级增强。结构级增强关注网络模式方面，设计了一种关系感知的条件变分自编码器，该编码器可以生成邻居的综合特征，以增强节点和链路稀缺的节点类型。元路径级增强主要集中在元路径方面，为不同的元路径构建元路径可达图，并估计它们的图元。MAHGA通过基于图形的采样和混合，产生元路径内和元路径间的增强。最后，我们在多个基准上进行了大量的实验来验证MAHGA的有效性。实验结果表明，我们的方法提高了一组异构图学习模型和数据集的性能。

{"title":"Multi-Aspect Heterogeneous Graph Augmentation","authors":"Yuchen Zhou, Yanan Cao, Yongchao Liu, Yanmin Shang, P. Zhang, Zheng Lin, Yun Yue, Baokun Wang, Xingbo Fu, Weiqiang Wang","doi":"10.1145/3543507.3583208","DOIUrl":"https://doi.org/10.1145/3543507.3583208","url":null,"abstract":"Data augmentation has been widely studied as it can be used to improve the generalizability of graph representation learning models. However, existing works focus only on the data augmentation on homogeneous graphs. Data augmentation for heterogeneous graphs remains under-explored. Considering that heterogeneous graphs contain different types of nodes and links, ignoring the type information and directly applying the data augmentation methods of homogeneous graphs to heterogeneous graphs will lead to suboptimal results. In this paper, we propose a novel Multi-Aspect Heterogeneous Graph Augmentation framework named MAHGA. Specifically, MAHGA consists of two core augmentation strategies: structure-level augmentation and metapath-level augmentation. Structure-level augmentation pays attention to network schema aspect and designs a relation-aware conditional variational auto-encoder that can generate synthetic features of neighbors to augment the nodes and the node types with scarce links. Metapath-level augmentation concentrates on metapath aspect, which constructs metapath reachable graphs for different metapaths and estimates the graphons of them. By sampling and mixing up based on the graphons, MAHGA yields intra-metapath and inter-metapath augmentation. Finally, we conduct extensive experiments on multiple benchmarks to validate the effectiveness of MAHGA. Experimental results demonstrate that our method improves the performances across a set of heterogeneous graph learning models and datasets.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125038409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Chameleon on the Web: an Empirical Study of the Insidious Proactive Web Defacements 网络上的变色龙:潜伏的主动网络破坏的实证研究

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583377

Rui Zhao

Web defacement is one of the major promotional channels for online underground economies. It regularly compromises benign websites and injects fraudulent content to promote illicit goods and services. It inflicts significant harm to websites’ reputations and revenues and may lead to legal ramifications. In this paper, we uncover proactive web defacements, where the involved web pages (i.e., landing pages) proactively deface themselves within browsers using JavaScript (i.e., control scripts). Proactive web defacements have not yet received attention from research communities, anti-hacking organizations, or law-enforcement officials. To detect proactive web defacements, we designed a practical tool, PACTOR. It runs in the browser and intercepts JavaScript API calls that manipulate web page content. It takes snapshots of the rendered HTML source code immediately before and after the intercepted API calls and detects proactive web defacements by visually comparing every two consecutive snapshots. Our two-month empirical study, using PACTOR, on 2,454 incidents of proactive web defacements shows that they can evade existing URL safety-checking tools and effectively promote the ranking of their landing pages using legitimate content/keywords. We also investigated the vendor network of proactive web defacements and reported all the involved domains to law-enforcement officials and URL-safety checking tools.

网络污损是网络地下经济的主要推广渠道之一。它经常破坏良性网站，并注入欺诈内容，以推广非法商品和服务。它会对网站的声誉和收入造成重大损害，并可能导致法律后果。在本文中，我们发现了主动的网页破坏，其中涉及的网页(即，登陆页面)在浏览器中使用JavaScript(即，控制脚本)主动破坏自己。主动的网络破坏还没有引起研究团体、反黑客组织或执法官员的注意。为了检测主动网络损坏，我们设计了一个实用的工具PACTOR。它在浏览器中运行，拦截操纵网页内容的JavaScript API调用。它在拦截API调用之前和之后立即获取呈现的HTML源代码的快照，并通过视觉比较每两个连续的快照来检测主动的web破坏。我们使用PACTOR对2,454起主动网页破坏事件进行了为期两个月的实证研究，结果表明，他们可以逃避现有的URL安全检查工具，并使用合法内容/关键字有效地提升其登陆页面的排名。我们还调查了主动破坏网页的供应商网络，并向执法官员和url安全检查工具报告了所有涉及的域名。

{"title":"The Chameleon on the Web: an Empirical Study of the Insidious Proactive Web Defacements","authors":"Rui Zhao","doi":"10.1145/3543507.3583377","DOIUrl":"https://doi.org/10.1145/3543507.3583377","url":null,"abstract":"Web defacement is one of the major promotional channels for online underground economies. It regularly compromises benign websites and injects fraudulent content to promote illicit goods and services. It inflicts significant harm to websites’ reputations and revenues and may lead to legal ramifications. In this paper, we uncover proactive web defacements, where the involved web pages (i.e., landing pages) proactively deface themselves within browsers using JavaScript (i.e., control scripts). Proactive web defacements have not yet received attention from research communities, anti-hacking organizations, or law-enforcement officials. To detect proactive web defacements, we designed a practical tool, PACTOR. It runs in the browser and intercepts JavaScript API calls that manipulate web page content. It takes snapshots of the rendered HTML source code immediately before and after the intercepted API calls and detects proactive web defacements by visually comparing every two consecutive snapshots. Our two-month empirical study, using PACTOR, on 2,454 incidents of proactive web defacements shows that they can evade existing URL safety-checking tools and effectively promote the ranking of their landing pages using legitimate content/keywords. We also investigated the vendor network of proactive web defacements and reported all the involved domains to law-enforcement officials and URL-safety checking tools.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115325521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Scoping Fairness Objectives and Identifying Fairness Metrics for Recommender Systems: The Practitioners’ Perspective 界定公平目标和确定推荐系统的公平指标:实践者的观点

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583204

Jessie J. Smith, Lex Beattie, H. Cramer

Measuring and assessing the impact and “fairness’’ of recommendation algorithms is central to responsible recommendation efforts. However, the complexity of fairness definitions and the proliferation of fairness metrics in research literature have led to a complex decision-making space. This environment makes it challenging for practitioners to operationalize and pick metrics that work within their unique context. This suggests that practitioners require more decision-making support, but it is not clear what type of support would be beneficial. We conducted a literature review of 24 papers to gather metrics introduced by the research community for measuring fairness in recommendation and ranking systems. We organized these metrics into a ‘decision-tree style’ support framework designed to help practitioners scope fairness objectives and identify fairness metrics relevant to their recommendation domain and application context. To explore the feasibility of this approach, we conducted 15 semi-structured interviews using this framework to assess which challenges practitioners may face when scoping fairness objectives and metrics for their system, and which further support may be needed beyond such tools.

衡量和评估推荐算法的影响和“公平性”是负责任的推荐工作的核心。然而，公平定义的复杂性和研究文献中公平指标的激增导致了一个复杂的决策空间。这种环境使得从业者在其独特的上下文中操作和选择工作的度量具有挑战性。这表明从业者需要更多的决策支持，但是不清楚哪种类型的支持是有益的。我们对24篇论文进行了文献综述，以收集研究界引入的衡量推荐和排名系统公平性的指标。我们将这些指标组织成一个“决策树风格”的支持框架，旨在帮助从业者确定公平目标，并确定与他们的推荐领域和应用程序上下文相关的公平指标。为了探索这种方法的可行性，我们使用该框架进行了15次半结构化访谈，以评估从业者在为其系统确定公平目标和度量范围时可能面临的挑战，以及在这些工具之外可能需要哪些进一步的支持。

{"title":"Scoping Fairness Objectives and Identifying Fairness Metrics for Recommender Systems: The Practitioners’ Perspective","authors":"Jessie J. Smith, Lex Beattie, H. Cramer","doi":"10.1145/3543507.3583204","DOIUrl":"https://doi.org/10.1145/3543507.3583204","url":null,"abstract":"Measuring and assessing the impact and “fairness’’ of recommendation algorithms is central to responsible recommendation efforts. However, the complexity of fairness definitions and the proliferation of fairness metrics in research literature have led to a complex decision-making space. This environment makes it challenging for practitioners to operationalize and pick metrics that work within their unique context. This suggests that practitioners require more decision-making support, but it is not clear what type of support would be beneficial. We conducted a literature review of 24 papers to gather metrics introduced by the research community for measuring fairness in recommendation and ranking systems. We organized these metrics into a ‘decision-tree style’ support framework designed to help practitioners scope fairness objectives and identify fairness metrics relevant to their recommendation domain and application context. To explore the feasibility of this approach, we conducted 15 semi-structured interviews using this framework to assess which challenges practitioners may face when scoping fairness objectives and metrics for their system, and which further support may be needed beyond such tools.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122444318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Learning Robust Multi-Modal Representation for Multi-Label Emotion Recognition via Adversarial Masking and Perturbation 通过对抗掩蔽和扰动学习多标签情感识别的鲁棒多模态表示

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583258

Shiping Ge, Zhiwei Jiang, Zifeng Cheng, Cong Wang, Yafeng Yin, Qing Gu

Recognizing emotions from multi-modal data is an emotion recognition task that requires strong multi-modal representation ability. The general approach to this task is to naturally train the representation model on training data without intervention. However, such natural training scheme is prone to modality bias of representation (i.e., tending to over-encode some informative modalities while neglecting other modalities) and data bias of training (i.e., tending to overfit training data). These biases may lead to instability (e.g., performing poorly when the neglected modality is dominant for recognition) and weak generalization (e.g., performing poorly when unseen data is inconsistent with overfitted data) of the model on unseen data. To address these problems, this paper presents two adversarial training strategies to learn more robust multi-modal representation for multi-label emotion recognition. Firstly, we propose an adversarial temporal masking strategy, which can enhance the encoding of other modalities by masking the most emotion-related temporal units (e.g., words for text or frames for video) of the informative modality. Secondly, we propose an adversarial parameter perturbation strategy, which can enhance the generalization of the model by adding the adversarial perturbation to the parameters of model. Both strategies boost model performance on the benchmark MMER datasets CMU-MOSEI and NEMu. Experimental results demonstrate the effectiveness of the proposed method compared with the previous state-of-the-art method. Code will be released at https://github.com/ShipingGe/MMER.

从多模态数据中识别情绪是一项需要较强多模态表示能力的情绪识别任务。该任务的一般方法是不加干预地在训练数据上自然地训练表示模型。然而，这种自然训练方案容易存在表征的模态偏差(即倾向于过度编码某些信息模态而忽略其他模态)和训练的数据偏差(即倾向于过拟合训练数据)。这些偏差可能导致模型在未知数据上的不稳定性(例如，当被忽略的模态在识别中占主导地位时表现不佳)和弱泛化(例如，当未知数据与过拟合数据不一致时表现不佳)。为了解决这些问题，本文提出了两种对抗训练策略，以学习更鲁棒的多模态表示用于多标签情感识别。首先，我们提出了一种对抗性的时间掩蔽策略，该策略可以通过掩蔽信息模态中与情感最相关的时间单元(例如，文本中的单词或视频中的帧)来增强其他模态的编码。其次，提出了一种对抗参数摄动策略，通过在模型参数中加入对抗摄动来增强模型的泛化能力。这两种策略都提高了模型在基准MMER数据集CMU-MOSEI和NEMu上的性能。实验结果证明了该方法的有效性。代码将在https://github.com/ShipingGe/MMER上发布。

{"title":"Learning Robust Multi-Modal Representation for Multi-Label Emotion Recognition via Adversarial Masking and Perturbation","authors":"Shiping Ge, Zhiwei Jiang, Zifeng Cheng, Cong Wang, Yafeng Yin, Qing Gu","doi":"10.1145/3543507.3583258","DOIUrl":"https://doi.org/10.1145/3543507.3583258","url":null,"abstract":"Recognizing emotions from multi-modal data is an emotion recognition task that requires strong multi-modal representation ability. The general approach to this task is to naturally train the representation model on training data without intervention. However, such natural training scheme is prone to modality bias of representation (i.e., tending to over-encode some informative modalities while neglecting other modalities) and data bias of training (i.e., tending to overfit training data). These biases may lead to instability (e.g., performing poorly when the neglected modality is dominant for recognition) and weak generalization (e.g., performing poorly when unseen data is inconsistent with overfitted data) of the model on unseen data. To address these problems, this paper presents two adversarial training strategies to learn more robust multi-modal representation for multi-label emotion recognition. Firstly, we propose an adversarial temporal masking strategy, which can enhance the encoding of other modalities by masking the most emotion-related temporal units (e.g., words for text or frames for video) of the informative modality. Secondly, we propose an adversarial parameter perturbation strategy, which can enhance the generalization of the model by adding the adversarial perturbation to the parameters of model. Both strategies boost model performance on the benchmark MMER datasets CMU-MOSEI and NEMu. Experimental results demonstrate the effectiveness of the proposed method compared with the previous state-of-the-art method. Code will be released at https://github.com/ShipingGe/MMER.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122030249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Fairness-Aware Clique-Preserving Spectral Clustering of Temporal Graphs 具有公平性意识的时间图保团谱聚类

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583423

Dongqi Fu, Dawei Zhou, Ross Maciejewski, A. Croitoru, Marcus Boyd, Jingrui He

With the widespread development of algorithmic fairness, there has been a surge of research interest that aims to generalize the fairness notions from the attributed data to the relational data (graphs). The vast majority of existing work considers the fairness measure in terms of the low-order connectivity patterns (e.g., edges), while overlooking the higher-order patterns (e.g., k-cliques) and the dynamic nature of real-world graphs. For example, preserving triangles from graph cuts during clustering is the key to detecting compact communities; however, if the clustering algorithm only pays attention to triangle-based compactness, then the returned communities lose the fairness guarantee for each group in the graph. Furthermore, in practice, when the graph (e.g., social networks) topology constantly changes over time, one natural question is how can we ensure the compactness and demographic parity at each timestamp efficiently. To address these problems, we start from the static setting and propose a spectral method that preserves clique connections and incorporates demographic fairness constraints in returned clusters at the same time. To make this static method fit for the dynamic setting, we propose two core techniques, Laplacian Update via Edge Filtering and Searching and Eigen-Pairs Update with Singularity Avoided. Finally, all proposed components are combined into an end-to-end clustering framework named F-SEGA, and we conduct extensive experiments to demonstrate the effectiveness, efficiency, and robustness of F-SEGA.

随着算法公平性的广泛发展，将公平性概念从属性数据推广到关系数据(图)的研究兴趣激增。现有的绝大多数工作都考虑了低阶连接模式(例如，边)的公平性度量，而忽略了高阶模式(例如，k-cliques)和现实世界图的动态性。例如，在聚类过程中保持三角形不被图切割是检测紧凑群落的关键;然而，如果聚类算法只关注基于三角形的紧凑性，那么返回的群体就失去了对图中每个组的公平性保证。此外，在实践中，当图(例如社交网络)拓扑结构随着时间不断变化时，一个自然的问题是我们如何有效地确保每个时间戳的紧凑性和人口均等。为了解决这些问题，我们从静态设置开始，提出了一种频谱方法，该方法保留了集团连接，同时在返回的集群中纳入了人口统计公平性约束。为了使这种静态方法适应动态环境，我们提出了两种核心技术:基于边缘滤波和搜索的拉普拉斯更新技术和避免奇异点的特征对更新技术。最后，将所有提出的组件组合到一个名为F-SEGA的端到端聚类框架中，并进行了大量的实验来证明F-SEGA的有效性、效率和鲁棒性。

{"title":"Fairness-Aware Clique-Preserving Spectral Clustering of Temporal Graphs","authors":"Dongqi Fu, Dawei Zhou, Ross Maciejewski, A. Croitoru, Marcus Boyd, Jingrui He","doi":"10.1145/3543507.3583423","DOIUrl":"https://doi.org/10.1145/3543507.3583423","url":null,"abstract":"With the widespread development of algorithmic fairness, there has been a surge of research interest that aims to generalize the fairness notions from the attributed data to the relational data (graphs). The vast majority of existing work considers the fairness measure in terms of the low-order connectivity patterns (e.g., edges), while overlooking the higher-order patterns (e.g., k-cliques) and the dynamic nature of real-world graphs. For example, preserving triangles from graph cuts during clustering is the key to detecting compact communities; however, if the clustering algorithm only pays attention to triangle-based compactness, then the returned communities lose the fairness guarantee for each group in the graph. Furthermore, in practice, when the graph (e.g., social networks) topology constantly changes over time, one natural question is how can we ensure the compactness and demographic parity at each timestamp efficiently. To address these problems, we start from the static setting and propose a spectral method that preserves clique connections and incorporates demographic fairness constraints in returned clusters at the same time. To make this static method fit for the dynamic setting, we propose two core techniques, Laplacian Update via Edge Filtering and Searching and Eigen-Pairs Update with Singularity Avoided. Finally, all proposed components are combined into an end-to-end clustering framework named F-SEGA, and we conduct extensive experiments to demonstrate the effectiveness, efficiency, and robustness of F-SEGA.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122158599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Learning Structural Co-occurrences for Structured Web Data Extraction in Low-Resource Settings 低资源环境下结构化Web数据抽取的结构共现学习

Proceedings of the ACM Web Conference 2023

Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583387

Zhenyu Zhang, Yu Bowen, Tingwen Liu, Tianyun Liu, Yubin Wang, Li Guo

Extracting structured information from all manner of webpages is an important problem with the potential to automate many real-world applications. Recent work has shown the effectiveness of leveraging DOM trees and pre-trained language models to describe and encode webpages. However, they typically optimize the model to learn the semantic co-occurrence of elements and labels in the same webpage, thus their effectiveness depends on sufficient labeled data, which is labor-intensive. In this paper, we further observe structural co-occurrences in different webpages of the same website: the same position in the DOM tree usually plays the same semantic role, and the DOM nodes in this position also share similar surface forms. Motivated by this, we propose a novel method, Structor, to effectively incorporate the structural co-occurrences over DOM tree and surface form into pre-trained language models. Such structural co-occurrences help the model learn the task better under low-resource settings, and we study two challenging experimental scenarios: website-level low-resource setting and webpage-level low-resource setting, to evaluate our approach. Extensive experiments on the public SWDE dataset show that Structor significantly outperforms the state-of-the-art models in both settings, and even achieves three times the performance of the strong baseline model in the case of extreme lack of training data.

从各种形式的网页中提取结构化信息是一个重要的问题，它有可能使许多现实世界的应用程序自动化。最近的工作已经证明了利用DOM树和预训练的语言模型来描述和编码网页的有效性。然而，他们通常会优化模型来学习同一网页中元素和标签的语义共现，因此他们的有效性取决于足够的标记数据，这是劳动密集型的。在本文中，我们进一步观察到在同一网站的不同网页中结构共现:DOM树中相同的位置通常扮演相同的语义角色，并且该位置的DOM节点也具有相似的表面形式。基于此，我们提出了一种新的方法Structor，将DOM树和表面形式的结构共现有效地整合到预训练的语言模型中。这种结构共现有助于模型在低资源设置下更好地学习任务，我们研究了两个具有挑战性的实验场景:网站级低资源设置和网页级低资源设置，以评估我们的方法。在公共SWDE数据集上进行的大量实验表明，Structor在这两种设置下的性能都明显优于最先进的模型，在训练数据极度缺乏的情况下，其性能甚至达到强基线模型的三倍。

{"title":"Learning Structural Co-occurrences for Structured Web Data Extraction in Low-Resource Settings","authors":"Zhenyu Zhang, Yu Bowen, Tingwen Liu, Tianyun Liu, Yubin Wang, Li Guo","doi":"10.1145/3543507.3583387","DOIUrl":"https://doi.org/10.1145/3543507.3583387","url":null,"abstract":"Extracting structured information from all manner of webpages is an important problem with the potential to automate many real-world applications. Recent work has shown the effectiveness of leveraging DOM trees and pre-trained language models to describe and encode webpages. However, they typically optimize the model to learn the semantic co-occurrence of elements and labels in the same webpage, thus their effectiveness depends on sufficient labeled data, which is labor-intensive. In this paper, we further observe structural co-occurrences in different webpages of the same website: the same position in the DOM tree usually plays the same semantic role, and the DOM nodes in this position also share similar surface forms. Motivated by this, we propose a novel method, Structor, to effectively incorporate the structural co-occurrences over DOM tree and surface form into pre-trained language models. Such structural co-occurrences help the model learn the task better under low-resource settings, and we study two challenging experimental scenarios: website-level low-resource setting and webpage-level low-resource setting, to evaluate our approach. Extensive experiments on the public SWDE dataset show that Structor significantly outperforms the state-of-the-art models in both settings, and even achieves three times the performance of the strong baseline model in the case of extreme lack of training data.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129050180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2