首页 > 最新文献

Proceedings of the ACM Web Conference 2023最新文献

英文 中文
Optimizing Guided Traversal for Fast Learned Sparse Retrieval 快速学习稀疏检索的优化引导遍历
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583497
Yifan Qiao, Yingrui Yang, Haixin Lin, Tao Yang
Recent studies show that BM25-driven dynamic index skipping can greatly accelerate MaxScore-based document retrieval based on the learned sparse representation derived by DeepImpact. This paper investigates the effectiveness of such a traversal guidance strategy during top k retrieval when using other models such as SPLADE and uniCOIL, and finds that unconstrained BM25-driven skipping could have a visible relevance degradation when the BM25 model is not well aligned with a learned weight model or when retrieval depth k is small. This paper generalizes the previous work and optimizes the BM25 guided index traversal with a two-level pruning control scheme and model alignment for fast retrieval using a sparse representation. Although there can be a cost of increased latency, the proposed scheme is much faster than the original MaxScore method without BM25 guidance while retaining the relevance effectiveness. This paper analyzes the competitiveness of this two-level pruning scheme, and evaluates its tradeoff in ranking relevance and time efficiency when searching several test datasets.
最近的研究表明,bm25驱动的动态索引跳转可以极大地加速基于maxscore的基于DeepImpact派生的学习稀疏表示的文档检索。本文研究了在使用SPLADE和uniCOIL等其他模型检索top k时,这种遍历制导策略的有效性,并发现当BM25模型与学习权模型没有很好地对齐或检索深度k很小时,无约束BM25驱动的跳转可能会产生明显的相关性下降。本文在总结前人工作的基础上,对BM25导航索引遍历算法进行了优化,采用两级剪枝控制方案和模型对齐,实现了基于稀疏表示的快速检索。虽然可能会增加延迟的代价,但所提出的方案比没有BM25指导的原始MaxScore方法快得多,同时保留了相关性有效性。本文分析了该两级剪枝方案的竞争力,并评估了其在搜索多个测试数据集时在排序相关性和时间效率方面的权衡。
{"title":"Optimizing Guided Traversal for Fast Learned Sparse Retrieval","authors":"Yifan Qiao, Yingrui Yang, Haixin Lin, Tao Yang","doi":"10.1145/3543507.3583497","DOIUrl":"https://doi.org/10.1145/3543507.3583497","url":null,"abstract":"Recent studies show that BM25-driven dynamic index skipping can greatly accelerate MaxScore-based document retrieval based on the learned sparse representation derived by DeepImpact. This paper investigates the effectiveness of such a traversal guidance strategy during top k retrieval when using other models such as SPLADE and uniCOIL, and finds that unconstrained BM25-driven skipping could have a visible relevance degradation when the BM25 model is not well aligned with a learned weight model or when retrieval depth k is small. This paper generalizes the previous work and optimizes the BM25 guided index traversal with a two-level pruning control scheme and model alignment for fast retrieval using a sparse representation. Although there can be a cost of increased latency, the proposed scheme is much faster than the original MaxScore method without BM25 guidance while retaining the relevance effectiveness. This paper analyzes the competitiveness of this two-level pruning scheme, and evaluates its tradeoff in ranking relevance and time efficiency when searching several test datasets.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114444178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
BiSR: Bidirectionally Optimized Super-Resolution for Mobile Video Streaming BiSR:用于移动视频流的双向优化超分辨率
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583519
Q. Yu, Qing Li, Rui He, Gareth Tyson, Wanxin Shi, Jianhui Lv, Zhenhui Yuan, Peng Zhang, Yulong Lan, Zhicheng Li
The user experience of mobile web video streaming is often impacted by insufficient and dynamic network bandwidth. In this paper, we design Bidirectionally Optimized Super-Resolution (BiSR) to improve the quality of experience (QoE) for mobile web users under limited bandwidth. BiSR exploits a deep neural network (DNN)-based model to super-resolve key frames efficiently without changing the inter-frame spatial-temporal information. We then propose a downscaling DNN and a mobile-specific optimized lightweight super-resolution DNN to enhance the performance. Finally, a novel reinforcement learning-based adaptive bitrate (ABR) algorithm is proposed to verify the performance of BiSR on real network traces. Our evaluation, using a full system implementation, shows that BiSR saves 26% of bitrate compared to the traditional H.264 codec and improves the SSIM of video by 3.7% compared to the prior state-of-the-art. Overall, BiSR enhances the user-perceived quality of experience by up to 30.6%.
移动web视频流的用户体验经常受到网络带宽不足和动态的影响。本文设计了双向优化的超分辨率(BiSR),以提高有限带宽下移动web用户的体验质量(QoE)。BiSR利用基于深度神经网络(DNN)的模型在不改变帧间时空信息的情况下有效地超分辨关键帧。然后,我们提出了一个缩小的深度神经网络和一个针对移动设备优化的轻量级超分辨率深度神经网络来提高性能。最后,提出了一种新的基于强化学习的自适应比特率(ABR)算法,在实际网络轨迹上验证了BiSR算法的性能。我们的评估,使用完整的系统实现,表明BiSR比传统的H.264编解码器节省了26%的比特率,并将视频的SSIM比现有技术提高了3.7%。总体而言,BiSR将用户感知的体验质量提高了30.6%。
{"title":"BiSR: Bidirectionally Optimized Super-Resolution for Mobile Video Streaming","authors":"Q. Yu, Qing Li, Rui He, Gareth Tyson, Wanxin Shi, Jianhui Lv, Zhenhui Yuan, Peng Zhang, Yulong Lan, Zhicheng Li","doi":"10.1145/3543507.3583519","DOIUrl":"https://doi.org/10.1145/3543507.3583519","url":null,"abstract":"The user experience of mobile web video streaming is often impacted by insufficient and dynamic network bandwidth. In this paper, we design Bidirectionally Optimized Super-Resolution (BiSR) to improve the quality of experience (QoE) for mobile web users under limited bandwidth. BiSR exploits a deep neural network (DNN)-based model to super-resolve key frames efficiently without changing the inter-frame spatial-temporal information. We then propose a downscaling DNN and a mobile-specific optimized lightweight super-resolution DNN to enhance the performance. Finally, a novel reinforcement learning-based adaptive bitrate (ABR) algorithm is proposed to verify the performance of BiSR on real network traces. Our evaluation, using a full system implementation, shows that BiSR saves 26% of bitrate compared to the traditional H.264 codec and improves the SSIM of video by 3.7% compared to the prior state-of-the-art. Overall, BiSR enhances the user-perceived quality of experience by up to 30.6%.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125579125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
RSGNN: A Model-agnostic Approach for Enhancing the Robustness of Signed Graph Neural Networks RSGNN:一种增强签名图神经网络鲁棒性的模型不可知方法
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583221
Zeyu Zhang, Jiamou Liu, Xianda Zheng, Yifei Wang, Pengqian Han, Yupan Wang, Kaiqi Zhao, Zijian Zhang
Signed graphs model complex relations using both positive and negative edges. Signed graph neural networks (SGNN) are powerful tools to analyze signed graphs. We address the vulnerability of SGNN to potential edge noise in the input graph. Our goal is to strengthen existing SGNN allowing them to withstand edge noises by extracting robust representations for signed graphs. First, we analyze the expressiveness of SGNN using an extended Weisfeiler-Lehman (WL) graph isomorphism test and identify the limitations to SGNN over triangles that are unbalanced. Then, we design some structure-based regularizers to be used in conjunction with an SGNN that highlight intrinsic properties of a signed graph. The tools and insights above allow us to propose a novel framework, Robust Signed Graph Neural Network (RSGNN), which adopts a dual architecture that simultaneously denoises the graph while learning node representations. We validate the performance of our model empirically on four real-world signed graph datasets, i.e., Bitcoin_OTC, Bitcoin_Alpha, Epinion and Slashdot, RSGNN can clearly improve the robustness of popular SGNN models. When the signed graphs are affected by random noise, our method outperforms baselines by up to 9.35% Binary-F1 for link sign prediction. Our implementation is available in PyTorch1.
符号图用正边和负边对复杂关系建模。签名图神经网络(SGNN)是分析签名图的强大工具。我们解决了SGNN对输入图中潜在边缘噪声的脆弱性。我们的目标是通过提取签名图的鲁棒表示来增强现有的SGNN,使其能够承受边缘噪声。首先,我们使用扩展的Weisfeiler-Lehman (WL)图同构检验分析了SGNN的可表达性,并确定了SGNN在不平衡三角形上的局限性。然后,我们设计了一些基于结构的正则化器,用于与SGNN结合使用,以突出有符号图的内在属性。上述工具和见解使我们能够提出一个新的框架,稳健签名图神经网络(RSGNN),它采用双重架构,在学习节点表示的同时对图进行降噪。我们在Bitcoin_OTC、Bitcoin_Alpha、Epinion和Slashdot四个真实签名图数据集上验证了我们的模型的性能,结果表明RSGNN可以明显提高流行的SGNN模型的鲁棒性。当符号图受随机噪声影响时,我们的方法在链接符号预测方面优于基线高达9.35%的Binary-F1。我们的实现在PyTorch1中可用。
{"title":"RSGNN: A Model-agnostic Approach for Enhancing the Robustness of Signed Graph Neural Networks","authors":"Zeyu Zhang, Jiamou Liu, Xianda Zheng, Yifei Wang, Pengqian Han, Yupan Wang, Kaiqi Zhao, Zijian Zhang","doi":"10.1145/3543507.3583221","DOIUrl":"https://doi.org/10.1145/3543507.3583221","url":null,"abstract":"Signed graphs model complex relations using both positive and negative edges. Signed graph neural networks (SGNN) are powerful tools to analyze signed graphs. We address the vulnerability of SGNN to potential edge noise in the input graph. Our goal is to strengthen existing SGNN allowing them to withstand edge noises by extracting robust representations for signed graphs. First, we analyze the expressiveness of SGNN using an extended Weisfeiler-Lehman (WL) graph isomorphism test and identify the limitations to SGNN over triangles that are unbalanced. Then, we design some structure-based regularizers to be used in conjunction with an SGNN that highlight intrinsic properties of a signed graph. The tools and insights above allow us to propose a novel framework, Robust Signed Graph Neural Network (RSGNN), which adopts a dual architecture that simultaneously denoises the graph while learning node representations. We validate the performance of our model empirically on four real-world signed graph datasets, i.e., Bitcoin_OTC, Bitcoin_Alpha, Epinion and Slashdot, RSGNN can clearly improve the robustness of popular SGNN models. When the signed graphs are affected by random noise, our method outperforms baselines by up to 9.35% Binary-F1 for link sign prediction. Our implementation is available in PyTorch1.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126236918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Message Function Search for Knowledge Graph Embedding 知识图嵌入的消息函数搜索
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583546
Shimin Di, Lei Chen
Recently, many promising embedding models have been proposed to embed knowledge graphs (KGs) and their more general forms, such as n-ary relational data (NRD) and hyper-relational KG (HKG). To promote the data adaptability and performance of embedding models, KG searching methods propose to search for suitable models for a given KG data set. But they are restricted to a single KG form, and the searched models are restricted to a single type of embedding model. To tackle such issues, we propose to build a search space for the message function in graph neural networks (GNNs). However, it is a non-trivial task. Existing message function designs fix the structures and operators, which makes them difficult to handle different KG forms and data sets. Therefore, we first design a novel message function space, which enables both structures and operators to be searched for the given KG form (including KG, NRD, and HKG) and data. The proposed space can flexibly take different KG forms as inputs and is expressive to search for different types of embedding models. Especially, some existing message function designs and some classic KG embedding models can be instantiated as special cases of our space. We empirically show that the searched message functions are data-dependent, and can achieve leading performance on benchmark KGs, NRD, and HKGs.
近年来,人们提出了许多有前途的嵌入模型来嵌入知识图及其更一般的形式,如n元关系数据(NRD)和超关系知识图(HKG)。为了提高嵌入模型的数据适应性和性能,提出了KG搜索方法,针对给定的KG数据集搜索合适的模型。但是它们被限制为单一的KG形式,并且搜索的模型被限制为单一类型的嵌入模型。为了解决这些问题,我们提出在图神经网络(gnn)中为消息函数建立一个搜索空间。然而,这是一项不平凡的任务。现有的消息功能设计固定了结构和操作符,这使得它们难以处理不同的KG表单和数据集。因此,我们首先设计了一个新颖的消息函数空间,使结构和操作符都能够搜索给定的KG形式(包括KG、NRD和HKG)和数据。所提出的空间可以灵活地以不同的KG形式作为输入,并且具有搜索不同类型嵌入模型的表达能力。特别是,一些现有的消息函数设计和一些经典的KG嵌入模型可以作为我们空间的特例实例化。我们的经验表明,搜索到的消息函数是数据依赖的,并且可以在基准KGs、NRD和KGs上取得领先的性能。
{"title":"Message Function Search for Knowledge Graph Embedding","authors":"Shimin Di, Lei Chen","doi":"10.1145/3543507.3583546","DOIUrl":"https://doi.org/10.1145/3543507.3583546","url":null,"abstract":"Recently, many promising embedding models have been proposed to embed knowledge graphs (KGs) and their more general forms, such as n-ary relational data (NRD) and hyper-relational KG (HKG). To promote the data adaptability and performance of embedding models, KG searching methods propose to search for suitable models for a given KG data set. But they are restricted to a single KG form, and the searched models are restricted to a single type of embedding model. To tackle such issues, we propose to build a search space for the message function in graph neural networks (GNNs). However, it is a non-trivial task. Existing message function designs fix the structures and operators, which makes them difficult to handle different KG forms and data sets. Therefore, we first design a novel message function space, which enables both structures and operators to be searched for the given KG form (including KG, NRD, and HKG) and data. The proposed space can flexibly take different KG forms as inputs and is expressive to search for different types of embedding models. Especially, some existing message function designs and some classic KG embedding models can be instantiated as special cases of our space. We empirically show that the searched message functions are data-dependent, and can achieve leading performance on benchmark KGs, NRD, and HKGs.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126827698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unsupervised Anomaly Detection on Microservice Traces through Graph VAE 基于图VAE的微服务轨迹无监督异常检测
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583215
Zhe Xie, Haowen Xu, Wenxiao Chen, Wanxue Li, Huai Jiang, Lang Su, Hanzhang Wang, Dan Pei
The microservice architecture is widely employed in large Internet systems. For each user request, a few of the microservices are called, and a trace is formed to record the tree-like call dependencies among microservices and the time consumption at each call node. Traces are useful in diagnosing system failures, but their complex structures make it difficult to model their patterns and detect their anomalies. In this paper, we propose a novel dual-variable graph variational autoencoder (VAE) for unsupervised anomaly detection on microservice traces. To reconstruct the time consumption of nodes, we propose a novel dispatching layer. We find that the inversion of negative log-likelihood (NLL) appears for some anomalous samples, which makes the anomaly score infeasible for anomaly detection. To address this, we point out that the NLL can be decomposed into KL-divergence and data entropy, whereas lower-dimensional anomalies can introduce an entropy gap with normal inputs. We propose three techniques to mitigate this entropy gap for trace anomaly detection: Bernoulli & Categorical Scaling, Node Count Normalization, and Gaussian Std-Limit. On five trace datasets from a top Internet company, our proposed TraceVAE achieves excellent F-scores.
微服务架构广泛应用于大型互联网系统。对于每个用户请求,调用几个微服务,并形成跟踪以记录微服务之间的树状调用依赖关系和每个调用节点的时间消耗。迹线在诊断系统故障时很有用,但其复杂的结构使其模式建模和异常检测变得困难。本文提出了一种新的双变量图变分自编码器(VAE),用于微服务轨迹的无监督异常检测。为了重构节点的时间消耗,我们提出了一种新的调度层。我们发现一些异常样本出现负对数似然(NLL)反转,使得异常评分无法用于异常检测。为了解决这个问题,我们指出NLL可以分解为kl -散度和数据熵,而低维异常可以引入与正常输入的熵隙。我们提出了三种技术来缓解跟踪异常检测的熵差:伯努利和分类缩放,节点计数归一化和高斯标准限制。在来自一家顶级互联网公司的5个跟踪数据集上,我们提出的TraceVAE获得了优异的f分。
{"title":"Unsupervised Anomaly Detection on Microservice Traces through Graph VAE","authors":"Zhe Xie, Haowen Xu, Wenxiao Chen, Wanxue Li, Huai Jiang, Lang Su, Hanzhang Wang, Dan Pei","doi":"10.1145/3543507.3583215","DOIUrl":"https://doi.org/10.1145/3543507.3583215","url":null,"abstract":"The microservice architecture is widely employed in large Internet systems. For each user request, a few of the microservices are called, and a trace is formed to record the tree-like call dependencies among microservices and the time consumption at each call node. Traces are useful in diagnosing system failures, but their complex structures make it difficult to model their patterns and detect their anomalies. In this paper, we propose a novel dual-variable graph variational autoencoder (VAE) for unsupervised anomaly detection on microservice traces. To reconstruct the time consumption of nodes, we propose a novel dispatching layer. We find that the inversion of negative log-likelihood (NLL) appears for some anomalous samples, which makes the anomaly score infeasible for anomaly detection. To address this, we point out that the NLL can be decomposed into KL-divergence and data entropy, whereas lower-dimensional anomalies can introduce an entropy gap with normal inputs. We propose three techniques to mitigate this entropy gap for trace anomaly detection: Bernoulli & Categorical Scaling, Node Count Normalization, and Gaussian Std-Limit. On five trace datasets from a top Internet company, our proposed TraceVAE achieves excellent F-scores.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122768153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Improving Health Mention Classification Through Emphasising Literal Meanings: A Study Towards Diversity and Generalisation for Public Health Surveillance 通过强调字面意义来改进健康提及分类:公共卫生监测的多样性和概括性研究
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583877
O. T. Aduragba, Jialin Yu, A. Cristea, Yang Long
People often use disease or symptom terms on social media and online forums in ways other than to describe their health. Thus the NLP health mention classification (HMC) task aims to identify posts where users are discussing health conditions literally, not figuratively. Existing computational research typically only studies health mentions within well-represented groups in developed nations. Developing countries with limited health surveillance abilities fail to benefit from such data to manage public health crises. To advance the HMC research and benefit more diverse populations, we present the Nairaland health mention dataset (NHMD), a new dataset collected from a dedicated web forum for Nigerians. NHMD consists of 7,763 manually labelled posts extracted based on four prevalent diseases (HIV/AIDS, Malaria, Stroke and Tuberculosis) in Nigeria. With NHMD, we conduct extensive experiments using current state-of-the-art models for HMC and identify that, compared to existing public datasets, NHMD contains out-of-distribution examples. Hence, it is well suited for domain adaptation studies. The introduction of the NHMD dataset imposes better diversity coverage of vulnerable populations and generalisation for HMC tasks in a global public health surveillance setting. Additionally, we present a novel multi-task learning approach for HMC tasks by combining literal word meaning prediction as an auxiliary task. Experimental results demonstrate that the proposed approach outperforms state-of-the-art methods statistically significantly (p < 0.01, Wilcoxon test) in terms of F1 score over the state-of-the-art and shows that our new dataset poses a strong challenge to the existing HMC methods.
人们经常在社交媒体和在线论坛上使用疾病或症状术语,而不是用来描述他们的健康状况。因此,NLP健康提及分类(HMC)任务的目的是识别用户讨论健康状况的帖子,而不是象征性的。现有的计算研究通常只研究发达国家中有代表性的群体中提及的健康问题。卫生监测能力有限的发展中国家无法从这些数据中受益,从而管理公共卫生危机。为了推进HMC研究并使更多不同的人群受益,我们提出了尼日利亚健康提及数据集(NHMD),这是一个从尼日利亚人专用网络论坛收集的新数据集。国家卫生保健计划包括根据尼日利亚四种流行疾病(艾滋病毒/艾滋病、疟疾、中风和结核病)提取的7,763个人工标记帖子。对于NHMD,我们使用当前最先进的HMC模型进行了广泛的实验,并确定与现有的公共数据集相比,NHMD包含分布外示例。因此,它非常适合领域适应研究。NHMD数据集的引入提高了脆弱人群的多样性覆盖率,并在全球公共卫生监测环境中推广了HMC任务。此外,我们提出了一种新的HMC任务多任务学习方法,将字面词义预测作为辅助任务。实验结果表明,就F1得分而言,该方法在统计上显著优于最先进的方法(p < 0.01, Wilcoxon检验),这表明我们的新数据集对现有的HMC方法提出了强有力的挑战。
{"title":"Improving Health Mention Classification Through Emphasising Literal Meanings: A Study Towards Diversity and Generalisation for Public Health Surveillance","authors":"O. T. Aduragba, Jialin Yu, A. Cristea, Yang Long","doi":"10.1145/3543507.3583877","DOIUrl":"https://doi.org/10.1145/3543507.3583877","url":null,"abstract":"People often use disease or symptom terms on social media and online forums in ways other than to describe their health. Thus the NLP health mention classification (HMC) task aims to identify posts where users are discussing health conditions literally, not figuratively. Existing computational research typically only studies health mentions within well-represented groups in developed nations. Developing countries with limited health surveillance abilities fail to benefit from such data to manage public health crises. To advance the HMC research and benefit more diverse populations, we present the Nairaland health mention dataset (NHMD), a new dataset collected from a dedicated web forum for Nigerians. NHMD consists of 7,763 manually labelled posts extracted based on four prevalent diseases (HIV/AIDS, Malaria, Stroke and Tuberculosis) in Nigeria. With NHMD, we conduct extensive experiments using current state-of-the-art models for HMC and identify that, compared to existing public datasets, NHMD contains out-of-distribution examples. Hence, it is well suited for domain adaptation studies. The introduction of the NHMD dataset imposes better diversity coverage of vulnerable populations and generalisation for HMC tasks in a global public health surveillance setting. Additionally, we present a novel multi-task learning approach for HMC tasks by combining literal word meaning prediction as an auxiliary task. Experimental results demonstrate that the proposed approach outperforms state-of-the-art methods statistically significantly (p < 0.01, Wilcoxon test) in terms of F1 score over the state-of-the-art and shows that our new dataset poses a strong challenge to the existing HMC methods.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122768913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FedEdge: Accelerating Edge-Assisted Federated Learning 加速边缘辅助联邦学习
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583264
Kaibin Wang, Qiang He, Feifei Chen, Hai Jin, Yun Yang
Federated learning (FL) has been widely acknowledged as a promising solution to training machine learning (ML) model training with privacy preservation. To reduce the traffic overheads incurred by FL systems, edge servers have been included between clients and the parameter server to aggregate clients’ local models. Recent studies on this edge-assisted hierarchical FL scheme have focused on ensuring or accelerating model convergence by coping with various factors, e.g., uncertain network conditions, unreliable clients, heterogeneous compute resources, etc. This paper presents our three new discoveries of the edge-assisted hierarchical FL scheme: 1) it wastes significant time during its two-phase training rounds; 2) it does not recognize or utilize model diversity when producing a global model; and 3) it is vulnerable to model poisoning attacks. To overcome these drawbacks, we propose FedEdge, a novel edge-assisted hierarchical FL scheme that accelerates model training with asynchronous local federated training and adaptive model aggregation. Extensive experiments are conducted on two widely-used public datasets. The results demonstrate that, compared with state-of-the-art FL schemes, FedEdge accelerates model convergence by 1.14 × −3.20 ×, and improves model accuracy by 2.14% - 6.63%.
联邦学习(FL)已被广泛认为是具有隐私保护的机器学习(ML)模型训练的一种有前途的解决方案。为了减少FL系统带来的流量开销,在客户端和参数服务器之间包含了边缘服务器,以聚合客户端的本地模型。近年来对这种边缘辅助分层FL方案的研究主要集中在通过应对各种因素,如不确定的网络条件、不可靠的客户端、异构计算资源等,来保证或加速模型收敛。本文介绍了我们对边缘辅助分层FL方案的三个新发现:1)它在两阶段训练回合中浪费了大量时间;2)在生成全球模式时未识别或利用模式多样性;3)容易受到模型中毒攻击。为了克服这些缺点,我们提出了FedEdge,一种新的边缘辅助分层FL方案,通过异步本地联邦训练和自适应模型聚合加速模型训练。在两个广泛使用的公共数据集上进行了大量的实验。结果表明,与目前最先进的FL方案相比,FedEdge将模型收敛速度提高了1.14 × ~ 3.20 ×,将模型精度提高了2.14% ~ 6.63%。
{"title":"FedEdge: Accelerating Edge-Assisted Federated Learning","authors":"Kaibin Wang, Qiang He, Feifei Chen, Hai Jin, Yun Yang","doi":"10.1145/3543507.3583264","DOIUrl":"https://doi.org/10.1145/3543507.3583264","url":null,"abstract":"Federated learning (FL) has been widely acknowledged as a promising solution to training machine learning (ML) model training with privacy preservation. To reduce the traffic overheads incurred by FL systems, edge servers have been included between clients and the parameter server to aggregate clients’ local models. Recent studies on this edge-assisted hierarchical FL scheme have focused on ensuring or accelerating model convergence by coping with various factors, e.g., uncertain network conditions, unreliable clients, heterogeneous compute resources, etc. This paper presents our three new discoveries of the edge-assisted hierarchical FL scheme: 1) it wastes significant time during its two-phase training rounds; 2) it does not recognize or utilize model diversity when producing a global model; and 3) it is vulnerable to model poisoning attacks. To overcome these drawbacks, we propose FedEdge, a novel edge-assisted hierarchical FL scheme that accelerates model training with asynchronous local federated training and adaptive model aggregation. Extensive experiments are conducted on two widely-used public datasets. The results demonstrate that, compared with state-of-the-art FL schemes, FedEdge accelerates model convergence by 1.14 × −3.20 ×, and improves model accuracy by 2.14% - 6.63%.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121517630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
MMMLP: Multi-modal Multilayer Perceptron for Sequential Recommendations MMMLP:时序推荐的多模态多层感知器
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583378
Jiahao Liang, Xiangyu Zhao, Muyang Li, Zijian Zhang, Wanyu Wang, Haochen Liu, Zitao Liu
Sequential recommendation aims to offer potentially interesting products to users by capturing their historical sequence of interacted items. Although it has facilitated extensive physical scenarios, sequential recommendation for multi-modal sequences has long been neglected. Multi-modal data that depicts a user’s historical interactions exists ubiquitously, such as product pictures, textual descriptions, and interacted item sequences, providing semantic information from multiple perspectives that comprehensively describe a user’s preferences. However, existing sequential recommendation methods either fail to directly handle multi-modality or suffer from high computational complexity. To address this, we propose a novel Multi-Modal Multi-Layer Perceptron (MMMLP) for maintaining multi-modal sequences for sequential recommendation. MMMLP is a purely MLP-based architecture that consists of three modules - the Feature Mixer Layer, Fusion Mixer Layer, and Prediction Layer - and has an edge on both efficacy and efficiency. Extensive experiments show that MMMLP achieves state-of-the-art performance with linear complexity. We also conduct ablating analysis to verify the contribution of each component. Furthermore, compatible experiments are devised, and the results show that the multi-modal representation learned by our proposed model generally benefits other recommendation models, emphasizing our model’s ability to handle multi-modal information. We have made our code available online to ease reproducibility1.
顺序推荐旨在通过捕获用户交互项目的历史顺序,向用户提供潜在的有趣产品。虽然它促进了广泛的物理场景,但多模态序列的顺序推荐长期以来被忽视。描述用户历史交互的多模态数据无处不在,如产品图片、文本描述和交互项目序列,从多个角度提供语义信息,全面描述用户的偏好。然而,现有的顺序推荐方法要么不能直接处理多模态,要么计算量大。为了解决这个问题,我们提出了一种新的多模态多层感知器(MMMLP),用于维护多模态序列以进行顺序推荐。MMMLP是一个纯粹基于mlp的架构,由三个模块组成——特征混频器层、融合混频器层和预测层——在功效和效率方面都有优势。大量的实验表明,MMMLP在线性复杂度下达到了最先进的性能。我们还进行了烧蚀分析,以验证每个组件的贡献。此外,设计了兼容实验,结果表明,我们提出的模型学习的多模态表示总体上有利于其他推荐模型,强调了我们的模型处理多模态信息的能力。我们已经将代码放到了网上,以方便再现1。
{"title":"MMMLP: Multi-modal Multilayer Perceptron for Sequential Recommendations","authors":"Jiahao Liang, Xiangyu Zhao, Muyang Li, Zijian Zhang, Wanyu Wang, Haochen Liu, Zitao Liu","doi":"10.1145/3543507.3583378","DOIUrl":"https://doi.org/10.1145/3543507.3583378","url":null,"abstract":"Sequential recommendation aims to offer potentially interesting products to users by capturing their historical sequence of interacted items. Although it has facilitated extensive physical scenarios, sequential recommendation for multi-modal sequences has long been neglected. Multi-modal data that depicts a user’s historical interactions exists ubiquitously, such as product pictures, textual descriptions, and interacted item sequences, providing semantic information from multiple perspectives that comprehensively describe a user’s preferences. However, existing sequential recommendation methods either fail to directly handle multi-modality or suffer from high computational complexity. To address this, we propose a novel Multi-Modal Multi-Layer Perceptron (MMMLP) for maintaining multi-modal sequences for sequential recommendation. MMMLP is a purely MLP-based architecture that consists of three modules - the Feature Mixer Layer, Fusion Mixer Layer, and Prediction Layer - and has an edge on both efficacy and efficiency. Extensive experiments show that MMMLP achieves state-of-the-art performance with linear complexity. We also conduct ablating analysis to verify the contribution of each component. Furthermore, compatible experiments are devised, and the results show that the multi-modal representation learned by our proposed model generally benefits other recommendation models, emphasizing our model’s ability to handle multi-modal information. We have made our code available online to ease reproducibility1.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125002148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Maximizing Submodular Functions for Recommendation in the Presence of Biases 在存在偏差的情况下最大化推荐的子模函数
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583195
Anay Mehrotra, Nisheeth K. Vishnoi
Subset selection tasks, arise in recommendation systems and search engines and ask to select a subset of items that maximize the value for the user. The values of subsets often display diminishing returns, and hence, submodular functions have been used to model them. If the inputs defining the submodular function are known, then existing algorithms can be used. In many applications, however, inputs have been observed to have social biases that reduce the utility of the output subset. Hence, interventions to improve the utility are desired. Prior works focus on maximizing linear functions—a special case of submodular functions—and show that fairness constraint-based interventions can not only ensure proportional representation but also achieve near-optimal utility in the presence of biases. We study the maximization of a family of submodular functions that capture functions arising in the aforementioned applications. Our first result is that, unlike linear functions, constraint-based interventions cannot guarantee any constant fraction of the optimal utility for this family of submodular functions. Our second result is an algorithm for submodular maximization. The algorithm provably outputs subsets that have near-optimal utility for this family under mild assumptions and that proportionally represent items from each group. In empirical evaluation, with both synthetic and real-world data, we observe that this algorithm improves the utility of the output subset for this family of submodular functions over baselines.
子集选择任务,出现在推荐系统和搜索引擎中,要求选择对用户价值最大化的项目子集。子集的值经常显示递减的收益,因此,子模函数被用来对它们建模。如果定义子模块函数的输入是已知的,则可以使用现有的算法。然而,在许多应用中,已经观察到输入具有降低输出子集效用的社会偏见。因此,需要采取干预措施来提高效用。先前的研究集中在最大化线性函数上——子模函数的一个特例——并表明基于公平约束的干预不仅可以确保比例代表性,而且在存在偏差的情况下也能获得接近最优的效用。我们研究了一组子模函数的最大化,这些子模函数捕获了上述应用中出现的函数。我们的第一个结果是,与线性函数不同,基于约束的干预不能保证这类子模函数的最优效用的任何常数部分。我们的第二个结果是一个子模最大化算法。该算法可以证明,在温和的假设下,该算法输出的子集对该家族具有接近最优的效用,并且按比例表示每个组中的项目。在综合数据和实际数据的经验评估中,我们观察到该算法在基线上提高了该系列子模块函数的输出子集的效用。
{"title":"Maximizing Submodular Functions for Recommendation in the Presence of Biases","authors":"Anay Mehrotra, Nisheeth K. Vishnoi","doi":"10.1145/3543507.3583195","DOIUrl":"https://doi.org/10.1145/3543507.3583195","url":null,"abstract":"Subset selection tasks, arise in recommendation systems and search engines and ask to select a subset of items that maximize the value for the user. The values of subsets often display diminishing returns, and hence, submodular functions have been used to model them. If the inputs defining the submodular function are known, then existing algorithms can be used. In many applications, however, inputs have been observed to have social biases that reduce the utility of the output subset. Hence, interventions to improve the utility are desired. Prior works focus on maximizing linear functions—a special case of submodular functions—and show that fairness constraint-based interventions can not only ensure proportional representation but also achieve near-optimal utility in the presence of biases. We study the maximization of a family of submodular functions that capture functions arising in the aforementioned applications. Our first result is that, unlike linear functions, constraint-based interventions cannot guarantee any constant fraction of the optimal utility for this family of submodular functions. Our second result is an algorithm for submodular maximization. The algorithm provably outputs subsets that have near-optimal utility for this family under mild assumptions and that proportionally represent items from each group. In empirical evaluation, with both synthetic and real-world data, we observe that this algorithm improves the utility of the output subset for this family of submodular functions over baselines.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123185659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Model Robustness: Generating Contextual Counterfactuals for Entities in Relation Extraction 模型鲁棒性:关系抽取中实体的上下文反事实生成
Pub Date : 2023-04-30 DOI: 10.1145/3543507.3583504
Mi Zhang, T. Qian, Ting Zhang, Xin Miao
The goal of relation extraction (RE) is to extract the semantic relations between/among entities in the text. As a fundamental task in information systems, it is crucial to ensure the robustness of RE models. Despite the high accuracy current deep neural models have achieved in RE tasks, they are easily affected by spurious correlations. One solution to this problem is to train the model with counterfactually augmented data (CAD) such that it can learn the causation rather than the confounding. However, no attempt has been made on generating counterfactuals for RE tasks. In this paper, we formulate the problem of automatically generating CAD for RE tasks from an entity-centric viewpoint, and develop a novel approach to derive contextual counterfactuals for entities. Specifically, we exploit two elementary topological properties, i.e., the centrality and the shortest path, in syntactic and semantic dependency graphs, to first identify and then intervene on the contextual causal features for entities. We conduct a comprehensive evaluation on four RE datasets by combining our proposed approach with a variety of RE backbones. Results prove that our approach not only improves the performance of the backbones but also makes them more robust in the out-of-domain test 1.
关系抽取(RE)的目标是抽取文本中实体之间的语义关系。作为信息系统的一项基础任务,保证可重构模型的鲁棒性至关重要。尽管目前深度神经模型在RE任务中已经取得了很高的精度,但它们很容易受到伪相关的影响。这个问题的一个解决方案是用反事实增强数据(CAD)训练模型,这样它就可以学习因果关系而不是混淆。然而,没有尝试为RE任务生成反事实。在本文中,我们从实体中心的角度阐述了自动生成可重构任务CAD的问题,并开发了一种新的方法来导出实体的上下文反事实。具体来说,我们利用句法和语义依赖图中的两个基本拓扑属性,即中心性和最短路径,首先识别实体的上下文因果特征,然后干预实体的上下文因果特征。我们将我们提出的方法与各种RE主干相结合,对四个RE数据集进行了综合评估。结果证明,我们的方法不仅提高了主干网的性能,而且使其在域外测试中具有更强的鲁棒性。
{"title":"Towards Model Robustness: Generating Contextual Counterfactuals for Entities in Relation Extraction","authors":"Mi Zhang, T. Qian, Ting Zhang, Xin Miao","doi":"10.1145/3543507.3583504","DOIUrl":"https://doi.org/10.1145/3543507.3583504","url":null,"abstract":"The goal of relation extraction (RE) is to extract the semantic relations between/among entities in the text. As a fundamental task in information systems, it is crucial to ensure the robustness of RE models. Despite the high accuracy current deep neural models have achieved in RE tasks, they are easily affected by spurious correlations. One solution to this problem is to train the model with counterfactually augmented data (CAD) such that it can learn the causation rather than the confounding. However, no attempt has been made on generating counterfactuals for RE tasks. In this paper, we formulate the problem of automatically generating CAD for RE tasks from an entity-centric viewpoint, and develop a novel approach to derive contextual counterfactuals for entities. Specifically, we exploit two elementary topological properties, i.e., the centrality and the shortest path, in syntactic and semantic dependency graphs, to first identify and then intervene on the contextual causal features for entities. We conduct a comprehensive evaluation on four RE datasets by combining our proposed approach with a variety of RE backbones. Results prove that our approach not only improves the performance of the backbones but also makes them more robust in the out-of-domain test 1.","PeriodicalId":296351,"journal":{"name":"Proceedings of the ACM Web Conference 2023","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126286392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the ACM Web Conference 2023
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1