首页 > 最新文献

ACM Transactions on Intelligent Systems and Technology最新文献

英文 中文
Explainability for Large Language Models: A Survey 大型语言模型的可解释性:调查
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-01-02 DOI: 10.1145/3639372
Haiyan Zhao, Hanjie Chen, Fan Yang, Ninghao Liu, Huiqi Deng, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Mengnan Du

Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this paper, we introduce a taxonomy of explainability techniques and provide a structured overview of methods for explaining Transformer-based language models. We categorize techniques based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm. For each paradigm, we summarize the goals and dominant approaches for generating local explanations of individual predictions and global explanations of overall model knowledge. We also discuss metrics for evaluating generated explanations, and discuss how explanations can be leveraged to debug models and improve performance. Lastly, we examine key challenges and emerging opportunities for explanation techniques in the era of LLMs in comparison to conventional deep learning models.

大型语言模型(LLMs)在自然语言处理方面表现出了令人印象深刻的能力。然而,它们的内部机制仍不清楚,这种缺乏透明度的情况给下游应用带来了不必要的风险。因此,理解和解释这些模型对于阐明其行为、局限性和社会影响至关重要。在本文中,我们介绍了可解释性技术分类法,并对解释基于变换器的语言模型的方法进行了结构化概述。我们根据 LLM 的训练范式对技术进行分类:基于微调的传统范式和基于提示的范式。针对每种范式,我们总结了生成单个预测的局部解释和整体模型知识的全局解释的目标和主要方法。我们还讨论了评估所生成解释的指标,并讨论了如何利用解释来调试模型和提高性能。最后,与传统深度学习模型相比,我们探讨了 LLM 时代解释技术面临的主要挑战和新兴机遇。
{"title":"Explainability for Large Language Models: A Survey","authors":"Haiyan Zhao, Hanjie Chen, Fan Yang, Ninghao Liu, Huiqi Deng, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Mengnan Du","doi":"10.1145/3639372","DOIUrl":"https://doi.org/10.1145/3639372","url":null,"abstract":"<p>Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this paper, we introduce a taxonomy of explainability techniques and provide a structured overview of methods for explaining Transformer-based language models. We categorize techniques based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm. For each paradigm, we summarize the goals and dominant approaches for generating local explanations of individual predictions and global explanations of overall model knowledge. We also discuss metrics for evaluating generated explanations, and discuss how explanations can be leveraged to debug models and improve performance. Lastly, we examine key challenges and emerging opportunities for explanation techniques in the era of LLMs in comparison to conventional deep learning models.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139084073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fairness-Driven Private Collaborative Machine Learning 公平驱动的私有协作机器学习
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-01-02 DOI: 10.1145/3639368
Dana Pessach, Tamir Tassa, Erez Shmueli

The performance of machine learning algorithms can be considerably improved when trained over larger datasets. In many domains, such as medicine and finance, larger datasets can be obtained if several parties, each having access to limited amounts of data, collaborate and share their data. However, such data sharing introduces significant privacy challenges. While multiple recent studies have investigated methods for private collaborative machine learning, the fairness of such collaborative algorithms was overlooked. In this work we suggest a feasible privacy-preserving pre-process mechanism for enhancing fairness of collaborative machine learning algorithms. An extensive evaluation of the proposed method shows that it is able to enhance fairness considerably with only a minor compromise in accuracy.

在较大的数据集上进行训练,机器学习算法的性能就会大大提高。在医学和金融等许多领域,如果多方合作并共享数据,就能获得更大的数据集。然而,这种数据共享带来了巨大的隐私挑战。虽然近期有多项研究调查了隐私协作机器学习的方法,但这种协作算法的公平性却被忽视了。在这项工作中,我们提出了一种可行的隐私保护预处理机制,以提高协作机器学习算法的公平性。对所提方法的广泛评估表明,该方法能显著提高公平性,而准确性只受到轻微影响。
{"title":"Fairness-Driven Private Collaborative Machine Learning","authors":"Dana Pessach, Tamir Tassa, Erez Shmueli","doi":"10.1145/3639368","DOIUrl":"https://doi.org/10.1145/3639368","url":null,"abstract":"<p>The performance of machine learning algorithms can be considerably improved when trained over larger datasets. In many domains, such as medicine and finance, larger datasets can be obtained if several parties, each having access to limited amounts of data, collaborate and share their data. However, such data sharing introduces significant privacy challenges. While multiple recent studies have investigated methods for private collaborative machine learning, the fairness of such collaborative algorithms was overlooked. In this work we suggest a feasible privacy-preserving pre-process mechanism for enhancing fairness of collaborative machine learning algorithms. An extensive evaluation of the proposed method shows that it is able to enhance fairness considerably with only a minor compromise in accuracy.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139084127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Distributed Knowledge Congruence in Proxy-data-free Federated Distillation 探索无代理数据联合蒸馏中的分布式知识一致性
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-29 DOI: 10.1145/3639369
Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Quyang Pan, Junbo Zhang, Zeju Li, Qingxiang Liu

Federated learning (FL) is a privacy-preserving machine learning paradigm in which the server periodically aggregates local model parameters from clients without assembling their private data. Constrained communication and personalization requirements pose severe challenges to FL. Federated distillation (FD) is proposed to simultaneously address the above two problems, which exchanges knowledge between the server and clients, supporting heterogeneous local models while significantly reducing communication overhead. However, most existing FD methods require a proxy dataset, which is often unavailable in reality. A few recent proxy-data-free FD approaches can eliminate the need for additional public data, but suffer from remarkable discrepancy among local knowledge due to client-side model heterogeneity, leading to ambiguous representation on the server and inevitable accuracy degradation. To tackle this issue, we propose a proxy-data-free FD algorithm based on distributed knowledge congruence (FedDKC). FedDKC leverages well-designed refinement strategies to narrow local knowledge differences into an acceptable upper bound, so as to mitigate the negative effects of knowledge incongruence. Specifically, from perspectives of peak probability and Shannon entropy of local knowledge, we design kernel-based knowledge refinement (KKR) and searching-based knowledge refinement (SKR) respectively, and theoretically guarantee that the refined-local knowledge can satisfy an approximately-similar distribution and be regarded as congruent. Extensive experiments conducted on three common datasets demonstrate that our proposed FedDKC significantly outperforms the state-of-the-art on various heterogeneous settings while evidently improving the convergence speed.

联合学习(FL)是一种保护隐私的机器学习范式,在这种范式中,服务器会定期汇总来自客户端的本地模型参数,而不会汇集他们的私人数据。受限的通信和个性化要求给联合学习带来了严峻的挑战。为了同时解决上述两个问题,有人提出了联合蒸馏(Federated distillation,FD)方法,即在服务器和客户端之间交换知识,支持异构本地模型,同时显著减少通信开销。然而,现有的大多数 FD 方法都需要代理数据集,而现实中往往没有代理数据集。最近出现的几种无代理数据 FD 方法不需要额外的公共数据,但由于客户端模型异构,本地知识之间存在显著差异,导致服务器上的表示模糊不清,精度不可避免地下降。为了解决这个问题,我们提出了一种基于分布式知识一致性的无代理数据 FD 算法(FedDKC)。FedDKC 利用精心设计的细化策略将局部知识差异缩小到可接受的上限,从而减轻知识不一致带来的负面影响。具体来说,我们从局部知识的峰值概率和香农熵的角度出发,分别设计了基于内核的知识细化(KKR)和基于搜索的知识细化(SKR),并从理论上保证了细化后的局部知识能够满足近似分布并被视为一致。在三个常见数据集上进行的大量实验表明,我们提出的 FedDKC 在各种异构环境下的性能明显优于最先进的技术,同时收敛速度也明显提高。
{"title":"Exploring the Distributed Knowledge Congruence in Proxy-data-free Federated Distillation","authors":"Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Quyang Pan, Junbo Zhang, Zeju Li, Qingxiang Liu","doi":"10.1145/3639369","DOIUrl":"https://doi.org/10.1145/3639369","url":null,"abstract":"<p>Federated learning (FL) is a privacy-preserving machine learning paradigm in which the server periodically aggregates local model parameters from clients without assembling their private data. Constrained communication and personalization requirements pose severe challenges to FL. Federated distillation (FD) is proposed to simultaneously address the above two problems, which exchanges knowledge between the server and clients, supporting heterogeneous local models while significantly reducing communication overhead. However, most existing FD methods require a proxy dataset, which is often unavailable in reality. A few recent proxy-data-free FD approaches can eliminate the need for additional public data, but suffer from remarkable discrepancy among local knowledge due to client-side model heterogeneity, leading to ambiguous representation on the server and inevitable accuracy degradation. To tackle this issue, we propose a proxy-data-free FD algorithm based on distributed knowledge congruence (FedDKC). FedDKC leverages well-designed refinement strategies to narrow local knowledge differences into an acceptable upper bound, so as to mitigate the negative effects of knowledge incongruence. Specifically, from perspectives of peak probability and Shannon entropy of local knowledge, we design kernel-based knowledge refinement (KKR) and searching-based knowledge refinement (SKR) respectively, and theoretically guarantee that the refined-local knowledge can satisfy an approximately-similar distribution and be regarded as congruent. Extensive experiments conducted on three common datasets demonstrate that our proposed FedDKC significantly outperforms the state-of-the-art on various heterogeneous settings while evidently improving the convergence speed.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strengthening Cooperative Consensus in Multi-Robot Confrontation 在多机器人对抗中加强合作共识
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-29 DOI: 10.1145/3639371
Meng Xu, Xinhong Chen, Yechao She, Yang Jin, Guanyi Zhao, Jianping Wang

Multi-agent reinforcement learning (MARL) has proven effective in training multi-robot confrontation, such as StarCraft and robot soccer games. However, the current joint action policies utilized in MARL have been unsuccessful in recognizing and preventing actions that often lead to failures on our side. This exacerbates the cooperation dilemma, ultimately resulting in our agents acting independently and being defeated individually by their opponents. To tackle this challenge, we propose a novel joint action policy, referred to as the consensus action policy (CAP). Specifically, CAP records the number of times each joint action has caused our side to fail in the past and computes a cooperation tendency, which is integrated with each agent’s Q-value and Nash bargaining solution to determine a joint action. The cooperation tendency promotes team cooperation by selecting joint actions that have a high tendency of cooperation and avoiding actions that may lead to team failure. Moreover, the proposed CAP policy can be extended to partially observable scenarios by combining it with Deep Q network (DQN) or actor-critic-based methods. We conducted extensive experiments to compare the proposed method with seven existing joint action policies, including four commonly used methods and three state-of-the-art (SOTA) methods, in terms of episode rewards, winning rates, and other metrics. Our results demonstrate that this approach holds great promise for multi-robot confrontation scenarios.

事实证明,多代理强化学习(MARL)在训练多机器人对抗(如《星际争霸》和机器人足球比赛)方面非常有效。然而,目前 MARL 中使用的联合行动策略无法成功识别和防止经常导致我方失败的行动。这加剧了合作困境,最终导致我们的代理各自为政,被对手击败。为了应对这一挑战,我们提出了一种新颖的联合行动策略,即共识行动策略(CAP)。具体来说,CAP 记录了每个联合行动在过去导致我方失败的次数,并计算出合作倾向,将其与每个代理的 Q 值和纳什讨价还价方案相结合,确定联合行动。合作倾向通过选择合作倾向高的联合行动,避免可能导致团队失败的行动,从而促进团队合作。此外,通过与深度 Q 网络(DQN)或基于行动者批判的方法相结合,所提出的 CAP 策略还可以扩展到部分可观测场景。我们进行了广泛的实验,将所提出的方法与现有的七种联合行动策略(包括四种常用方法和三种最先进的(SOTA)方法)在情节奖励、胜率和其他指标方面进行了比较。我们的结果表明,这种方法在多机器人对抗场景中大有可为。
{"title":"Strengthening Cooperative Consensus in Multi-Robot Confrontation","authors":"Meng Xu, Xinhong Chen, Yechao She, Yang Jin, Guanyi Zhao, Jianping Wang","doi":"10.1145/3639371","DOIUrl":"https://doi.org/10.1145/3639371","url":null,"abstract":"<p>Multi-agent reinforcement learning (MARL) has proven effective in training multi-robot confrontation, such as StarCraft and robot soccer games. However, the current joint action policies utilized in MARL have been unsuccessful in recognizing and preventing actions that often lead to failures on our side. This exacerbates the cooperation dilemma, ultimately resulting in our agents acting independently and being defeated individually by their opponents. To tackle this challenge, we propose a novel joint action policy, referred to as the consensus action policy (CAP). Specifically, CAP records the number of times each joint action has caused our side to fail in the past and computes a cooperation tendency, which is integrated with each agent’s Q-value and Nash bargaining solution to determine a joint action. The cooperation tendency promotes team cooperation by selecting joint actions that have a high tendency of cooperation and avoiding actions that may lead to team failure. Moreover, the proposed CAP policy can be extended to partially observable scenarios by combining it with Deep Q network (DQN) or actor-critic-based methods. We conducted extensive experiments to compare the proposed method with seven existing joint action policies, including four commonly used methods and three state-of-the-art (SOTA) methods, in terms of episode rewards, winning rates, and other metrics. Our results demonstrate that this approach holds great promise for multi-robot confrontation scenarios.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reconstructing Turbulent Flows Using Spatio-Temporal Physical Dynamics 利用时空物理动力学重建湍流
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-15 DOI: 10.1145/3637491
Shengyu Chen, Tianshu Bao, Peyman Givi, Can Zheng, Xiaowei Jia

Accurate simulation of turbulent flows is of crucial importance in many branches of science and engineering. Direct numerical simulation (DNS) provides the highest fidelity means of capturing all intricate physics of turbulent transport. However, the method is computationally expensive because of the wide range of turbulence scales that must be accounted for in such simulations. Large eddy simulation (LES) provides an alternative. In such simulations, the large scales of the flow are resolved and the effects of small scales are modelled. Reconstruction of the DNS field from the low-resolution LES is needed for a wide variety of applications. Thus the construction of super-resolution (SR) methodologies that can provide this reconstruction has become an area of active research. In this work, a new physics-guided neural network is developed for such a reconstruction. The method leverages the partial differential equation that underlies the flow dynamics in the design of spatio-temporal model architecture. A degradation-based refinement method is also developed to enforce physical constraints and to further reduce the accumulated reconstruction errors over long periods. Detailed DNS data on two turbulent flow configurations are used to assess the performance of the model.

湍流的精确模拟在许多科学和工程领域都至关重要。直接数值模拟(DNS)是捕捉湍流传输所有复杂物理现象的保真度最高的方法。然而,由于这种模拟必须考虑广泛的湍流尺度,因此计算成本高昂。大涡模拟(LES)提供了一种替代方法。在这种模拟中,流动的大尺度被解析,小尺度的影响被模拟。各种应用都需要从低分辨率 LES 中重建 DNS 场。因此,构建能够提供这种重构的超分辨率(SR)方法已成为一个活跃的研究领域。在这项工作中,为这种重建开发了一种新的物理引导神经网络。该方法在设计时空模型结构时利用了作为水流动力学基础的偏微分方程。此外,还开发了一种基于退化的细化方法,以执行物理约束并进一步减少长期累积的重建误差。两种湍流配置的详细 DNS 数据用于评估模型的性能。
{"title":"Reconstructing Turbulent Flows Using Spatio-Temporal Physical Dynamics","authors":"Shengyu Chen, Tianshu Bao, Peyman Givi, Can Zheng, Xiaowei Jia","doi":"10.1145/3637491","DOIUrl":"https://doi.org/10.1145/3637491","url":null,"abstract":"<p>Accurate simulation of turbulent flows is of crucial importance in many branches of science and engineering. Direct numerical simulation (DNS) provides the highest fidelity means of capturing all intricate physics of turbulent transport. However, the method is computationally expensive because of the wide range of turbulence scales that must be accounted for in such simulations. Large eddy simulation (LES) provides an alternative. In such simulations, the large scales of the flow are resolved and the effects of small scales are modelled. Reconstruction of the DNS field from the low-resolution LES is needed for a wide variety of applications. Thus the construction of super-resolution (SR) methodologies that can provide this reconstruction has become an area of active research. In this work, a new physics-guided neural network is developed for such a reconstruction. The method leverages the partial differential equation that underlies the flow dynamics in the design of spatio-temporal model architecture. A degradation-based refinement method is also developed to enforce physical constraints and to further reduce the accumulated reconstruction errors over long periods. Detailed DNS data on two turbulent flow configurations are used to assess the performance of the model.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138688694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generating Daily Activities with Need Dynamics 利用需求动态生成日常活动
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-14 DOI: 10.1145/3637493
Yuan Yuan, Jingtao Ding, Huandong Wang, Depeng Jin

Daily activity data recording individuals’ various activities in daily life are widely used in many applications such as activity scheduling, activity recommendation, and policymaking. Though with high value, its accessibility is limited due to high collection costs and potential privacy issues. Therefore, simulating human activities to produce massive high-quality data is of great importance. However, existing solutions, including rule-based methods with simplified behavior assumptions and data-driven methods directly fitting real-world data, both cannot fully qualify for matching reality. In this paper, motivated by the classic psychological theory, Maslow’s need theory describing human motivation, we propose a knowledge-driven simulation framework based on generative adversarial imitation learning. Our core idea is to model the evolution of human needs as the underlying mechanism that drives activity generation in the simulation model. Specifically, a hierarchical model structure that disentangles different need levels and the use of neural stochastic differential equations successfully capture the piecewise-continuous characteristics of need dynamics. Extensive experiments demonstrate that our framework outperforms the state-of-the-art baselines regarding data fidelity and utility. We also present the insightful interpretability of the need modeling. Moreover, privacy preservation evaluations validate that the generated data does not leak individual privacy. The code is available at https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND.

日常活动数据记录了个体在日常生活中的各种活动,广泛应用于活动调度、活动推荐、政策制定等领域。虽然具有很高的价值,但由于高昂的收集成本和潜在的隐私问题,其可访问性受到限制。因此,模拟人类活动产生大量高质量数据具有重要意义。然而,现有的解决方案,包括简化行为假设的基于规则的方法和直接拟合现实世界数据的数据驱动方法,都不能完全符合现实。本文以描述人类动机的经典心理学理论马斯洛需求理论为启发,提出了一种基于生成式对抗模仿学习的知识驱动模拟框架。我们的核心思想是将人类需求的演变建模为驱动模拟模型中活动生成的潜在机制。具体来说,一种分层模型结构将不同的需求层次分离开来,并使用神经随机微分方程成功地捕获了需求动力学的分段连续特征。大量的实验表明,我们的框架在数据保真度和实用性方面优于最先进的基线。我们还提出了需求建模的深刻的可解释性。此外,隐私保护评估验证了生成的数据不会泄露个人隐私。代码可在https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND上获得。
{"title":"Generating Daily Activities with Need Dynamics","authors":"Yuan Yuan, Jingtao Ding, Huandong Wang, Depeng Jin","doi":"10.1145/3637493","DOIUrl":"https://doi.org/10.1145/3637493","url":null,"abstract":"<p>Daily activity data recording individuals’ various activities in daily life are widely used in many applications such as activity scheduling, activity recommendation, and policymaking. Though with high value, its accessibility is limited due to high collection costs and potential privacy issues. Therefore, simulating human activities to produce massive high-quality data is of great importance. However, existing solutions, including rule-based methods with simplified behavior assumptions and data-driven methods directly fitting real-world data, both cannot fully qualify for matching reality. In this paper, motivated by the classic psychological theory, Maslow’s need theory describing human motivation, we propose a knowledge-driven simulation framework based on generative adversarial imitation learning. Our core idea is to model the evolution of human needs as the underlying mechanism that drives activity generation in the simulation model. Specifically, a hierarchical model structure that disentangles different need levels and the use of neural stochastic differential equations successfully capture the piecewise-continuous characteristics of need dynamics. Extensive experiments demonstrate that our framework outperforms the state-of-the-art baselines regarding data fidelity and utility. We also present the insightful interpretability of the need modeling. Moreover, privacy preservation evaluations validate that the generated data does not leak individual privacy. The code is available at https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138630639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nationwide Air Pollution Forecasting with Heterogeneous Graph Neural Networks 利用异构图神经网络进行全国空气污染预测
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-14 DOI: 10.1145/3637492
Fernando Terroso-Saenz, Juan Morales-García, Andres Muñoz

Nowadays, air pollution is one of the most relevant environmental problems in most urban settings. Due to the utility in operational terms of anticipating certain pollution levels, several predictors based on Graph Neural Networks (GNN) have been proposed for the last years. Most of these solutions usually encode the relationships among stations in terms of their spatial distance, but they fail when it comes to capture other spatial and feature-based contextual factors. Besides, they assume a homogeneous setting where all the stations are able to capture the same pollutants. However, large-scale settings frequently comprise different types of stations, each one with different measurement capabilities. For that reason, the present paper introduces a novel GNN framework able to capture the similarities among stations related to the land use of their locations and their primary source of pollution. Furthermore, we define a methodology to deal with heterogeneous settings on the top of the GNN architecture. Finally, the proposal has been tested with a nation-wide Spanish air-pollution dataset with very promising results.

如今,空气污染是大多数城市环境中最相关的环境问题之一。由于在预测某些污染水平的操作方面的效用,在过去几年中已经提出了几种基于图神经网络(GNN)的预测器。大多数解决方案通常根据站点之间的空间距离编码站点之间的关系,但是当涉及到捕获其他基于空间和特征的上下文因素时,它们就失败了。此外,他们假设一个均匀的设置,所有的站能够捕获相同的污染物。然而,大尺度设置往往包括不同类型的台站,每一个都有不同的测量能力。因此,本文介绍了一种新的GNN框架,能够捕捉到站点之间与其位置的土地利用及其主要污染源有关的相似性。此外,我们定义了一种方法来处理GNN体系结构顶部的异构设置。最后,该提案已经在西班牙全国空气污染数据集上进行了测试,结果非常有希望。
{"title":"Nationwide Air Pollution Forecasting with Heterogeneous Graph Neural Networks","authors":"Fernando Terroso-Saenz, Juan Morales-García, Andres Muñoz","doi":"10.1145/3637492","DOIUrl":"https://doi.org/10.1145/3637492","url":null,"abstract":"<p>Nowadays, air pollution is one of the most relevant environmental problems in most urban settings. Due to the utility in operational terms of anticipating certain pollution levels, several predictors based on Graph Neural Networks (GNN) have been proposed for the last years. Most of these solutions usually encode the relationships among stations in terms of their spatial distance, but they fail when it comes to capture other spatial and feature-based contextual factors. Besides, they assume a <i>homogeneous</i> setting where all the stations are able to capture the same pollutants. However, large-scale settings frequently comprise different types of stations, each one with different measurement capabilities. For that reason, the present paper introduces a novel GNN framework able to capture the similarities among stations related to the land use of their locations and their primary source of pollution. Furthermore, we define a methodology to deal with heterogeneous settings on the top of the GNN architecture. Finally, the proposal has been tested with a nation-wide Spanish air-pollution dataset with very promising results.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138630644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explicit Knowledge Graph Reasoning for Conversational Recommendation 用于对话式推荐的显式知识图谱推理
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-11 DOI: 10.1145/3637216
Xuhui Ren, Tong Chen, Quoc Viet Hung Nguyen, Lizhen Cui, Zi Huang, Hongzhi Yin

Traditional recommender systems estimate user preference on items purely based on historical interaction records, thus failing to capture fine-grained yet dynamic user interests and letting users receive recommendation only passively. Recent conversational recommender systems (CRSs) tackle those limitations by enabling recommender systems to interact with the user to obtain her/his current preference through a sequence of clarifying questions. Recently, there has been a rise of using knowledge graphs (KGs) for CRSs, where the core motivation is to incorporate the abundant side information carried by a KG into both the recommendation and conversation processes. However, existing KG-based CRSs are subject to two defects: (1) there is a semantic gap between the learned representations of utterances and KG entities, hindering the retrieval of relevant KG information; (2) the reasoning over KG is mostly performed with the implicitly learned user interests, overlooking the explicit signals from the entities actually mentioned in the conversation.

To address these drawbacks, we propose a new CRS framework, namely the Knowledge Enhanced Conversational Reasoning (KECR) model. As a user can reflect her/his preferences via both attribute- and item-level expressions, KECR jointly embeds the structured knowledge from two levels in the KG. A mutual information maximization constraint is further proposed for semantic alignment between the embedding spaces of utterances and KG entities. Meanwhile, KECR utilizes the connectivity within the KG to conduct explicit reasoning of the user demand, making the model less dependent on the user’s feedback to clarifying questions. As such, the semantic alignment and explicit KG reasoning can jointly facilitate accurate recommendation and quality dialogue generation. By comparing with strong baselines on two real-world datasets, we demonstrate that KECR obtains state-of-the-art recommendation effectiveness, as well as competitive dialogue generation performance.

传统的推荐系统纯粹根据历史交互记录来估算用户对项目的偏好,因此无法捕捉细粒度但动态的用户兴趣,用户只能被动地接受推荐。最近的对话式推荐系统(CRS)解决了这些局限性,它使推荐系统能够与用户互动,通过一连串的澄清问题获得用户当前的偏好。最近,在会话推荐系统中使用知识图谱(KGs)的做法开始兴起,其核心动机是将知识图谱所携带的丰富侧边信息纳入推荐和会话过程。然而,现有的基于知识图谱的 CRS 有两个缺陷:(1) 语篇的学习表示与知识图谱实体之间存在语义鸿沟,阻碍了相关知识图谱信息的检索;(2) 对知识图谱的推理大多是通过隐式学习的用户兴趣来进行的,忽略了对话中实际提及的实体的显式信号。为了解决这些问题,我们提出了一种新的会话推理框架,即知识增强会话推理(KECR)模型。由于用户可以通过属性级和项目级表达来反映自己的偏好,因此 KECR 将两个级别的结构化知识共同嵌入到 KG 中。为实现语篇嵌入空间与 KG 实体之间的语义对齐,进一步提出了互信息最大化约束。同时,KECR 利用 KG 内部的连通性对用户需求进行显式推理,使模型不再依赖于用户对澄清问题的反馈。因此,语义对齐和明确的 KG 推理可以共同促进准确的推荐和高质量的对话生成。通过在两个真实世界数据集上与强大的基线进行比较,我们证明了 KECR 获得了最先进的推荐效果以及具有竞争力的对话生成性能。
{"title":"Explicit Knowledge Graph Reasoning for Conversational Recommendation","authors":"Xuhui Ren, Tong Chen, Quoc Viet Hung Nguyen, Lizhen Cui, Zi Huang, Hongzhi Yin","doi":"10.1145/3637216","DOIUrl":"https://doi.org/10.1145/3637216","url":null,"abstract":"<p>Traditional recommender systems estimate user preference on items purely based on historical interaction records, thus failing to capture fine-grained yet dynamic user interests and letting users receive recommendation only passively. Recent conversational recommender systems (CRSs) tackle those limitations by enabling recommender systems to interact with the user to obtain her/his current preference through a sequence of clarifying questions. Recently, there has been a rise of using knowledge graphs (KGs) for CRSs, where the core motivation is to incorporate the abundant side information carried by a KG into both the recommendation and conversation processes. However, existing KG-based CRSs are subject to two defects: (1) there is a semantic gap between the learned representations of utterances and KG entities, hindering the retrieval of relevant KG information; (2) the reasoning over KG is mostly performed with the implicitly learned user interests, overlooking the explicit signals from the entities actually mentioned in the conversation. </p><p>To address these drawbacks, we propose a new CRS framework, namely the <b>K</b>nowledge <b>E</b>nhanced <b>C</b>onversational <b>R</b>easoning (KECR) model. As a user can reflect her/his preferences via both attribute- and item-level expressions, KECR jointly embeds the structured knowledge from two levels in the KG. A mutual information maximization constraint is further proposed for semantic alignment between the embedding spaces of utterances and KG entities. Meanwhile, KECR utilizes the connectivity within the KG to conduct explicit reasoning of the user demand, making the model less dependent on the user’s feedback to clarifying questions. As such, the semantic alignment and explicit KG reasoning can jointly facilitate accurate recommendation and quality dialogue generation. By comparing with strong baselines on two real-world datasets, we demonstrate that KECR obtains state-of-the-art recommendation effectiveness, as well as competitive dialogue generation performance.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138572359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized Fashion Recommendations for Diverse Body Shapes and Local Preferences with Contrastive Multimodal Cross-Attention Network 利用对比多模态交叉注意力网络,针对不同体型和地方偏好提供个性化时尚推荐
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-11 DOI: 10.1145/3637217
Jianghong Ma, Huiyue Sun, Dezhao Yang, Haijun Zhang

Fashion recommendation has become a prominent focus in the realm of online shopping, with various tasks being explored to enhance the customer experience. Recent research has particularly emphasized fashion recommendation based on body shapes, yet a critical aspect of incorporating multimodal data relevance has been overlooked. In this paper, we present the Contrastive Multimodal Cross-Attention Network, a novel approach specifically designed for fashion recommendation catering to diverse body shapes. By incorporating multimodal representation learning and leveraging contrastive learning techniques, our method effectively captures both inter- and intra-sample relationships, resulting in improved accuracy in fashion recommendations tailored to individual body types. Additionally, we propose a locality-aware cross-attention module to align and understand the local preferences between body shapes and clothing items, thus enhancing the matching process. Experimental results conducted on a diverse dataset demonstrate the state-of-the-art performance achieved by our approach, reinforcing its potential to significantly enhance the personalized online shopping experience for consumers with varying body shapes and preferences.

时尚推荐已成为在线购物领域的一个突出焦点,人们正在探索各种任务来提升客户体验。最近的研究特别强调基于体型的时尚推荐,但却忽略了结合多模态数据相关性的一个重要方面。在本文中,我们介绍了对比多模态交叉注意力网络,这是一种新颖的方法,专门用于针对不同体形的时尚推荐。通过结合多模态表征学习和利用对比学习技术,我们的方法有效地捕捉了样本间和样本内的关系,从而提高了针对不同体型的时尚推荐的准确性。此外,我们还提出了一个局部感知交叉关注模块,以调整和理解体型与服装之间的局部偏好,从而增强匹配过程。在一个多样化数据集上进行的实验结果表明,我们的方法达到了最先进的性能,增强了其为具有不同体型和偏好的消费者显著提升个性化在线购物体验的潜力。
{"title":"Personalized Fashion Recommendations for Diverse Body Shapes and Local Preferences with Contrastive Multimodal Cross-Attention Network","authors":"Jianghong Ma, Huiyue Sun, Dezhao Yang, Haijun Zhang","doi":"10.1145/3637217","DOIUrl":"https://doi.org/10.1145/3637217","url":null,"abstract":"<p>Fashion recommendation has become a prominent focus in the realm of online shopping, with various tasks being explored to enhance the customer experience. Recent research has particularly emphasized fashion recommendation based on body shapes, yet a critical aspect of incorporating multimodal data relevance has been overlooked. In this paper, we present the Contrastive Multimodal Cross-Attention Network, a novel approach specifically designed for fashion recommendation catering to diverse body shapes. By incorporating multimodal representation learning and leveraging contrastive learning techniques, our method effectively captures both inter- and intra-sample relationships, resulting in improved accuracy in fashion recommendations tailored to individual body types. Additionally, we propose a locality-aware cross-attention module to align and understand the local preferences between body shapes and clothing items, thus enhancing the matching process. Experimental results conducted on a diverse dataset demonstrate the state-of-the-art performance achieved by our approach, reinforcing its potential to significantly enhance the personalized online shopping experience for consumers with varying body shapes and preferences.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138572071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EMG-Based Automatic Gesture Recognition Using Lipschitz-Regularized Neural Networks 使用 Lipschitz-Regularized 神经网络进行基于肌电图的自动手势识别
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2023-12-08 DOI: 10.1145/3635159
Ana NEACȘU, Jean-Christophe Pesquet, Corneliu Burileanu

This paper introduces a novel approach for building a robust Automatic Gesture Recognition system based on Surface Electromyographic (sEMG) signals, acquired at the forearm level. Our main contribution is to propose new constrained learning strategies that ensure robustness against adversarial perturbations by controlling the Lipschitz constant of the classifier. We focus on nonnegative neural networks for which accurate Lipschitz bounds can be derived, and we propose different spectral norm constraints offering robustness guarantees from a theoretical viewpoint. Experimental results on four publicly available datasets highlight that a good trade-off in terms of accuracy and performance is achieved. We then demonstrate the robustness of our models, compared to standard trained classifiers in four scenarios, considering both white-box and black-box attacks.

本文介绍了一种基于前臂水平采集的表面肌电图(sEMG)信号构建稳健的自动手势识别系统的新方法。我们的主要贡献在于提出了新的约束学习策略,通过控制分类器的 Lipschitz 常量来确保鲁棒性,从而抵御对抗性扰动。我们将重点放在非负神经网络上,可以为其推导出精确的 Lipschitz 定界,我们还提出了不同的谱规范约束,从理论上保证了鲁棒性。在四个公开数据集上的实验结果表明,我们在准确性和性能方面实现了很好的权衡。然后,我们展示了我们的模型与标准训练分类器在四种情况下的鲁棒性,同时考虑了白盒攻击和黑盒攻击。
{"title":"EMG-Based Automatic Gesture Recognition Using Lipschitz-Regularized Neural Networks","authors":"Ana NEACȘU, Jean-Christophe Pesquet, Corneliu Burileanu","doi":"10.1145/3635159","DOIUrl":"https://doi.org/10.1145/3635159","url":null,"abstract":"<p>This paper introduces a novel approach for building a robust Automatic Gesture Recognition system based on Surface Electromyographic (sEMG) signals, acquired at the forearm level. Our main contribution is to propose new constrained learning strategies that ensure robustness against adversarial perturbations by controlling the Lipschitz constant of the classifier. We focus on nonnegative neural networks for which accurate Lipschitz bounds can be derived, and we propose different spectral norm constraints offering robustness guarantees from a theoretical viewpoint. Experimental results on four publicly available datasets highlight that a good trade-off in terms of accuracy and performance is achieved. We then demonstrate the robustness of our models, compared to standard trained classifiers in four scenarios, considering both white-box and black-box attacks.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138553238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on Intelligent Systems and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1