ACM Transactions on Intelligent Systems and Technology最新文献_第8页

VesNet: a Vessel Network for Jointly Learning Route Pattern and Future Trajectory VesNet：联合学习路线模式和未来轨迹的容器网络

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-01-18 DOI: 10.1145/3639370

Fenyu Jiang, Huandong Wang, Yong Li

Vessel trajectory prediction is the key to maritime applications such as traffic surveillance, collision avoidance, anomaly detection, etc. Making predictions more precisely requires a better understanding of the moving trend for a particular vessel since the movement is affected by multiple factors like marine environment, vessel type, and vessel behavior. In this paper, we propose a model named VesNet, based on the attentional seq2seq framework, to predict vessel future movement sequence by observing the current trajectory. Firstly, we extract the route patterns from the raw AIS data during preprocessing. Then, we design a multi-task learning structure to learn how to implement route pattern classification and vessel trajectory prediction simultaneously. By comparing with representative baseline models, we find that our VesNet has the best performance in terms of long-term prediction precision. Additionally, VesNet can recognize the route pattern by capturing the implicit moving characteristics. The experimental results prove that the proposed multi-task learning assists the vessel trajectory prediction mission.

船舶轨迹预测是交通监控、避免碰撞、异常检测等海事应用的关键。要更精确地进行预测，就必须更好地了解特定船只的运动趋势，因为运动受海洋环境、船只类型和船只行为等多种因素的影响。在本文中，我们基于注意力 seq2seq 框架提出了一个名为 VesNet 的模型，通过观察当前轨迹来预测船舶未来的移动序列。首先，我们在预处理过程中从原始 AIS 数据中提取航线模式。然后，我们设计了一种多任务学习结构，学习如何同时实现航线模式分类和船舶轨迹预测。通过与具有代表性的基线模型进行比较，我们发现 VesNet 在长期预测精度方面表现最佳。此外，VesNet 还能通过捕捉隐含的移动特征来识别航线模式。实验结果证明，所提出的多任务学习方法有助于完成船舶轨迹预测任务。

引用次数: 0

Evolving Knowledge Graph Representation Learning with Multiple Attention Strategies for Citation Recommendation System 针对引文推荐系统的进化知识图谱表示学习与多重关注策略

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-01-13 DOI: 10.1145/3635273

Jhih-Chen Liu, Chiao-Ting Chen, Chi Lee, Szu-Hao Huang

The growing number of publications in the field of artificial intelligence highlights the need for researchers to enhance their efficiency in searching for relevant articles. Most paper recommendation models either rely on simplistic citation relationships among papers or focus on content-based approaches, both of which overlook interactions within academic networks. To address the aforementioned problem, knowledge graph embedding (KGE) methods have been used for citation recommendations because recent research proving that graph representations can effectively improve recommendation model accuracy. However, academic networks are dynamic, leading to changes in the representations of users and items over time. The majority of KGE-based citation recommendations are primarily designed for static graphs, thus failing to capture the evolution of dynamic knowledge graph (DKG) structures. To address these challenges, we introduced the evolving knowledge graph embedding (EKGE) method. In this methodology, evolving knowledge graphs are input into time-series models to learn the patterns of structural evolution. The model has the capability to generate embeddings for each entity at various time points, thereby overcoming limitation of static models that require retraining to acquire embeddings at each specific time point. To enhance the efficiency of feature extraction, we employed a multiple attention strategy. This helped the model find recommendation lists that are closely related to a user’s needs, leading to improved recommendation accuracy. Various experiments conducted on a citation recommendation dataset revealed that the EKGE model exhibits a 1.13% increase in prediction accuracy compared to other KGE methods. Moreover, the model’s accuracy can be further increased by an additional 0.84% through the incorporation of an attention mechanism.

人工智能领域的论文数量日益增多，这凸显了研究人员提高搜索相关文章效率的必要性。大多数论文推荐模型要么依赖于论文之间简单的引用关系，要么专注于基于内容的方法，这两种方法都忽略了学术网络内部的互动。为了解决上述问题，知识图嵌入（KGE）方法被用于引文推荐，因为最近的研究证明图表示法可以有效提高推荐模型的准确性。然而，学术网络是动态的，随着时间的推移，用户和项目的表征会发生变化。大多数基于知识图谱的引文推荐主要是针对静态图谱设计的，因此无法捕捉动态知识图谱（DKG）结构的演变。为了应对这些挑战，我们引入了演化知识图嵌入（EKGE）方法。在这种方法中，不断演化的知识图谱被输入到时间序列模型中，以学习结构演化的模式。该模型能够在不同的时间点为每个实体生成嵌入，从而克服了静态模型需要重新训练以获取每个特定时间点的嵌入的局限性。为了提高特征提取的效率，我们采用了多重关注策略。这有助于模型找到与用户需求密切相关的推荐列表，从而提高推荐准确率。在引文推荐数据集上进行的各种实验表明，与其他 KGE 方法相比，EKGE 模型的预测准确率提高了 1.13%。此外，通过加入关注机制，该模型的准确率还能再提高 0.84%。

{"title":"Evolving Knowledge Graph Representation Learning with Multiple Attention Strategies for Citation Recommendation System","authors":"Jhih-Chen Liu, Chiao-Ting Chen, Chi Lee, Szu-Hao Huang","doi":"10.1145/3635273","DOIUrl":"https://doi.org/10.1145/3635273","url":null,"abstract":"The growing number of publications in the field of artificial intelligence highlights the need for researchers to enhance their efficiency in searching for relevant articles. Most paper recommendation models either rely on simplistic citation relationships among papers or focus on content-based approaches, both of which overlook interactions within academic networks. To address the aforementioned problem, knowledge graph embedding (KGE) methods have been used for citation recommendations because recent research proving that graph representations can effectively improve recommendation model accuracy. However, academic networks are dynamic, leading to changes in the representations of users and items over time. The majority of KGE-based citation recommendations are primarily designed for static graphs, thus failing to capture the evolution of dynamic knowledge graph (DKG) structures. To address these challenges, we introduced the evolving knowledge graph embedding (EKGE) method. In this methodology, evolving knowledge graphs are input into time-series models to learn the patterns of structural evolution. The model has the capability to generate embeddings for each entity at various time points, thereby overcoming limitation of static models that require retraining to acquire embeddings at each specific time point. To enhance the efficiency of feature extraction, we employed a multiple attention strategy. This helped the model find recommendation lists that are closely related to a user’s needs, leading to improved recommendation accuracy. Various experiments conducted on a citation recommendation dataset revealed that the EKGE model exhibits a 1.13% increase in prediction accuracy compared to other KGE methods. Moreover, the model’s accuracy can be further increased by an additional 0.84% through the incorporation of an attention mechanism.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"2 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139464262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explainability for Large Language Models: A Survey 大型语言模型的可解释性：调查

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-01-02 DOI: 10.1145/3639372

Haiyan Zhao, Hanjie Chen, Fan Yang, Ninghao Liu, Huiqi Deng, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Mengnan Du

Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this paper, we introduce a taxonomy of explainability techniques and provide a structured overview of methods for explaining Transformer-based language models. We categorize techniques based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm. For each paradigm, we summarize the goals and dominant approaches for generating local explanations of individual predictions and global explanations of overall model knowledge. We also discuss metrics for evaluating generated explanations, and discuss how explanations can be leveraged to debug models and improve performance. Lastly, we examine key challenges and emerging opportunities for explanation techniques in the era of LLMs in comparison to conventional deep learning models.

大型语言模型（LLMs）在自然语言处理方面表现出了令人印象深刻的能力。然而，它们的内部机制仍不清楚，这种缺乏透明度的情况给下游应用带来了不必要的风险。因此，理解和解释这些模型对于阐明其行为、局限性和社会影响至关重要。在本文中，我们介绍了可解释性技术分类法，并对解释基于变换器的语言模型的方法进行了结构化概述。我们根据 LLM 的训练范式对技术进行分类：基于微调的传统范式和基于提示的范式。针对每种范式，我们总结了生成单个预测的局部解释和整体模型知识的全局解释的目标和主要方法。我们还讨论了评估所生成解释的指标，并讨论了如何利用解释来调试模型和提高性能。最后，与传统深度学习模型相比，我们探讨了 LLM 时代解释技术面临的主要挑战和新兴机遇。

{"title":"Explainability for Large Language Models: A Survey","authors":"Haiyan Zhao, Hanjie Chen, Fan Yang, Ninghao Liu, Huiqi Deng, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Mengnan Du","doi":"10.1145/3639372","DOIUrl":"https://doi.org/10.1145/3639372","url":null,"abstract":"Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this paper, we introduce a taxonomy of explainability techniques and provide a structured overview of methods for explaining Transformer-based language models. We categorize techniques based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm. For each paradigm, we summarize the goals and dominant approaches for generating local explanations of individual predictions and global explanations of overall model knowledge. We also discuss metrics for evaluating generated explanations, and discuss how explanations can be leveraged to debug models and improve performance. Lastly, we examine key challenges and emerging opportunities for explanation techniques in the era of LLMs in comparison to conventional deep learning models.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"8 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139084073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fairness-Driven Private Collaborative Machine Learning 公平驱动的私有协作机器学习

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-01-02 DOI: 10.1145/3639368

Dana Pessach, Tamir Tassa, Erez Shmueli

The performance of machine learning algorithms can be considerably improved when trained over larger datasets. In many domains, such as medicine and finance, larger datasets can be obtained if several parties, each having access to limited amounts of data, collaborate and share their data. However, such data sharing introduces significant privacy challenges. While multiple recent studies have investigated methods for private collaborative machine learning, the fairness of such collaborative algorithms was overlooked. In this work we suggest a feasible privacy-preserving pre-process mechanism for enhancing fairness of collaborative machine learning algorithms. An extensive evaluation of the proposed method shows that it is able to enhance fairness considerably with only a minor compromise in accuracy.

在较大的数据集上进行训练，机器学习算法的性能就会大大提高。在医学和金融等许多领域，如果多方合作并共享数据，就能获得更大的数据集。然而，这种数据共享带来了巨大的隐私挑战。虽然近期有多项研究调查了隐私协作机器学习的方法，但这种协作算法的公平性却被忽视了。在这项工作中，我们提出了一种可行的隐私保护预处理机制，以提高协作机器学习算法的公平性。对所提方法的广泛评估表明，该方法能显著提高公平性，而准确性只受到轻微影响。

引用次数: 0

Exploring the Distributed Knowledge Congruence in Proxy-data-free Federated Distillation 探索无代理数据联合蒸馏中的分布式知识一致性

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-29 DOI: 10.1145/3639369

Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Quyang Pan, Junbo Zhang, Zeju Li, Qingxiang Liu

Federated learning (FL) is a privacy-preserving machine learning paradigm in which the server periodically aggregates local model parameters from clients without assembling their private data. Constrained communication and personalization requirements pose severe challenges to FL. Federated distillation (FD) is proposed to simultaneously address the above two problems, which exchanges knowledge between the server and clients, supporting heterogeneous local models while significantly reducing communication overhead. However, most existing FD methods require a proxy dataset, which is often unavailable in reality. A few recent proxy-data-free FD approaches can eliminate the need for additional public data, but suffer from remarkable discrepancy among local knowledge due to client-side model heterogeneity, leading to ambiguous representation on the server and inevitable accuracy degradation. To tackle this issue, we propose a proxy-data-free FD algorithm based on distributed knowledge congruence (FedDKC). FedDKC leverages well-designed refinement strategies to narrow local knowledge differences into an acceptable upper bound, so as to mitigate the negative effects of knowledge incongruence. Specifically, from perspectives of peak probability and Shannon entropy of local knowledge, we design kernel-based knowledge refinement (KKR) and searching-based knowledge refinement (SKR) respectively, and theoretically guarantee that the refined-local knowledge can satisfy an approximately-similar distribution and be regarded as congruent. Extensive experiments conducted on three common datasets demonstrate that our proposed FedDKC significantly outperforms the state-of-the-art on various heterogeneous settings while evidently improving the convergence speed.

联合学习（FL）是一种保护隐私的机器学习范式，在这种范式中，服务器会定期汇总来自客户端的本地模型参数，而不会汇集他们的私人数据。受限的通信和个性化要求给联合学习带来了严峻的挑战。为了同时解决上述两个问题，有人提出了联合蒸馏（Federated distillation，FD）方法，即在服务器和客户端之间交换知识，支持异构本地模型，同时显著减少通信开销。然而，现有的大多数 FD 方法都需要代理数据集，而现实中往往没有代理数据集。最近出现的几种无代理数据 FD 方法不需要额外的公共数据，但由于客户端模型异构，本地知识之间存在显著差异，导致服务器上的表示模糊不清，精度不可避免地下降。为了解决这个问题，我们提出了一种基于分布式知识一致性的无代理数据 FD 算法（FedDKC）。FedDKC 利用精心设计的细化策略将局部知识差异缩小到可接受的上限，从而减轻知识不一致带来的负面影响。具体来说，我们从局部知识的峰值概率和香农熵的角度出发，分别设计了基于内核的知识细化（KKR）和基于搜索的知识细化（SKR），并从理论上保证了细化后的局部知识能够满足近似分布并被视为一致。在三个常见数据集上进行的大量实验表明，我们提出的 FedDKC 在各种异构环境下的性能明显优于最先进的技术，同时收敛速度也明显提高。

{"title":"Exploring the Distributed Knowledge Congruence in Proxy-data-free Federated Distillation","authors":"Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Quyang Pan, Junbo Zhang, Zeju Li, Qingxiang Liu","doi":"10.1145/3639369","DOIUrl":"https://doi.org/10.1145/3639369","url":null,"abstract":"Federated learning (FL) is a privacy-preserving machine learning paradigm in which the server periodically aggregates local model parameters from clients without assembling their private data. Constrained communication and personalization requirements pose severe challenges to FL. Federated distillation (FD) is proposed to simultaneously address the above two problems, which exchanges knowledge between the server and clients, supporting heterogeneous local models while significantly reducing communication overhead. However, most existing FD methods require a proxy dataset, which is often unavailable in reality. A few recent proxy-data-free FD approaches can eliminate the need for additional public data, but suffer from remarkable discrepancy among local knowledge due to client-side model heterogeneity, leading to ambiguous representation on the server and inevitable accuracy degradation. To tackle this issue, we propose a proxy-data-free FD algorithm based on distributed knowledge congruence (FedDKC). FedDKC leverages well-designed refinement strategies to narrow local knowledge differences into an acceptable upper bound, so as to mitigate the negative effects of knowledge incongruence. Specifically, from perspectives of peak probability and Shannon entropy of local knowledge, we design kernel-based knowledge refinement (KKR) and searching-based knowledge refinement (SKR) respectively, and theoretically guarantee that the refined-local knowledge can satisfy an approximately-similar distribution and be regarded as congruent. Extensive experiments conducted on three common datasets demonstrate that our proposed FedDKC significantly outperforms the state-of-the-art on various heterogeneous settings while evidently improving the convergence speed.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"45 2 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Strengthening Cooperative Consensus in Multi-Robot Confrontation 在多机器人对抗中加强合作共识

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-29 DOI: 10.1145/3639371

Meng Xu, Xinhong Chen, Yechao She, Yang Jin, Guanyi Zhao, Jianping Wang

Multi-agent reinforcement learning (MARL) has proven effective in training multi-robot confrontation, such as StarCraft and robot soccer games. However, the current joint action policies utilized in MARL have been unsuccessful in recognizing and preventing actions that often lead to failures on our side. This exacerbates the cooperation dilemma, ultimately resulting in our agents acting independently and being defeated individually by their opponents. To tackle this challenge, we propose a novel joint action policy, referred to as the consensus action policy (CAP). Specifically, CAP records the number of times each joint action has caused our side to fail in the past and computes a cooperation tendency, which is integrated with each agent’s Q-value and Nash bargaining solution to determine a joint action. The cooperation tendency promotes team cooperation by selecting joint actions that have a high tendency of cooperation and avoiding actions that may lead to team failure. Moreover, the proposed CAP policy can be extended to partially observable scenarios by combining it with Deep Q network (DQN) or actor-critic-based methods. We conducted extensive experiments to compare the proposed method with seven existing joint action policies, including four commonly used methods and three state-of-the-art (SOTA) methods, in terms of episode rewards, winning rates, and other metrics. Our results demonstrate that this approach holds great promise for multi-robot confrontation scenarios.

事实证明，多代理强化学习（MARL）在训练多机器人对抗（如《星际争霸》和机器人足球比赛）方面非常有效。然而，目前 MARL 中使用的联合行动策略无法成功识别和防止经常导致我方失败的行动。这加剧了合作困境，最终导致我们的代理各自为政，被对手击败。为了应对这一挑战，我们提出了一种新颖的联合行动策略，即共识行动策略（CAP）。具体来说，CAP 记录了每个联合行动在过去导致我方失败的次数，并计算出合作倾向，将其与每个代理的 Q 值和纳什讨价还价方案相结合，确定联合行动。合作倾向通过选择合作倾向高的联合行动，避免可能导致团队失败的行动，从而促进团队合作。此外，通过与深度 Q 网络（DQN）或基于行动者批判的方法相结合，所提出的 CAP 策略还可以扩展到部分可观测场景。我们进行了广泛的实验，将所提出的方法与现有的七种联合行动策略（包括四种常用方法和三种最先进的（SOTA）方法）在情节奖励、胜率和其他指标方面进行了比较。我们的结果表明，这种方法在多机器人对抗场景中大有可为。

{"title":"Strengthening Cooperative Consensus in Multi-Robot Confrontation","authors":"Meng Xu, Xinhong Chen, Yechao She, Yang Jin, Guanyi Zhao, Jianping Wang","doi":"10.1145/3639371","DOIUrl":"https://doi.org/10.1145/3639371","url":null,"abstract":"Multi-agent reinforcement learning (MARL) has proven effective in training multi-robot confrontation, such as StarCraft and robot soccer games. However, the current joint action policies utilized in MARL have been unsuccessful in recognizing and preventing actions that often lead to failures on our side. This exacerbates the cooperation dilemma, ultimately resulting in our agents acting independently and being defeated individually by their opponents. To tackle this challenge, we propose a novel joint action policy, referred to as the consensus action policy (CAP). Specifically, CAP records the number of times each joint action has caused our side to fail in the past and computes a cooperation tendency, which is integrated with each agent’s Q-value and Nash bargaining solution to determine a joint action. The cooperation tendency promotes team cooperation by selecting joint actions that have a high tendency of cooperation and avoiding actions that may lead to team failure. Moreover, the proposed CAP policy can be extended to partially observable scenarios by combining it with Deep Q network (DQN) or actor-critic-based methods. We conducted extensive experiments to compare the proposed method with seven existing joint action policies, including four commonly used methods and three state-of-the-art (SOTA) methods, in terms of episode rewards, winning rates, and other metrics. Our results demonstrate that this approach holds great promise for multi-robot confrontation scenarios.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"194 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reconstructing Turbulent Flows Using Spatio-Temporal Physical Dynamics 利用时空物理动力学重建湍流

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-15 DOI: 10.1145/3637491

Shengyu Chen, Tianshu Bao, Peyman Givi, Can Zheng, Xiaowei Jia

Accurate simulation of turbulent flows is of crucial importance in many branches of science and engineering. Direct numerical simulation (DNS) provides the highest fidelity means of capturing all intricate physics of turbulent transport. However, the method is computationally expensive because of the wide range of turbulence scales that must be accounted for in such simulations. Large eddy simulation (LES) provides an alternative. In such simulations, the large scales of the flow are resolved and the effects of small scales are modelled. Reconstruction of the DNS field from the low-resolution LES is needed for a wide variety of applications. Thus the construction of super-resolution (SR) methodologies that can provide this reconstruction has become an area of active research. In this work, a new physics-guided neural network is developed for such a reconstruction. The method leverages the partial differential equation that underlies the flow dynamics in the design of spatio-temporal model architecture. A degradation-based refinement method is also developed to enforce physical constraints and to further reduce the accumulated reconstruction errors over long periods. Detailed DNS data on two turbulent flow configurations are used to assess the performance of the model.

湍流的精确模拟在许多科学和工程领域都至关重要。直接数值模拟（DNS）是捕捉湍流传输所有复杂物理现象的保真度最高的方法。然而，由于这种模拟必须考虑广泛的湍流尺度，因此计算成本高昂。大涡模拟（LES）提供了一种替代方法。在这种模拟中，流动的大尺度被解析，小尺度的影响被模拟。各种应用都需要从低分辨率 LES 中重建 DNS 场。因此，构建能够提供这种重构的超分辨率（SR）方法已成为一个活跃的研究领域。在这项工作中，为这种重建开发了一种新的物理引导神经网络。该方法在设计时空模型结构时利用了作为水流动力学基础的偏微分方程。此外，还开发了一种基于退化的细化方法，以执行物理约束并进一步减少长期累积的重建误差。两种湍流配置的详细 DNS 数据用于评估模型的性能。

{"title":"Reconstructing Turbulent Flows Using Spatio-Temporal Physical Dynamics","authors":"Shengyu Chen, Tianshu Bao, Peyman Givi, Can Zheng, Xiaowei Jia","doi":"10.1145/3637491","DOIUrl":"https://doi.org/10.1145/3637491","url":null,"abstract":"Accurate simulation of turbulent flows is of crucial importance in many branches of science and engineering. Direct numerical simulation (DNS) provides the highest fidelity means of capturing all intricate physics of turbulent transport. However, the method is computationally expensive because of the wide range of turbulence scales that must be accounted for in such simulations. Large eddy simulation (LES) provides an alternative. In such simulations, the large scales of the flow are resolved and the effects of small scales are modelled. Reconstruction of the DNS field from the low-resolution LES is needed for a wide variety of applications. Thus the construction of super-resolution (SR) methodologies that can provide this reconstruction has become an area of active research. In this work, a new physics-guided neural network is developed for such a reconstruction. The method leverages the partial differential equation that underlies the flow dynamics in the design of spatio-temporal model architecture. A degradation-based refinement method is also developed to enforce physical constraints and to further reduce the accumulated reconstruction errors over long periods. Detailed DNS data on two turbulent flow configurations are used to assess the performance of the model.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"36 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138688694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generating Daily Activities with Need Dynamics 利用需求动态生成日常活动

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-14 DOI: 10.1145/3637493

Yuan Yuan, Jingtao Ding, Huandong Wang, Depeng Jin

Daily activity data recording individuals’ various activities in daily life are widely used in many applications such as activity scheduling, activity recommendation, and policymaking. Though with high value, its accessibility is limited due to high collection costs and potential privacy issues. Therefore, simulating human activities to produce massive high-quality data is of great importance. However, existing solutions, including rule-based methods with simplified behavior assumptions and data-driven methods directly fitting real-world data, both cannot fully qualify for matching reality. In this paper, motivated by the classic psychological theory, Maslow’s need theory describing human motivation, we propose a knowledge-driven simulation framework based on generative adversarial imitation learning. Our core idea is to model the evolution of human needs as the underlying mechanism that drives activity generation in the simulation model. Specifically, a hierarchical model structure that disentangles different need levels and the use of neural stochastic differential equations successfully capture the piecewise-continuous characteristics of need dynamics. Extensive experiments demonstrate that our framework outperforms the state-of-the-art baselines regarding data fidelity and utility. We also present the insightful interpretability of the need modeling. Moreover, privacy preservation evaluations validate that the generated data does not leak individual privacy. The code is available at https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND.

日常活动数据记录了个体在日常生活中的各种活动，广泛应用于活动调度、活动推荐、政策制定等领域。虽然具有很高的价值，但由于高昂的收集成本和潜在的隐私问题，其可访问性受到限制。因此，模拟人类活动产生大量高质量数据具有重要意义。然而，现有的解决方案，包括简化行为假设的基于规则的方法和直接拟合现实世界数据的数据驱动方法，都不能完全符合现实。本文以描述人类动机的经典心理学理论马斯洛需求理论为启发，提出了一种基于生成式对抗模仿学习的知识驱动模拟框架。我们的核心思想是将人类需求的演变建模为驱动模拟模型中活动生成的潜在机制。具体来说，一种分层模型结构将不同的需求层次分离开来，并使用神经随机微分方程成功地捕获了需求动力学的分段连续特征。大量的实验表明，我们的框架在数据保真度和实用性方面优于最先进的基线。我们还提出了需求建模的深刻的可解释性。此外，隐私保护评估验证了生成的数据不会泄露个人隐私。代码可在https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND上获得。

{"title":"Generating Daily Activities with Need Dynamics","authors":"Yuan Yuan, Jingtao Ding, Huandong Wang, Depeng Jin","doi":"10.1145/3637493","DOIUrl":"https://doi.org/10.1145/3637493","url":null,"abstract":"Daily activity data recording individuals’ various activities in daily life are widely used in many applications such as activity scheduling, activity recommendation, and policymaking. Though with high value, its accessibility is limited due to high collection costs and potential privacy issues. Therefore, simulating human activities to produce massive high-quality data is of great importance. However, existing solutions, including rule-based methods with simplified behavior assumptions and data-driven methods directly fitting real-world data, both cannot fully qualify for matching reality. In this paper, motivated by the classic psychological theory, Maslow’s need theory describing human motivation, we propose a knowledge-driven simulation framework based on generative adversarial imitation learning. Our core idea is to model the evolution of human needs as the underlying mechanism that drives activity generation in the simulation model. Specifically, a hierarchical model structure that disentangles different need levels and the use of neural stochastic differential equations successfully capture the piecewise-continuous characteristics of need dynamics. Extensive experiments demonstrate that our framework outperforms the state-of-the-art baselines regarding data fidelity and utility. We also present the insightful interpretability of the need modeling. Moreover, privacy preservation evaluations validate that the generated data does not leak individual privacy. The code is available at https://github.com/tsinghua-fib-lab/Activity-Simulation-SAND.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"33 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138630639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Nationwide Air Pollution Forecasting with Heterogeneous Graph Neural Networks 利用异构图神经网络进行全国空气污染预测

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-14 DOI: 10.1145/3637492

Fernando Terroso-Saenz, Juan Morales-García, Andres Muñoz

Nowadays, air pollution is one of the most relevant environmental problems in most urban settings. Due to the utility in operational terms of anticipating certain pollution levels, several predictors based on Graph Neural Networks (GNN) have been proposed for the last years. Most of these solutions usually encode the relationships among stations in terms of their spatial distance, but they fail when it comes to capture other spatial and feature-based contextual factors. Besides, they assume a homogeneous setting where all the stations are able to capture the same pollutants. However, large-scale settings frequently comprise different types of stations, each one with different measurement capabilities. For that reason, the present paper introduces a novel GNN framework able to capture the similarities among stations related to the land use of their locations and their primary source of pollution. Furthermore, we define a methodology to deal with heterogeneous settings on the top of the GNN architecture. Finally, the proposal has been tested with a nation-wide Spanish air-pollution dataset with very promising results.

如今，空气污染是大多数城市环境中最相关的环境问题之一。由于在预测某些污染水平的操作方面的效用，在过去几年中已经提出了几种基于图神经网络(GNN)的预测器。大多数解决方案通常根据站点之间的空间距离编码站点之间的关系，但是当涉及到捕获其他基于空间和特征的上下文因素时，它们就失败了。此外，他们假设一个均匀的设置，所有的站能够捕获相同的污染物。然而，大尺度设置往往包括不同类型的台站，每一个都有不同的测量能力。因此，本文介绍了一种新的GNN框架，能够捕捉到站点之间与其位置的土地利用及其主要污染源有关的相似性。此外，我们定义了一种方法来处理GNN体系结构顶部的异构设置。最后，该提案已经在西班牙全国空气污染数据集上进行了测试，结果非常有希望。

{"title":"Nationwide Air Pollution Forecasting with Heterogeneous Graph Neural Networks","authors":"Fernando Terroso-Saenz, Juan Morales-García, Andres Muñoz","doi":"10.1145/3637492","DOIUrl":"https://doi.org/10.1145/3637492","url":null,"abstract":"Nowadays, air pollution is one of the most relevant environmental problems in most urban settings. Due to the utility in operational terms of anticipating certain pollution levels, several predictors based on Graph Neural Networks (GNN) have been proposed for the last years. Most of these solutions usually encode the relationships among stations in terms of their spatial distance, but they fail when it comes to capture other spatial and feature-based contextual factors. Besides, they assume a homogeneous setting where all the stations are able to capture the same pollutants. However, large-scale settings frequently comprise different types of stations, each one with different measurement capabilities. For that reason, the present paper introduces a novel GNN framework able to capture the similarities among stations related to the land use of their locations and their primary source of pollution. Furthermore, we define a methodology to deal with heterogeneous settings on the top of the GNN architecture. Finally, the proposal has been tested with a nation-wide Spanish air-pollution dataset with very promising results.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"4 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138630644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explicit Knowledge Graph Reasoning for Conversational Recommendation 用于对话式推荐的显式知识图谱推理

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2023-12-11 DOI: 10.1145/3637216

Xuhui Ren, Tong Chen, Quoc Viet Hung Nguyen, Lizhen Cui, Zi Huang, Hongzhi Yin

Traditional recommender systems estimate user preference on items purely based on historical interaction records, thus failing to capture fine-grained yet dynamic user interests and letting users receive recommendation only passively. Recent conversational recommender systems (CRSs) tackle those limitations by enabling recommender systems to interact with the user to obtain her/his current preference through a sequence of clarifying questions. Recently, there has been a rise of using knowledge graphs (KGs) for CRSs, where the core motivation is to incorporate the abundant side information carried by a KG into both the recommendation and conversation processes. However, existing KG-based CRSs are subject to two defects: (1) there is a semantic gap between the learned representations of utterances and KG entities, hindering the retrieval of relevant KG information; (2) the reasoning over KG is mostly performed with the implicitly learned user interests, overlooking the explicit signals from the entities actually mentioned in the conversation.

To address these drawbacks, we propose a new CRS framework, namely the Knowledge Enhanced Conversational Reasoning (KECR) model. As a user can reflect her/his preferences via both attribute- and item-level expressions, KECR jointly embeds the structured knowledge from two levels in the KG. A mutual information maximization constraint is further proposed for semantic alignment between the embedding spaces of utterances and KG entities. Meanwhile, KECR utilizes the connectivity within the KG to conduct explicit reasoning of the user demand, making the model less dependent on the user’s feedback to clarifying questions. As such, the semantic alignment and explicit KG reasoning can jointly facilitate accurate recommendation and quality dialogue generation. By comparing with strong baselines on two real-world datasets, we demonstrate that KECR obtains state-of-the-art recommendation effectiveness, as well as competitive dialogue generation performance.

传统的推荐系统纯粹根据历史交互记录来估算用户对项目的偏好，因此无法捕捉细粒度但动态的用户兴趣，用户只能被动地接受推荐。最近的对话式推荐系统（CRS）解决了这些局限性，它使推荐系统能够与用户互动，通过一连串的澄清问题获得用户当前的偏好。最近，在会话推荐系统中使用知识图谱（KGs）的做法开始兴起，其核心动机是将知识图谱所携带的丰富侧边信息纳入推荐和会话过程。然而，现有的基于知识图谱的 CRS 有两个缺陷：(1) 语篇的学习表示与知识图谱实体之间存在语义鸿沟，阻碍了相关知识图谱信息的检索；(2) 对知识图谱的推理大多是通过隐式学习的用户兴趣来进行的，忽略了对话中实际提及的实体的显式信号。为了解决这些问题，我们提出了一种新的会话推理框架，即知识增强会话推理（KECR）模型。由于用户可以通过属性级和项目级表达来反映自己的偏好，因此 KECR 将两个级别的结构化知识共同嵌入到 KG 中。为实现语篇嵌入空间与 KG 实体之间的语义对齐，进一步提出了互信息最大化约束。同时，KECR 利用 KG 内部的连通性对用户需求进行显式推理，使模型不再依赖于用户对澄清问题的反馈。因此，语义对齐和明确的 KG 推理可以共同促进准确的推荐和高质量的对话生成。通过在两个真实世界数据集上与强大的基线进行比较，我们证明了 KECR 获得了最先进的推荐效果以及具有竞争力的对话生成性能。

{"title":"Explicit Knowledge Graph Reasoning for Conversational Recommendation","authors":"Xuhui Ren, Tong Chen, Quoc Viet Hung Nguyen, Lizhen Cui, Zi Huang, Hongzhi Yin","doi":"10.1145/3637216","DOIUrl":"https://doi.org/10.1145/3637216","url":null,"abstract":"Traditional recommender systems estimate user preference on items purely based on historical interaction records, thus failing to capture fine-grained yet dynamic user interests and letting users receive recommendation only passively. Recent conversational recommender systems (CRSs) tackle those limitations by enabling recommender systems to interact with the user to obtain her/his current preference through a sequence of clarifying questions. Recently, there has been a rise of using knowledge graphs (KGs) for CRSs, where the core motivation is to incorporate the abundant side information carried by a KG into both the recommendation and conversation processes. However, existing KG-based CRSs are subject to two defects: (1) there is a semantic gap between the learned representations of utterances and KG entities, hindering the retrieval of relevant KG information; (2) the reasoning over KG is mostly performed with the implicitly learned user interests, overlooking the explicit signals from the entities actually mentioned in the conversation. To address these drawbacks, we propose a new CRS framework, namely the Knowledge Enhanced Conversational Reasoning (KECR) model. As a user can reflect her/his preferences via both attribute- and item-level expressions, KECR jointly embeds the structured knowledge from two levels in the KG. A mutual information maximization constraint is further proposed for semantic alignment between the embedding spaces of utterances and KG entities. Meanwhile, KECR utilizes the connectivity within the KG to conduct explicit reasoning of the user demand, making the model less dependent on the user’s feedback to clarifying questions. As such, the semantic alignment and explicit KG reasoning can jointly facilitate accurate recommendation and quality dialogue generation. By comparing with strong baselines on two real-world datasets, we demonstrate that KECR obtains state-of-the-art recommendation effectiveness, as well as competitive dialogue generation performance.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"7 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138572359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0