ACM Computing Surveys最新文献

英文中文

Resource-efficient Algorithms and Systems of Foundation Models: A Survey 资源效率算法和基础模型系统：综述

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-29 DOI: 10.1145/3706418

Mengwei Xu, Dongqi Cai, Wangsong Yin, Shangguang Wang, Xin Jin, Xuanzhe Liu

Large foundation models, including large language models, vision transformers, diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment. However, the substantial advancements in versatility and performance these models offer come at a significant cost in terms of hardware resources. To support the growth of these large models in a scalable and environmentally sustainable way, there has been a considerable focus on developing resource-efficient strategies. This survey delves into the critical importance of such research, examining both algorithmic and systemic aspects. It offers a comprehensive analysis and valuable insights gleaned from existing literature, encompassing a broad array of topics from cutting-edge model architectures and training/serving algorithms to practical system designs and implementations. The goal of this survey is to provide an overarching understanding of how current approaches are tackling the resource challenges posed by large foundation models and to potentially inspire future breakthroughs in this field.

大型基础模型，包括大型语言模型、视觉转换、扩散和基于llm的多模态模型，正在彻底改变整个机器学习生命周期，从训练到部署。然而，这些模型在多功能性和性能方面的巨大进步是以硬件资源方面的巨大成本为代价的。为了以可扩展和环境可持续的方式支持这些大型模型的增长，开发资源节约型战略受到了相当大的关注。本调查深入研究了此类研究的关键重要性，检查了算法和系统方面。它提供了从现有文献中收集的全面分析和有价值的见解，涵盖了从前沿模型架构和培训/服务算法到实际系统设计和实现的广泛主题。本调查的目的是提供一个总体的理解，当前的方法是如何解决大型基础模型带来的资源挑战的，并有可能激发该领域未来的突破。

引用次数: 0

SoK: Access Control Policy Generation from High-level Natural Language Requirements 从高级自然语言需求生成访问控制策略

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-28 DOI: 10.1145/3706057

Sakuna Harinda Jayasundara, Nalin Asanka Gamagedara Arachchilage, Giovanni Russello

Administrator-centered access control failures can cause data breaches, putting organizations at risk of financial loss and reputation damage. Existing graphical policy configuration tools and automated policy generation frameworks attempt to help administrators configure and generate access control policies by avoiding such failures. However, graphical policy configuration tools are prone to human errors, making them unusable. On the other hand, automated policy generation frameworks are prone to erroneous predictions, making them unreliable. Therefore, to find ways to improve their usability and reliability, we conducted a Systematic Literature Review analyzing 49 publications. The thematic analysis of the publications revealed that graphical policy configuration tools are developed to write and visualize policies manually. Moreover, automated policy generation frameworks are developed using machine learning (ML) and natural language processing (NLP) techniques to automatically generate access control policies from high-level requirement specifications. Despite their utility in the access control domain, limitations of these tools, such as the lack of flexibility, and limitations of frameworks, such as the lack of domain adaptation, negatively affect their usability and reliability, respectively. Our study offers recommendations to address these limitations through real-world applications and recent advancements in the NLP domain, paving the way for future research.

以管理员为中心的访问控制失败可能导致数据泄露，使组织面临财务损失和声誉受损的风险。现有的图形策略配置工具和自动策略生成框架试图通过避免此类故障来帮助管理员配置和生成访问控制策略。但是，图形化策略配置工具容易出现人为错误，使其无法使用。另一方面，自动化策略生成框架容易产生错误的预测，使其不可靠。因此，为了寻找提高其可用性和可靠性的方法，我们对49篇文献进行了系统的文献回顾分析。对出版物的专题分析显示，开发了图形化策略配置工具来手动编写和可视化策略。此外，使用机器学习（ML）和自然语言处理（NLP）技术开发了自动策略生成框架，以从高级需求规范自动生成访问控制策略。尽管这些工具在访问控制领域很有用，但它们的局限性，如缺乏灵活性，以及框架的局限性，如缺乏领域适应性，分别对它们的可用性和可靠性产生了负面影响。我们的研究通过实际应用和NLP领域的最新进展提出了解决这些限制的建议，为未来的研究铺平了道路。

{"title":"SoK: Access Control Policy Generation from High-level Natural Language Requirements","authors":"Sakuna Harinda Jayasundara, Nalin Asanka Gamagedara Arachchilage, Giovanni Russello","doi":"10.1145/3706057","DOIUrl":"https://doi.org/10.1145/3706057","url":null,"abstract":"Administrator-centered access control failures can cause data breaches, putting organizations at risk of financial loss and reputation damage. Existing graphical policy configuration tools and automated policy generation frameworks attempt to help administrators configure and generate access control policies by avoiding such failures. However, graphical policy configuration tools are prone to human errors, making them unusable. On the other hand, automated policy generation frameworks are prone to erroneous predictions, making them unreliable. Therefore, to find ways to improve their usability and reliability, we conducted a Systematic Literature Review analyzing 49 publications. The thematic analysis of the publications revealed that graphical policy configuration tools are developed to write and visualize policies manually. Moreover, automated policy generation frameworks are developed using machine learning (ML) and natural language processing (NLP) techniques to automatically generate access control policies from high-level requirement specifications. Despite their utility in the access control domain, limitations of these tools, such as the lack of flexibility, and limitations of frameworks, such as the lack of domain adaptation, negatively affect their usability and reliability, respectively. Our study offers recommendations to address these limitations through real-world applications and recent advancements in the NLP domain, paving the way for future research.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"14 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142742597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AI-powered Fraud Detection in Decentralized Finance: A Project Life Cycle Perspective 去中心化金融中的人工智能欺诈检测：项目生命周期视角

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-25 DOI: 10.1145/3705296

Bingqiao Luo, Zhen Zhang, Qian Wang, Anli Ke, Shengliang Lu, Bingsheng He

Decentralized finance (DeFi) represents a novel financial system but faces significant fraud challenges, leading to substantial losses. Recent advancements in artificial intelligence (AI) show potential for complex fraud detection. Despite growing interest, a systematic review of these methods is lacking. This survey correlates fraud types with DeFi project stages, presenting a taxonomy based on the project life cycle. We evaluate AI techniques, revealing notable findings such as the superiority of tree-based and graph-related models. Based on these insights, we offer recommendations and outline future research directions to aid researchers, practitioners, and regulators in enhancing DeFi security.

去中心化金融（DeFi）是一种新颖的金融系统，但也面临着巨大的欺诈挑战，导致大量损失。人工智能（AI）的最新进展显示了复杂欺诈检测的潜力。尽管人们对这些方法的兴趣与日俱增，但却缺乏对这些方法的系统回顾。本调查将欺诈类型与 DeFi 项目阶段相关联，提出了基于项目生命周期的分类法。我们对人工智能技术进行了评估，发现了基于树的模型和图相关模型的优越性等显著发现。基于这些见解，我们提出了建议并概述了未来的研究方向，以帮助研究人员、从业人员和监管机构提高 DeFi 的安全性。

引用次数: 0

Wi-Fi Sensing Techniques for Human Activity Recognition: Brief Survey, Potential Challenges, and Research Directions 用于人类活动识别的 Wi-Fi 传感技术：简要调查、潜在挑战和研究方向

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-25 DOI: 10.1145/3705893

Fucheng Miao, Youxiang Huang, Zhiyi Lu, Tomoaki Ohtsuki, Guan Gui, Hikmet Sari

Recent advancements in wireless communication technologies have made Wi-Fi signals indispensable in both personal and professional settings. The utilization of these signals for Human Activity Recognition (HAR) has emerged as a cutting-edge technology. By leveraging the fluctuations in Wi-Fi signals for HAR, this approach offers enhanced privacy compared to traditional visual surveillance methods. The essence of this technique lies in detecting subtle changes when Wi-Fi signals interact with the human body, which are then captured and interpreted by advanced algorithms. This paper initially provides an overview of the key methodologies in HAR and the evolution of non-contact sensing, introducing sensor-based recognition, computer vision, and Wi-Fi signal-based approaches, respectively. It then explores tools for Wi-Fi-based HAR signal collection and lists several high-quality datasets. Subsequently, the paper reviews various sensing tasks enabled by Wi-Fi signal recognition, highlighting the application of deep learning networks in Wi-Fi signal detection. The fourth section presents experimental results that assess the capabilities of different networks. The findings indicate significant variability in the generalization capacities of neural networks and notable differences in test accuracy for various motion analyses.

无线通信技术的最新进展使 Wi-Fi 信号在个人和专业环境中都变得不可或缺。利用这些信号进行人类活动识别（HAR）已成为一项尖端技术。与传统的视觉监控方法相比，利用 Wi-Fi 信号的波动进行人类活动识别（HAR）可增强隐私性。这种技术的精髓在于检测 Wi-Fi 信号与人体相互作用时的微妙变化，然后通过先进的算法捕捉和解读这些变化。本文首先概述了 HAR 的主要方法和非接触式传感的发展，分别介绍了基于传感器的识别、计算机视觉和基于 Wi-Fi 信号的方法。然后，论文探讨了基于 Wi-Fi 的 HAR 信号采集工具，并列出了几个高质量的数据集。随后，论文回顾了通过 Wi-Fi 信号识别实现的各种传感任务，重点介绍了深度学习网络在 Wi-Fi 信号检测中的应用。第四部分介绍了评估不同网络能力的实验结果。研究结果表明，神经网络的泛化能力存在很大差异，各种运动分析的测试准确性也存在明显差异。

{"title":"Wi-Fi Sensing Techniques for Human Activity Recognition: Brief Survey, Potential Challenges, and Research Directions","authors":"Fucheng Miao, Youxiang Huang, Zhiyi Lu, Tomoaki Ohtsuki, Guan Gui, Hikmet Sari","doi":"10.1145/3705893","DOIUrl":"https://doi.org/10.1145/3705893","url":null,"abstract":"Recent advancements in wireless communication technologies have made Wi-Fi signals indispensable in both personal and professional settings. The utilization of these signals for Human Activity Recognition (HAR) has emerged as a cutting-edge technology. By leveraging the fluctuations in Wi-Fi signals for HAR, this approach offers enhanced privacy compared to traditional visual surveillance methods. The essence of this technique lies in detecting subtle changes when Wi-Fi signals interact with the human body, which are then captured and interpreted by advanced algorithms. This paper initially provides an overview of the key methodologies in HAR and the evolution of non-contact sensing, introducing sensor-based recognition, computer vision, and Wi-Fi signal-based approaches, respectively. It then explores tools for Wi-Fi-based HAR signal collection and lists several high-quality datasets. Subsequently, the paper reviews various sensing tasks enabled by Wi-Fi signal recognition, highlighting the application of deep learning networks in Wi-Fi signal detection. The fourth section presents experimental results that assess the capabilities of different networks. The findings indicate significant variability in the generalization capacities of neural networks and notable differences in test accuracy for various motion analyses.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"191 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142713368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Causal Discovery from Temporal Data: An Overview and New Perspectives 从时态数据中发现因果关系：概述与新视角

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-23 DOI: 10.1145/3705297

Chang Gong, Chuzhe Zhang, Di Yao, Jingping Bi, Wenbin Li, YongJun Xu

Temporal data, representing chronological observations of complex systems, has always been a typical data structure that can be widely generated by many domains, such as industry, finance, healthcare and climatology. Analyzing the underlying structures, i.e. , the causal relations, could be extremely valuable for various applications. Recently, causal discovery from temporal data has been considered as an interesting yet critical task and attracted much research attention. According to the nature and structure of temporal data, existing causal discovery works can be divided into two highly correlated categories i.e. , multivariate time series causal discovery, and event sequence causal discovery. However, most previous surveys are only focused on the multivariate time series causal discovery but ignore the second category. In this paper, we specify the similarity between the two categories and provide an overview of existing solutions. Furthermore, we provide public datasets, evaluation metrics and new perspectives for temporal data causal discovery.

代表复杂系统按时间顺序观测结果的时态数据一直是一种典型的数据结构，可广泛应用于工业、金融、医疗保健和气候学等多个领域。分析底层结构，即因果关系，对各种应用都极具价值。最近，从时态数据中发现因果关系被认为是一项有趣而又关键的任务，吸引了大量研究人员的关注。根据时态数据的性质和结构，现有的因果发现工作可分为两个高度相关的类别，即多变量时间序列因果发现和事件序列因果发现。然而，以往的研究大多只关注多变量时间序列因果发现，而忽略了第二类因果发现。在本文中，我们明确了这两类发现之间的相似性，并概述了现有的解决方案。此外，我们还为时态数据因果发现提供了公共数据集、评估指标和新视角。

引用次数: 0

Explainable Artificial Intelligence: Importance, Use Domains, Stages, Output Shapes, and Challenges 可解释的人工智能：重要性、使用领域、阶段、输出形状和挑战

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-23 DOI: 10.1145/3705724

Naeem Ullah, Javed Ali Khan, Ivanoe De Falco, Giovanna Sannino

There is an urgent need in many application areas for eXplainable ArtificiaI Intelligence (XAI) approaches to boost people’s confidence and trust in Artificial Intelligence methods. Current works concentrate on specific aspects of XAI and avoid a comprehensive perspective. This study undertakes a systematic survey of importance, approaches, methods, and application domains to address this gap and provide a comprehensive understanding of the XAI domain. Applying the Systematic Literature Review approach has resulted in finding and discussing 155 papers, allowing a wide discussion on the strengths, limitations, and challenges of XAI methods and future research directions.

许多应用领域迫切需要可解释的人工智能（XAI）方法，以增强人们对人工智能方法的信心和信任。目前的研究主要集中在 XAI 的特定方面，缺乏全面的视角。本研究对重要性、方式、方法和应用领域进行了系统调查，以弥补这一不足，并提供对 XAI 领域的全面了解。通过应用系统文献综述方法，我们找到并讨论了 155 篇论文，从而就 XAI 方法的优势、局限性和挑战以及未来研究方向展开了广泛讨论。

引用次数: 0

Racial Bias within Face Recognition: A Survey 人脸识别中的种族偏见：一项调查

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-22 DOI: 10.1145/3705295

Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby Breckon

Facial recognition is one of the most academically studied and industrially developed areas within computer vision where we readily find associated applications deployed globally. This widespread adoption has uncovered significant performance variation across subjects of different racial profiles leading to focused research attention on racial bias within face recognition spanning both current causation and future potential solutions. In support, this study provides an extensive taxonomic review of research on racial bias within face recognition exploring every aspect and stage of the associated facial processing pipeline. Firstly, we discuss the problem definition of racial bias, starting with race definition, grouping strategies, and the societal implications of using race or race-related groupings. Secondly, we divide the common face recognition processing pipeline into four stages: image acquisition, face localisation, face representation, face verification and identification, and review the relevant corresponding literature associated with each stage. The overall aim is to provide comprehensive coverage of the racial bias problem with respect to each and every stage of the face recognition processing pipeline whilst also highlighting the potential pitfalls and limitations of contemporary mitigation strategies that need to be considered within future research endeavours or commercial applications alike.

人脸识别是计算机视觉领域中学术研究和工业开发最多的领域之一，我们很容易在全球范围内发现相关的应用。这种广泛的应用发现了不同种族被试之间的显著性能差异，从而引发了对人脸识别中种族偏见的集中研究，包括当前的成因和未来潜在的解决方案。为此，本研究对人脸识别中的种族偏见研究进行了广泛的分类综述，探讨了相关面部处理管道的各个方面和阶段。首先，我们讨论了种族偏见的问题定义，从种族定义、分组策略以及使用种族或种族相关分组的社会影响入手。其次，我们将常见的人脸识别处理流程分为四个阶段：图像采集、人脸定位、人脸表示、人脸验证和识别，并回顾了与每个阶段相关的相应文献。我们的总体目标是在人脸识别处理流程的每一个阶段全面覆盖种族偏见问题，同时强调当代缓解策略的潜在隐患和局限性，这些都需要在未来的研究工作或商业应用中加以考虑。

{"title":"Racial Bias within Face Recognition: A Survey","authors":"Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby Breckon","doi":"10.1145/3705295","DOIUrl":"https://doi.org/10.1145/3705295","url":null,"abstract":"Facial recognition is one of the most academically studied and industrially developed areas within computer vision where we readily find associated applications deployed globally. This widespread adoption has uncovered significant performance variation across subjects of different racial profiles leading to focused research attention on racial bias within face recognition spanning both current causation and future potential solutions. In support, this study provides an extensive taxonomic review of research on racial bias within face recognition exploring every aspect and stage of the associated facial processing pipeline. Firstly, we discuss the problem definition of racial bias, starting with race definition, grouping strategies, and the societal implications of using race or race-related groupings. Secondly, we divide the common face recognition processing pipeline into four stages: image acquisition, face localisation, face representation, face verification and identification, and review the relevant corresponding literature associated with each stage. The overall aim is to provide comprehensive coverage of the racial bias problem with respect to each and every stage of the face recognition processing pipeline whilst also highlighting the potential pitfalls and limitations of contemporary mitigation strategies that need to be considered within future research endeavours or commercial applications alike.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"115 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142690743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare 城市决策机器学习概览》：规划、交通和医疗领域的应用

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-22 DOI: 10.1145/3695986

Yu Zheng, Qianyue Hao, Jingwei Wang, Changzheng Gao, Jinwei Chen, Depeng Jin, Yong Li

Developing smart cities is vital for ensuring sustainable development and improving human well-being. One critical aspect of building smart cities is designing intelligent methods to address various decision-making problems that arise in urban areas. As machine learning techniques continue to advance rapidly, a growing body of research has been focused on utilizing these methods to achieve intelligent urban decision making. In this survey, we conduct a systematic literature review on the application of machine learning methods in urban decision making, with a focus on planning, transportation, and healthcare. First, we provide a taxonomy based on typical applications of machine learning methods for urban decision making. We then present background knowledge on these tasks and the machine learning techniques that have been adopted to solve them. Next, we examine the challenges and advantages of applying machine learning in urban decision making, including issues related to urban complexity, urban heterogeneity and computational cost. Afterward and primarily, we elaborate on the existing machine learning methods that aim to solve urban decision making tasks in planning, transportation, and healthcare, highlighting their strengths and limitations. Finally, we discuss open problems and the future directions of applying machine learning to enable intelligent urban decision making, such as developing foundation models and combining reinforcement learning algorithms with human feedback. We hope this survey can help researchers in related fields understand the recent progress made in existing works, and inspire novel applications of machine learning in smart cities.

发展智慧城市对于确保可持续发展和改善人类福祉至关重要。建设智慧城市的一个重要方面是设计智能方法，以解决城市地区出现的各种决策问题。随着机器学习技术的不断快速发展，越来越多的研究集中于利用这些方法实现智能城市决策。在本调查中，我们对机器学习方法在城市决策中的应用进行了系统的文献综述，重点关注规划、交通和医疗保健领域。首先，我们根据机器学习方法在城市决策中的典型应用进行了分类。然后，我们介绍了这些任务的背景知识以及解决这些任务所采用的机器学习技术。接下来，我们探讨了在城市决策中应用机器学习所面临的挑战和优势，包括与城市复杂性、城市异质性和计算成本相关的问题。随后，我们主要阐述了旨在解决规划、交通和医疗保健领域城市决策任务的现有机器学习方法，并强调了这些方法的优势和局限性。最后，我们讨论了应用机器学习实现智能城市决策的未决问题和未来方向，如开发基础模型、将强化学习算法与人工反馈相结合等。我们希望这份调查报告能帮助相关领域的研究人员了解现有工作的最新进展，并激发机器学习在智慧城市中的新应用。

{"title":"A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare","authors":"Yu Zheng, Qianyue Hao, Jingwei Wang, Changzheng Gao, Jinwei Chen, Depeng Jin, Yong Li","doi":"10.1145/3695986","DOIUrl":"https://doi.org/10.1145/3695986","url":null,"abstract":"Developing smart cities is vital for ensuring sustainable development and improving human well-being. One critical aspect of building smart cities is designing intelligent methods to address various decision-making problems that arise in urban areas. As machine learning techniques continue to advance rapidly, a growing body of research has been focused on utilizing these methods to achieve intelligent urban decision making. In this survey, we conduct a systematic literature review on the application of machine learning methods in urban decision making, with a focus on planning, transportation, and healthcare. First, we provide a taxonomy based on typical applications of machine learning methods for urban decision making. We then present background knowledge on these tasks and the machine learning techniques that have been adopted to solve them. Next, we examine the challenges and advantages of applying machine learning in urban decision making, including issues related to urban complexity, urban heterogeneity and computational cost. Afterward and primarily, we elaborate on the existing machine learning methods that aim to solve urban decision making tasks in planning, transportation, and healthcare, highlighting their strengths and limitations. Finally, we discuss open problems and the future directions of applying machine learning to enable intelligent urban decision making, such as developing foundation models and combining reinforcement learning algorithms with human feedback. We hope this survey can help researchers in related fields understand the recent progress made in existing works, and inspire novel applications of machine learning in smart cities.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"15 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142691069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tool Learning with Foundation Models 利用基础模型进行工具学习

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-21 DOI: 10.1145/3704435

Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Xuanhe Zhou, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Guoliang Li, Zhiyuan Liu, Maosong Sun

Humans possess an extraordinary ability to create and utilize tools. With the advent of foundation models, artificial intelligence systems have the potential to be equally adept in tool use as humans. This paradigm, which is dubbed as tool learning with foundation models , combines the strengths of tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. This paper presents a systematic investigation and comprehensive review of tool learning. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research and formulate a general framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate generalization in tool learning. Finally, we discuss several open problems that require further investigation, such as ensuring trustworthy tool use, enabling tool creation with foundation models, and addressing personalization challenges. Overall, we hope this paper could inspire future research in integrating tools with foundation models.

人类拥有创造和使用工具的非凡能力。随着基础模型的出现，人工智能系统有可能像人类一样善于使用工具。这种模式被称为 "工具学习与基础模型"，它结合了工具和基础模型的优势，从而提高了解决问题的准确性、效率和自动化程度。本文对工具学习进行了系统研究和全面评述。我们首先介绍了工具学习的背景，包括其认知起源、基础模型的范式转变以及工具和模型的互补作用。然后，我们回顾了现有的工具学习研究，并提出了一个总体框架：从理解用户指令开始，模型应学会将复杂任务分解为多个子任务，通过推理动态调整计划，并通过选择适当的工具有效地完成每个子任务。我们还讨论了如何训练模型以提高工具使用能力，并促进工具学习的泛化。最后，我们讨论了几个需要进一步研究的开放性问题，如确保工具使用的可信度、利用基础模型创建工具以及应对个性化挑战。总之，我们希望本文能对未来将工具与基础模型相结合的研究有所启发。

{"title":"Tool Learning with Foundation Models","authors":"Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Xuanhe Zhou, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Guoliang Li, Zhiyuan Liu, Maosong Sun","doi":"10.1145/3704435","DOIUrl":"https://doi.org/10.1145/3704435","url":null,"abstract":"Humans possess an extraordinary ability to create and utilize tools. With the advent of foundation models, artificial intelligence systems have the potential to be equally adept in tool use as humans. This paradigm, which is dubbed as tool learning with foundation models , combines the strengths of tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. This paper presents a systematic investigation and comprehensive review of tool learning. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research and formulate a general framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate generalization in tool learning. Finally, we discuss several open problems that require further investigation, such as ensuring trustworthy tool use, enabling tool creation with foundation models, and addressing personalization challenges. Overall, we hope this paper could inspire future research in integrating tools with foundation models.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"15 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142684949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Collaborative Distributed Machine Learning 协作式分布机器学习

IF 16.6 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

ACM Computing Surveys

Pub Date : 2024-11-20 DOI: 10.1145/3704807

David Jin, Niclas Kannengießer, Sascha Rank, Ali Sunyaev

Various collaborative distributed machine learning (CDML) systems, including federated learning systems and swarm learning systems, with different key traits were developed to leverage resources for the development and use of machine learning (ML) models in a confidentiality-preserving way. To meet use case requirements, suitable CDML systems need to be selected. However, comparison between CDML systems to assess their suitability for use cases is often difficult. To support comparison of CDML systems and introduce scientific and practical audiences to the principal functioning and key traits of CDML systems, this work presents a CDML system conceptualization and CDML archetypes.

为了以保密方式利用资源开发和使用机器学习（ML）模型，开发了各种具有不同关键特征的协作分布式机器学习（CDML）系统，包括联合学习系统和群学习系统。为满足用例要求，需要选择合适的 CDML 系统。然而，对 CDML 系统进行比较以评估其是否适合用例往往很困难。为了支持对 CDML 系统进行比较，并向科学界和实际受众介绍 CDML 系统的主要功能和关键特征，这项工作提出了 CDML 系统概念化和 CDML 原型。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ACM Computing Surveys

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀