首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications, and Opportunities 法学硕士、法学硕士及其他领域的模型合并:方法、理论、应用和机遇
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-10 DOI: 10.1145/3787849
Enneng Yang, Li Shen, Guibing Guo, Xingwei Wang, Xiaochun Cao, Jie Zhang, Dacheng Tao
Model merging is an efficient empowerment technique in the machine learning community that does not require the collection of raw training data and does not require expensive computation. As model merging becomes increasingly prevalent across various fields, it is crucial to understand the available model merging techniques comprehensively. However, there is a significant gap in the literature regarding a systematic and thorough review of these techniques. This survey provides a comprehensive overview of model merging methods and theories, their applications in various domains and settings, and future research directions. Specifically, we first propose a new taxonomic approach that exhaustively discusses existing model merging methods. Secondly, we discuss the application of model merging techniques in large language models, multimodal large language models, and more than ten machine learning subfields, including continual learning, multi-task learning, few-shot learning, etc. Finally, we highlight the remaining challenges of model merging and discuss future research directions. A comprehensive list of papers about model merging is available at https://github.com/EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications .
模型合并是机器学习社区中一种有效的授权技术,它不需要收集原始训练数据,也不需要昂贵的计算。随着模型合并在各个领域变得越来越普遍,全面了解可用的模型合并技术是至关重要的。然而,关于这些技术的系统和彻底的审查的文献有一个显著的差距。本文全面概述了模型融合的方法和理论、在不同领域和环境中的应用以及未来的研究方向。具体来说,我们首先提出了一种新的分类方法,详尽地讨论了现有的模型合并方法。其次,讨论了模型合并技术在大型语言模型、多模态大型语言模型以及持续学习、多任务学习、少镜头学习等十多个机器学习子领域中的应用。最后,我们强调了模型融合仍存在的挑战,并讨论了未来的研究方向。关于模型合并的论文的综合列表可以在https://github.com/EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications上找到。
{"title":"Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications, and Opportunities","authors":"Enneng Yang, Li Shen, Guibing Guo, Xingwei Wang, Xiaochun Cao, Jie Zhang, Dacheng Tao","doi":"10.1145/3787849","DOIUrl":"https://doi.org/10.1145/3787849","url":null,"abstract":"Model merging is an efficient empowerment technique in the machine learning community that does not require the collection of raw training data and does not require expensive computation. As model merging becomes increasingly prevalent across various fields, it is crucial to understand the available model merging techniques comprehensively. However, there is a significant gap in the literature regarding a systematic and thorough review of these techniques. This survey provides a comprehensive overview of model merging methods and theories, their applications in various domains and settings, and future research directions. Specifically, we first propose a new taxonomic approach that exhaustively discusses existing model merging methods. Secondly, we discuss the application of model merging techniques in large language models, multimodal large language models, and more than ten machine learning subfields, including continual learning, multi-task learning, few-shot learning, etc. Finally, we highlight the remaining challenges of model merging and discuss future research directions. A comprehensive list of papers about model merging is available at <jats:italic toggle=\"yes\">https://github.com/EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications</jats:italic> .","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"94 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145947217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Training Data For Large Language Models: A Survey 大型语言模型的训练数据检测:综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-07 DOI: 10.1145/3779430
Chen Yang, Junyi Li, Shulin Lan, Yingchao Wang, Hongyang Du, Congcheng Gong, Xingshan Yao, Dusit (Tao) Niyato, Liehuang Zhu
As large language models (LLMs) continue to evolve, the scope and diversity of data used for training are expanding significantly. However, the training dataset of LLMs may inevitably contain sensitive information such as personal data or copyrighted material, leading to privacy leakage or copyright infringement risks if the model generates highly similar or identical text to these sources. This has drawn attention to the issue of detecting whether the text data is used for LLM training. To date, research on detecting training data usage in artificial intelligence (AI) models has mainly focused on traditional machine learning (ML) models. However, studies on LLMs remain relatively immature. The lack of understanding of research progress in this area has hindered the development of more effective detection methods. Therefore, this article aims to address this gap by conducting the analysis of detecting training data for LLM. Specifically, we analyze the available LLM’s information to the detector, the main detection methods, determination metrics, and discuss the technical challenges and potential directions for future research in this field.
随着大型语言模型(llm)的不断发展,用于训练的数据的范围和多样性正在显著扩大。然而,法学硕士的训练数据集可能不可避免地包含敏感信息,如个人数据或受版权保护的材料,如果模型生成的文本与这些来源高度相似或相同,则会导致隐私泄露或版权侵权风险。这引起了人们对检测文本数据是否用于LLM训练的关注。迄今为止,人工智能(AI)模型中训练数据使用检测的研究主要集中在传统的机器学习(ML)模型上。然而,法学硕士的研究还相对不成熟。对这一领域的研究进展缺乏了解,阻碍了更有效检测方法的发展。因此,本文旨在通过对LLM检测训练数据进行分析来解决这一空白。具体来说,我们分析了LLM对检测器的可用信息、主要检测方法、测定指标,并讨论了该领域未来研究的技术挑战和潜在方向。
{"title":"Detecting Training Data For Large Language Models: A Survey","authors":"Chen Yang, Junyi Li, Shulin Lan, Yingchao Wang, Hongyang Du, Congcheng Gong, Xingshan Yao, Dusit (Tao) Niyato, Liehuang Zhu","doi":"10.1145/3779430","DOIUrl":"https://doi.org/10.1145/3779430","url":null,"abstract":"As large language models (LLMs) continue to evolve, the scope and diversity of data used for training are expanding significantly. However, the training dataset of LLMs may inevitably contain sensitive information such as personal data or copyrighted material, leading to privacy leakage or copyright infringement risks if the model generates highly similar or identical text to these sources. This has drawn attention to the issue of detecting whether the text data is used for LLM training. To date, research on detecting training data usage in artificial intelligence (AI) models has mainly focused on traditional machine learning (ML) models. However, studies on LLMs remain relatively immature. The lack of understanding of research progress in this area has hindered the development of more effective detection methods. Therefore, this article aims to address this gap by conducting the analysis of detecting training data for LLM. Specifically, we analyze the available LLM’s information to the detector, the main detection methods, determination metrics, and discuss the technical challenges and potential directions for future research in this field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"45 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145920261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models for Computer-Aided Design: A Survey 计算机辅助设计的大型语言模型:综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-06 DOI: 10.1145/3787499
Licheng Zhang, Bach Le, Naveed Akhtar, Siew-Kei Lam, Duc Ngo
Large Language Models (LLMs) have seen rapid advancements in recent years, with models like ChatGPT and DeepSeek, showcasing their remarkable capabilities across diverse domains. While substantial research has been conducted on LLMs in various fields, a comprehensive review focusing on their integration with Computer-Aided Design (CAD) remains notably absent. CAD is the industry standard for 3D modeling and plays a vital role in the design and development of products across different industries. As the complexity of modern designs increases, the potential for LLMs to enhance and streamline CAD workflows presents an exciting frontier. This article presents the first systematic survey exploring the intersection of LLMs and CAD. We begin by outlining the industrial significance of CAD, highlighting the need for Artificial Intelligence (AI)-driven innovation. Next, we provide a detailed overview of the foundation of LLMs. We also examine both closed-source LLMs as well as publicly available models. The core of this review focuses on the various applications of LLMs in CAD, providing a taxonomy of six key areas where these models are making considerable impact. We also provide a comprehensive study of CAD evaluation, reviewing existing methods and metrics in detail. In our analysis, we also examine common data modalities, model usage trends, dataset sources, and industrial application domains to provide a well-rounded picture of the field. Finally, we propose several promising future directions for further advancements, which offer vast opportunities for innovation and are poised to shape the future of CAD technology. Github: https://github.com/lichengzhanguom/LLMs-CAD-Survey-Taxonomy
近年来,大型语言模型(llm)发展迅速,ChatGPT和DeepSeek等模型展示了它们在不同领域的卓越能力。虽然对法学硕士在各个领域进行了大量的研究,但对其与计算机辅助设计(CAD)的集成进行全面的审查仍然明显缺乏。CAD是3D建模的行业标准,在不同行业的产品设计和开发中起着至关重要的作用。随着现代设计复杂性的增加,法学硕士增强和简化CAD工作流程的潜力呈现出令人兴奋的前沿。本文首次系统地探讨了法学硕士与CAD的交叉。我们首先概述了CAD的工业意义,强调了对人工智能(AI)驱动的创新的需求。接下来,我们将详细介绍法学硕士的基础。我们还研究了闭源法学硕士和公开可用的模型。这篇综述的核心集中在法学硕士在CAD中的各种应用,提供了这些模型产生相当大影响的六个关键领域的分类。我们还提供了CAD评估的全面研究,详细回顾了现有的方法和指标。在我们的分析中,我们还研究了常见的数据模式、模型使用趋势、数据集来源和工业应用领域,以提供该领域的全面图景。最后,我们提出了几个有希望的未来发展方向,这些方向为创新提供了巨大的机会,并准备塑造CAD技术的未来。Github: https://github.com/lichengzhanguom/LLMs-CAD-Survey-Taxonomy
{"title":"Large Language Models for Computer-Aided Design: A Survey","authors":"Licheng Zhang, Bach Le, Naveed Akhtar, Siew-Kei Lam, Duc Ngo","doi":"10.1145/3787499","DOIUrl":"https://doi.org/10.1145/3787499","url":null,"abstract":"Large Language Models (LLMs) have seen rapid advancements in recent years, with models like ChatGPT and DeepSeek, showcasing their remarkable capabilities across diverse domains. While substantial research has been conducted on LLMs in various fields, a comprehensive review focusing on their integration with Computer-Aided Design (CAD) remains notably absent. CAD is the industry standard for 3D modeling and plays a vital role in the design and development of products across different industries. As the complexity of modern designs increases, the potential for LLMs to enhance and streamline CAD workflows presents an exciting frontier. This article presents the first systematic survey exploring the intersection of LLMs and CAD. We begin by outlining the industrial significance of CAD, highlighting the need for Artificial Intelligence (AI)-driven innovation. Next, we provide a detailed overview of the foundation of LLMs. We also examine both closed-source LLMs as well as publicly available models. The core of this review focuses on the various applications of LLMs in CAD, providing a taxonomy of six key areas where these models are making considerable impact. We also provide a comprehensive study of CAD evaluation, reviewing existing methods and metrics in detail. In our analysis, we also examine common data modalities, model usage trends, dataset sources, and industrial application domains to provide a well-rounded picture of the field. Finally, we propose several promising future directions for further advancements, which offer vast opportunities for innovation and are poised to shape the future of CAD technology. Github: https://github.com/lichengzhanguom/LLMs-CAD-Survey-Taxonomy","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"41 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145908000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNN Partitioning for Cooperative Inference in Edge Intelligence: Modeling, Solutions, Toolchains 边缘智能协同推理的深度神经网络划分:建模,解决方案,工具链
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-30 DOI: 10.1145/3786145
Yuntao Hao, Nan Ding, Weiguo Xia, Hongwei Ge, Li Xu
With rapid advancements in artificial intelligence and Internet of Things technologies, the deployment of deep neural network (DNN) models on the edge nodes and the end nodes has become an essential trend. However, the limited computational power, storage capacity, and resource constraints of these devices present significant challenges for deep learning inference. Traditional acceleration methods, such as model compression and hardware optimization, often struggle to balance real-time performance, accuracy, and cost-effectiveness. To address these challenges, collaborative inference through DNN partitioning has emerged as a promising solution. This paper provides a comprehensive overview of architectural frameworks for DNN partitioning in collaborative inference. We establish a unified mathematical framework to describe various architectures, DNN models, and their associated optimization problems. In addition, we systematically classify and analyze existing partitioning strategies based on partition count and granularity. Furthermore, we summarize commonly used experimental setups and tools, offering practical insight into implementation. Finally, we discuss key challenges and open issues in DNN partitioning for collaborative inference, such as ensuring data security and privacy and efficiently partitioning large-scale models, providing valuable guidance for future research.
随着人工智能和物联网技术的飞速发展,深度神经网络(DNN)模型在边缘节点和终端节点上的部署已成为必然趋势。然而,这些设备有限的计算能力、存储容量和资源限制对深度学习推理提出了重大挑战。传统的加速方法,如模型压缩和硬件优化,往往难以平衡实时性能、准确性和成本效益。为了应对这些挑战,通过DNN划分进行协作推理已经成为一种很有前途的解决方案。本文提供了协作推理中DNN划分的体系结构框架的全面概述。我们建立了一个统一的数学框架来描述各种架构、深度神经网络模型及其相关的优化问题。此外,我们还基于分区计数和粒度对现有的分区策略进行了系统的分类和分析。此外,我们总结了常用的实验设置和工具,为实现提供了实际的见解。最后,我们讨论了协同推理中DNN划分的关键挑战和开放问题,如确保数据安全和隐私以及有效划分大规模模型,为未来的研究提供了有价值的指导。
{"title":"DNN Partitioning for Cooperative Inference in Edge Intelligence: Modeling, Solutions, Toolchains","authors":"Yuntao Hao, Nan Ding, Weiguo Xia, Hongwei Ge, Li Xu","doi":"10.1145/3786145","DOIUrl":"https://doi.org/10.1145/3786145","url":null,"abstract":"With rapid advancements in artificial intelligence and Internet of Things technologies, the deployment of deep neural network (DNN) models on the edge nodes and the end nodes has become an essential trend. However, the limited computational power, storage capacity, and resource constraints of these devices present significant challenges for deep learning inference. Traditional acceleration methods, such as model compression and hardware optimization, often struggle to balance real-time performance, accuracy, and cost-effectiveness. To address these challenges, collaborative inference through DNN partitioning has emerged as a promising solution. This paper provides a comprehensive overview of architectural frameworks for DNN partitioning in collaborative inference. We establish a unified mathematical framework to describe various architectures, DNN models, and their associated optimization problems. In addition, we systematically classify and analyze existing partitioning strategies based on partition count and granularity. Furthermore, we summarize commonly used experimental setups and tools, offering practical insight into implementation. Finally, we discuss key challenges and open issues in DNN partitioning for collaborative inference, such as ensuring data security and privacy and efficiently partitioning large-scale models, providing valuable guidance for future research.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"29 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Review of Brain-inspired Navigation 脑启发导航的综合综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-26 DOI: 10.1145/3786344
Xu He, Xiaolin Meng, Lingfei Mo, Youdong Zhang, Fangwen Yu, Jingnan Liu
Intelligent navigation is essential for unmanned systems. Yet nowadays navigation technologies still fall short of animals’ innate navigation prowess, characterized by continuous, efficient, adaptive, low-power navigating across complex terrains, despite technological advancements. Neuroscience's half-century exploration has revealed the brain's innate “Global Positioning System (GPS),” instigating research into Brain-Inspired Navigation (BIN). BIN, is a cutting-edge navigation technology, that bridges disciplines but lacks a cohesive guide for its interdisciplinary study. In this paper, we offer a comprehensive BIN review, mapping its neural basis, computational foundations, current progress, and implementation conditions, providing a general framework for researchers alongside forward-looking recommendations for future development in the domain. The highlights of this paper can be available at https://binucoe.github.io/Awesome-Brain-inspired-Navigation/ .
智能导航对于无人驾驶系统至关重要。然而,如今的导航技术仍不及动物天生的导航能力,这种能力的特点是在复杂的地形上进行持续、高效、自适应、低功耗的导航。神经科学半个世纪的探索揭示了大脑固有的“全球定位系统(GPS)”,激发了对大脑启发导航(BIN)的研究。BIN是一种前沿的导航技术,它连接了多个学科,但缺乏一个有凝聚力的跨学科研究指南。在本文中,我们提供了一个全面的BIN综述,映射其神经基础、计算基础、当前进展和实施条件,为研究人员提供了一个总体框架,并为该领域的未来发展提供了前瞻性建议。本文的重点内容可在https://binucoe.github.io/Awesome-Brain-inspired-Navigation/上获得。
{"title":"A Comprehensive Review of Brain-inspired Navigation","authors":"Xu He, Xiaolin Meng, Lingfei Mo, Youdong Zhang, Fangwen Yu, Jingnan Liu","doi":"10.1145/3786344","DOIUrl":"https://doi.org/10.1145/3786344","url":null,"abstract":"Intelligent navigation is essential for unmanned systems. Yet nowadays navigation technologies still fall short of animals’ innate navigation prowess, characterized by continuous, efficient, adaptive, low-power navigating across complex terrains, despite technological advancements. Neuroscience's half-century exploration has revealed the brain's innate “Global Positioning System (GPS),” instigating research into Brain-Inspired Navigation (BIN). BIN, is a cutting-edge navigation technology, that bridges disciplines but lacks a cohesive guide for its interdisciplinary study. In this paper, we offer a comprehensive BIN review, mapping its neural basis, computational foundations, current progress, and implementation conditions, providing a general framework for researchers alongside forward-looking recommendations for future development in the domain. The highlights of this paper can be available at <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" ext-link-type=\"uri\" xlink:href=\"https://binucoe.github.io/Awesome-Brain-inspired-Navigation/\">https://binucoe.github.io/Awesome-Brain-inspired-Navigation/</jats:ext-link> .","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"8 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit Aspect Extraction: A Systematic Review 隐式方面提取:系统综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-25 DOI: 10.1145/3786590
Meghna Chaudhary, Tempestt Neal
The subject of one’s opinions expressed in textual data provides rich information regarding their attitudes and behaviors. Many natural language processing tasks leverage such information to, for example, study product purchasing behaviors or extract insights during global events. The task of identifying these subjects is referred to as aspect extraction . Aspect extraction approaches typically focus on the identification of explicitly stated aspects in a text sample. However, it is suggested that implicit aspects , or those that must be inferred by the context provided in the text, comprise more than 20% of all aspects in a given dataset and that identification of implicit aspects is important for accurate aspect-based analyses such as aspect-based sentiment analysis. As such, this paper surveys recent work in implicit aspect extraction. We define and describe commonly used datasets and algorithmic approaches and detail various challenges which have thus far led to limited research in implicit aspect extraction as compared to explicit aspect extraction, like fewer benchmark datasets and limited use of powerful attention models.
一个人的观点在文本数据中表达的主题提供了关于他们的态度和行为的丰富信息。许多自然语言处理任务利用这些信息,例如,研究产品购买行为或在全球事件中提取见解。识别这些主题的任务称为方面提取。方面提取方法通常侧重于识别文本样本中显式陈述的方面。然而,建议隐式方面,或必须由文本中提供的上下文推断的方面,占给定数据集中所有方面的20%以上,并且隐式方面的识别对于准确的基于方面的分析(如基于方面的情感分析)非常重要。因此,本文综述了隐式方面提取的最新研究进展。我们定义和描述了常用的数据集和算法方法,并详细介绍了迄今为止导致隐式方面提取研究有限的各种挑战,与显式方面提取相比,如较少的基准数据集和有限的使用强大的注意力模型。
{"title":"Implicit Aspect Extraction: A Systematic Review","authors":"Meghna Chaudhary, Tempestt Neal","doi":"10.1145/3786590","DOIUrl":"https://doi.org/10.1145/3786590","url":null,"abstract":"The subject of one’s opinions expressed in textual data provides rich information regarding their attitudes and behaviors. Many natural language processing tasks leverage such information to, for example, study product purchasing behaviors or extract insights during global events. The task of identifying these subjects is referred to as <jats:italic toggle=\"yes\">aspect extraction</jats:italic> . Aspect extraction approaches typically focus on the identification of explicitly stated aspects in a text sample. However, it is suggested that <jats:italic toggle=\"yes\">implicit aspects</jats:italic> , or those that must be inferred by the context provided in the text, comprise more than 20% of all aspects in a given dataset and that identification of implicit aspects is important for accurate aspect-based analyses such as aspect-based sentiment analysis. As such, this paper surveys recent work in implicit aspect extraction. We define and describe commonly used datasets and algorithmic approaches and detail various challenges which have thus far led to limited research in implicit aspect extraction as compared to explicit aspect extraction, like fewer benchmark datasets and limited use of powerful attention models.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"4 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey of Single Image Blind Motion Deblurring from Traditional to Deep Learning 从传统到深度学习的单幅图像盲运动去模糊研究综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-25 DOI: 10.1145/3785655
Tingting Zhang, Jiawei Lu, Qiyu Jin, Tieyong Zeng
Single image blind motion deblurring, a cornerstone of low-level computer vision, seeks to recover a sharp image from a single blurred observation, addressing challenges posed by motion-induced degradation. This survey provides a comprehensive review of the field, spanning traditional methodologies and deep learning (DL) methods. We begin by defining the problem, outlining its significance, and tracing its research evolution. The paper systematically examines traditional approaches–including prior-based, edge-detection, patch-based, and specialized deblurring techniques–followed by an in-depth exploration of DL-based methods, categorized into hybrid model-driven/data-driven frameworks and fully data-driven architectures. Key datasets, loss functions, and quantitative performance evaluations of both classic and state-of-the-art methods on benchmarks are presented to offer practical insights. We conclude by summarizing advancements, identifying persistent challenges such as handling complex real-world data and computational efficiency, and proposing future research directions. This survey serves as a valuable resource for researchers, providing a holistic understanding of blind motion deblurring and fostering innovation in this dynamic domain.
单图像盲运动去模糊是低级计算机视觉的基础,旨在从单个模糊的观察中恢复清晰的图像,解决由运动引起的退化带来的挑战。该调查提供了该领域的全面回顾,涵盖了传统方法和深度学习(DL)方法。我们首先定义问题,概述其重要性,并追溯其研究演变。本文系统地研究了传统方法,包括基于先验的、边缘检测的、基于补丁的和专门的去模糊技术,然后深入探索了基于dl的方法,分为混合模型驱动/数据驱动框架和完全数据驱动架构。关键数据集,损失函数,以及经典和最先进的基准方法的定量性能评估,以提供实际的见解。最后,我们总结了研究进展,指出了处理复杂现实世界数据和计算效率等持续存在的挑战,并提出了未来的研究方向。这项调查为研究人员提供了宝贵的资源,提供了对盲运动去模糊的整体理解,并促进了这一动态领域的创新。
{"title":"A Survey of Single Image Blind Motion Deblurring from Traditional to Deep Learning","authors":"Tingting Zhang, Jiawei Lu, Qiyu Jin, Tieyong Zeng","doi":"10.1145/3785655","DOIUrl":"https://doi.org/10.1145/3785655","url":null,"abstract":"Single image blind motion deblurring, a cornerstone of low-level computer vision, seeks to recover a sharp image from a single blurred observation, addressing challenges posed by motion-induced degradation. This survey provides a comprehensive review of the field, spanning traditional methodologies and deep learning (DL) methods. We begin by defining the problem, outlining its significance, and tracing its research evolution. The paper systematically examines traditional approaches–including prior-based, edge-detection, patch-based, and specialized deblurring techniques–followed by an in-depth exploration of DL-based methods, categorized into hybrid model-driven/data-driven frameworks and fully data-driven architectures. Key datasets, loss functions, and quantitative performance evaluations of both classic and state-of-the-art methods on benchmarks are presented to offer practical insights. We conclude by summarizing advancements, identifying persistent challenges such as handling complex real-world data and computational efficiency, and proposing future research directions. This survey serves as a valuable resource for researchers, providing a holistic understanding of blind motion deblurring and fostering innovation in this dynamic domain.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"47 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Event Camera Meets Mobile Embodied Perception: Abstraction, Algorithm, Acceleration, Application 事件相机与移动具身感知:抽象、算法、加速、应用
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-25 DOI: 10.1145/3786332
Haoyang Wang, Ruishan Guo, Pengtao Ma, Ciyu Ruan, Xinyu Luo, Wenhua Ding, Tianyang Zhong, Jingao Xu, Yunhao Liu, Xinlei Chen
With the evolution of mobile embodied intelligence, agents such as drones and autonomous robots are transitioning toward high agility. This shift imposes stringent demands on embodied perception, requiring high-accuracy and low-latency feedback loops for reliable interaction. Event-based vision has emerged as a transformative paradigm. Its microsecond-level temporal resolution and high dynamic range render it ideal for embodied perception tasks on high-agility mobile platforms. However, asynchronous nature, substantial noise, lack of persistent semantic information, and large data volume pose challenges for processing on resource-constrained mobile agents. This paper surveys the literature from 2014-2025 and presents a comprehensive overview of event-based mobile embodied perception. We organize review around four key pillars: event abstraction methods, perception algorithm advancements, hardware and software acceleration strategies, and mobile applications . We discuss critical tasks including visual odometry, object tracking, optical flow, and 3D reconstruction, while highlighting challenges associated with sensor fusion and real-time deployment. Furthermore, we outline future research directions, such as improving event cameras with advanced optics and leveraging neuromorphic computing for efficient processing. To support ongoing research, we provide an open-source Online Sheet with recent developments. We hope this survey serves as a reference, facilitating adoption of event-based vision across diverse mobile embodied applications.
随着移动具身智能的发展,无人机和自主机器人等智能体正在向高敏捷性过渡。这种转变对具身感知提出了严格的要求,需要高精度和低延迟的反馈循环来进行可靠的交互。基于事件的愿景已经成为一种变革范式。其微秒级的时间分辨率和高动态范围使其成为高敏捷移动平台上体现感知任务的理想选择。然而,异步特性、大量噪声、缺乏持久语义信息和大数据量给资源受限的移动代理的处理带来了挑战。本文对2014-2025年的文献进行了调查,并对基于事件的移动体现感知进行了全面概述。我们围绕四个关键支柱组织审查:事件抽象方法,感知算法的进步,硬件和软件加速策略,以及移动应用。我们讨论了关键任务,包括视觉里程计,目标跟踪,光流和3D重建,同时强调了与传感器融合和实时部署相关的挑战。此外,我们概述了未来的研究方向,例如用先进的光学技术改进事件相机,利用神经形态计算进行有效的处理。为了支持正在进行的研究,我们提供了一个开源的在线表格,其中包含最近的发展。我们希望这项调查能提供参考,促进在各种移动嵌入应用程序中采用基于事件的愿景。
{"title":"Event Camera Meets Mobile Embodied Perception: Abstraction, Algorithm, Acceleration, Application","authors":"Haoyang Wang, Ruishan Guo, Pengtao Ma, Ciyu Ruan, Xinyu Luo, Wenhua Ding, Tianyang Zhong, Jingao Xu, Yunhao Liu, Xinlei Chen","doi":"10.1145/3786332","DOIUrl":"https://doi.org/10.1145/3786332","url":null,"abstract":"With the evolution of mobile embodied intelligence, agents such as drones and autonomous robots are transitioning toward high agility. This shift imposes stringent demands on embodied perception, requiring high-accuracy and low-latency feedback loops for reliable interaction. Event-based vision has emerged as a transformative paradigm. Its microsecond-level temporal resolution and high dynamic range render it ideal for embodied perception tasks on high-agility mobile platforms. However, asynchronous nature, substantial noise, lack of persistent semantic information, and large data volume pose challenges for processing on resource-constrained mobile agents. This paper surveys the literature from 2014-2025 and presents a comprehensive overview of event-based mobile embodied perception. We organize review around four key pillars: event <jats:italic toggle=\"yes\">abstraction</jats:italic> methods, perception <jats:italic toggle=\"yes\">algorithm</jats:italic> advancements, hardware and software <jats:italic toggle=\"yes\">acceleration</jats:italic> strategies, and mobile <jats:italic toggle=\"yes\">applications</jats:italic> . We discuss critical tasks including visual odometry, object tracking, optical flow, and 3D reconstruction, while highlighting challenges associated with sensor fusion and real-time deployment. Furthermore, we outline future research directions, such as improving event cameras with advanced optics and leveraging neuromorphic computing for efficient processing. To support ongoing research, we provide an open-source <jats:italic toggle=\"yes\">Online Sheet</jats:italic> with recent developments. We hope this survey serves as a reference, facilitating adoption of event-based vision across diverse mobile embodied applications.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"16 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey on Large Language Models for Mathematical Reasoning 用于数学推理的大型语言模型综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-25 DOI: 10.1145/3786333
Peng-Yuan Wang, Tian-Shuo Liu, Chenyang Wang, Ziniu Li, Yidi Wang, Shu Yan, Chengxing Jia, Xu-Hui Liu, Xinwei Chen, Jiacheng Xu, Yang Yu
Mathematical reasoning has long represented one of the most fundamental and challenging frontiers in artificial intelligence research. In recent years, large language models (LLMs) have achieved significant advances in this area. This survey examines the development of mathematical reasoning abilities in LLMs through two high-level cognitive phases: comprehension, where models gain mathematical understanding via diverse pretraining strategies, and answer generation, which has progressed from direct prediction to step-by-step Chain-of-Thought (CoT) reasoning. We review methods for enhancing mathematical reasoning, ranging from training-free prompting to fine-tuning approaches such as supervised fine-tuning and reinforcement learning, and discuss recent work on extended CoT and “test-time scaling”. Despite notable progress, fundamental challenges remain in terms of capacity, efficiency, and generalization. To address these issues, we highlight promising research directions, including advanced pretraining and knowledge augmentation techniques, formal reasoning frameworks, and meta-generalization through principled learning paradigms. This survey tries to provide some insights for researchers interested in enhancing reasoning capabilities of LLMs and for those seeking to apply these techniques to other domains.
长期以来,数学推理一直是人工智能研究中最基本、最具挑战性的前沿之一。近年来,大型语言模型(llm)在这一领域取得了重大进展。本调查通过两个高级认知阶段考察法学硕士数学推理能力的发展:理解,模型通过各种预训练策略获得数学理解,以及答案生成,从直接预测到逐步的思维链(CoT)推理。我们回顾了增强数学推理的方法,从无训练提示到微调方法,如监督微调和强化学习,并讨论了最近在扩展CoT和“测试时间缩放”方面的工作。尽管取得了显著进展,但在能力、效率和泛化方面仍然存在根本性的挑战。为了解决这些问题,我们强调了有前途的研究方向,包括先进的预训练和知识增强技术、形式推理框架和通过原则学习范式的元概括。这项调查试图为那些对增强法学硕士推理能力感兴趣的研究人员以及那些寻求将这些技术应用于其他领域的研究人员提供一些见解。
{"title":"A Survey on Large Language Models for Mathematical Reasoning","authors":"Peng-Yuan Wang, Tian-Shuo Liu, Chenyang Wang, Ziniu Li, Yidi Wang, Shu Yan, Chengxing Jia, Xu-Hui Liu, Xinwei Chen, Jiacheng Xu, Yang Yu","doi":"10.1145/3786333","DOIUrl":"https://doi.org/10.1145/3786333","url":null,"abstract":"Mathematical reasoning has long represented one of the most fundamental and challenging frontiers in artificial intelligence research. In recent years, large language models (LLMs) have achieved significant advances in this area. This survey examines the development of mathematical reasoning abilities in LLMs through two high-level cognitive phases: comprehension, where models gain mathematical understanding via diverse pretraining strategies, and answer generation, which has progressed from direct prediction to step-by-step Chain-of-Thought (CoT) reasoning. We review methods for enhancing mathematical reasoning, ranging from training-free prompting to fine-tuning approaches such as supervised fine-tuning and reinforcement learning, and discuss recent work on extended CoT and “test-time scaling”. Despite notable progress, fundamental challenges remain in terms of capacity, efficiency, and generalization. To address these issues, we highlight promising research directions, including advanced pretraining and knowledge augmentation techniques, formal reasoning frameworks, and meta-generalization through principled learning paradigms. This survey tries to provide some insights for researchers interested in enhancing reasoning capabilities of LLMs and for those seeking to apply these techniques to other domains.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"56 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Survey and Taxonomy of Cybersecurity Challenges and Proactive Measures for IoD 网络安全挑战与主动措施的综合调查与分类
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-12-25 DOI: 10.1145/3785658
Arnolnt Spyros, Periklis Chatzimisios, Dimitrios Kavallieros, Theodora Tsikrika, Stefanos Vrochidis, Yiannis Kompatsiaris
The use of Unmanned Aerial Vehicles (UAVs) in various domains is continuously increasing. The benefits offered by UAVs enable the facilitation of functions or execution of tasks, such as search and rescue, which would otherwise be challenging or financially restrictive. Furthermore, UAVs are important for the military domain being considered as efficient weapons. Therefore, many countries and retail companies are investing in the massive production of UAVs for market as well as government purposes. Nevertheless, most UAVs in all domains, lack even basic cybersecurity mechanisms, thus, posing cybersecurity threats that significantly affect cybersecurity aspects or result in physical damage and even loss of human life. Many cybersecurity solutions have been proposed in the literature, encompassing both reactive and proactive measures, with the latter being preferable. Added to this, threat modeling emerges as an effective proactive measure with minimal cost and complexity. The current work conducts extensive and comprehensive research concerning the domain of UAVs. Initially, the paper outlines the historical development of UAVs and the different taxonomies in which they are classified according to different characteristics (e.g., number of rotors). The next sections detail a comprehensive description of the Internet of Drones (IoD) environment and the various relevant cybersecurity issues. Subsequently, the paper examines a wide range of diverse threat modeling approaches found in the literature, while it categorises relevant papers based on whether they describe solutions that integrate threat modeling or propose novel approaches for threat modeling.
无人驾驶飞行器(uav)在各个领域的应用不断增加。无人机提供的优势能够促进功能或任务的执行,例如搜索和救援,否则将具有挑战性或财务限制。此外,无人机作为一种高效的武器,在军事领域具有重要意义。因此,许多国家和零售公司正在投资于无人机的大规模生产,以满足市场和政府的目的。然而,在所有领域,大多数无人机甚至缺乏基本的网络安全机制,从而构成网络安全威胁,严重影响网络安全方面或导致物理损坏甚至人员伤亡。文献中提出了许多网络安全解决方案,包括被动措施和主动措施,后者更可取。此外,威胁建模作为一种有效的主动措施出现,成本和复杂性都很低。目前的工作对无人机领域进行了广泛而全面的研究。首先,本文概述了无人机的历史发展和不同的分类法,其中根据不同的特征(例如,旋翼数量)进行分类。接下来的部分详细介绍了无人机互联网(IoD)环境和各种相关网络安全问题的全面描述。随后,本文研究了文献中发现的各种威胁建模方法,同时根据它们是否描述了集成威胁建模的解决方案或提出了新的威胁建模方法对相关论文进行了分类。
{"title":"A Comprehensive Survey and Taxonomy of Cybersecurity Challenges and Proactive Measures for IoD","authors":"Arnolnt Spyros, Periklis Chatzimisios, Dimitrios Kavallieros, Theodora Tsikrika, Stefanos Vrochidis, Yiannis Kompatsiaris","doi":"10.1145/3785658","DOIUrl":"https://doi.org/10.1145/3785658","url":null,"abstract":"The use of Unmanned Aerial Vehicles (UAVs) in various domains is continuously increasing. The benefits offered by UAVs enable the facilitation of functions or execution of tasks, such as search and rescue, which would otherwise be challenging or financially restrictive. Furthermore, UAVs are important for the military domain being considered as efficient weapons. Therefore, many countries and retail companies are investing in the massive production of UAVs for market as well as government purposes. Nevertheless, most UAVs in all domains, lack even basic cybersecurity mechanisms, thus, posing cybersecurity threats that significantly affect cybersecurity aspects or result in physical damage and even loss of human life. Many cybersecurity solutions have been proposed in the literature, encompassing both reactive and proactive measures, with the latter being preferable. Added to this, threat modeling emerges as an effective proactive measure with minimal cost and complexity. The current work conducts extensive and comprehensive research concerning the domain of UAVs. Initially, the paper outlines the historical development of UAVs and the different taxonomies in which they are classified according to different characteristics (e.g., number of rotors). The next sections detail a comprehensive description of the Internet of Drones (IoD) environment and the various relevant cybersecurity issues. Subsequently, the paper examines a wide range of diverse threat modeling approaches found in the literature, while it categorises relevant papers based on whether they describe solutions that integrate threat modeling or propose novel approaches for threat modeling.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"4 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1