首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
Selective Forgetting in Machine Learning and Beyond: A Survey 选择性遗忘在机器学习和超越:一项调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-10 DOI: 10.1145/3796542
Alyssa Sha, Bernardo Nunes, Armin Haller
This survey investigates the multifaceted nature of selective forgetting in machine learning, drawing insights from neuroscientific research that posits forgetting as an adaptive function rather than a defect, enhancing the learning process and preventing overfitting. This survey focuses on the benefits of selective forgetting and its applications across various machine learning sub-fields that can help improve model performance and enhance data privacy. Moreover, the paper discusses current challenges, future directions, and ethical considerations regarding the integration of selective forgetting mechanisms into machine learning models. We present a comprehensive taxonomy that bridges diverse selective forgetting-related research in machine learning, systematically categorising approaches along key dimensions. Our work synthesises theories of forgetting from different knowledge areas to establish theoretical foundations for forgetting mechanisms in machine learning, providing a unified framework for understanding selective forgetting processes.
本研究调查了机器学习中选择性遗忘的多面性,从神经科学研究中得出了一些见解,这些研究认为遗忘是一种自适应功能,而不是一种缺陷,从而增强了学习过程并防止了过拟合。本调查的重点是选择性遗忘的好处及其在各种机器学习子领域的应用,可以帮助提高模型性能和增强数据隐私。此外,本文还讨论了将选择性遗忘机制整合到机器学习模型中的当前挑战、未来方向和伦理考虑。我们提出了一个全面的分类法,它连接了机器学习中不同的选择性遗忘相关研究,沿着关键维度系统地对方法进行分类。我们的工作综合了不同知识领域的遗忘理论,为机器学习中的遗忘机制建立了理论基础,为理解选择性遗忘过程提供了统一的框架。
{"title":"Selective Forgetting in Machine Learning and Beyond: A Survey","authors":"Alyssa Sha, Bernardo Nunes, Armin Haller","doi":"10.1145/3796542","DOIUrl":"https://doi.org/10.1145/3796542","url":null,"abstract":"This survey investigates the multifaceted nature of selective forgetting in machine learning, drawing insights from neuroscientific research that posits forgetting as an adaptive function rather than a defect, enhancing the learning process and preventing overfitting. This survey focuses on the benefits of selective forgetting and its applications across various machine learning sub-fields that can help improve model performance and enhance data privacy. Moreover, the paper discusses current challenges, future directions, and ethical considerations regarding the integration of selective forgetting mechanisms into machine learning models. We present a comprehensive taxonomy that bridges diverse selective forgetting-related research in machine learning, systematically categorising approaches along key dimensions. Our work synthesises theories of forgetting from different knowledge areas to establish theoretical foundations for forgetting mechanisms in machine learning, providing a unified framework for understanding selective forgetting processes.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"186 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alignment of Diffusion Models: Fundamentals, Challenges, and Future 扩散模型的一致性:基础、挑战和未来
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-10 DOI: 10.1145/3796982
Buhua Liu, Shitong Shao, Bao Li, Lichen Bai, Zhiqiang Xu, Haoyi Xiong, James T. Kwok, Sumi Helal, Zeke Xie
Diffusion models have emerged as the leading paradigm in generative modeling, excelling in various applications. Despite their success, these models often misalign with human intentions and generate results with undesired properties or even harmful content. Inspired by the success and popularity of alignment in tuning large language models, recent studies have investigated aligning diffusion models with human expectations and preferences. This work mainly reviews alignment of diffusion models, covering advancements in fundamentals of alignment, alignment techniques of diffusion models, preference benchmarks, and evaluation for diffusion models. Moreover, we discuss key perspectives on current challenges and promising future directions on solving the remaining challenges in alignment of diffusion models. To the best of our knowledge, our work is the first comprehensive review paper for researchers and engineers to comprehend, practice, and research alignment of diffusion models.
扩散模型已经成为生成建模的主要范例,在各种应用中表现优异。尽管这些模型取得了成功,但它们经常与人类的意图不一致,并产生具有不希望的属性甚至有害内容的结果。受调整大型语言模型的成功和流行的启发,最近的研究调查了将扩散模型与人类的期望和偏好对齐。本文主要回顾了扩散模型的对齐,涵盖了对齐基础、扩散模型的对齐技术、偏好基准和扩散模型的评估方面的进展。此外,我们讨论了当前挑战的关键观点和解决扩散模型对齐中剩余挑战的有希望的未来方向。据我们所知,我们的工作是研究人员和工程师理解、实践和研究扩散模型对齐的第一篇综合综述论文。
{"title":"Alignment of Diffusion Models: Fundamentals, Challenges, and Future","authors":"Buhua Liu, Shitong Shao, Bao Li, Lichen Bai, Zhiqiang Xu, Haoyi Xiong, James T. Kwok, Sumi Helal, Zeke Xie","doi":"10.1145/3796982","DOIUrl":"https://doi.org/10.1145/3796982","url":null,"abstract":"Diffusion models have emerged as the leading paradigm in generative modeling, excelling in various applications. Despite their success, these models often misalign with human intentions and generate results with undesired properties or even harmful content. Inspired by the success and popularity of alignment in tuning large language models, recent studies have investigated aligning diffusion models with human expectations and preferences. This work mainly reviews alignment of diffusion models, covering advancements in fundamentals of alignment, alignment techniques of diffusion models, preference benchmarks, and evaluation for diffusion models. Moreover, we discuss key perspectives on current challenges and promising future directions on solving the remaining challenges in alignment of diffusion models. To the best of our knowledge, our work is the first comprehensive review paper for researchers and engineers to comprehend, practice, and research alignment of diffusion models.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"26 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual Adversarial Attacks and Defenses in the Physical World: A Survey 物理世界中的视觉对抗性攻击和防御:综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-10 DOI: 10.1145/3793659
Xingxing Wei, Bangzheng Pu, Shiji Zhao, Jiefan Lu, Baoyuan Wu
Although Deep Neural Networks (DNNs) have been widely applied in various real-world scenarios, they remain vulnerable to adversarial examples. Adversarial attacks in computer vision can be categorized into digital attacks and physical attacks based on their different forms. Compared to digital attacks, which generate perturbations in digital pixels, physical attacks are more practical in real-world settings. Due to the serious security risks posed by physically adversarial examples, many studies have been conducted to evaluate the physically adversarial robustness of DNNs in recent years. In this paper, we provide a comprehensive survey of current physically adversarial attacks and defenses in computer vision. We establish a taxonomy by organizing physical attacks according to attack tasks, attack forms, and attack methods. This approach offers readers a systematic understanding of the topic from multiple perspectives. For physical defenses, we categorize them into pre-processing, in-processing, and post-processing for DNN models to ensure comprehensive coverage of adversarial defenses. Based on this survey, we discuss the challenges facing this research field and provide an outlook on future directions.
尽管深度神经网络(dnn)已广泛应用于各种现实场景,但它们仍然容易受到对抗性示例的影响。计算机视觉中的对抗性攻击根据其不同的形式可以分为数字攻击和物理攻击。与在数字像素中产生扰动的数字攻击相比,物理攻击在现实世界中更实用。由于物理对抗示例带来的严重安全风险,近年来进行了许多研究来评估dnn的物理对抗鲁棒性。在本文中,我们提供了一个全面的调查目前物理对抗性攻击和防御在计算机视觉。我们根据攻击任务、攻击形式和攻击方法对物理攻击进行组织,建立了分类法。这种方法为读者提供了从多个角度对主题的系统理解。对于物理防御,我们将其分类为DNN模型的预处理,处理中和后处理,以确保对抗性防御的全面覆盖。在此基础上,我们讨论了该研究领域面临的挑战,并展望了未来的发展方向。
{"title":"Visual Adversarial Attacks and Defenses in the Physical World: A Survey","authors":"Xingxing Wei, Bangzheng Pu, Shiji Zhao, Jiefan Lu, Baoyuan Wu","doi":"10.1145/3793659","DOIUrl":"https://doi.org/10.1145/3793659","url":null,"abstract":"Although Deep Neural Networks (DNNs) have been widely applied in various real-world scenarios, they remain vulnerable to adversarial examples. Adversarial attacks in computer vision can be categorized into digital attacks and physical attacks based on their different forms. Compared to digital attacks, which generate perturbations in digital pixels, physical attacks are more practical in real-world settings. Due to the serious security risks posed by physically adversarial examples, many studies have been conducted to evaluate the physically adversarial robustness of DNNs in recent years. In this paper, we provide a comprehensive survey of current physically adversarial attacks and defenses in computer vision. We establish a taxonomy by organizing physical attacks according to attack tasks, attack forms, and attack methods. This approach offers readers a systematic understanding of the topic from multiple perspectives. For physical defenses, we categorize them into pre-processing, in-processing, and post-processing for DNN models to ensure comprehensive coverage of adversarial defenses. Based on this survey, we discuss the challenges facing this research field and provide an outlook on future directions.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"299 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey on Inference Optimization Techniques for Mixture of Experts Models 混合专家模型推理优化技术综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-09 DOI: 10.1145/3794845
Jiacheng Liu, Peng Tang, Wenfeng Wang, Yuhang Ren, Xiaofeng Hou, Pheng Ann Heng, Minyi Guo, Chao Li
The emergence of large-scale Mixture of Experts (MoE) models represents a significant advancement in artificial intelligence, offering larger model capacity and computational efficiency through conditional computation. However, deploying and running inference on these models presents significant challenges in computational resources, latency, and energy efficiency. This comprehensive survey analyzes optimization techniques for MoE models across the entire system stack. We first establish a taxonomical framework that categorizes optimization approaches into model-level, system-level, and hardware-level optimizations. At the model level, we examine architectural innovations including efficient expert design, attention mechanisms, various compression techniques such as pruning, quantization, and knowledge distillation, as well as algorithm improvement including dynamic routing strategies and expert merging methods. At the system level, we investigate distributed computing approaches, load balancing mechanisms, and efficient scheduling algorithms that enable scalable deployment. Furthermore, we delve into hardware-specific optimizations and co-design strategies that maximize throughput and energy efficiency. This survey provides both a structured overview of existing solutions and identifies key challenges and promising research directions in MoE inference optimization. To facilitate ongoing updates and the sharing of cutting-edge advances in MoE inference optimization research, we have established a repository accessible at https://github.com/MoE-Inf/awesome-moe-inference/.
大规模混合专家(MoE)模型的出现代表了人工智能的重大进步,通过条件计算提供了更大的模型容量和计算效率。然而,在这些模型上部署和运行推理在计算资源、延迟和能源效率方面提出了重大挑战。这个全面的调查分析了整个系统堆栈中MoE模型的优化技术。我们首先建立一个分类法框架,将优化方法分为模型级、系统级和硬件级优化。在模型层面,我们研究了架构创新,包括高效的专家设计、关注机制、各种压缩技术,如修剪、量化和知识蒸馏,以及算法改进,包括动态路由策略和专家合并方法。在系统级,我们研究了分布式计算方法、负载平衡机制和有效的调度算法,这些算法支持可扩展的部署。此外,我们还深入研究了特定于硬件的优化和协同设计策略,以最大限度地提高吞吐量和能源效率。本调查提供了现有解决方案的结构化概述,并确定了MoE推理优化中的关键挑战和有前途的研究方向。为了促进MoE推理优化研究的持续更新和前沿进展的共享,我们已经建立了一个可访问的存储库https://github.com/MoE-Inf/awesome-moe-inference/。
{"title":"A Survey on Inference Optimization Techniques for Mixture of Experts Models","authors":"Jiacheng Liu, Peng Tang, Wenfeng Wang, Yuhang Ren, Xiaofeng Hou, Pheng Ann Heng, Minyi Guo, Chao Li","doi":"10.1145/3794845","DOIUrl":"https://doi.org/10.1145/3794845","url":null,"abstract":"The emergence of large-scale Mixture of Experts (MoE) models represents a significant advancement in artificial intelligence, offering larger model capacity and computational efficiency through conditional computation. However, deploying and running inference on these models presents significant challenges in computational resources, latency, and energy efficiency. This comprehensive survey analyzes optimization techniques for MoE models across the entire system stack. We first establish a taxonomical framework that categorizes optimization approaches into model-level, system-level, and hardware-level optimizations. At the model level, we examine architectural innovations including efficient expert design, attention mechanisms, various compression techniques such as pruning, quantization, and knowledge distillation, as well as algorithm improvement including dynamic routing strategies and expert merging methods. At the system level, we investigate distributed computing approaches, load balancing mechanisms, and efficient scheduling algorithms that enable scalable deployment. Furthermore, we delve into hardware-specific optimizations and co-design strategies that maximize throughput and energy efficiency. This survey provides both a structured overview of existing solutions and identifies key challenges and promising research directions in MoE inference optimization. To facilitate ongoing updates and the sharing of cutting-edge advances in MoE inference optimization research, we have established a repository accessible at https://github.com/MoE-Inf/awesome-moe-inference/.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"4 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crystalline Material Discovery in the Era of Artificial Intelligence 人工智能时代晶体材料的发现
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-09 DOI: 10.1145/3794853
Zhenzhong Wang, Haowei Hua, Wanyu Lin, Ming Yang, Kay Chen Tan
Crystalline materials, with symmetrical and periodic structures, exhibit a wide spectrum of properties and have been widely used in numerous applications across electronics, energy, and beyond. For crystalline materials discovery, traditional experimental and computational approaches are time-consuming and expensive. In these years, thanks to the explosive amount of crystalline materials data, great interest has been given to data-driven materials discovery. Particularly, recent advancements have exploited the expressive representation ability of deep learning to model the highly complex atomic systems within crystalline materials, opening up new avenues for efficient and accurate materials discovery. These works main focus on four types of tasks, including physicochemical property prediction, generative design of crystalline materials, aiding characterization, and accelerating theoretical computations. Despite the remarkable progress, there is still a lack of systematic investigation to summarize their distinctions and limitations. To fill this gap, we systematically investigated the progress of crystalline materials discovery using artificial intelligence made in recent years. We first introduce several data representations of the crystalline materials. Based on the representations, we summarize various fundamental deep learning models and their tailored usages in various material discovery tasks. Finally, we highlight the remaining challenges and propose future directions.
晶体材料具有对称和周期性结构,具有广泛的特性,广泛应用于电子、能源等领域。对于晶体材料的发现,传统的实验和计算方法既耗时又昂贵。近年来,由于晶体材料数据的爆炸式增长,数据驱动的材料发现引起了极大的兴趣。特别是,最近的进展已经利用深度学习的表达能力来模拟晶体材料中高度复杂的原子系统,为有效和准确的材料发现开辟了新的途径。这些工作主要集中在四种类型的任务上,包括物理化学性质预测、晶体材料的生成设计、辅助表征和加速理论计算。尽管取得了显著的进展,但仍缺乏系统的研究来总结它们的区别和局限性。为了填补这一空白,我们系统地研究了近年来利用人工智能发现晶体材料的进展。我们首先介绍晶体材料的几个数据表示。基于这些表示,我们总结了各种基本的深度学习模型及其在各种材料发现任务中的定制用法。最后,我们强调了仍然存在的挑战,并提出了未来的方向。
{"title":"Crystalline Material Discovery in the Era of Artificial Intelligence","authors":"Zhenzhong Wang, Haowei Hua, Wanyu Lin, Ming Yang, Kay Chen Tan","doi":"10.1145/3794853","DOIUrl":"https://doi.org/10.1145/3794853","url":null,"abstract":"Crystalline materials, with symmetrical and periodic structures, exhibit a wide spectrum of properties and have been widely used in numerous applications across electronics, energy, and beyond. For crystalline materials discovery, traditional experimental and computational approaches are time-consuming and expensive. In these years, thanks to the explosive amount of crystalline materials data, great interest has been given to data-driven materials discovery. Particularly, recent advancements have exploited the expressive representation ability of deep learning to model the highly complex atomic systems within crystalline materials, opening up new avenues for efficient and accurate materials discovery. These works main focus on four types of tasks, including physicochemical property prediction, generative design of crystalline materials, aiding characterization, and accelerating theoretical computations. Despite the remarkable progress, there is still a lack of systematic investigation to summarize their distinctions and limitations. To fill this gap, we systematically investigated the progress of crystalline materials discovery using artificial intelligence made in recent years. We first introduce several data representations of the crystalline materials. Based on the representations, we summarize various fundamental deep learning models and their tailored usages in various material discovery tasks. Finally, we highlight the remaining challenges and propose future directions.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"71 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias-Free? An Empirical Study on Ethnicity, Gender, and Age Fairness in Deepfake Detection 没有偏见吗?深度造假检测中种族、性别和年龄公平性的实证研究
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-09 DOI: 10.1145/3796544
Aditi Panda, Tanusree Ghosh, Tushar Choudhary, Ruchira Naskar
In this study, we evaluate potential demographic bias in state-of-the-art deepfake image detection models across three key attributes: age, ethnicity, and gender. Unlike prior works that retrain detectors or analyse forensic manipulations, we systematically assess multiple pretrained checkpoints of leading deepfake detectors, each trained on different datasets, to ensure an unbiased evaluation framework. Our experiments employ synthetic images generated by recent diffusion and autoregressive models, alongside real images from balanced datasets, to measure subgroup-specific detection performance. Results reveal no systematic bias across demographic categories—variations in accuracy and precision remain within small statistical margins across all detectors and checkpoints. We further provide a taxonomy of image generative models, highlighting their evolution from pixel-space to latent-space diffusion architectures, to contextualize the diversity of synthetic data used in our evaluation. Overall, our findings suggest that modern deepfake image detectors, when tested in a cross-demographic setting using pretrained checkpoints, exhibit robust and fair performance across age, ethnicity, and gender.
在本研究中,我们评估了最先进的深度假图像检测模型中三个关键属性的潜在人口统计学偏差:年龄、种族和性别。与之前重新训练检测器或分析法医操作的工作不同,我们系统地评估了领先深度假检测器的多个预训练检查点,每个检查点都在不同的数据集上进行了训练,以确保公正的评估框架。我们的实验使用由最近的扩散和自回归模型生成的合成图像,以及来自平衡数据集的真实图像,来测量亚群特定的检测性能。结果显示,在人口统计类别中没有系统性偏差——在所有检测器和检查点中,准确度和精度的变化仍然在很小的统计范围内。我们进一步提供了图像生成模型的分类,强调了它们从像素空间到潜在空间扩散架构的演变,以将我们评估中使用的合成数据的多样性背景化。总的来说,我们的研究结果表明,当使用预训练的检查点在跨人口统计环境中进行测试时,现代深度假图像检测器在年龄、种族和性别方面都表现出稳健和公平的表现。
{"title":"Bias-Free? An Empirical Study on Ethnicity, Gender, and Age Fairness in Deepfake Detection","authors":"Aditi Panda, Tanusree Ghosh, Tushar Choudhary, Ruchira Naskar","doi":"10.1145/3796544","DOIUrl":"https://doi.org/10.1145/3796544","url":null,"abstract":"In this study, we evaluate potential demographic bias in state-of-the-art deepfake image detection models across three key attributes: age, ethnicity, and gender. Unlike prior works that retrain detectors or analyse forensic manipulations, we systematically assess multiple pretrained checkpoints of leading deepfake detectors, each trained on different datasets, to ensure an unbiased evaluation framework. Our experiments employ synthetic images generated by recent diffusion and autoregressive models, alongside real images from balanced datasets, to measure subgroup-specific detection performance. Results reveal no systematic bias across demographic categories—variations in accuracy and precision remain within small statistical margins across all detectors and checkpoints. We further provide a taxonomy of image generative models, highlighting their evolution from pixel-space to latent-space diffusion architectures, to contextualize the diversity of synthetic data used in our evaluation. Overall, our findings suggest that modern deepfake image detectors, when tested in a cross-demographic setting using pretrained checkpoints, exhibit robust and fair performance across age, ethnicity, and gender.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"40 4 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Review of Information Uncertainty Modelling in Domain Ontologies 领域本体中信息不确定性建模研究综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3794841
Deemah Alomair, Ridha Khedri, Wendy MacCaull
Domain ontologies are essential for representing and reasoning about knowledge, yet addressing information uncertainty within them remains challenging. This review surveys approaches to modelling information uncertainty in domain ontologies from 2010 to 2024. It categorizes modelling formalisms, identifies information uncertainty types, and analyzes how information uncertainty is integrated into ontology components. It reviews reasoning techniques and emerging methods, including Machine Learning and Natural Language Processing. The review examines languages, tools, and evaluation strategies. The purpose is to map the landscape of information uncertainty modelling in domain ontologies, highlight research gaps and trends, and provide structured guidance for selecting suitable approaches.
领域本体对于知识的表示和推理至关重要,但在其中处理信息不确定性仍然具有挑战性。本文综述了2010年至2024年领域本体中信息不确定性建模的方法。对建模形式进行了分类,识别了信息不确定性类型,并分析了信息不确定性如何集成到本体组件中。它回顾了推理技术和新兴方法,包括机器学习和自然语言处理。审查审查语言、工具和评估策略。目的是绘制领域本体中信息不确定性建模的景观,突出研究差距和趋势,并为选择合适的方法提供结构化指导。
{"title":"A Comprehensive Review of Information Uncertainty Modelling in Domain Ontologies","authors":"Deemah Alomair, Ridha Khedri, Wendy MacCaull","doi":"10.1145/3794841","DOIUrl":"https://doi.org/10.1145/3794841","url":null,"abstract":"Domain ontologies are essential for representing and reasoning about knowledge, yet addressing information uncertainty within them remains challenging. This review surveys approaches to modelling information uncertainty in domain ontologies from 2010 to 2024. It categorizes modelling formalisms, identifies information uncertainty types, and analyzes how information uncertainty is integrated into ontology components. It reviews reasoning techniques and emerging methods, including Machine Learning and Natural Language Processing. The review examines languages, tools, and evaluation strategies. The purpose is to map the landscape of information uncertainty modelling in domain ontologies, highlight research gaps and trends, and provide structured guidance for selecting suitable approaches.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"41 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146135118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalizability of Large Language Model-Based Agents: A Comprehensive Survey 基于大型语言模型的智能体的可泛化性综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3794858
Minxing Zhang, Yi Yang, Roy Xie, Bhuwan Dhingra, Shuyan Zhou, Jian Pei
Large Language Model (LLM)-based agents have recently emerged as a new paradigm that extends the capabilities of LLMs beyond text generation to dynamic interaction with external environments. A critical challenge lies in ensuring their generalizability – the ability to maintain consistently high performance across varied instructions, tasks, environments, and domains, especially those different from the agent’s fine-tuning data. Despite growing interest, the concept of generalizability in LLM-based agents remains underdefined, and systematic approaches to measure and improve it are lacking. We provide the first comprehensive review of generalizability in LLM-based agents. We begin by clarifying the definition and boundaries of agent generalizability. We then review existing benchmarks. Next, we categorize strategies for improving generalizability into three groups: methods targeting the backbone LLM, targeting agent components, and targeting their interactions. Furthermore, we introduce the distinction between generalizable frameworks and generalizable agents and outline how generalizable frameworks can be translated into agent-level generalizability. Finally, we identify future directions, including the development of standardized evaluation frameworks, variance- and cost-based metrics, and hybrid approaches that integrate methodological innovations with agent architecture-level designs. We aim to establish a foundation for principled research on building LLM-based agents that generalize reliably across diverse real-world applications.
基于大型语言模型(LLM)的代理最近作为一种新的范例出现,它将LLM的能力从文本生成扩展到与外部环境的动态交互。一个关键的挑战在于确保它们的泛化性——能够在不同的指令、任务、环境和领域(特别是那些与代理的微调数据不同的领域)中保持一致的高性能。尽管人们越来越感兴趣,但在基于llm的代理中,泛化的概念仍然没有得到充分的定义,并且缺乏测量和改进它的系统方法。我们首次全面回顾了基于法学硕士的药物的普遍性。我们首先澄清智能体可泛化性的定义和边界。然后我们审查现有的基准。接下来,我们将提高泛化性的策略分为三组:针对主干LLM的方法,针对代理组件的方法,以及针对它们之间相互作用的方法。此外,我们还介绍了可一般化框架和可一般化代理之间的区别,并概述了如何将可一般化框架转换为代理级的一般化。最后,我们确定了未来的方向,包括标准化评估框架的发展,基于方差和成本的度量,以及将方法创新与代理体系结构级设计相结合的混合方法。我们的目标是为构建基于llm的代理的原则研究奠定基础,这些代理可以在不同的实际应用中可靠地进行推广。
{"title":"Generalizability of Large Language Model-Based Agents: A Comprehensive Survey","authors":"Minxing Zhang, Yi Yang, Roy Xie, Bhuwan Dhingra, Shuyan Zhou, Jian Pei","doi":"10.1145/3794858","DOIUrl":"https://doi.org/10.1145/3794858","url":null,"abstract":"Large Language Model (LLM)-based agents have recently emerged as a new paradigm that extends the capabilities of LLMs beyond text generation to dynamic interaction with external environments. A critical challenge lies in ensuring their <jats:italic toggle=\"yes\">generalizability</jats:italic> – the ability to maintain consistently high performance across varied instructions, tasks, environments, and domains, especially those different from the agent’s fine-tuning data. Despite growing interest, the concept of generalizability in LLM-based agents remains underdefined, and systematic approaches to measure and improve it are lacking. We provide the first comprehensive review of generalizability in LLM-based agents. We begin by clarifying the definition and boundaries of agent generalizability. We then review existing benchmarks. Next, we categorize strategies for improving generalizability into three groups: methods targeting the backbone LLM, targeting agent components, and targeting their interactions. Furthermore, we introduce the distinction between <jats:italic toggle=\"yes\">generalizable frameworks</jats:italic> and <jats:italic toggle=\"yes\">generalizable agents</jats:italic> and outline how generalizable frameworks can be translated into agent-level generalizability. Finally, we identify future directions, including the development of standardized evaluation frameworks, variance- and cost-based metrics, and hybrid approaches that integrate methodological innovations with agent architecture-level designs. We aim to establish a foundation for principled research on building LLM-based agents that generalize reliably across diverse real-world applications.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"244 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Meta-Analysis of Music Emotion Recognition Studies 音乐情绪识别研究的元分析
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3796518
Tuomas Eerola, Cameron Anderson
This meta-analysis examines music emotion recognition (MER) models published between 2014 and 2024, focusing on predictions of valence, arousal, and categorical emotions. A total of 553 studies were identified, of which 96 full-text articles were assessed, resulting in a final review of 34 studies. These studies reported 290 models, including 86 for emotion classification and 204 for regression. Using the best-performing model from each study, we found that valence and arousal were predicted with reasonable accuracy (r = 0.67 and r = 0.81, respectively), while classification models achieved an accuracy of 0.87 as measured with Matthews correlation coefficient. Across modelling approaches, linear and tree-based methods generally outperformed neural networks in regression tasks, whereas neural networks and support vector machines (SVMs) showed highest performance in classification tasks. We highlight key recommendations for future MER research, emphasizing the need for greater transparency, feature validation, and standardized reporting to improve comparability across studies.
本荟萃分析考察了2014年至2024年间发表的音乐情绪识别(MER)模型,重点关注价态、唤醒和分类情绪的预测。总共确定了553项研究,对其中96篇全文文章进行了评估,最终对34项研究进行了审查。这些研究报告了290个模型,其中情绪分类模型86个,回归模型204个。使用各研究中表现最好的模型,我们发现效价和唤醒的预测准确率合理(r = 0.67和r = 0.81),而分类模型的马修斯相关系数测量的准确率为0.87。在建模方法中,线性和基于树的方法在回归任务中通常优于神经网络,而神经网络和支持向量机(svm)在分类任务中表现出最高的性能。我们强调了未来MER研究的关键建议,强调需要更大的透明度、特征验证和标准化报告,以提高研究之间的可比性。
{"title":"A Meta-Analysis of Music Emotion Recognition Studies","authors":"Tuomas Eerola, Cameron Anderson","doi":"10.1145/3796518","DOIUrl":"https://doi.org/10.1145/3796518","url":null,"abstract":"This meta-analysis examines music emotion recognition (MER) models published between 2014 and 2024, focusing on predictions of valence, arousal, and categorical emotions. A total of 553 studies were identified, of which 96 full-text articles were assessed, resulting in a final review of 34 studies. These studies reported 290 models, including 86 for emotion classification and 204 for regression. Using the best-performing model from each study, we found that valence and arousal were predicted with reasonable accuracy (r = 0.67 and r = 0.81, respectively), while classification models achieved an accuracy of 0.87 as measured with Matthews correlation coefficient. Across modelling approaches, linear and tree-based methods generally outperformed neural networks in regression tasks, whereas neural networks and support vector machines (SVMs) showed highest performance in classification tasks. We highlight key recommendations for future MER research, emphasizing the need for greater transparency, feature validation, and standardized reporting to improve comparability across studies.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"72 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Entity Linking with Wikidata: A Systematic Literature Review 维基数据实体链接:系统文献综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-31 DOI: 10.1145/3795134
Philipp Scharpf, Corinna Breitinger, Andreas Spitz, Norman Meuschke, André Greiner-Petter, Moritz Schubotz, Bela Gipp
This article provides a comprehensive systematic review of the literature on entity linking using Wikidata as the grounding knowledge base. Our review extends the scope of previous studies from two to eight dimensions of entity linking, which we classify into the following categories: definitions, tasks, types, domains, approaches, datasets, applications, and challenges. We find that datasets primarily address question-answering and news domains but underutilize Wikidata’s capabilities for hyper-relations, multilingualism, and time dependence. The research gaps we identify include the need for more robust datasets, hybrid methods combining rule-based and learning-based approaches, and improved handling of ambiguity, sparse entity types, data noise, and knowledge graph evolution.
本文以维基数据为基础知识库,对实体链接的相关文献进行了系统的综述。我们的综述将先前研究的范围从实体链接的两个维度扩展到八个维度,我们将其分为以下几类:定义、任务、类型、领域、方法、数据集、应用和挑战。我们发现数据集主要用于问答和新闻领域,但没有充分利用维基数据在超关系、多语言和时间依赖性方面的能力。我们发现的研究差距包括需要更健壮的数据集,结合基于规则和基于学习的方法的混合方法,以及改进对歧义、稀疏实体类型、数据噪声和知识图进化的处理。
{"title":"Entity Linking with Wikidata: A Systematic Literature Review","authors":"Philipp Scharpf, Corinna Breitinger, Andreas Spitz, Norman Meuschke, André Greiner-Petter, Moritz Schubotz, Bela Gipp","doi":"10.1145/3795134","DOIUrl":"https://doi.org/10.1145/3795134","url":null,"abstract":"This article provides a comprehensive systematic review of the literature on entity linking using Wikidata as the grounding knowledge base. Our review extends the scope of previous studies from two to eight dimensions of entity linking, which we classify into the following categories: definitions, tasks, types, domains, approaches, datasets, applications, and challenges. We find that datasets primarily address question-answering and news domains but underutilize Wikidata’s capabilities for hyper-relations, multilingualism, and time dependence. The research gaps we identify include the need for more robust datasets, hybrid methods combining rule-based and learning-based approaches, and improved handling of ambiguity, sparse entity types, data noise, and knowledge graph evolution.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"9 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146095841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1