首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search 神经网络设计的高效自动化:可微分神经架构搜索调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-15 DOI: 10.1145/3665138
Alexandre Heuillet, Ahmad Nasser, Hichem Arioui, Hedi Tabia

In the past few years, Differentiable Neural Architecture Search (DNAS) rapidly imposed itself as the trending approach to automate the discovery of deep neural network architectures. This rise is mainly due to the popularity of DARTS (Differentiable ARchitecTure Search), one of the first major DNAS methods. In contrast with previous works based on Reinforcement Learning or Evolutionary Algorithms, DNAS is faster by several orders of magnitude and uses fewer computational resources. In this comprehensive survey, we focused specifically on DNAS and reviewed recent approaches in this field. Furthermore, we proposed a novel challenge-based taxonomy to classify DNAS methods. We also discussed the contributions brought to DNAS in the past few years and its impact on the global NAS field. Finally, we concluded by giving some insights into future research directions for the DNAS field.

在过去几年中,可微分神经架构搜索(DNAS)迅速成为自动发现深度神经网络架构的潮流方法。这种崛起主要归功于 DARTS(可微分神经架构搜索)的流行,它是最早的主要 DNAS 方法之一。与之前基于强化学习或进化算法的工作相比,DNAS 的速度快了几个数量级,而且使用的计算资源更少。在这份综合调查报告中,我们特别关注 DNAS,并回顾了该领域的最新方法。此外,我们还提出了一种新颖的基于挑战的分类法,用于对 DNAS 方法进行分类。我们还讨论了 DNAS 在过去几年中的贡献及其对全球 NAS 领域的影响。最后,我们对 DNAS 领域未来的研究方向提出了一些见解。
{"title":"Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search","authors":"Alexandre Heuillet, Ahmad Nasser, Hichem Arioui, Hedi Tabia","doi":"10.1145/3665138","DOIUrl":"https://doi.org/10.1145/3665138","url":null,"abstract":"<p>In the past few years, Differentiable Neural Architecture Search (DNAS) rapidly imposed itself as the trending approach to automate the discovery of deep neural network architectures. This rise is mainly due to the popularity of DARTS (Differentiable ARchitecTure Search), one of the first major DNAS methods. In contrast with previous works based on Reinforcement Learning or Evolutionary Algorithms, DNAS is faster by several orders of magnitude and uses fewer computational resources. In this comprehensive survey, we focused specifically on DNAS and reviewed recent approaches in this field. Furthermore, we proposed a novel challenge-based taxonomy to classify DNAS methods. We also discussed the contributions brought to DNAS in the past few years and its impact on the global NAS field. Finally, we concluded by giving some insights into future research directions for the DNAS field.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"8 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141251577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Overview of Privacy-Enhancing Technologies in Biometric Recognition 生物识别中的隐私增强技术概览
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-14 DOI: 10.1145/3664596
Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Ruben Vera, Christoph Busch

Privacy-enhancing technologies are technologies that implement fundamental data protection principles. With respect to biometric recognition, different types of privacy-enhancing technologies have been introduced for protecting stored biometric data which are generally classified as sensitive. In this regard, various taxonomies and conceptual categorizations have been proposed and standardisation activities have been carried out. However, these efforts have mainly been devoted to certain sub-categories of privacy-enhancing technologies and therefore lack generalization. This work provides an overview of concepts of privacy-enhancing technologies for biometric recognition in a unified framework. Key properties and differences between existing concepts are highlighted in detail at each processing step. Fundamental characteristics and limitations of existing technologies are discussed and related to data protection techniques and principles. Moreover, scenarios and methods for the assessment of privacy-enhancing technologies for biometric recognition are presented. This paper is meant as a point of entry to the field of data protection for biometric recognition applications and is directed towards experienced researchers as well as non-experts.

隐私增强技术是实施基本数据保护原则的技术。在生物识别方面,已经引入了不同类型的隐私增强技术来保护存储的生物识别数据,这些数据通常被归类为敏感数据。在这方面,已经提出了各种分类法和概念分类,并开展了标准化活动。不过,这些工作主要针对隐私增强技术的某些子类别,因此缺乏普遍性。这项工作在一个统一的框架内概述了用于生物识别的隐私增强技术的概念。在每个处理步骤中都详细强调了现有概念的关键特性和差异。讨论了现有技术的基本特征和局限性,并将其与数据保护技术和原则联系起来。此外,还介绍了用于生物识别的隐私增强技术的评估方案和方法。本文旨在作为生物识别应用数据保护领域的切入点,面向有经验的研究人员和非专业人员。
{"title":"An Overview of Privacy-Enhancing Technologies in Biometric Recognition","authors":"Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Ruben Vera, Christoph Busch","doi":"10.1145/3664596","DOIUrl":"https://doi.org/10.1145/3664596","url":null,"abstract":"<p>Privacy-enhancing technologies are technologies that implement fundamental data protection principles. With respect to biometric recognition, different types of privacy-enhancing technologies have been introduced for protecting stored biometric data which are generally classified as sensitive. In this regard, various taxonomies and conceptual categorizations have been proposed and standardisation activities have been carried out. However, these efforts have mainly been devoted to certain sub-categories of privacy-enhancing technologies and therefore lack generalization. This work provides an overview of concepts of privacy-enhancing technologies for biometric recognition in a unified framework. Key properties and differences between existing concepts are highlighted in detail at each processing step. Fundamental characteristics and limitations of existing technologies are discussed and related to data protection techniques and principles. Moreover, scenarios and methods for the assessment of privacy-enhancing technologies for biometric recognition are presented. This paper is meant as a point of entry to the field of data protection for biometric recognition applications and is directed towards experienced researchers as well as non-experts.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"104 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141256840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recent Advances for Aerial Object Detection: A Survey 航空物体探测的最新进展:调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-13 DOI: 10.1145/3664598
jiaxu leng, Yongming Ye, Mengjingcheng MO, Chenqiang Gao, Ji Gan, Bin Xiao, Xinbo Gao

Aerial object detection, as object detection in aerial images captured from an overhead perspective, has been widely applied in urban management, industrial inspection, and other aspects. However, the performance of existing aerial object detection algorithms is hindered by variations in object scales and orientations attributed to the aerial perspective. This survey presents a comprehensive review of recent advances in aerial object detection. We start with some basic concepts of aerial object detection and then summarize the five imbalance problems of aerial object detection, including scale imbalance, spatial imbalance, objective imbalance, semantic imbalance, and class imbalance. Moreover, we classify and analyze relevant methods and especially introduce the applications of aerial object detection in practical scenarios. Finally, the performance evaluation is presented on two popular aerial object detection datasets VisDrone-DET and DOTA, and we discuss several future directions that could facilitate the development of aerial object detection.

航空物体检测是指从俯瞰角度拍摄的航空图像中进行物体检测,已被广泛应用于城市管理、工业检测等领域。然而,现有航空物体检测算法的性能受到航空视角导致的物体比例和方向变化的影响。本研究全面回顾了航空物体检测的最新进展。我们首先介绍了空中物体检测的一些基本概念,然后总结了空中物体检测的五个不平衡问题,包括比例不平衡、空间不平衡、目标不平衡、语义不平衡和类别不平衡。此外,我们还对相关方法进行了分类和分析,并特别介绍了航空物体检测在实际场景中的应用。最后,我们对 VisDrone-DET 和 DOTA 这两个流行的航空物体检测数据集进行了性能评估,并讨论了未来促进航空物体检测发展的几个方向。
{"title":"Recent Advances for Aerial Object Detection: A Survey","authors":"jiaxu leng, Yongming Ye, Mengjingcheng MO, Chenqiang Gao, Ji Gan, Bin Xiao, Xinbo Gao","doi":"10.1145/3664598","DOIUrl":"https://doi.org/10.1145/3664598","url":null,"abstract":"<p>Aerial object detection, as object detection in aerial images captured from an overhead perspective, has been widely applied in urban management, industrial inspection, and other aspects. However, the performance of existing aerial object detection algorithms is hindered by variations in object scales and orientations attributed to the aerial perspective. This survey presents a comprehensive review of recent advances in aerial object detection. We start with some basic concepts of aerial object detection and then summarize the five imbalance problems of aerial object detection, including scale imbalance, spatial imbalance, objective imbalance, semantic imbalance, and class imbalance. Moreover, we classify and analyze relevant methods and especially introduce the applications of aerial object detection in practical scenarios. Finally, the performance evaluation is presented on two popular aerial object detection datasets VisDrone-DET and DOTA, and we discuss several future directions that could facilitate the development of aerial object detection.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"59 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140915139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making 顺序决策的符号、次符号和混合方法综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-11 DOI: 10.1145/3663366
Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares

In the field of Sequential Decision Making (SDM), two paradigms have historically vied for supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of reconciliation, this paper reviews AP, RL and hybrid methods (e.g., novel learn to plan techniques) for solving Sequential Decision Processes (SDPs), focusing on their knowledge representation: symbolic, subsymbolic or a combination. Additionally, it also covers methods for learning the SDP structure. Finally, we compare the advantages and drawbacks of the existing methods and conclude that neurosymbolic AI poses a promising approach for SDM, since it combines AP and RL with a hybrid knowledge representation.

在顺序决策(SDM)领域,历来有两种范式争夺主导地位:自动规划(AP)和强化学习(RL)。本着和解的精神,本文回顾了用于解决序列决策过程(SDP)的自动规划、强化学习和混合方法(如新颖的学习规划技术),重点是它们的知识表示:符号表示、次符号表示或组合表示。此外,它还包括学习 SDP 结构的方法。最后,我们比较了现有方法的优缺点,并得出结论:神经符号人工智能是一种很有前途的 SDM 方法,因为它将 AP 和 RL 与混合知识表示法相结合。
{"title":"A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making","authors":"Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares","doi":"10.1145/3663366","DOIUrl":"https://doi.org/10.1145/3663366","url":null,"abstract":"<p>In the field of Sequential Decision Making (SDM), two paradigms have historically vied for supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of reconciliation, this paper reviews AP, RL and hybrid methods (e.g., novel learn to plan techniques) for solving Sequential Decision Processes (SDPs), focusing on their knowledge representation: symbolic, subsymbolic or a combination. Additionally, it also covers methods for learning the SDP structure. Finally, we compare the advantages and drawbacks of the existing methods and conclude that neurosymbolic AI poses a promising approach for SDM, since it combines AP and RL with a hybrid knowledge representation.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"191 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140907232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight Deep Learning for Resource-Constrained Environments: A Survey 资源受限环境下的轻量级深度学习:调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-11 DOI: 10.1145/3657282
Hou-I Liu, Marco Galindo, Hongxia Xie, Lai-Kuan Wong, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng

Over the past decade, the dominance of deep learning has prevailed across various domains of artificial intelligence, including natural language processing, computer vision, and biomedical signal processing. While there have been remarkable improvements in model accuracy, deploying these models on lightweight devices, such as mobile phones and microcontrollers, is constrained by limited resources. In this survey, we provide comprehensive design guidance tailored for these devices, detailing the meticulous design of lightweight models, compression methods, and hardware acceleration strategies. The principal goal of this work is to explore methods and concepts for getting around hardware constraints without compromising the model’s accuracy. Additionally, we explore two notable paths for lightweight deep learning in the future: deployment techniques for TinyML and Large Language Models. Although these paths undoubtedly have potential, they also present significant challenges, encouraging research into unexplored areas.

在过去十年中,深度学习在人工智能的各个领域都占据了主导地位,包括自然语言处理、计算机视觉和生物医学信号处理。虽然模型的准确性有了显著提高,但在手机和微控制器等轻型设备上部署这些模型却受到有限资源的限制。在本研究中,我们为这些设备提供了全面的设计指导,详细介绍了轻量级模型的精心设计、压缩方法和硬件加速策略。这项工作的主要目标是探索在不影响模型准确性的前提下绕过硬件限制的方法和概念。此外,我们还探索了未来轻量级深度学习的两条显著路径:TinyML 和大型语言模型的部署技术。尽管这些路径无疑具有潜力,但它们也提出了巨大的挑战,鼓励对未开发领域进行研究。
{"title":"Lightweight Deep Learning for Resource-Constrained Environments: A Survey","authors":"Hou-I Liu, Marco Galindo, Hongxia Xie, Lai-Kuan Wong, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng","doi":"10.1145/3657282","DOIUrl":"https://doi.org/10.1145/3657282","url":null,"abstract":"<p>Over the past decade, the dominance of deep learning has prevailed across various domains of artificial intelligence, including natural language processing, computer vision, and biomedical signal processing. While there have been remarkable improvements in model accuracy, deploying these models on lightweight devices, such as mobile phones and microcontrollers, is constrained by limited resources. In this survey, we provide comprehensive design guidance tailored for these devices, detailing the meticulous design of lightweight models, compression methods, and hardware acceleration strategies. The principal goal of this work is to explore methods and concepts for getting around hardware constraints without compromising the model’s accuracy. Additionally, we explore two notable paths for lightweight deep learning in the future: deployment techniques for TinyML and Large Language Models. Although these paths undoubtedly have potential, they also present significant challenges, encouraging research into unexplored areas.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"122 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140907237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Task Learning in Natural Language Processing: An Overview 自然语言处理中的多任务学习:概述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-11 DOI: 10.1145/3663363
Shijie Chen, Yu Zhang, Qiang Yang

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks. In recent years, Multi-Task Learning (MTL), which can leverage useful information of related tasks to achieve simultaneous performance improvement on these tasks, has been used to handle these problems. In this paper, we give an overview of the use of MTL in NLP tasks. We first review MTL architectures used in NLP tasks and categorize them into four classes, including parallel architecture, hierarchical architecture, modular architecture, and generative adversarial architecture. Then we present optimization techniques on loss construction, gradient regularization, data sampling, and task scheduling to properly train a multi-task model. After presenting applications of MTL in a variety of NLP tasks, we introduce some benchmark datasets. Finally, we make a conclusion and discuss several possible research directions in this field.

深度学习方法在自然语言处理(NLP)领域取得了巨大成功。然而,直接训练深度神经模型往往会遇到过拟合和数据稀缺的问题,而这些问题在 NLP 任务中普遍存在。近年来,多任务学习(Multi-Task Learning,MTL)被用来处理这些问题,它可以利用相关任务的有用信息,实现这些任务性能的同步提升。本文概述了 MTL 在 NLP 任务中的应用。我们首先回顾了在 NLP 任务中使用的 MTL 架构,并将其分为四类,包括并行架构、分层架构、模块化架构和生成式对抗架构。然后,我们介绍了损失构建、梯度正则化、数据采样和任务调度方面的优化技术,以正确训练多任务模型。在介绍了 MTL 在各种 NLP 任务中的应用后,我们介绍了一些基准数据集。最后,我们做出结论,并讨论了该领域可能的几个研究方向。
{"title":"Multi-Task Learning in Natural Language Processing: An Overview","authors":"Shijie Chen, Yu Zhang, Qiang Yang","doi":"10.1145/3663363","DOIUrl":"https://doi.org/10.1145/3663363","url":null,"abstract":"<p>Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks. In recent years, Multi-Task Learning (MTL), which can leverage useful information of related tasks to achieve simultaneous performance improvement on these tasks, has been used to handle these problems. In this paper, we give an overview of the use of MTL in NLP tasks. We first review MTL architectures used in NLP tasks and categorize them into four classes, including parallel architecture, hierarchical architecture, modular architecture, and generative adversarial architecture. Then we present optimization techniques on loss construction, gradient regularization, data sampling, and task scheduling to properly train a multi-task model. After presenting applications of MTL in a variety of NLP tasks, we introduce some benchmark datasets. Finally, we make a conclusion and discuss several possible research directions in this field.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"122 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140907233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of explainable fashion compatibility modeling methods 可解释时尚兼容性建模方法综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-11 DOI: 10.1145/3664614
Karolina Selwon, Julian Szyma?ski

The paper reviews methods used in the fashion compatibility recommendation domain. We select methods based on reproducibility, explainability, and novelty aspects and then organize them chronologically and thematically. We presented general characteristics of publicly available datasets that are related to the fashion compatibility recommendation task. Finally, we analyzed the representation bias of datasets, fashion-based algorithms’ sustainability, and explainable model assessment. The paper describes practical problem explanations, methodologies, and published datasets that may serve as an inspiration for further research. The proposed structure of the survey organizes knowledge in the fashion recommendation domain and will be beneficial for those who want to learn the topic from scratch, expand their knowledge, or find a new field for research. Furthermore, the information included in this paper could contribute to developing an effective and ethical fashion-based recommendation system.

本文回顾了时尚兼容性推荐领域所使用的方法。我们根据可重复性、可解释性和新颖性等方面来选择方法,然后按时间顺序和主题来组织这些方法。我们介绍了与时尚兼容性推荐任务相关的公开可用数据集的一般特征。最后,我们分析了数据集的代表性偏差、基于时尚的算法的可持续性以及可解释模型评估。本文介绍了实际问题的解释、方法和已发布的数据集,这些数据集可作为进一步研究的灵感来源。本文提出的调查结构对时尚推荐领域的知识进行了梳理,对于那些想从头开始学习该主题、扩展知识面或寻找新的研究领域的人来说都将大有裨益。此外,本文所包含的信息还有助于开发有效且符合道德规范的时尚推荐系统。
{"title":"A review of explainable fashion compatibility modeling methods","authors":"Karolina Selwon, Julian Szyma?ski","doi":"10.1145/3664614","DOIUrl":"https://doi.org/10.1145/3664614","url":null,"abstract":"<p>The paper reviews methods used in the fashion compatibility recommendation domain. We select methods based on reproducibility, explainability, and novelty aspects and then organize them chronologically and thematically. We presented general characteristics of publicly available datasets that are related to the fashion compatibility recommendation task. Finally, we analyzed the representation bias of datasets, fashion-based algorithms’ sustainability, and explainable model assessment. The paper describes practical problem explanations, methodologies, and published datasets that may serve as an inspiration for further research. The proposed structure of the survey organizes knowledge in the fashion recommendation domain and will be beneficial for those who want to learn the topic from scratch, expand their knowledge, or find a new field for research. Furthermore, the information included in this paper could contribute to developing an effective and ethical fashion-based recommendation system.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"50 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140907242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creativity and Machine Learning: A Survey 创造力与机器学习:一项调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-11 DOI: 10.1145/3664595
Giorgio Franceschelli, Mirco Musolesi

There is a growing interest in the area of machine learning and creativity. This survey presents an overview of the history and the state of the art of computational creativity theories, key machine learning techniques (including generative deep learning), and corresponding automatic evaluation methods. After presenting a critical discussion of the key contributions in this area, we outline the current research challenges and emerging opportunities in this field.

人们对机器学习和创造力领域的兴趣与日俱增。本调查报告概述了计算创造力理论、关键机器学习技术(包括生成式深度学习)以及相应的自动评估方法的历史和现状。在对该领域的主要贡献进行批判性讨论之后,我们概述了该领域当前的研究挑战和新出现的机遇。
{"title":"Creativity and Machine Learning: A Survey","authors":"Giorgio Franceschelli, Mirco Musolesi","doi":"10.1145/3664595","DOIUrl":"https://doi.org/10.1145/3664595","url":null,"abstract":"<p>There is a growing interest in the area of machine learning and creativity. This survey presents an overview of the history and the state of the art of computational creativity theories, key machine learning techniques (including generative deep learning), and corresponding automatic evaluation methods. After presenting a critical discussion of the key contributions in this area, we outline the current research challenges and emerging opportunities in this field.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"8 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140907186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Natural Language Reasoning, A Survey 自然语言推理,概览
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-09 DOI: 10.1145/3664194
Fei Yu, Hongbo Zhang, Prayag Tiwari, Benyou Wang

This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically. Conceptually, we provide a distinct definition for natural language reasoning in NLP, based on both philosophy and NLP scenarios, discuss what types of tasks require reasoning, and introduce a taxonomy of reasoning. Practically, we conduct a comprehensive literature review on natural language reasoning in NLP, mainly covering classical logical reasoning, natural language inference, multi-hop question answering, and commonsense reasoning. The paper also identifies and views backward reasoning, a powerful paradigm for multi-step reasoning, and introduces defeasible reasoning as one of the most important future directions in natural language reasoning research. We focus on single-modality unstructured natural language text, excluding neuro-symbolic research and mathematical reasoning.

本调查报告从概念和实践两方面,对自然语言处理(NLP)领域的自然语言推理提出了更清晰的看法。在概念上,我们基于哲学和 NLP 场景,为 NLP 中的自然语言推理提供了一个独特的定义,讨论了哪些类型的任务需要推理,并介绍了推理分类法。在实践中,我们对 NLP 中的自然语言推理进行了全面的文献综述,主要涉及经典逻辑推理、自然语言推理、多跳问题解答和常识推理。此外,本文还指出并阐述了后向推理这一强大的多步推理范式,并介绍了作为自然语言推理研究未来最重要方向之一的可败推理。我们专注于单模态非结构化自然语言文本,不包括神经符号研究和数学推理。
{"title":"Natural Language Reasoning, A Survey","authors":"Fei Yu, Hongbo Zhang, Prayag Tiwari, Benyou Wang","doi":"10.1145/3664194","DOIUrl":"https://doi.org/10.1145/3664194","url":null,"abstract":"<p>This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically. Conceptually, we provide a distinct definition for natural language reasoning in NLP, based on both philosophy and NLP scenarios, discuss what types of tasks require reasoning, and introduce a taxonomy of reasoning. Practically, we conduct a comprehensive literature review on natural language reasoning in NLP, mainly covering classical logical reasoning, natural language inference, multi-hop question answering, and commonsense reasoning. The paper also identifies and views backward reasoning, a powerful paradigm for multi-step reasoning, and introduces defeasible reasoning as one of the most important future directions in natural language reasoning research. We focus on single-modality unstructured natural language text, excluding neuro-symbolic research and mathematical reasoning.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"53 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140902989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic Data for Deep Learning in Computer Vision & Medical Imaging: A Means to Reduce Data Bias 计算机视觉与医学影像深度学习的合成数据:减少数据偏差的方法
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-05-09 DOI: 10.1145/3663759
Anthony Paproki, Olivier Salvado, Clinton Fookes

Deep-learning (DL) performs well in computer-vision and medical-imaging automated decision-making applications. A bottleneck of DL stems from the large amount of labelled data required to train accurate models that generalise well. Data scarcity and imbalance are common problems in imaging applications that can lead DL models towards biased decision making. A solution to this problem is synthetic data. Synthetic data is an inexpensive substitute to real data for improved accuracy and generalisability of DL models. This survey reviews the recent methods published in relation to the creation and use of synthetic data for computer-vision and medical-imaging DL applications. The focus will be on applications that utilised synthetic data to improve DL models by either incorporating an increased diversity of data that is difficult to obtain in real life, or by reducing a bias caused by class imbalance. Computer-graphics software and generative networks are the most popular data generation techniques encountered in the literature. We highlight their suitability for typical computer-vision and medical-imaging applications, and present promising avenues for research to overcome their computational and theoretical limitations.

深度学习(DL)在计算机视觉和医学影像自动决策应用中表现出色。深度学习的瓶颈在于,要训练出具有良好泛化能力的精确模型,需要大量标记数据。数据稀缺和不平衡是成像应用中的常见问题,会导致 DL 模型的决策出现偏差。解决这一问题的方法是合成数据。合成数据是真实数据的廉价替代品,可提高 DL 模型的准确性和通用性。本调查回顾了最近发表的与计算机视觉和医学成像 DL 应用中合成数据的创建和使用有关的方法。重点将放在利用合成数据改进 DL 模型的应用上,这些方法要么是通过增加现实生活中难以获得的数据的多样性,要么是通过减少类别不平衡造成的偏差。计算机图形软件和生成网络是文献中最常用的数据生成技术。我们重点介绍了它们在典型计算机视觉和医学影像应用中的适用性,并提出了克服其计算和理论局限性的可行研究途径。
{"title":"Synthetic Data for Deep Learning in Computer Vision & Medical Imaging: A Means to Reduce Data Bias","authors":"Anthony Paproki, Olivier Salvado, Clinton Fookes","doi":"10.1145/3663759","DOIUrl":"https://doi.org/10.1145/3663759","url":null,"abstract":"<p>Deep-learning (DL) performs well in computer-vision and medical-imaging automated decision-making applications. A bottleneck of DL stems from the large amount of labelled data required to train accurate models that generalise well. Data scarcity and imbalance are common problems in imaging applications that can lead DL models towards biased decision making. A solution to this problem is synthetic data. Synthetic data is an inexpensive substitute to real data for improved accuracy and generalisability of DL models. This survey reviews the recent methods published in relation to the creation and use of synthetic data for computer-vision and medical-imaging DL applications. The focus will be on applications that utilised synthetic data to improve DL models by either incorporating an increased diversity of data that is difficult to obtain in real life, or by reducing a bias caused by class imbalance. Computer-graphics software and generative networks are the most popular data generation techniques encountered in the literature. We highlight their suitability for typical computer-vision and medical-imaging applications, and present promising avenues for research to overcome their computational and theoretical limitations.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"121 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140903047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1