首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
State of the Art and Potentialities of Graph-level Learning 图层面学习的现状与潜力
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-09-13 DOI: 10.1145/3695863
Zhenyu Yang, Ge Zhang, Jia Wu, Jian Yang, Quan Z. Sheng, Shan Xue, Chuan Zhou, Charu Aggarwal, Hao Peng, Wenbin Hu, Edwin Hancock, Pietro Liò
Graphs have a superior ability to represent relational data, like chemical compounds, proteins, and social networks. Hence, graph-level learning, which takes a set of graphs as input, has been applied to many tasks including comparison, regression, classification, and more. Traditional approaches to learning a set of graphs heavily rely on hand-crafted features, such as substructures. While these methods benefit from good interpretability, they often suffer from computational bottlenecks as they cannot skirt the graph isomorphism problem. Conversely, deep learning has helped graph-level learning adapt to the growing scale of graphs by extracting features automatically and encoding graphs into low-dimensional representations. As a result, these deep graph learning methods have been responsible for many successes. Yet, no comprehensive survey reviews graph-level learning starting with traditional learning and moving through to the deep learning approaches. This article fills this gap and frames the representative algorithms into a systematic taxonomy covering traditional learning, graph-level deep neural networks, graph-level graph neural networks, and graph pooling. In addition, the evolution and interaction between methods from these four branches within their developments are examined to provide an in-depth analysis. This is followed by a brief review of the benchmark datasets, evaluation metrics, and common downstream applications. Finally, the survey concludes with an in-depth discussion of 12 current and future directions in this booming field.
图在表示化合物、蛋白质和社交网络等关系数据方面具有卓越的能力。因此,将一组图作为输入的图级学习已被应用到许多任务中,包括比较、回归、分类等。学习图集的传统方法严重依赖于手工创建的特征,例如子结构。虽然这些方法具有良好的可解释性,但由于无法解决图同构问题,因此往往会遭遇计算瓶颈。相反,深度学习通过自动提取特征并将图编码为低维表示,帮助图级学习适应图规模的不断扩大。因此,这些深度图学习方法取得了许多成功。然而,从传统学习到深度学习方法,还没有一份全面的调查报告对图级学习进行了回顾。本文填补了这一空白,并将具有代表性的算法归纳为一个系统的分类法,涵盖传统学习、图级深度神经网络、图级图神经网络和图池化。此外,文章还对这四个分支的方法在其发展过程中的演变和相互作用进行了深入分析。随后,简要回顾了基准数据集、评估指标和常见的下游应用。最后,调查报告对这一蓬勃发展领域的 12 个当前和未来方向进行了深入讨论。
{"title":"State of the Art and Potentialities of Graph-level Learning","authors":"Zhenyu Yang, Ge Zhang, Jia Wu, Jian Yang, Quan Z. Sheng, Shan Xue, Chuan Zhou, Charu Aggarwal, Hao Peng, Wenbin Hu, Edwin Hancock, Pietro Liò","doi":"10.1145/3695863","DOIUrl":"https://doi.org/10.1145/3695863","url":null,"abstract":"Graphs have a superior ability to represent relational data, like chemical compounds, proteins, and social networks. Hence, graph-level learning, which takes a set of graphs as input, has been applied to many tasks including comparison, regression, classification, and more. Traditional approaches to learning a set of graphs heavily rely on hand-crafted features, such as substructures. While these methods benefit from good interpretability, they often suffer from computational bottlenecks as they cannot skirt the graph isomorphism problem. Conversely, deep learning has helped graph-level learning adapt to the growing scale of graphs by extracting features automatically and encoding graphs into low-dimensional representations. As a result, these deep graph learning methods have been responsible for many successes. Yet, no comprehensive survey reviews graph-level learning starting with traditional learning and moving through to the deep learning approaches. This article fills this gap and frames the representative algorithms into a systematic taxonomy covering traditional learning, graph-level deep neural networks, graph-level graph neural networks, and graph pooling. In addition, the evolution and interaction between methods from these four branches within their developments are examined to provide an in-depth analysis. This is followed by a brief review of the benchmark datasets, evaluation metrics, and common downstream applications. Finally, the survey concludes with an in-depth discussion of 12 current and future directions in this booming field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"5 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142374640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Image steganography approaches and their detection strategies: a survey 图像隐写术方法及其检测策略概览
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-09-10 DOI: 10.1145/3694965
Meike Helena Kombrink, Zeno Jean Marius Hubert Geradts, Marcel Worring
Steganography is the art and science of hidden (or covered) communication. In digital steganography, the bits of images, videos, audio and text files are tweaked to represent the information to hide. This paper covers the current methods for hiding information in images, alongside steganalysis methods which aim to detect the presence of steganography. By reviewing 456 references, this paper discusses the different approaches that can be taken toward steganography and its much less widely studied counterpart. Currently in research older steganography approaches are more widely used than newer methods even though these show greater potential. New methods do have flaws, therefore more research is needed to make these practically applicable. For steganalysis one of the greatest challenges is the generalisability. Often one scheme can detect the presence of one specific hiding method. More research is needed to combine current schemes and/or create new generalisable schemes. To allow readers to compare results between different papers in our work performance indications of all steganalysis methods are outlined and a comparison of performance is included. This comparison is given using ’topological sorting’ graphs, which compares detection results from all papers (as stated in the papers themselves) on different steganographic schemes.
隐写术是一门隐藏(或掩盖)通信的艺术和科学。在数字隐写术中,对图像、视频、音频和文本文件的比特进行调整,以表示要隐藏的信息。本文介绍了目前在图像中隐藏信息的方法,以及旨在检测是否存在隐写术的隐写分析方法。通过查阅 456 篇参考文献,本文讨论了可用于隐写术的不同方法及其研究较少的对应方法。目前在研究中,旧的隐写术方法比新的方法使用得更广泛,尽管这些方法显示出更大的潜力。新方法确实存在缺陷,因此需要进行更多的研究,使这些方法切实可行。对于隐写分析来说,最大的挑战之一就是通用性。通常情况下,一种方案只能检测到一种特定隐藏方法的存在。需要开展更多的研究,将现有的方案结合起来,并/或创建新的通用方案。为了让读者能够比较不同论文之间的结果,我们概述了所有隐写分析方法的性能指标,并对性能进行了比较。比较采用 "拓扑排序 "图,比较了所有论文(如论文本身所述)对不同隐写方案的检测结果。
{"title":"Image steganography approaches and their detection strategies: a survey","authors":"Meike Helena Kombrink, Zeno Jean Marius Hubert Geradts, Marcel Worring","doi":"10.1145/3694965","DOIUrl":"https://doi.org/10.1145/3694965","url":null,"abstract":"Steganography is the art and science of hidden (or covered) communication. In digital steganography, the bits of images, videos, audio and text files are tweaked to represent the information to hide. This paper covers the current methods for hiding information in images, alongside steganalysis methods which aim to detect the presence of steganography. By reviewing 456 references, this paper discusses the different approaches that can be taken toward steganography and its much less widely studied counterpart. Currently in research older steganography approaches are more widely used than newer methods even though these show greater potential. New methods do have flaws, therefore more research is needed to make these practically applicable. For steganalysis one of the greatest challenges is the generalisability. Often one scheme can detect the presence of one specific hiding method. More research is needed to combine current schemes and/or create new generalisable schemes. To allow readers to compare results between different papers in our work performance indications of all steganalysis methods are outlined and a comparison of performance is included. This comparison is given using ’topological sorting’ graphs, which compares detection results from all papers (as stated in the papers themselves) on different steganographic schemes.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"5 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142374641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Recommender Systems: A Survey 多模式推荐系统:调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-09-10 DOI: 10.1145/3695461
Qidong Liu, Jiaxi Hu, Yutian Xiao, Xiangyu Zhao, Jingtong Gao, Wanyu Wang, Qing Li, Jiliang Tang
The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc. , understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, M ultimodal R ecommender S ystem (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder , Feature Interaction , Feature Enhancement and Model Optimization . Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.
推荐系统(RS)已成为在线服务不可或缺的工具包。它们配备了各种深度学习技术,可根据标识符和属性信息为用户偏好建模。随着短视频、新闻等多媒体服务的出现,在推荐时理解这些内容变得至关重要。随着短视频、新闻等多媒体服务的出现,在推荐时理解这些内容变得至关重要。此外,多模态特征还有助于缓解 RS 中数据稀疏的问题。因此,多模态推荐系统(Multimodal R ecommender S ystem,MRS)最近引起了学术界和产业界的广泛关注。本文将主要从技术角度对 MRS 模型进行全面考察。首先,我们总结了 MRS 的一般程序和主要挑战。然后,我们按照模态编码器、特征交互、特征增强和模型优化四个类别介绍了现有的 MRS 模型。此外,为了方便有志于该领域研究的人员,我们还总结了数据集和代码资源。最后,我们讨论了 MRS 未来的一些发展方向,并对本文进行了总结。为了获取更多调查论文的细节,如实现代码,我们开放了一个源代码库。
{"title":"Multimodal Recommender Systems: A Survey","authors":"Qidong Liu, Jiaxi Hu, Yutian Xiao, Xiangyu Zhao, Jingtong Gao, Wanyu Wang, Qing Li, Jiliang Tang","doi":"10.1145/3695461","DOIUrl":"https://doi.org/10.1145/3695461","url":null,"abstract":"The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and <jats:italic>etc.</jats:italic> , understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, M ultimodal R ecommender S ystem (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, <jats:italic>i.e.,</jats:italic> Modality Encoder , Feature Interaction , Feature Enhancement and Model Optimization . Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"40 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142374645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Survey on Biclustering-based Collaborative Filtering 基于双聚类的协同过滤综合调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-22 DOI: 10.1145/3674723
Miguel G. Silva, Sara C. Madeira, Rui Henriques

Collaborative Filtering (CF) is achieving a plateau of high popularity. Still, recommendation success is challenged by the diversity of user preferences, structural sparsity of user-item ratings, and inherent subjectivity of rating scales. The increasing user base and item dimensionality of e-commerce and e-entertainment platforms creates opportunities, while further raising generalization and scalability needs. Moved by the need to answer these challenges, user-based and item-based clustering approaches for CF became pervasive. However, classic clustering approaches assess user (item) rating similarity across all items (users), neglecting the rich diversity of item and user profiles. Instead, as preferences are generally simultaneously correlated on subsets of users and items, biclustering approaches provide a natural alternative, being successfully applied to CF for nearly two decades and synergistically integrated with emerging deep learning CF stances. Notwithstanding, biclustering-based CF principles are dispersed, causing state-of-the-art approaches to show accentuated behavioral differences. This work offers a structured view on how biclustering aspects impact recommendation success, coverage, and efficiency. To this end, we introduce a taxonomy to categorize contributions in this field and comprehensively survey state-of-the-art biclustering approaches to CF, highlighting their limitations and potentialities.

协同过滤技术(CF)正处于一个高流行度的阶段。然而,用户偏好的多样性、用户-物品评分的结构稀疏性以及评分尺度固有的主观性都给推荐的成功带来了挑战。电子商务和电子娱乐平台不断增加的用户群和项目维度创造了机会,同时也进一步提高了对通用性和可扩展性的需求。为了应对这些挑战,基于用户和项目的 CF 聚类方法变得非常普遍。然而,传统的聚类方法评估的是所有项目(用户)的用户(项目)评级相似性,忽略了项目和用户配置文件的丰富多样性。相反,由于用户和物品子集的偏好通常同时相关,双聚类方法提供了一个自然的替代方案,近二十年来已成功应用于 CF 领域,并与新兴的深度学习 CF 立场协同整合。尽管如此,基于双聚类的 CF 原理并不统一,导致最先进的方法表现出明显的行为差异。这项工作提供了一个结构化的视角,说明双聚类方面如何影响推荐的成功率、覆盖率和效率。为此,我们引入了一种分类法来对这一领域的贡献进行分类,并全面考察了最先进的双聚类方法,突出了它们的局限性和潜力。
{"title":"A Comprehensive Survey on Biclustering-based Collaborative Filtering","authors":"Miguel G. Silva, Sara C. Madeira, Rui Henriques","doi":"10.1145/3674723","DOIUrl":"https://doi.org/10.1145/3674723","url":null,"abstract":"<p>Collaborative Filtering (CF) is achieving a plateau of high popularity. Still, recommendation success is challenged by the diversity of user preferences, structural sparsity of user-item ratings, and inherent subjectivity of rating scales. The increasing user base and item dimensionality of e-commerce and e-entertainment platforms creates opportunities, while further raising generalization and scalability needs. Moved by the need to answer these challenges, user-based and item-based clustering approaches for CF became pervasive. However, classic clustering approaches assess user (item) rating similarity across all items (users), neglecting the rich diversity of item and user profiles. Instead, as preferences are generally simultaneously correlated on subsets of users and items, biclustering approaches provide a natural alternative, being successfully applied to CF for nearly two decades and synergistically integrated with emerging deep learning CF stances. Notwithstanding, biclustering-based CF principles are dispersed, causing state-of-the-art approaches to show accentuated behavioral differences. This work offers a structured view on how biclustering aspects impact recommendation success, coverage, and efficiency. To this end, we introduce a taxonomy to categorize contributions in this field and comprehensively survey state-of-the-art biclustering approaches to CF, highlighting their limitations and potentialities.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"82 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141439849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Object-Centric Learning with Capsule Networks: A Survey 利用胶囊网络进行以对象为中心的学习:调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-21 DOI: 10.1145/3674500
Fabio De Sousa Ribeiro, Kevin Duarte, Miles Everett, Georgios Leontidis, Mubarak Shah

Capsule networks emerged as a promising alternative to convolutional neural networks for learning object-centric representations. The idea is to explicitly model part-whole hierarchies by using groups of neurons called capsules to encode visual entities, then learn the relationships between these entities dynamically from data. However, a major hurdle for capsule network research has been the lack of a reliable point of reference for understanding their foundational ideas and motivations. This survey provides a comprehensive and critical overview of capsule networks which aims to serve as a main point of reference going forward. To that end, we introduce the fundamental concepts and motivations behind capsule networks, such as equivariant inference. We then cover various technical advances in capsule routing algorithms as well as alternative geometric and generative formulations. We provide a detailed explanation of how capsule networks relate to the attention mechanism in Transformers and uncover non-trivial conceptual similarities between them in the context of object-centric representation learning. We also review the extensive applications of capsule networks in computer vision, video and motion, graph representation learning, natural language processing, medical imaging, and many others. To conclude, we provide an in-depth discussion highlighting promising directions for future work.

在学习以对象为中心的表征方面,"胶囊 "网络是卷积神经网络的一种有前途的替代方案。其理念是通过使用被称为 "胶囊 "的神经元组来编码视觉实体,然后从数据中动态学习这些实体之间的关系,从而明确建立部分-整体层次结构模型。然而,胶囊网络研究的一个主要障碍是缺乏一个可靠的参照点来了解其基本思想和动机。本调查报告对胶囊网络进行了全面和批判性的概述,旨在作为今后研究的主要参考点。为此,我们将介绍胶囊网络背后的基本概念和动机,例如等变量推理。然后,我们将介绍胶囊路由算法的各种技术进展,以及其他几何和生成公式。我们详细解释了胶囊网络与《变形金刚》中的注意力机制之间的关系,并揭示了在以对象为中心的表征学习方面,胶囊网络与注意力机制在概念上的相似之处。我们还回顾了胶囊网络在计算机视觉、视频与运动、图表示学习、自然语言处理、医学成像等领域的广泛应用。最后,我们还深入讨论了未来工作的发展方向。
{"title":"Object-Centric Learning with Capsule Networks: A Survey","authors":"Fabio De Sousa Ribeiro, Kevin Duarte, Miles Everett, Georgios Leontidis, Mubarak Shah","doi":"10.1145/3674500","DOIUrl":"https://doi.org/10.1145/3674500","url":null,"abstract":"<p>Capsule networks emerged as a promising alternative to convolutional neural networks for learning object-centric representations. The idea is to explicitly model part-whole hierarchies by using groups of neurons called <i>capsules</i> to encode visual entities, then learn the relationships between these entities dynamically from data. However, a major hurdle for capsule network research has been the lack of a reliable point of reference for understanding their foundational ideas and motivations. This survey provides a comprehensive and critical overview of capsule networks which aims to serve as a main point of reference going forward. To that end, we introduce the fundamental concepts and motivations behind capsule networks, such as <i>equivariant inference</i>. We then cover various technical advances in capsule routing algorithms as well as alternative geometric and generative formulations. We provide a detailed explanation of how capsule networks relate to the attention mechanism in Transformers and uncover non-trivial conceptual similarities between them in the context of object-centric representation learning. We also review the extensive applications of capsule networks in computer vision, video and motion, graph representation learning, natural language processing, medical imaging, and many others. To conclude, we provide an in-depth discussion highlighting promising directions for future work.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"75 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141435748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey of 3D Space Path-Planning Methods and Algorithms 三维空间路径规划方法和算法概览
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-20 DOI: 10.1145/3673896
Hakimeh mazaheri, salman goli, ali nourollah

Due to their agility, cost-effectiveness, and high maneuverability, Unmanned Aerial Vehicles (UAVs) have attracted considerable attention from researchers and investors alike. Path planning is one of the practical subsets of motion planning for UAVs. It prevents collisions and ensures complete coverage of an area. This study provides a structured review of applicable algorithms and coverage path planning solutions in Three-Dimensional (3D) space, presenting state-of-the-art technologies related to heuristic decomposition approaches for UAVs and the forefront challenges. Additionally, it introduces a comprehensive and novel classification of practical methods and representational techniques for path-planning algorithms. This depends on environmental characteristics and optimal parameters in the real world. The first category presents a classification of semi-accurate decomposition approaches as the most practical decomposition method, along with the data structure of these practices, categorized by phases. The second category illustrates path-planning processes based on symbolic techniques in 3D space. Additionally, it provides a critical analysis of crucial influential approaches based on their importance in path quality and researchers' attention, highlighting their limitations and research gaps. Furthermore, it will provide the most pertinent recommendations for future work for researchers. The studies demonstrate an apparent inclination among experimenters towards using the semi-accurate cellular decomposition approach to improve 3D path planning.

无人驾驶飞行器(UAV)因其灵活性、成本效益和高机动性,吸引了研究人员和投资者的极大关注。路径规划是无人飞行器运动规划的实用子集之一。它可以防止碰撞并确保完全覆盖一个区域。本研究对三维(3D)空间中的适用算法和覆盖路径规划解决方案进行了结构化回顾,介绍了与无人飞行器启发式分解方法相关的最新技术和前沿挑战。此外,它还对路径规划算法的实用方法和表示技术进行了全面而新颖的分类。这取决于现实世界中的环境特征和最佳参数。第一类介绍了半精确分解方法的分类,这是最实用的分解方法,同时还介绍了这些做法的数据结构,并按阶段进行了分类。第二类介绍基于三维空间符号技术的路径规划过程。此外,它还根据路径质量的重要性和研究人员的关注度,对具有重要影响的方法进行了批判性分析,强调了这些方法的局限性和研究空白。此外,它还将为研究人员今后的工作提供最中肯的建议。研究表明,实验人员明显倾向于使用半精确蜂窝分解方法来改进三维路径规划。
{"title":"A survey of 3D Space Path-Planning Methods and Algorithms","authors":"Hakimeh mazaheri, salman goli, ali nourollah","doi":"10.1145/3673896","DOIUrl":"https://doi.org/10.1145/3673896","url":null,"abstract":"<p>Due to their agility, cost-effectiveness, and high maneuverability, Unmanned Aerial Vehicles (UAVs) have attracted considerable attention from researchers and investors alike. Path planning is one of the practical subsets of motion planning for UAVs. It prevents collisions and ensures complete coverage of an area. This study provides a structured review of applicable algorithms and coverage path planning solutions in Three-Dimensional (3D) space, presenting state-of-the-art technologies related to heuristic decomposition approaches for UAVs and the forefront challenges. Additionally, it introduces a comprehensive and novel classification of practical methods and representational techniques for path-planning algorithms. This depends on environmental characteristics and optimal parameters in the real world. The first category presents a classification of semi-accurate decomposition approaches as the most practical decomposition method, along with the data structure of these practices, categorized by phases. The second category illustrates path-planning processes based on symbolic techniques in 3D space. Additionally, it provides a critical analysis of crucial influential approaches based on their importance in path quality and researchers' attention, highlighting their limitations and research gaps. Furthermore, it will provide the most pertinent recommendations for future work for researchers. The studies demonstrate an apparent inclination among experimenters towards using the semi-accurate cellular decomposition approach to improve 3D path planning.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"44 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141430377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-Based Affective Music Generation Systems: A Review of Methods and Challenges 基于人工智能的情感音乐生成系统:方法与挑战综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-17 DOI: 10.1145/3672554
Adyasha Dash, Kathleen Agres

Music is a powerful medium for altering the emotional state of the listener. In recent years, with significant advancements in computing capabilities, artificial intelligence-based (AI-based) approaches have become popular for creating affective music generation (AMG) systems. Entertainment, healthcare, and sensor-integrated interactive system design are a few of the areas in which AI-based affective music generation (AI-AMG) systems may have a significant impact. Given the surge of interest in this topic, this article aims to provide a comprehensive review of controllable AI-AMG systems. The main building blocks of an AI-AMG system are discussed, and existing systems are formally categorized based on the core algorithm used for music generation. In addition, this article discusses the main musical features employed to compose affective music, along with the respective AI-based approaches used for tailoring them. Lastly, the main challenges and open questions in this field, as well as their potential solutions, are presented to guide future research. We hope that this review will be useful for readers seeking to understand the state-of-the-art in AI-AMG systems, and gain an overview of the methods used for developing them, thereby helping them explore this field in the future.

音乐是改变听众情绪状态的强大媒介。近年来,随着计算能力的大幅提升,基于人工智能(AI)的方法已成为创建情感音乐生成(AMG)系统的流行方法。娱乐、医疗保健和传感器集成互动系统设计是基于人工智能的情感音乐生成(AI-AMG)系统可能产生重大影响的几个领域。鉴于人们对这一主题的兴趣激增,本文旨在对可控人工智能情感音乐生成系统进行全面评述。文章讨论了人工智能-AMG 系统的主要构件,并根据音乐生成所使用的核心算法对现有系统进行了正式分类。此外,本文还讨论了用于创作情感音乐的主要音乐特征,以及用于调整这些特征的基于人工智能的方法。最后,本文提出了这一领域的主要挑战和悬而未决的问题,以及潜在的解决方案,以指导未来的研究。我们希望这篇综述能帮助读者了解人工智能-AMG 系统的最新进展,并对开发这些系统所使用的方法有一个全面的认识,从而帮助他们在未来探索这一领域。
{"title":"AI-Based Affective Music Generation Systems: A Review of Methods and Challenges","authors":"Adyasha Dash, Kathleen Agres","doi":"10.1145/3672554","DOIUrl":"https://doi.org/10.1145/3672554","url":null,"abstract":"<p>Music is a powerful medium for altering the emotional state of the listener. In recent years, with significant advancements in computing capabilities, artificial intelligence-based (AI-based) approaches have become popular for creating affective music generation (AMG) systems. Entertainment, healthcare, and sensor-integrated interactive system design are a few of the areas in which AI-based affective music generation (AI-AMG) systems may have a significant impact. Given the surge of interest in this topic, this article aims to provide a comprehensive review of controllable AI-AMG systems. The main building blocks of an AI-AMG system are discussed, and existing systems are formally categorized based on the core algorithm used for music generation. In addition, this article discusses the main musical features employed to compose affective music, along with the respective AI-based approaches used for tailoring them. Lastly, the main challenges and open questions in this field, as well as their potential solutions, are presented to guide future research. We hope that this review will be useful for readers seeking to understand the state-of-the-art in AI-AMG systems, and gain an overview of the methods used for developing them, thereby helping them explore this field in the future.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"12 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141333730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward a Privacy-Preserving Face Recognition System: A Survey of Leakages and Solutions 实现保护隐私的人脸识别系统:泄漏与解决方案调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-17 DOI: 10.1145/3673224
Lamyanba Laishram, Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

Abstract Recent advancements in face recognition (FR) technology in surveillance systems make it possible to monitor a person as they move around. FR gathers a lot of information depending on the quantity and data sources. The most severe privacy concern with FR technology is its use to identify people in real-time public monitoring applications or via an aggregation of datasets without their consent. Due to the importance of private data leakage in the FR environment, academia and business have given it a lot of attention, leading to the creation of several research initiatives meant to solve the corresponding challenges. As a result, this study aims to look at privacy-preserving face recognition (PPFR) methods. We propose a detailed and systematic study of the PPFR based on our suggested six-level framework. Along with all the levels, more emphasis is given to the processing of face images as it is more crucial for FR technology. We explore the privacy leakage issues and offer an up-to-date and thorough summary of current research trends in the FR system from six perspectives. We also encourage additional research initiatives in this promising area for further investigation.

摘要 监控系统中人脸识别(FR)技术的最新进展使监视一个人的行动成为可能。根据数量和数据来源的不同,人脸识别技术可以收集大量信息。人脸识别技术最令人担忧的隐私问题是,它在实时公共监控应用中或在未经本人同意的情况下通过数据集的汇总来识别人的身份。由于私人数据泄漏在 FR 环境中的重要性,学术界和企业界对此给予了极大关注,并发起了多项旨在解决相应挑战的研究计划。因此,本研究旨在探讨保护隐私的人脸识别(PPFR)方法。我们根据建议的六级框架对 PPFR 进行了详细而系统的研究。在所有层次中,我们更加重视人脸图像的处理,因为这对于人脸识别技术来说更为关键。我们探讨了隐私泄露问题,并从六个方面对当前 FR 系统的研究趋势进行了最新、最全面的总结。我们还鼓励在这一前景广阔的领域开展更多的研究活动,以作进一步探讨。
{"title":"Toward a Privacy-Preserving Face Recognition System: A Survey of Leakages and Solutions","authors":"Lamyanba Laishram, Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung","doi":"10.1145/3673224","DOIUrl":"https://doi.org/10.1145/3673224","url":null,"abstract":"<p><b>Abstract</b> Recent advancements in face recognition (FR) technology in surveillance systems make it possible to monitor a person as they move around. FR gathers a lot of information depending on the quantity and data sources. The most severe privacy concern with FR technology is its use to identify people in real-time public monitoring applications or via an aggregation of datasets without their consent. Due to the importance of private data leakage in the FR environment, academia and business have given it a lot of attention, leading to the creation of several research initiatives meant to solve the corresponding challenges. As a result, this study aims to look at privacy-preserving face recognition (PPFR) methods. We propose a detailed and systematic study of the PPFR based on our suggested six-level framework. Along with all the levels, more emphasis is given to the processing of face images as it is more crucial for FR technology. We explore the privacy leakage issues and offer an up-to-date and thorough summary of current research trends in the FR system from six perspectives. We also encourage additional research initiatives in this promising area for further investigation.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"9 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141333700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure UAV (Drone) and the Great Promise of AI 安全无人机和人工智能的巨大前景
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-17 DOI: 10.1145/3673225
Behrouz Zolfaghari, Mostafa Abbasmollaei, Fahimeh Hajizadeh, Naoto Yanai, Khodakhast Bibak

UAVs have found their applications in numerous applications from recreational activities to business in addition to military and strategic fields. However, research on UAVs is not going on as quickly as the technology. Especially, when it comes to the security of these devices, the academia is lagging behind the industry. This gap motivates our work in this paper as a stepping stone for future research in this area. A comprehensive survey on the security of UAVs and UAV-based systems can help the research community keep pace with, or even lead the industry. Although there are several reviews on UAVs or related areas, there is no recent survey broadly covering various aspects of security. Moreover, none of the existing surveys highlights current and future trends with a focus on the role of an omnipresent technology such as AI. This paper endeavors to overcome these shortcomings. We conduct a comprehensive review on security challenges of UAVs as well as the related security controls. Then we develop a future roadmap for research in this area with a focus on the role of AI. The future roadmap is established based on the identified current trends, under-researched topics, and a future look-ahead.

除军事和战略领域外,无人驾驶飞行器还应用于从娱乐活动到商业等众多领域。然而,对无人飞行器的研究并没有像技术发展那样迅速。特别是在这些设备的安全性方面,学术界落后于产业界。这一差距促使我们在本文中开展工作,为这一领域的未来研究铺平道路。对无人机和基于无人机的系统的安全性进行全面调查,有助于研究界跟上甚至引领行业发展。虽然有几篇关于无人机或相关领域的综述,但近期还没有一份广泛涵盖各方面安全问题的调查报告。此外,现有的调查报告都没有突出当前和未来的趋势,重点关注人工智能等无所不在的技术的作用。本文致力于克服这些不足。我们全面回顾了无人机的安全挑战以及相关的安全控制。然后,我们以人工智能的作用为重点,为这一领域的研究制定了未来路线图。未来路线图是根据已确定的当前趋势、研究不足的主题和未来展望制定的。
{"title":"Secure UAV (Drone) and the Great Promise of AI","authors":"Behrouz Zolfaghari, Mostafa Abbasmollaei, Fahimeh Hajizadeh, Naoto Yanai, Khodakhast Bibak","doi":"10.1145/3673225","DOIUrl":"https://doi.org/10.1145/3673225","url":null,"abstract":"<p>UAVs have found their applications in numerous applications from recreational activities to business in addition to military and strategic fields. However, research on UAVs is not going on as quickly as the technology. Especially, when it comes to the security of these devices, the academia is lagging behind the industry. This gap motivates our work in this paper as a stepping stone for future research in this area. A comprehensive survey on the security of UAVs and UAV-based systems can help the research community keep pace with, or even lead the industry. Although there are several reviews on UAVs or related areas, there is no recent survey broadly covering various aspects of security. Moreover, none of the existing surveys highlights current and future trends with a focus on the role of an omnipresent technology such as AI. This paper endeavors to overcome these shortcomings. We conduct a comprehensive review on security challenges of UAVs as well as the related security controls. Then we develop a future roadmap for research in this area with a focus on the role of AI. The future roadmap is established based on the identified current trends, under-researched topics, and a future look-ahead.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"42 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141333580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Introduction to the Compute Express Link (CXL) Interconnect 计算高速链路 (CXL) 互连简介
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-06-14 DOI: 10.1145/3669900
Debendra Das Sharma, Robert Blankenship, Daniel Berger

The Compute Express Link (CXL) is an open industry-standard interconnect between processors and devices such as accelerators, memory buffers, smart network interfaces, persistent memory, and solid-state drives. CXL offers coherency and memory semantics with bandwidth that scales with PCIe bandwidth while achieving significantly lower latency than PCIe. All major CPU vendors, device vendors, and datacenter operators have adopted CXL as a common standard. This enables an inter-operable ecosystem that supports key computing use cases including highly efficient accelerators, server memory bandwidth and capacity expansion, multi-server resource pooling and sharing, and efficient peer-to-peer communication. This survey provides an introduction to CXL covering the standards CXL 1.0, CXL 2.0, and CXL 3.0. We further survey CXL implementations, discuss CXL's impact on the datacenter landscape, and future directions.

Compute Express Link(CXL)是处理器与加速器、内存缓冲器、智能网络接口、持久内存和固态硬盘等设备之间的开放式行业标准互连。CXL 提供一致性和内存语义,其带宽可与 PCIe 带宽相匹配,同时延迟大大低于 PCIe。所有主要 CPU 供应商、设备供应商和数据中心运营商都已采用 CXL 作为通用标准。这样就形成了一个可互操作的生态系统,支持包括高效加速器、服务器内存带宽和容量扩展、多服务器资源池和共享以及高效点对点通信在内的关键计算用例。本调查介绍了 CXL,包括 CXL 1.0、CXL 2.0 和 CXL 3.0 标准。我们将进一步调查 CXL 的实施情况,讨论 CXL 对数据中心格局的影响以及未来的发展方向。
{"title":"An Introduction to the Compute Express Link (CXL) Interconnect","authors":"Debendra Das Sharma, Robert Blankenship, Daniel Berger","doi":"10.1145/3669900","DOIUrl":"https://doi.org/10.1145/3669900","url":null,"abstract":"<p>The Compute Express Link (CXL) is an open industry-standard interconnect between processors and devices such as accelerators, memory buffers, smart network interfaces, persistent memory, and solid-state drives. CXL offers coherency and memory semantics with bandwidth that scales with PCIe bandwidth while achieving significantly lower latency than PCIe. All major CPU vendors, device vendors, and datacenter operators have adopted CXL as a common standard. This enables an inter-operable ecosystem that supports key computing use cases including highly efficient accelerators, server memory bandwidth and capacity expansion, multi-server resource pooling and sharing, and efficient peer-to-peer communication. This survey provides an introduction to CXL covering the standards CXL 1.0, CXL 2.0, and CXL 3.0. We further survey CXL implementations, discuss CXL's impact on the datacenter landscape, and future directions.</p>","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"34 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141319871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1