首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
A Review on Trustworthiness of Digital Assistants for Personal Healthcare 个人医疗保健数字助理可信度研究综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-22 DOI: 10.1145/3714999
Tania Bailoni, Mauro Dragoni
Artificial Intelligence (AI) is widely used within the healthcare domain. One of the branches of digital health concerns the design and development of digital assistant solutions. AI-enabled digital assistants highlighted the need to be trustworthy given their intrusiveness within people’s lives. Such solutions aim to provide intelligent tools to ease the management of care pathways or to enhance the capabilities of healthcare organizations in deploying health prevention campaigns by monitoring the lifestyles of healthy people. In this work, we intend to analyze the recent literature concerning integrating AI techniques within digital assistants. We focused on the contribution published during the last ten years and we performed a careful analysis of whether and how trustworthy pillars have been addressed. We also discuss the risks of designing digital assistants without considering trustworthy pillars and present some recommendations to mitigate them.
人工智能(AI)广泛应用于医疗保健领域。数字健康的一个分支涉及数字助理解决方案的设计和开发。人工智能支持的数字助理强调了值得信赖的必要性,因为它们侵入了人们的生活。此类解决方案旨在提供智能工具,以简化护理路径的管理,或通过监测健康人的生活方式,增强医疗保健组织部署健康预防运动的能力。在这项工作中,我们打算分析有关在数字助理中集成人工智能技术的最新文献。我们专注于过去十年发表的贡献,并对是否以及如何解决值得信赖的支柱进行了仔细的分析。我们还讨论了在设计数字助理时没有考虑值得信赖的支柱的风险,并提出了一些减轻风险的建议。
{"title":"A Review on Trustworthiness of Digital Assistants for Personal Healthcare","authors":"Tania Bailoni, Mauro Dragoni","doi":"10.1145/3714999","DOIUrl":"https://doi.org/10.1145/3714999","url":null,"abstract":"Artificial Intelligence (AI) is widely used within the healthcare domain. One of the branches of digital health concerns the design and development of digital assistant solutions. AI-enabled digital assistants highlighted the need to be trustworthy given their intrusiveness within people’s lives. Such solutions aim to provide intelligent tools to ease the management of care pathways or to enhance the capabilities of healthcare organizations in deploying health prevention campaigns by monitoring the lifestyles of healthy people. In this work, we intend to analyze the recent literature concerning integrating AI techniques within digital assistants. We focused on the contribution published during the last ten years and we performed a careful analysis of whether and how trustworthy pillars have been addressed. We also discuss the risks of designing digital assistants without considering trustworthy pillars and present some recommendations to mitigate them.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"6 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143020701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Generative Models for Therapeutic Peptide Discovery: A Comprehensive Review 治疗性多肽发现的深度生成模型:综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-21 DOI: 10.1145/3714455
Leshan Lai, Yuansheng Liu, Bosheng Song, Keqin Li, Xiangxiang Zeng
Deep learning tools, especially deep generative models (DGMs), provide opportunities to accelerate and simplify the design of drugs. As drug candidates, peptides are superior to other biomolecules because they combine potency, selectivity, and low toxicity. This review examines the fundamental aspects of current DGMs for designing therapeutic peptide sequences. First, relevant databases in this field are introduced. Next, the current situation of data representation and where it can be optimized are discussed. Then, after introducing the basic principles and variants of diverse DGM algorithms, the applications of these methods to design and optimize peptides are stated. Finally, we present several challenges to devising a powerful model that can meet the requirements of learning the different biological properties of peptides, as well as future research directions to address these challenges.
深度学习工具,特别是深度生成模型(dgm),为加速和简化药物设计提供了机会。作为候选药物,肽优于其他生物分子,因为它们结合了效力、选择性和低毒性。本文综述了目前用于设计治疗性肽序列的dgm的基本方面。首先,介绍了该领域的相关数据库。接下来,讨论了数据表示的现状和可以优化的地方。然后,在介绍了各种DGM算法的基本原理和变体之后,阐述了这些方法在多肽设计和优化中的应用。最后,我们提出了几个挑战,设计一个强大的模型,可以满足学习肽的不同生物学特性的要求,以及未来的研究方向,以解决这些挑战。
{"title":"Deep Generative Models for Therapeutic Peptide Discovery: A Comprehensive Review","authors":"Leshan Lai, Yuansheng Liu, Bosheng Song, Keqin Li, Xiangxiang Zeng","doi":"10.1145/3714455","DOIUrl":"https://doi.org/10.1145/3714455","url":null,"abstract":"Deep learning tools, especially deep generative models (DGMs), provide opportunities to accelerate and simplify the design of drugs. As drug candidates, peptides are superior to other biomolecules because they combine potency, selectivity, and low toxicity. This review examines the fundamental aspects of current DGMs for designing therapeutic peptide sequences. First, relevant databases in this field are introduced. Next, the current situation of data representation and where it can be optimized are discussed. Then, after introducing the basic principles and variants of diverse DGM algorithms, the applications of these methods to design and optimize peptides are stated. Finally, we present several challenges to devising a powerful model that can meet the requirements of learning the different biological properties of peptides, as well as future research directions to address these challenges.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"45 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142992150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering on Attributed Graphs: From Single-view to Multi-view 属性图的聚类:从单视图到多视图
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-21 DOI: 10.1145/3714407
Mengyao Li, Zhibang Yang, Xu Zhou, Yixiang Fang, Kenli Li, Keqin Li
Attributed graphs with both topological information and node information have prevalent applications in the real world, including recommendation systems, biological networks, community analysis, and so on. Recently, with rapid development of information gathering and extraction technology, the sources of data become more extensive and multi-view data attracts growing attention. Consequently, attributed graphs can be divided into two categories: single-view attributed graphs and multi-view attributed graphs. Compared with single-view attributed graphs, multi-view attributed graphs can provide more complementary information but also pose challenges to fusing information of multi-views. Moreover, attributed graph clustering aims to reveal the inherent community structure of the graph, which is widely applied in fraud detection, crime recognition, and recommendation systems. Recently, numerous methods based on various ideas and techniques have appeared to cluster attributed graphs, thus there is an urgent need to summarize related methods. To this end, we make a timely and comprehensive review of recent methods. Furthermore, we provide a novel standard according to fusion results to classify related methods into three categories: Fusion on adjacency matrix methods, Fusion on embedding methods, and Model-based methods. Moreover, to conduct a comprehensive evaluation of existing methods, this paper evaluates these advanced methods with sufficient experimental results and theoretical analysis. Finally, we analyze the challenges and open opportunities to promote the future development of this field.
具有拓扑信息和节点信息的属性图在现实世界中有着广泛的应用,包括推荐系统、生物网络、社区分析等等。近年来,随着信息采集和提取技术的飞速发展,数据的来源越来越广泛,多视图数据越来越受到人们的关注。因此,属性图可以分为单视图属性图和多视图属性图两类。与单视图属性图相比,多视图属性图可以提供更多的互补信息,但也给多视图信息融合带来了挑战。此外,属性图聚类旨在揭示图的固有社区结构,广泛应用于欺诈检测、犯罪识别和推荐系统。近年来,基于各种思想和技术的属性图聚类方法层出不穷,迫切需要对相关方法进行总结。为此,我们对最近的方法进行了及时而全面的回顾。此外,我们还根据融合结果提出了一种新的标准,将相关方法分为三类:邻接矩阵融合方法、嵌入融合方法和基于模型的方法。此外,为了对现有方法进行综合评价,本文对这些先进的方法进行了充分的实验结果和理论分析。最后,分析了该领域未来发展面临的挑战和机遇。
{"title":"Clustering on Attributed Graphs: From Single-view to Multi-view","authors":"Mengyao Li, Zhibang Yang, Xu Zhou, Yixiang Fang, Kenli Li, Keqin Li","doi":"10.1145/3714407","DOIUrl":"https://doi.org/10.1145/3714407","url":null,"abstract":"Attributed graphs with both topological information and node information have prevalent applications in the real world, including recommendation systems, biological networks, community analysis, and so on. Recently, with rapid development of information gathering and extraction technology, the sources of data become more extensive and multi-view data attracts growing attention. Consequently, attributed graphs can be divided into two categories: single-view attributed graphs and multi-view attributed graphs. Compared with single-view attributed graphs, multi-view attributed graphs can provide more complementary information but also pose challenges to fusing information of multi-views. Moreover, attributed graph clustering aims to reveal the inherent community structure of the graph, which is widely applied in fraud detection, crime recognition, and recommendation systems. Recently, numerous methods based on various ideas and techniques have appeared to cluster attributed graphs, thus there is an urgent need to summarize related methods. To this end, we make a timely and comprehensive review of recent methods. Furthermore, we provide a novel standard according to fusion results to classify related methods into three categories: Fusion on adjacency matrix methods, Fusion on embedding methods, and Model-based methods. Moreover, to conduct a comprehensive evaluation of existing methods, this paper evaluates these advanced methods with sufficient experimental results and theoretical analysis. Finally, we analyze the challenges and open opportunities to promote the future development of this field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"81 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142992149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Text Classification Using Graph Convolutional Networks: A Comprehensive Survey 使用图卷积网络的文本分类:一个全面的调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-21 DOI: 10.1145/3714456
Syed Mustafa Haider Rizvi, Ramsha Imran, Arif Mahmood
Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable body of recent works exists where researchers have studied and tackled text classification from different angles with varying degrees of success. Graph convolution network (GCN)-based approaches have gained a lot of traction in this domain over the last decade with many implementations achieving state-of-the-art performance in more recent literature and thus, warranting the need for an updated survey. This work aims to summarize and categorize various GCN-based Text Classification approaches with regard to the architecture and mode of supervision. It identifies their strengths and limitations and compares their performance on various benchmark datasets. We also discuss future research directions and the challenges that exist in this domain.
文本分类是自然语言处理中一个典型的实用问题,可应用于情感分析、假新闻检测、医疗诊断和文档分类等多个领域。近年来,研究人员从不同角度对文本分类进行了大量研究,并取得了不同程度的成功。在过去十年中,基于图卷积网络(GCN)的方法在这一领域获得了广泛的关注,许多实现方法在最近的文献中达到了最先进的性能,因此有必要对其进行更新调查。这项工作旨在总结和归类各种基于 GCN 的文本分类方法的架构和监督模式。它指出了这些方法的优势和局限性,并比较了它们在各种基准数据集上的性能。我们还讨论了该领域未来的研究方向和存在的挑战。
{"title":"Text Classification Using Graph Convolutional Networks: A Comprehensive Survey","authors":"Syed Mustafa Haider Rizvi, Ramsha Imran, Arif Mahmood","doi":"10.1145/3714456","DOIUrl":"https://doi.org/10.1145/3714456","url":null,"abstract":"Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable body of recent works exists where researchers have studied and tackled text classification from different angles with varying degrees of success. Graph convolution network (GCN)-based approaches have gained a lot of traction in this domain over the last decade with many implementations achieving state-of-the-art performance in more recent literature and thus, warranting the need for an updated survey. This work aims to summarize and categorize various GCN-based Text Classification approaches with regard to the architecture and mode of supervision. It identifies their strengths and limitations and compares their performance on various benchmark datasets. We also discuss future research directions and the challenges that exist in this domain.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"29 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142992151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recent Advances of Differential Privacy in Centralized Deep Learning: A Systematic Survey 集中式深度学习中差分隐私的最新进展:系统综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-21 DOI: 10.1145/3712000
Lea Demelius, Roman Kern, Andreas Trügler
Differential Privacy has become a widely popular method for data protection in machine learning, especially since it allows formulating strict mathematical privacy guarantees. This survey provides an overview of the state-of-the-art of differentially private centralized deep learning, thorough analyses of recent advances and open problems, as well as a discussion of potential future developments in the field. Based on a systematic literature review, the following topics are addressed: emerging application domains, differentially private generative models, auditing and evaluation methods for private models, protection against a broad range of threats and attacks, and improvements of privacy-utility trade-offs.
差分隐私已经成为机器学习中广泛流行的数据保护方法,特别是因为它允许制定严格的数学隐私保证。本调查概述了差异化私有集中式深度学习的最新进展,全面分析了最近的进展和开放的问题,并讨论了该领域未来的潜在发展。在系统文献综述的基础上,讨论了以下主题:新兴应用领域、差异私有生成模型、私有模型的审计和评估方法、针对广泛威胁和攻击的保护,以及隐私-效用权衡的改进。
{"title":"Recent Advances of Differential Privacy in Centralized Deep Learning: A Systematic Survey","authors":"Lea Demelius, Roman Kern, Andreas Trügler","doi":"10.1145/3712000","DOIUrl":"https://doi.org/10.1145/3712000","url":null,"abstract":"Differential Privacy has become a widely popular method for data protection in machine learning, especially since it allows formulating strict mathematical privacy guarantees. This survey provides an overview of the state-of-the-art of differentially private centralized deep learning, thorough analyses of recent advances and open problems, as well as a discussion of potential future developments in the field. Based on a systematic literature review, the following topics are addressed: emerging application domains, differentially private generative models, auditing and evaluation methods for private models, protection against a broad range of threats and attacks, and improvements of privacy-utility trade-offs.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"74 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143020696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward the Construction of Affective Brain-Computer Interface: A Systematic Review 情感脑机接口的构建:系统综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-20 DOI: 10.1145/3712259
Huayu Chen, Junxiang Li, Huanhuan He, Jing Zhu, Shuting Sun, Xiaowei Li, Bin Hu
Electroencephalogram(EEG)-based affective computing aims to recognize the emotional state, which is the core technology of affective brain-computer interface(aBCI). This concept encompasses aspects of physiological computing, human-computer interaction(HCI), mental health care, and brain-computer interfaces(BCI), presenting significant theoretical and practical value. However, the field reached a bottleneck stage due to EEG individual difference issues, causing various challenges to achieve a fundamental aBCI. In this review, we collected some representative works from 2019 to 2023. Combining the historical exploration process and research approaches of EEG-based emotion recognition, a comprehensive understand of current research status was conducted. Furthermore, we analyzed the main obstacles for emotion recognition modeling. To construct a reasonable aBCI, we envisioned the working scenarios, developmental stages, and key impact factors based on the existing EEG physiology knowledge. From the practical application perspective, we evaluated the theoretical significance, implementation difficulty, and real-world limitations of different approaches. By synthesizing the merits and drawbacks of various techniques, we proposed a theoretically feasible aBCI framework under the restrictions of real-world application scenarios. Finally, we suggested several research topics that have not been thoroughly investigated to broaden the research scope and accelerate the development of aBCIs.
基于脑电图的情感计算旨在识别情绪状态,是情感脑机接口(aBCI)的核心技术。这一概念涵盖了生理计算、人机交互(HCI)、精神卫生保健和脑机接口(BCI)等方面,具有重要的理论和实践价值。然而,由于EEG个体差异问题,该领域达到了瓶颈阶段,给实现基本的aBCI带来了各种挑战。在本次综述中,我们收集了2019年至2023年的一些代表性作品。结合基于脑电图的情感识别的历史探索过程和研究方法,对目前的研究现状进行了全面的了解。此外,我们还分析了情感识别建模的主要障碍。为了构建合理的aBCI,我们在现有脑电生理学知识的基础上,对aBCI的工作场景、发育阶段和关键影响因素进行了设想。从实际应用的角度,我们评估了不同方法的理论意义、实施难度和现实世界的局限性。在综合各种技术优缺点的基础上,提出了一个在实际应用场景限制下理论上可行的aBCI框架。最后,我们提出了几个尚未深入研究的研究课题,以拓宽研究范围,加快业务基础信息系统的发展。
{"title":"Toward the Construction of Affective Brain-Computer Interface: A Systematic Review","authors":"Huayu Chen, Junxiang Li, Huanhuan He, Jing Zhu, Shuting Sun, Xiaowei Li, Bin Hu","doi":"10.1145/3712259","DOIUrl":"https://doi.org/10.1145/3712259","url":null,"abstract":"Electroencephalogram(EEG)-based affective computing aims to recognize the emotional state, which is the core technology of affective brain-computer interface(aBCI). This concept encompasses aspects of physiological computing, human-computer interaction(HCI), mental health care, and brain-computer interfaces(BCI), presenting significant theoretical and practical value. However, the field reached a bottleneck stage due to EEG individual difference issues, causing various challenges to achieve a fundamental aBCI. In this review, we collected some representative works from 2019 to 2023. Combining the historical exploration process and research approaches of EEG-based emotion recognition, a comprehensive understand of current research status was conducted. Furthermore, we analyzed the main obstacles for emotion recognition modeling. To construct a reasonable aBCI, we envisioned the working scenarios, developmental stages, and key impact factors based on the existing EEG physiology knowledge. From the practical application perspective, we evaluated the theoretical significance, implementation difficulty, and real-world limitations of different approaches. By synthesizing the merits and drawbacks of various techniques, we proposed a theoretically feasible aBCI framework under the restrictions of real-world application scenarios. Finally, we suggested several research topics that have not been thoroughly investigated to broaden the research scope and accelerate the development of aBCIs.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"31 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142989903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey of Multimodal Learning: Methods, Applications, and Future 多模态学习研究:方法、应用与未来
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-18 DOI: 10.1145/3713070
Yuan Yuan, Zhaojian Li, Bin Zhao
The multimodal interplay of the five fundamental senses—Sight, Hearing, Smell, Taste, and Touch—provides humans with superior environmental perception and learning skills. Adapted from the human perceptual system, multimodal machine learning tries to incorporate different forms of input, such as image, audio, and text, and determine their fundamental connections through joint modeling. As one of the future development forms of artificial intelligence, it is necessary to summarize the progress of multimodal machine learning. In this paper, we start with the form of a multimodal combination and provide a comprehensive survey of the emerging subject of multimodal machine learning, covering representative research approaches, the most recent advancements, and their applications. Specifically, this paper analyzes the relationship between different modalities in detail and sorts out the key issues in multimodal research from the application scenarios. Besides, we thoroughly reviewed state-of-the-art methods and datasets covered in multimodal learning research. We then identify the substantial challenges and potential developing directions in this field. Finally, given the comprehensive nature of this survey, both modality-specific and task-specific researchers can benefit from this survey and advance the field.
五种基本感官(视觉、听觉、嗅觉、味觉和触觉)的多模态相互作用为人类提供了优越的环境感知和学习技能。多模态机器学习改编自人类感知系统,试图将不同形式的输入,如图像、音频和文本,并通过联合建模确定它们的基本联系。作为人工智能的未来发展形式之一,有必要对多模态机器学习的进展进行总结。在本文中,我们从多模态组合的形式开始,对多模态机器学习的新兴主题进行了全面的调查,涵盖了代表性的研究方法、最新进展及其应用。具体而言,本文详细分析了不同模态之间的关系,并从应用场景上梳理了多模态研究中的关键问题。此外,我们全面回顾了多模态学习研究中最先进的方法和数据集。然后,我们确定了该领域的重大挑战和潜在的发展方向。最后,鉴于这项调查的综合性,特定模式和特定任务的研究人员都可以从这项调查中受益,并推动该领域的发展。
{"title":"A Survey of Multimodal Learning: Methods, Applications, and Future","authors":"Yuan Yuan, Zhaojian Li, Bin Zhao","doi":"10.1145/3713070","DOIUrl":"https://doi.org/10.1145/3713070","url":null,"abstract":"The multimodal interplay of the five fundamental senses—Sight, Hearing, Smell, Taste, and Touch—provides humans with superior environmental perception and learning skills. Adapted from the human perceptual system, multimodal machine learning tries to incorporate different forms of input, such as image, audio, and text, and determine their fundamental connections through joint modeling. As one of the future development forms of artificial intelligence, it is necessary to summarize the progress of multimodal machine learning. In this paper, we start with the form of a multimodal combination and provide a comprehensive survey of the emerging subject of multimodal machine learning, covering representative research approaches, the most recent advancements, and their applications. Specifically, this paper analyzes the relationship between different modalities in detail and sorts out the key issues in multimodal research from the application scenarios. Besides, we thoroughly reviewed state-of-the-art methods and datasets covered in multimodal learning research. We then identify the substantial challenges and potential developing directions in this field. Finally, given the comprehensive nature of this survey, both modality-specific and task-specific researchers can benefit from this survey and advance the field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"205 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142989095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Getting the Data in Shape for Your Process Mining Analysis: An In-Depth Analysis of the Pre-Analysis Stage 为您的流程挖掘分析获取数据:对预分析阶段的深入分析
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-18 DOI: 10.1145/3712587
Shameer K. Pradhan, Mieke Jans, Niels Martin
Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods that are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.
流程挖掘使组织能够分析存储在其信息系统中的数据,并获得有关其业务流程的见解。但是,需要将原始数据转换为可以提供给流程挖掘算法的格式。可以对原始数据执行各种预分析活动,例如去除缺陷或更改粒度级别。虽然分析前活动在过程挖掘中起着至关重要的作用,但目前对其范围和审查程度的概述有限。本研究对流程采矿项目的预分析活动进行了系统的文献综述。为了更好地理解这一阶段及其研究现状,我们探讨了哪些活动构成了预分析阶段,它们的目标,应用研究方法,拟议的研究成果以及用于评估研究成果的数据。我们确定了15个预分析活动和概念,例如,数据提取、生成和清理。我们还发现,设计科学研究是方法论和方法,是以往研究的主要研究成果。我们也意识到,大多数情况下,所提出的结果仅使用实际数据进行评估。这项研究表明,对预分析的研究是一个日益增长的领域感兴趣的过程采矿。
{"title":"Getting the Data in Shape for Your Process Mining Analysis: An In-Depth Analysis of the Pre-Analysis Stage","authors":"Shameer K. Pradhan, Mieke Jans, Niels Martin","doi":"10.1145/3712587","DOIUrl":"https://doi.org/10.1145/3712587","url":null,"abstract":"Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods that are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"15 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142989528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decoding Fake News and Hate Speech: A Survey of Explainable AI Techniques 解码假新闻和仇恨言论:可解释人工智能技术概览
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-17 DOI: 10.1145/3711123
Mikel Ngueajio, Saurav Aryal, Marcellin Atemkeng, Gloria Washington, Danda Rawat
This survey emphasizes the significance of Explainable AI (XAI) techniques in detecting hateful speech and misinformation/Fake news. It explores recent trends in detecting these phenomena, highlighting current research that reveals a synergistic relationship between them. Additionally, it presents recent trends in the use of XAI methods to mitigate the occurrences of hateful land Fake contents in conversations. The survey reviews state-of-the-art XAI approaches, algorithms, modeling datasets, as well as the evaluation metrics leveraged for assessing model interpretability, and thus provides a comprehensive summary table of the literature surveyed and relevant datasets. It concludes with an overview of key observations, offering insights into the prominent model explainability methods used in hate speech and misinformation detection. The research strengths, limitations are also presented, as well as perspectives and suggestions for future directions in this research domain.
这项调查强调了可解释人工智能(XAI)技术在检测仇恨言论和错误信息/假新闻方面的重要性。它探讨了检测这些现象的最新趋势,强调了揭示它们之间协同关系的当前研究。此外,它还介绍了使用XAI方法来减少对话中可恨的土地虚假内容发生的最新趋势。该调查回顾了最先进的XAI方法、算法、建模数据集,以及用于评估模型可解释性的评估指标,从而提供了调查文献和相关数据集的综合汇总表。最后概述了关键观察结果,提供了对仇恨言论和错误信息检测中使用的突出模型可解释性方法的见解。最后提出了本研究的优势和不足,并对未来的研究方向提出了展望和建议。
{"title":"Decoding Fake News and Hate Speech: A Survey of Explainable AI Techniques","authors":"Mikel Ngueajio, Saurav Aryal, Marcellin Atemkeng, Gloria Washington, Danda Rawat","doi":"10.1145/3711123","DOIUrl":"https://doi.org/10.1145/3711123","url":null,"abstract":"This survey emphasizes the significance of Explainable AI (XAI) techniques in detecting hateful speech and misinformation/Fake news. It explores recent trends in detecting these phenomena, highlighting current research that reveals a synergistic relationship between them. Additionally, it presents recent trends in the use of XAI methods to mitigate the occurrences of hateful land Fake contents in conversations. The survey reviews state-of-the-art XAI approaches, algorithms, modeling datasets, as well as the evaluation metrics leveraged for assessing model interpretability, and thus provides a comprehensive summary table of the literature surveyed and relevant datasets. It concludes with an overview of key observations, offering insights into the prominent model explainability methods used in hate speech and misinformation detection. The research strengths, limitations are also presented, as well as perspectives and suggestions for future directions in this research domain.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"30 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142989096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols 密集视频字幕:技术,数据集和评估协议的调查
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2025-01-14 DOI: 10.1145/3712059
Iqra Qasim, Alexander Horsch, Dilip Prasad
Untrimmed videos have interrelated events, dependencies, context, overlapping events, object-object interactions, domain specificity, and other semantics that are worth highlighting while describing a video in natural language. Owing to such a vast diversity, a single sentence can only correctly describe a portion of the video. Dense Video Captioning (DVC) aims to detect and describe different events in a given video. The term DVC originated in the 2017 ActivityNet challenge, after which considerable effort has been made to address the challenge. Dense Video Captioning is divided into three sub-tasks: (1) Video Feature Extraction (VFE), (2) Temporal Event Localization (TEL), and (3) Dense Caption Generation (DCG). In this survey, we discuss all the studies that claim to perform DVC along with its sub-tasks and summarize their results. We also discuss all the datasets that have been used for DVC. Lastly, current challenges in the field are highlighted along with observatory remarks and future trends in the field.
未经剪辑的视频具有相互关联的事件、依赖性、上下文、重叠事件、对象与对象之间的交互、领域特殊性以及其他语义,这些都是在用自然语言描述视频时值得强调的。由于存在如此巨大的多样性,一句话只能正确描述视频的一部分内容。密集视频字幕(DVC)旨在检测和描述给定视频中的不同事件。DVC 一词起源于 2017 年的 ActivityNet 挑战赛,之后人们为应对这一挑战做出了大量努力。密集视频字幕制作分为三个子任务:(1)视频特征提取(VFE);(2)时态事件定位(TEL);(3)密集字幕生成(DCG)。在本调查报告中,我们将讨论所有声称可以执行 DVC 及其子任务的研究,并总结其结果。我们还讨论了用于 DVC 的所有数据集。最后,我们强调了该领域当前面临的挑战,以及该领域的观察评论和未来趋势。
{"title":"Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols","authors":"Iqra Qasim, Alexander Horsch, Dilip Prasad","doi":"10.1145/3712059","DOIUrl":"https://doi.org/10.1145/3712059","url":null,"abstract":"Untrimmed videos have interrelated events, dependencies, context, overlapping events, object-object interactions, domain specificity, and other semantics that are worth highlighting while describing a video in natural language. Owing to such a vast diversity, a single sentence can only correctly describe a portion of the video. Dense Video Captioning (DVC) aims to detect and describe different events in a given video. The term DVC originated in the 2017 ActivityNet challenge, after which considerable effort has been made to address the challenge. Dense Video Captioning is divided into three sub-tasks: (1) Video Feature Extraction (VFE), (2) Temporal Event Localization (TEL), and (3) Dense Caption Generation (DCG). In this survey, we discuss all the studies that claim to perform DVC along with its sub-tasks and summarize their results. We also discuss all the datasets that have been used for DVC. Lastly, current challenges in the field are highlighted along with observatory remarks and future trends in the field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"22 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1