首页 > 最新文献

ACM Computing Surveys最新文献

英文 中文
Bias-Free? An Empirical Study on Ethnicity, Gender, and Age Fairness in Deepfake Detection 没有偏见吗?深度造假检测中种族、性别和年龄公平性的实证研究
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-09 DOI: 10.1145/3796544
Aditi Panda, Tanusree Ghosh, Tushar Choudhary, Ruchira Naskar
In this study, we evaluate potential demographic bias in state-of-the-art deepfake image detection models across three key attributes: age, ethnicity, and gender. Unlike prior works that retrain detectors or analyse forensic manipulations, we systematically assess multiple pretrained checkpoints of leading deepfake detectors, each trained on different datasets, to ensure an unbiased evaluation framework. Our experiments employ synthetic images generated by recent diffusion and autoregressive models, alongside real images from balanced datasets, to measure subgroup-specific detection performance. Results reveal no systematic bias across demographic categories—variations in accuracy and precision remain within small statistical margins across all detectors and checkpoints. We further provide a taxonomy of image generative models, highlighting their evolution from pixel-space to latent-space diffusion architectures, to contextualize the diversity of synthetic data used in our evaluation. Overall, our findings suggest that modern deepfake image detectors, when tested in a cross-demographic setting using pretrained checkpoints, exhibit robust and fair performance across age, ethnicity, and gender.
在本研究中,我们评估了最先进的深度假图像检测模型中三个关键属性的潜在人口统计学偏差:年龄、种族和性别。与之前重新训练检测器或分析法医操作的工作不同,我们系统地评估了领先深度假检测器的多个预训练检查点,每个检查点都在不同的数据集上进行了训练,以确保公正的评估框架。我们的实验使用由最近的扩散和自回归模型生成的合成图像,以及来自平衡数据集的真实图像,来测量亚群特定的检测性能。结果显示,在人口统计类别中没有系统性偏差——在所有检测器和检查点中,准确度和精度的变化仍然在很小的统计范围内。我们进一步提供了图像生成模型的分类,强调了它们从像素空间到潜在空间扩散架构的演变,以将我们评估中使用的合成数据的多样性背景化。总的来说,我们的研究结果表明,当使用预训练的检查点在跨人口统计环境中进行测试时,现代深度假图像检测器在年龄、种族和性别方面都表现出稳健和公平的表现。
{"title":"Bias-Free? An Empirical Study on Ethnicity, Gender, and Age Fairness in Deepfake Detection","authors":"Aditi Panda, Tanusree Ghosh, Tushar Choudhary, Ruchira Naskar","doi":"10.1145/3796544","DOIUrl":"https://doi.org/10.1145/3796544","url":null,"abstract":"In this study, we evaluate potential demographic bias in state-of-the-art deepfake image detection models across three key attributes: age, ethnicity, and gender. Unlike prior works that retrain detectors or analyse forensic manipulations, we systematically assess multiple pretrained checkpoints of leading deepfake detectors, each trained on different datasets, to ensure an unbiased evaluation framework. Our experiments employ synthetic images generated by recent diffusion and autoregressive models, alongside real images from balanced datasets, to measure subgroup-specific detection performance. Results reveal no systematic bias across demographic categories—variations in accuracy and precision remain within small statistical margins across all detectors and checkpoints. We further provide a taxonomy of image generative models, highlighting their evolution from pixel-space to latent-space diffusion architectures, to contextualize the diversity of synthetic data used in our evaluation. Overall, our findings suggest that modern deepfake image detectors, when tested in a cross-demographic setting using pretrained checkpoints, exhibit robust and fair performance across age, ethnicity, and gender.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"40 4 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Review of Information Uncertainty Modelling in Domain Ontologies 领域本体中信息不确定性建模研究综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3794841
Deemah Alomair, Ridha Khedri, Wendy MacCaull
Domain ontologies are essential for representing and reasoning about knowledge, yet addressing information uncertainty within them remains challenging. This review surveys approaches to modelling information uncertainty in domain ontologies from 2010 to 2024. It categorizes modelling formalisms, identifies information uncertainty types, and analyzes how information uncertainty is integrated into ontology components. It reviews reasoning techniques and emerging methods, including Machine Learning and Natural Language Processing. The review examines languages, tools, and evaluation strategies. The purpose is to map the landscape of information uncertainty modelling in domain ontologies, highlight research gaps and trends, and provide structured guidance for selecting suitable approaches.
领域本体对于知识的表示和推理至关重要,但在其中处理信息不确定性仍然具有挑战性。本文综述了2010年至2024年领域本体中信息不确定性建模的方法。对建模形式进行了分类,识别了信息不确定性类型,并分析了信息不确定性如何集成到本体组件中。它回顾了推理技术和新兴方法,包括机器学习和自然语言处理。审查审查语言、工具和评估策略。目的是绘制领域本体中信息不确定性建模的景观,突出研究差距和趋势,并为选择合适的方法提供结构化指导。
{"title":"A Comprehensive Review of Information Uncertainty Modelling in Domain Ontologies","authors":"Deemah Alomair, Ridha Khedri, Wendy MacCaull","doi":"10.1145/3794841","DOIUrl":"https://doi.org/10.1145/3794841","url":null,"abstract":"Domain ontologies are essential for representing and reasoning about knowledge, yet addressing information uncertainty within them remains challenging. This review surveys approaches to modelling information uncertainty in domain ontologies from 2010 to 2024. It categorizes modelling formalisms, identifies information uncertainty types, and analyzes how information uncertainty is integrated into ontology components. It reviews reasoning techniques and emerging methods, including Machine Learning and Natural Language Processing. The review examines languages, tools, and evaluation strategies. The purpose is to map the landscape of information uncertainty modelling in domain ontologies, highlight research gaps and trends, and provide structured guidance for selecting suitable approaches.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"41 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146135118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalizability of Large Language Model-Based Agents: A Comprehensive Survey 基于大型语言模型的智能体的可泛化性综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3794858
Minxing Zhang, Yi Yang, Roy Xie, Bhuwan Dhingra, Shuyan Zhou, Jian Pei
Large Language Model (LLM)-based agents have recently emerged as a new paradigm that extends the capabilities of LLMs beyond text generation to dynamic interaction with external environments. A critical challenge lies in ensuring their generalizability – the ability to maintain consistently high performance across varied instructions, tasks, environments, and domains, especially those different from the agent’s fine-tuning data. Despite growing interest, the concept of generalizability in LLM-based agents remains underdefined, and systematic approaches to measure and improve it are lacking. We provide the first comprehensive review of generalizability in LLM-based agents. We begin by clarifying the definition and boundaries of agent generalizability. We then review existing benchmarks. Next, we categorize strategies for improving generalizability into three groups: methods targeting the backbone LLM, targeting agent components, and targeting their interactions. Furthermore, we introduce the distinction between generalizable frameworks and generalizable agents and outline how generalizable frameworks can be translated into agent-level generalizability. Finally, we identify future directions, including the development of standardized evaluation frameworks, variance- and cost-based metrics, and hybrid approaches that integrate methodological innovations with agent architecture-level designs. We aim to establish a foundation for principled research on building LLM-based agents that generalize reliably across diverse real-world applications.
基于大型语言模型(LLM)的代理最近作为一种新的范例出现,它将LLM的能力从文本生成扩展到与外部环境的动态交互。一个关键的挑战在于确保它们的泛化性——能够在不同的指令、任务、环境和领域(特别是那些与代理的微调数据不同的领域)中保持一致的高性能。尽管人们越来越感兴趣,但在基于llm的代理中,泛化的概念仍然没有得到充分的定义,并且缺乏测量和改进它的系统方法。我们首次全面回顾了基于法学硕士的药物的普遍性。我们首先澄清智能体可泛化性的定义和边界。然后我们审查现有的基准。接下来,我们将提高泛化性的策略分为三组:针对主干LLM的方法,针对代理组件的方法,以及针对它们之间相互作用的方法。此外,我们还介绍了可一般化框架和可一般化代理之间的区别,并概述了如何将可一般化框架转换为代理级的一般化。最后,我们确定了未来的方向,包括标准化评估框架的发展,基于方差和成本的度量,以及将方法创新与代理体系结构级设计相结合的混合方法。我们的目标是为构建基于llm的代理的原则研究奠定基础,这些代理可以在不同的实际应用中可靠地进行推广。
{"title":"Generalizability of Large Language Model-Based Agents: A Comprehensive Survey","authors":"Minxing Zhang, Yi Yang, Roy Xie, Bhuwan Dhingra, Shuyan Zhou, Jian Pei","doi":"10.1145/3794858","DOIUrl":"https://doi.org/10.1145/3794858","url":null,"abstract":"Large Language Model (LLM)-based agents have recently emerged as a new paradigm that extends the capabilities of LLMs beyond text generation to dynamic interaction with external environments. A critical challenge lies in ensuring their <jats:italic toggle=\"yes\">generalizability</jats:italic> – the ability to maintain consistently high performance across varied instructions, tasks, environments, and domains, especially those different from the agent’s fine-tuning data. Despite growing interest, the concept of generalizability in LLM-based agents remains underdefined, and systematic approaches to measure and improve it are lacking. We provide the first comprehensive review of generalizability in LLM-based agents. We begin by clarifying the definition and boundaries of agent generalizability. We then review existing benchmarks. Next, we categorize strategies for improving generalizability into three groups: methods targeting the backbone LLM, targeting agent components, and targeting their interactions. Furthermore, we introduce the distinction between <jats:italic toggle=\"yes\">generalizable frameworks</jats:italic> and <jats:italic toggle=\"yes\">generalizable agents</jats:italic> and outline how generalizable frameworks can be translated into agent-level generalizability. Finally, we identify future directions, including the development of standardized evaluation frameworks, variance- and cost-based metrics, and hybrid approaches that integrate methodological innovations with agent architecture-level designs. We aim to establish a foundation for principled research on building LLM-based agents that generalize reliably across diverse real-world applications.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"244 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Meta-Analysis of Music Emotion Recognition Studies 音乐情绪识别研究的元分析
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-02-07 DOI: 10.1145/3796518
Tuomas Eerola, Cameron Anderson
This meta-analysis examines music emotion recognition (MER) models published between 2014 and 2024, focusing on predictions of valence, arousal, and categorical emotions. A total of 553 studies were identified, of which 96 full-text articles were assessed, resulting in a final review of 34 studies. These studies reported 290 models, including 86 for emotion classification and 204 for regression. Using the best-performing model from each study, we found that valence and arousal were predicted with reasonable accuracy (r = 0.67 and r = 0.81, respectively), while classification models achieved an accuracy of 0.87 as measured with Matthews correlation coefficient. Across modelling approaches, linear and tree-based methods generally outperformed neural networks in regression tasks, whereas neural networks and support vector machines (SVMs) showed highest performance in classification tasks. We highlight key recommendations for future MER research, emphasizing the need for greater transparency, feature validation, and standardized reporting to improve comparability across studies.
本荟萃分析考察了2014年至2024年间发表的音乐情绪识别(MER)模型,重点关注价态、唤醒和分类情绪的预测。总共确定了553项研究,对其中96篇全文文章进行了评估,最终对34项研究进行了审查。这些研究报告了290个模型,其中情绪分类模型86个,回归模型204个。使用各研究中表现最好的模型,我们发现效价和唤醒的预测准确率合理(r = 0.67和r = 0.81),而分类模型的马修斯相关系数测量的准确率为0.87。在建模方法中,线性和基于树的方法在回归任务中通常优于神经网络,而神经网络和支持向量机(svm)在分类任务中表现出最高的性能。我们强调了未来MER研究的关键建议,强调需要更大的透明度、特征验证和标准化报告,以提高研究之间的可比性。
{"title":"A Meta-Analysis of Music Emotion Recognition Studies","authors":"Tuomas Eerola, Cameron Anderson","doi":"10.1145/3796518","DOIUrl":"https://doi.org/10.1145/3796518","url":null,"abstract":"This meta-analysis examines music emotion recognition (MER) models published between 2014 and 2024, focusing on predictions of valence, arousal, and categorical emotions. A total of 553 studies were identified, of which 96 full-text articles were assessed, resulting in a final review of 34 studies. These studies reported 290 models, including 86 for emotion classification and 204 for regression. Using the best-performing model from each study, we found that valence and arousal were predicted with reasonable accuracy (r = 0.67 and r = 0.81, respectively), while classification models achieved an accuracy of 0.87 as measured with Matthews correlation coefficient. Across modelling approaches, linear and tree-based methods generally outperformed neural networks in regression tasks, whereas neural networks and support vector machines (SVMs) showed highest performance in classification tasks. We highlight key recommendations for future MER research, emphasizing the need for greater transparency, feature validation, and standardized reporting to improve comparability across studies.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"72 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Entity Linking with Wikidata: A Systematic Literature Review 维基数据实体链接:系统文献综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-31 DOI: 10.1145/3795134
Philipp Scharpf, Corinna Breitinger, Andreas Spitz, Norman Meuschke, André Greiner-Petter, Moritz Schubotz, Bela Gipp
This article provides a comprehensive systematic review of the literature on entity linking using Wikidata as the grounding knowledge base. Our review extends the scope of previous studies from two to eight dimensions of entity linking, which we classify into the following categories: definitions, tasks, types, domains, approaches, datasets, applications, and challenges. We find that datasets primarily address question-answering and news domains but underutilize Wikidata’s capabilities for hyper-relations, multilingualism, and time dependence. The research gaps we identify include the need for more robust datasets, hybrid methods combining rule-based and learning-based approaches, and improved handling of ambiguity, sparse entity types, data noise, and knowledge graph evolution.
本文以维基数据为基础知识库,对实体链接的相关文献进行了系统的综述。我们的综述将先前研究的范围从实体链接的两个维度扩展到八个维度,我们将其分为以下几类:定义、任务、类型、领域、方法、数据集、应用和挑战。我们发现数据集主要用于问答和新闻领域,但没有充分利用维基数据在超关系、多语言和时间依赖性方面的能力。我们发现的研究差距包括需要更健壮的数据集,结合基于规则和基于学习的方法的混合方法,以及改进对歧义、稀疏实体类型、数据噪声和知识图进化的处理。
{"title":"Entity Linking with Wikidata: A Systematic Literature Review","authors":"Philipp Scharpf, Corinna Breitinger, Andreas Spitz, Norman Meuschke, André Greiner-Petter, Moritz Schubotz, Bela Gipp","doi":"10.1145/3795134","DOIUrl":"https://doi.org/10.1145/3795134","url":null,"abstract":"This article provides a comprehensive systematic review of the literature on entity linking using Wikidata as the grounding knowledge base. Our review extends the scope of previous studies from two to eight dimensions of entity linking, which we classify into the following categories: definitions, tasks, types, domains, approaches, datasets, applications, and challenges. We find that datasets primarily address question-answering and news domains but underutilize Wikidata’s capabilities for hyper-relations, multilingualism, and time dependence. The research gaps we identify include the need for more robust datasets, hybrid methods combining rule-based and learning-based approaches, and improved handling of ambiguity, sparse entity types, data noise, and knowledge graph evolution.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"9 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146095841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Responsible AI Question Bank for Risk Assessment 负责风险评估的AI题库
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-29 DOI: 10.1145/3790096
Sung Une Lee, Harsha Perera, Yue Liu, Boming Xia, Qinghua Lu, Liming Zhu, Olivier Salvado, Jon Whittle
The rapid growth of AI underscores the need for responsible AI (RAI) practices. While many RAI checklists and frameworks exist, practitioners still struggle with how to use them in practice across roles and stages. We introduce the RAI Question Bank, a role- and lifecycle-tagged, evidence-oriented question set that simplifies interaction for executives, managers, and developers while preserving comprehensive coverage mapped to leading frameworks and regulations (e.g., EU AI Act). With comprehensive taxonomy and linkage between lower-level questions and higher-level themes, the Question Bank facilitates cohesive assessments. Two case studies show how it surfaces risks, prioritizes effort, and supports policy alignment.
人工智能的快速发展凸显了对负责任的人工智能(RAI)实践的需求。虽然存在许多RAI检查表和框架,但从业者仍然在努力解决如何在跨角色和阶段的实践中使用它们的问题。我们介绍了RAI题库,这是一个角色和生命周期标记的、面向证据的问题集,它简化了高管、经理和开发人员的交互,同时保留了对领先框架和法规(例如,EU AI Act)的全面覆盖。通过对较低层次问题和较高层次主题的综合分类和联系,题库促进了内聚性评估。两个案例研究展示了它是如何暴露风险、确定工作的优先级以及支持政策一致性的。
{"title":"Responsible AI Question Bank for Risk Assessment","authors":"Sung Une Lee, Harsha Perera, Yue Liu, Boming Xia, Qinghua Lu, Liming Zhu, Olivier Salvado, Jon Whittle","doi":"10.1145/3790096","DOIUrl":"https://doi.org/10.1145/3790096","url":null,"abstract":"The rapid growth of AI underscores the need for responsible AI (RAI) practices. While many RAI checklists and frameworks exist, practitioners still struggle with how to use them in practice across roles and stages. We introduce the RAI Question Bank, a role- and lifecycle-tagged, evidence-oriented question set that simplifies interaction for executives, managers, and developers while preserving comprehensive coverage mapped to leading frameworks and regulations (e.g., EU AI Act). With comprehensive taxonomy and linkage between lower-level questions and higher-level themes, the Question Bank facilitates cohesive assessments. Two case studies show how it surfaces risks, prioritizes effort, and supports policy alignment.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"2 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146072563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Transparent Time Series Analysis: Exploring Methods and Enhancing Interpretability 走向透明的时间序列分析:探索方法与提高可解释性
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-29 DOI: 10.1145/3794839
Youngjin Park, Anh Tong, Sehyun Lee, Jihyeon Seong, Qin Xie, Jaesik Choi
This paper presents a comprehensive cross-task analysis of time series methodologies, revealing fundamental connections that are often obscured by task-specific perspectives. Our contributions are fivefold. First, we introduce seven priority properties, along with exogenous integration, that characterize methodologies independent of application domain, enabling systematic comparison across traditional and modern approaches. Second, we classify neural architectures by transparency levels determined by two characteristics: parameter time-invariance and the explicitness of mathematical formulations. Locally time-invariant operations enable mechanistic understanding, but globally time-varying operations pose fundamental challenges to achieving it. Third, our hierarchical taxonomy guides the selection of methodologies. Fourth, we comparatively evaluate explanation methods by quantifying how closely they recover transparency, measuring explanation richness via breadth (granularity) and depth (mechanistic understanding): pointwise methods offer lower richness, component-level methods achieve medium richness, and concept-based methods achieve higher richness, sometimes at the cost of generalization. Finally, we identify an ongoing challenge from the absence of ground truth for temporal components and outline future research directions for time-varying modeling explanations. This survey provides methodological insights and practical frameworks in time series analysis.
本文介绍了时间序列方法的全面跨任务分析,揭示了通常被特定任务的观点所掩盖的基本联系。我们的贡献是五倍。首先,我们介绍了七个优先属性,以及外生集成,这些属性表征了独立于应用领域的方法,从而能够在传统方法和现代方法之间进行系统比较。其次,我们根据参数时不变性和数学公式的显式性两个特征确定的透明度级别对神经结构进行分类。局部时不变操作可以实现机械理解,但全局时不变操作对实现这一目标提出了根本性的挑战。第三,我们的分层分类法指导了方法的选择。第四,我们通过量化解释方法恢复透明度的程度来比较评估解释方法,通过广度(粒度)和深度(机制理解)来衡量解释丰富度:点式方法提供较低的丰富度,组件级方法实现中等丰富度,基于概念的方法实现较高的丰富度,有时以泛化为代价。最后,我们确定了时间分量缺乏基础真值的持续挑战,并概述了时变建模解释的未来研究方向。这项调查提供了时间序列分析的方法见解和实用框架。
{"title":"Towards Transparent Time Series Analysis: Exploring Methods and Enhancing Interpretability","authors":"Youngjin Park, Anh Tong, Sehyun Lee, Jihyeon Seong, Qin Xie, Jaesik Choi","doi":"10.1145/3794839","DOIUrl":"https://doi.org/10.1145/3794839","url":null,"abstract":"This paper presents a comprehensive cross-task analysis of time series methodologies, revealing fundamental connections that are often obscured by task-specific perspectives. Our contributions are fivefold. First, we introduce seven priority properties, along with exogenous integration, that characterize methodologies independent of application domain, enabling systematic comparison across traditional and modern approaches. Second, we classify neural architectures by transparency levels determined by two characteristics: parameter time-invariance and the explicitness of mathematical formulations. Locally time-invariant operations enable mechanistic understanding, but globally time-varying operations pose fundamental challenges to achieving it. Third, our hierarchical taxonomy guides the selection of methodologies. Fourth, we comparatively evaluate explanation methods by quantifying how closely they recover transparency, measuring explanation richness via breadth (granularity) and depth (mechanistic understanding): pointwise methods offer lower richness, component-level methods achieve medium richness, and concept-based methods achieve higher richness, sometimes at the cost of generalization. Finally, we identify an ongoing challenge from the absence of ground truth for temporal components and outline future research directions for time-varying modeling explanations. This survey provides methodological insights and practical frameworks in time series analysis.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"30 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146072562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Inference with Complex Treatments: A Survey 复杂处理的因果推理:综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-27 DOI: 10.1145/3789499
Yingrong Wang, Haoxuan Li, Minqin Zhu, Anpeng Wu, Baohong Li, Keting Yin, Ruoxuan Xiong, Fei Wu, Kun Kuang
Causal inference plays an important role in explanatory analysis and decision-making across a wide range of fields, including statistics, marketing, healthcare, and education. Its core objective is to estimate treatment effects and inform intervention policies. Most existing work focuses on the binary treatment setting, where each unit is assigned to either treatment or control. In practice, however, treatments are often more complex, encompassing multi-valued, continuous, or bundle interventions. We refer to such settings as complex treatments. In this paper, we provide a systematic and comprehensive survey of causal inference methods for complex treatments. We first revisit the problem formulation, core assumptions, and their possible variations under different settings. We sequentially review the representative methods for multi-valued, continuous, and bundle treatments. Within each setting, we organize the methods into two broad categories: those that rely on the unconfoundedness assumption and those that address violations of this assumption. We further discuss the intrinsic relationships among these methods and the assumption verification. Finally, we summarize available benchmark datasets and open-source codes, and outline several directions for future research.
因果推理在包括统计、市场营销、医疗保健和教育在内的广泛领域的解释分析和决策中起着重要作用。其核心目标是评估治疗效果并为干预政策提供信息。大多数现有的工作侧重于二元治疗设置,其中每个单元被分配到治疗或控制。然而,实际上,治疗往往更复杂,包括多值、连续或捆绑干预。我们把这种情况称为复杂治疗。在本文中,我们提供了一个系统的和全面的调查因果推理方法的复杂治疗。我们首先回顾问题的表述,核心假设,以及它们在不同环境下的可能变化。我们依次回顾了多值处理、连续处理和束处理的代表性方法。在每个设置中,我们将方法分为两大类:那些依赖于非混淆假设的方法和那些解决违反该假设的方法。我们进一步讨论了这些方法之间的内在联系和假设的验证。最后,我们总结了现有的基准数据集和开源代码,并概述了未来研究的几个方向。
{"title":"Causal Inference with Complex Treatments: A Survey","authors":"Yingrong Wang, Haoxuan Li, Minqin Zhu, Anpeng Wu, Baohong Li, Keting Yin, Ruoxuan Xiong, Fei Wu, Kun Kuang","doi":"10.1145/3789499","DOIUrl":"https://doi.org/10.1145/3789499","url":null,"abstract":"Causal inference plays an important role in explanatory analysis and decision-making across a wide range of fields, including statistics, marketing, healthcare, and education. Its core objective is to estimate treatment effects and inform intervention policies. Most existing work focuses on the binary treatment setting, where each unit is assigned to either treatment or control. In practice, however, treatments are often more complex, encompassing multi-valued, continuous, or bundle interventions. We refer to such settings as complex treatments. In this paper, we provide a systematic and comprehensive survey of causal inference methods for complex treatments. We first revisit the problem formulation, core assumptions, and their possible variations under different settings. We sequentially review the representative methods for multi-valued, continuous, and bundle treatments. Within each setting, we organize the methods into two broad categories: those that rely on the <jats:italic toggle=\"yes\">unconfoundedness assumption</jats:italic> and those that address violations of this assumption. We further discuss the intrinsic relationships among these methods and the assumption verification. Finally, we summarize available benchmark datasets and open-source codes, and outline several directions for future research.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"178 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146056017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Confidential Computing on Heterogeneous CPU-GPU Systems: Survey and Future Directions 异构CPU-GPU系统的机密计算:综述与未来方向
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-26 DOI: 10.1145/3793532
Qifan Wang, David Oswald
In recent years, the widespread informatization and rapid data explosion have increased the demand for high-performance heterogeneous systems that integrate multiple computing cores such as CPUs, Graphics Processing Units (GPUs), Application Specific Integrated Circuits (ASICs), and Field Programmable Gate Arrays (FPGAs). The combination of CPU and GPU is particularly popular due to its versatility. However, these heterogeneous systems face significant security and privacy risks. Advances in privacy-preserving techniques, especially hardware-based Trusted Execution Environments (TEEs), offer effective protection for GPU applications. Nonetheless, the potential security risks involved in extending TEEs to GPUs in heterogeneous systems remain uncertain and need further investigation. To investigate these risks in depth, we study the existing popular GPU TEE designs and summarize and compare their key implications. Additionally, we review existing powerful attacks on GPUs and traditional TEEs deployed on CPUs, along with the efforts to mitigate these threats. We identify potential attack surfaces introduced by GPU TEEs and provide insights into key considerations for designing secure GPU TEEs. This survey is timely as new TEEs for heterogeneous systems, particularly GPUs, are being developed, highlighting the need to understand potential security threats and build both efficient and secure systems.
近年来,随着信息化的广泛发展和数据的快速爆炸,对cpu、图形处理器(gpu)、专用集成电路(asic)、现场可编程门阵列(fpga)等多计算核心集成的高性能异构系统的需求不断增加。由于其通用性,CPU和GPU的组合特别受欢迎。然而,这些异构系统面临着重大的安全和隐私风险。隐私保护技术的进步,特别是基于硬件的可信执行环境(tee),为GPU应用程序提供了有效的保护。尽管如此,将tee扩展到异构系统中的gpu所涉及的潜在安全风险仍然不确定,需要进一步调查。为了深入研究这些风险,我们研究了现有流行的GPU TEE设计,并总结和比较了它们的关键含义。此外,我们还回顾了针对gpu和部署在cpu上的传统tee的现有强大攻击,以及减轻这些威胁的努力。我们确定了GPU tee引入的潜在攻击面,并提供了设计安全GPU tee的关键考虑因素的见解。这项调查是及时的,因为异构系统(特别是gpu)的新tee正在开发中,强调需要了解潜在的安全威胁并构建高效和安全的系统。
{"title":"Confidential Computing on Heterogeneous CPU-GPU Systems: Survey and Future Directions","authors":"Qifan Wang, David Oswald","doi":"10.1145/3793532","DOIUrl":"https://doi.org/10.1145/3793532","url":null,"abstract":"In recent years, the widespread informatization and rapid data explosion have increased the demand for high-performance heterogeneous systems that integrate multiple computing cores such as CPUs, Graphics Processing Units (GPUs), Application Specific Integrated Circuits (ASICs), and Field Programmable Gate Arrays (FPGAs). The combination of CPU and GPU is particularly popular due to its versatility. However, these heterogeneous systems face significant security and privacy risks. Advances in privacy-preserving techniques, especially hardware-based Trusted Execution Environments (TEEs), offer effective protection for GPU applications. Nonetheless, the potential security risks involved in extending TEEs to GPUs in heterogeneous systems remain uncertain and need further investigation. To investigate these risks in depth, we study the existing popular GPU TEE designs and summarize and compare their key implications. Additionally, we review existing powerful attacks on GPUs and traditional TEEs deployed on CPUs, along with the efforts to mitigate these threats. We identify potential attack surfaces introduced by GPU TEEs and provide insights into key considerations for designing secure GPU TEEs. This survey is timely as new TEEs for heterogeneous systems, particularly GPUs, are being developed, highlighting the need to understand potential security threats and build both efficient and secure systems.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"41 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146048426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Systematic Literature Review of Healthcare Embedded Systems Using AI-based Biosignal Analysis 基于人工智能的生物信号分析的医疗嵌入式系统的系统文献综述
IF 16.6 1区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2026-01-26 DOI: 10.1145/3793669
Sumair Aziz, Girija Chetty, Roland Goecke, Raul Fernandez-Rojas
Healthcare Embedded Systems (HES) use biosensors to capture physiological data, analyse it with advanced algorithms, and provide timely alerts during emergencies. These systems enhance healthcare delivery by supporting diagnosis, early symptom detection, and disease prediction. Despite extensive research on data analysis techniques in healthcare, selecting real-time methods for specific embedded hardware remains challenging. This review aims to summarise and synthesise existing literature to: (a) identify the healthcare challenges addressed by HES and the types of biosignals employed, (b) explore the embedded platforms utilised for implementing HES, and (c) examine the data analysis techniques used for real-time HES applications. A systematic search across three electronic databases (2015-2024), identified 50 relevant studies. These studies span various application domains, biosensing modalities, feature extraction methods, and machine learning and deep learning techniques. Raspberry Pi single-board computers emerged as the most popular embedded platform for implementing AI-based HES. Deep learning, especially convolutional neural networks, dominated, with cardiac health as the primary focus. While the reviewed studies demonstrate promising results, they are often constrained by specific experimental contexts. This review offers a comprehensive overview of real-time data analysis in HES and highlights key opportunities for future research to advance the field.
医疗保健嵌入式系统(HES)使用生物传感器捕获生理数据,用高级算法进行分析,并在紧急情况下提供及时警报。这些系统通过支持诊断、早期症状检测和疾病预测来增强医疗保健服务。尽管对医疗保健中的数据分析技术进行了广泛的研究,但为特定的嵌入式硬件选择实时方法仍然具有挑战性。本综述旨在总结和综合现有文献,以:(a)确定HES解决的医疗挑战和所采用的生物信号类型,(b)探索用于实施HES的嵌入式平台,以及(c)检查用于实时HES应用的数据分析技术。通过对三个电子数据库(2015-2024)的系统搜索,确定了50项相关研究。这些研究跨越了不同的应用领域,生物传感模式,特征提取方法,机器学习和深度学习技术。树莓派单板计算机成为实现基于人工智能的HES的最流行的嵌入式平台。深度学习,尤其是卷积神经网络占主导地位,心脏健康是主要焦点。虽然审查的研究显示出有希望的结果,但它们往往受到特定实验背景的限制。这篇综述全面概述了HES中的实时数据分析,并强调了未来研究推进该领域的关键机会。
{"title":"A Systematic Literature Review of Healthcare Embedded Systems Using AI-based Biosignal Analysis","authors":"Sumair Aziz, Girija Chetty, Roland Goecke, Raul Fernandez-Rojas","doi":"10.1145/3793669","DOIUrl":"https://doi.org/10.1145/3793669","url":null,"abstract":"Healthcare Embedded Systems (HES) use biosensors to capture physiological data, analyse it with advanced algorithms, and provide timely alerts during emergencies. These systems enhance healthcare delivery by supporting diagnosis, early symptom detection, and disease prediction. Despite extensive research on data analysis techniques in healthcare, selecting real-time methods for specific embedded hardware remains challenging. This review aims to summarise and synthesise existing literature to: (a) identify the healthcare challenges addressed by HES and the types of biosignals employed, (b) explore the embedded platforms utilised for implementing HES, and (c) examine the data analysis techniques used for real-time HES applications. A systematic search across three electronic databases (2015-2024), identified 50 relevant studies. These studies span various application domains, biosensing modalities, feature extraction methods, and machine learning and deep learning techniques. Raspberry Pi single-board computers emerged as the most popular embedded platform for implementing AI-based HES. Deep learning, especially convolutional neural networks, dominated, with cardiac health as the primary focus. While the reviewed studies demonstrate promising results, they are often constrained by specific experimental contexts. This review offers a comprehensive overview of real-time data analysis in HES and highlights key opportunities for future research to advance the field.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"5 1","pages":""},"PeriodicalIF":16.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146048424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Computing Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1