首页 > 最新文献

Ai Magazine最新文献

英文 中文
Effective knowledge representation and utilization for sustainable collaborative learning across heterogeneous systems 有效的知识表示和利用,促进跨异构系统的可持续协作学习
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12193
Trong Nghia Hoang

The increasingly decentralized and private nature of data in our digital society has motivated the development of collaborative intelligent systems that enable knowledge aggregation among data owners. However, collaborative learning has only been investigated in simple settings. For example, clients are often assumed to train solution models de novo, disregarding all prior expertise. The learned model is typically represented in task-specific forms that are not generalizable to unseen, emerging scenarios. Finally, a universal model representation is enforced among collaborators, ignoring their local compute constraints or input representations. These limitations hampers the practicality of prior collaborative systems in learning scenarios with limited task data that demand constant knowledge adaptation and transfer across information silos, tasks, and learning models, as well as the utilization of prior solution expertise. Furthermore, prior collaborative learning frameworks are not sustainable on a macro scale where participants desire fairness allocation of benefits (e.g., access to the combined model) based on their costs of participation (e.g., overhead of model sharing and training synchronization, risk of information breaches, etc.). This necessitates a new perspective of collaborative learning where the server not only aggregates but also conducts valuation of the participant's contribution, and distribute aggregated information to individuals in commensurate to their contribution. To substantiate the above vision, we propose a new research agenda on developing effective and sustainable collaborative learning frameworks across heterogeneous systems, featuring three novel computational capabilities on knowledge organization: model expression, comprehension, and valuation.

在我们的数字社会中,数据的分散性和私密性越来越强,这促使人们开发能够在数据所有者之间聚合知识的协作智能系统。然而,人们只在简单的环境中研究过协作学习。例如,通常假设客户从头开始训练解决方案模型,而不考虑所有先前的专业知识。学习到的模型通常以特定任务的形式表示,无法推广到未见过的新兴场景中。最后,合作者之间强制使用通用模型表示法,忽略了他们的本地计算约束或输入表示法。这些局限性妨碍了先前的协作系统在任务数据有限的学习场景中的实用性,因为这种场景需要在信息孤岛、任务和学习模型之间不断进行知识调整和转移,并需要利用先前的解决方案专长。此外,先前的协作学习框架在宏观上是不可持续的,因为参与者希望根据他们的参与成本(如模型共享和培训同步的开销、信息泄露的风险等)公平分配利益(如访问组合模型)。这就需要一种新的协作学习视角,即服务器不仅要汇总信息,还要对参与者的贡献进行评估,并将汇总信息按贡献分配给个人。为了证实上述愿景,我们提出了一个新的研究议程,即在异构系统中开发有效、可持续的协作学习框架,其中包括三种新的知识组织计算能力:模型表达、理解和评估。
{"title":"Effective knowledge representation and utilization for sustainable collaborative learning across heterogeneous systems","authors":"Trong Nghia Hoang","doi":"10.1002/aaai.12193","DOIUrl":"https://doi.org/10.1002/aaai.12193","url":null,"abstract":"<p>The increasingly decentralized and private nature of data in our digital society has motivated the development of collaborative intelligent systems that enable knowledge aggregation among data owners. However, collaborative learning has only been investigated in simple settings. For example, clients are often assumed to train solution models <i>de novo</i>, disregarding all prior expertise. The learned model is typically represented in task-specific forms that are not generalizable to unseen, emerging scenarios. Finally, a universal model representation is enforced among collaborators, ignoring their local compute constraints or input representations. These limitations hampers the practicality of prior collaborative systems in learning scenarios with limited task data that demand constant knowledge adaptation and transfer across information silos, tasks, and learning models, as well as the utilization of prior solution expertise. Furthermore, prior collaborative learning frameworks are not sustainable on a macro scale where participants desire fairness allocation of benefits (e.g., access to the combined model) based on their costs of participation (e.g., overhead of model sharing and training synchronization, risk of information breaches, etc.). This necessitates a new perspective of collaborative learning where the server not only aggregates but also conducts valuation of the participant's contribution, and distribute aggregated information to individuals in commensurate to their contribution. To substantiate the above vision, we propose a new research agenda on developing effective and sustainable collaborative learning frameworks across heterogeneous systems, featuring three novel computational capabilities on knowledge organization: model expression, comprehension, and valuation.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"404-410"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12193","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fair and optimal prediction via post-processing 通过后处理实现公平和优化预测
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12191
Han Zhao

With the development of machine learning algorithms and the increasing computational resources available, artificial intelligence has achieved great success in many application domains. However, the success of machine learning has also raised concerns about the fairness of the learned models. For instance, the learned models can perpetuate and even exacerbate the potential bias and discrimination in the training data. This issue has become a major obstacle to the deployment of machine learning systems in high-stakes domains, for example, criminal judgment, medical testing, online advertising, hiring process, and so forth. To mitigate the potential bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, is crucial to the design of optimal and fair algorithms. My research focuses on characterizing the inherent tradeoff between fairness and accuracy in machine learning, and developing algorithms that can achieve both fairness and optimality. In this article, I will discuss our recent work on designing post-processing algorithms for fair classification, which can be applied to a wide range of fairness criteria, including statistical parity, equal opportunity, and equalized odds, under both attribute-aware and attribute-blind settings, and is particularly suited to large-scale foundation models where retraining is expensive or even infeasible. I will also discuss the connections between our work and other related research on trustworthy machine learning, including the connections between algorithmic fairness and differential privacy as well as adversarial robustness.

随着机器学习算法的发展和可用计算资源的不断增加,人工智能在许多应用领域取得了巨大成功。然而,机器学习的成功也引发了人们对所学模型公平性的担忧。例如,学习到的模型可能会延续甚至加剧训练数据中潜在的偏见和歧视。这一问题已成为机器学习系统在刑事判决、医疗测试、在线广告、招聘流程等高风险领域部署的主要障碍。为了减轻机器学习模型可能表现出的偏差,可以将公平性标准集成到训练过程中,以确保公平对待所有人群,但这往往以牺牲模型性能为代价。因此,了解这种权衡对于设计最佳公平算法至关重要。我的研究重点是描述机器学习中公平性和准确性之间固有的权衡,并开发能同时实现公平性和最优性的算法。在本文中,我将讨论我们最近在设计公平分类的后处理算法方面所做的工作,该算法可在属性感知和属性盲设置下应用于广泛的公平标准,包括统计均等、机会均等和赔率均等,尤其适用于重新训练成本高昂甚至不可行的大规模基础模型。我还将讨论我们的工作与其他可信机器学习相关研究之间的联系,包括算法公平性与差异隐私以及对抗鲁棒性之间的联系。
{"title":"Fair and optimal prediction via post-processing","authors":"Han Zhao","doi":"10.1002/aaai.12191","DOIUrl":"https://doi.org/10.1002/aaai.12191","url":null,"abstract":"<p>With the development of machine learning algorithms and the increasing computational resources available, artificial intelligence has achieved great success in many application domains. However, the success of machine learning has also raised concerns about the <i>fairness</i> of the learned models. For instance, the learned models can perpetuate and even exacerbate the potential bias and discrimination in the training data. This issue has become a major obstacle to the deployment of machine learning systems in high-stakes domains, for example, criminal judgment, medical testing, online advertising, hiring process, and so forth. To mitigate the potential bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, is crucial to the design of optimal and fair algorithms. My research focuses on characterizing the inherent tradeoff between fairness and accuracy in machine learning, and developing algorithms that can achieve both fairness and optimality. In this article, I will discuss our recent work on designing post-processing algorithms for fair classification, which can be applied to a wide range of fairness criteria, including statistical parity, equal opportunity, and equalized odds, under both attribute-aware and attribute-blind settings, and is particularly suited to large-scale foundation models where retraining is expensive or even infeasible. I will also discuss the connections between our work and other related research on trustworthy machine learning, including the connections between algorithmic fairness and differential privacy as well as adversarial robustness.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"411-418"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12191","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient and robust sequential decision making algorithms 高效稳健的顺序决策算法
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12186
Pan Xu

Sequential decision-making involves making informed decisions based on continuous interactions with a complex environment. This process is ubiquitous in various applications, including recommendation systems and clinical treatment design. My research has concentrated on addressing two pivotal challenges in sequential decision-making: (1) How can we design algorithms that efficiently learn the optimal decision strategy with minimal interactions and limited sample data? (2) How can we ensure robustness in decision-making algorithms when faced with distributional shifts due to environmental changes and the sim-to-real gap? This paper summarizes and expands upon the talk I presented at the AAAI 2024 New Faculty Highlights program, detailing how my research aims to tackle these challenges.

顺序决策涉及根据与复杂环境的持续互动做出明智的决定。这一过程在各种应用中无处不在,包括推荐系统和临床治疗设计。我的研究集中于解决顺序决策中的两个关键挑战:(1) 我们如何设计算法,在最小的交互和有限的样本数据中高效地学习最优决策策略?(2) 面对环境变化和模拟与实际差距造成的分布变化,我们如何确保决策算法的稳健性?本文总结并扩展了我在 AAAI 2024 新教师亮点计划中的演讲,详细介绍了我的研究如何应对这些挑战。
{"title":"Efficient and robust sequential decision making algorithms","authors":"Pan Xu","doi":"10.1002/aaai.12186","DOIUrl":"https://doi.org/10.1002/aaai.12186","url":null,"abstract":"<p>Sequential decision-making involves making informed decisions based on continuous interactions with a complex environment. This process is ubiquitous in various applications, including recommendation systems and clinical treatment design. My research has concentrated on addressing two pivotal challenges in sequential decision-making: (1) How can we design algorithms that efficiently learn the optimal decision strategy with minimal interactions and limited sample data? (2) How can we ensure robustness in decision-making algorithms when faced with distributional shifts due to environmental changes and the sim-to-real gap? This paper summarizes and expands upon the talk I presented at the AAAI 2024 New Faculty Highlights program, detailing how my research aims to tackle these challenges.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"376-385"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12186","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI fairness in practice: Paradigm, challenges, and prospects 实践中的人工智能公平性:范式、挑战和前景
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12189
Wenbin Zhang

Understanding and correcting algorithmic bias in artificial intelligence (AI) has become increasingly important, leading to a surge in research on AI fairness within both the AI community and broader society. Traditionally, this research operates within the constrained supervised learning paradigm, assuming the presence of class labels, independent and identically distributed (IID) data, and batch-based learning necessitating the simultaneous availability of all training data. However, in practice, class labels may be absent due to censoring, data is often represented using non-IID graph structures that capture connections among individual units, and data can arrive and evolve over time. These prevalent real-world data representations limit the applicability of existing fairness literature, which typically addresses fairness in static and tabular supervised learning settings. This paper reviews recent advances in AI fairness aimed at bridging these gaps for practical deployment in real-world scenarios. Additionally, opportunities are envisioned by highlighting the limitations and significant potential for real applications.

了解和纠正人工智能(AI)中的算法偏差已变得越来越重要,这导致人工智能界和更广泛的社会对人工智能公平性的研究激增。传统上,这项研究是在受限的监督学习范式下进行的,假定存在类标签、独立且同分布(IID)的数据,以及基于批量的学习(必须同时提供所有训练数据)。然而,在实践中,由于删减的原因,类标签可能不存在,数据通常使用非独立且同分布(IID)的图结构来表示,以捕捉单个单元之间的联系,而且数据可能随着时间的推移而到达和演变。这些现实世界中普遍存在的数据表示方式限制了现有公平性文献的适用性,因为这些文献通常涉及静态和表格监督学习环境中的公平性问题。本文回顾了人工智能公平性方面的最新进展,旨在缩小这些差距,以便在现实世界场景中进行实际部署。此外,本文还通过强调实际应用的局限性和巨大潜力,展望了各种机遇。
{"title":"AI fairness in practice: Paradigm, challenges, and prospects","authors":"Wenbin Zhang","doi":"10.1002/aaai.12189","DOIUrl":"https://doi.org/10.1002/aaai.12189","url":null,"abstract":"<p>Understanding and correcting algorithmic bias in artificial intelligence (AI) has become increasingly important, leading to a surge in research on AI fairness within both the AI community and broader society. Traditionally, this research operates within the constrained supervised learning paradigm, assuming the presence of class labels, independent and identically distributed (IID) data, and batch-based learning necessitating the simultaneous availability of all training data. However, in practice, class labels may be absent due to censoring, data is often represented using non-IID graph structures that capture connections among individual units, and data can arrive and evolve over time. These prevalent real-world data representations limit the applicability of existing fairness literature, which typically addresses fairness in static and tabular supervised learning settings. This paper reviews recent advances in AI fairness aimed at bridging these gaps for practical deployment in real-world scenarios. Additionally, opportunities are envisioned by highlighting the limitations and significant potential for real applications.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"386-395"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12189","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards smooth mobile robot deployments in dynamic human environments 在动态人类环境中顺利部署移动机器人
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12192
Christoforos Mavrogiannis

Recently, there has been great interest in deploying autonomous mobile robots in airports, malls, and hospitals to complete a range of tasks such as delivery, cleaning, and patrolling. The rich context of these environments gives rise to highly unstructured motion that is challenging for robots to anticipate and adapt to. This results in uncomfortable and unsafe human–robot encounters, poor robot performance, and even catastrophic failures that hinder robot acceptance. Such observations have motivated my work on social robot navigation, the problem of enabling robots to navigate in human environments while accounting for human safety and comfort. In this article, I highlight prior work on expanding the classical autonomy stack with mathematical models and algorithms designed to contribute towards smoother mobile robot deployments in complex environments.

最近,人们对在机场、商场和医院部署自主移动机器人以完成一系列任务(如送货、清洁和巡逻)产生了浓厚的兴趣。这些环境的丰富背景导致了高度非结构化的运动,对机器人的预测和适应能力提出了挑战。这就导致了不舒适和不安全的人机交互、糟糕的机器人性能,甚至是阻碍机器人被接受的灾难性故障。这些观察结果激发了我在社交机器人导航方面的研究,即如何让机器人在人类环境中导航,同时考虑到人类的安全和舒适。在这篇文章中,我将重点介绍之前的工作,即利用数学模型和算法扩展经典的自主性堆栈,以促进移动机器人在复杂环境中更顺利地部署。
{"title":"Towards smooth mobile robot deployments in dynamic human environments","authors":"Christoforos Mavrogiannis","doi":"10.1002/aaai.12192","DOIUrl":"https://doi.org/10.1002/aaai.12192","url":null,"abstract":"<p>Recently, there has been great interest in deploying autonomous mobile robots in airports, malls, and hospitals to complete a range of tasks such as delivery, cleaning, and patrolling. The rich context of these environments gives rise to highly unstructured motion that is challenging for robots to anticipate and adapt to. This results in uncomfortable and unsafe human–robot encounters, poor robot performance, and even catastrophic failures that hinder robot acceptance. Such observations have motivated my work on social robot navigation, the problem of enabling robots to navigate in human environments while accounting for human safety and comfort. In this article, I highlight prior work on expanding the classical autonomy stack with mathematical models and algorithms designed to contribute towards smoother mobile robot deployments in complex environments.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"419-428"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12192","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward the confident deployment of real-world reinforcement learning agents 自信地部署现实世界中的强化学习代理
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12190
Josiah P. Hanna

Intelligent learning agents must be able to learn from experience so as to accomplish tasks that require more ability than could be initially programmed. Reinforcement learning (RL) has emerged as a potentially powerful class of solution methods to create agents that learn from trial-and-error interaction with the world. Despite many prominent success stories, a number of challenges often stand between the use of RL in real-world problems. As part of the AAAI New Faculty Highlight Program, in this article, I will describe the work that my group is doing at the University of Wisconsin—Madison with the intent to remove barriers to the use of RL in practice. Specifically, I will describe recent work that aims to give practitioners confidence in learned behaviors, methods to increase the data efficiency of RL, and work on “challenge” domains that stress RL algorithms beyond current testbeds.

智能学习代理必须能够从经验中学习,从而完成需要比最初编程能力更强的任务。强化学习(RL)已成为一类潜在的强大解决方法,用于创建从与世界的试错互动中学习的代理。尽管有许多突出的成功案例,但在现实世界的问题中使用强化学习往往面临着许多挑战。作为 AAAI 新教师亮点计划的一部分,我将在本文中介绍我所在的威斯康星大学麦迪逊分校的研究小组正在开展的工作,目的是消除在实践中使用 RL 的障碍。具体来说,我将介绍最近的工作,这些工作旨在让实践者对学习到的行为有信心、提高 RL 数据效率的方法,以及在 "挑战 "领域的工作,这些领域对 RL 算法的压力超出了当前的测试平台。
{"title":"Toward the confident deployment of real-world reinforcement learning agents","authors":"Josiah P. Hanna","doi":"10.1002/aaai.12190","DOIUrl":"https://doi.org/10.1002/aaai.12190","url":null,"abstract":"<p>Intelligent learning agents must be able to learn from experience so as to accomplish tasks that require more ability than could be initially programmed. Reinforcement learning (RL) has emerged as a potentially powerful class of solution methods to create agents that learn from trial-and-error interaction with the world. Despite many prominent success stories, a number of challenges often stand between the use of RL in real-world problems. As part of the AAAI New Faculty Highlight Program, in this article, I will describe the work that my group is doing at the University of Wisconsin—Madison with the intent to remove barriers to the use of RL in practice. Specifically, I will describe recent work that aims to give practitioners confidence in learned behaviors, methods to increase the data efficiency of RL, and work on “challenge” domains that stress RL algorithms beyond current testbeds.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"396-403"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12190","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards robust visual understanding: A paradigm shift in computer vision from recognition to reasoning 实现稳健的视觉理解:计算机视觉从识别到推理的范式转变
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-22 DOI: 10.1002/aaai.12194
Tejas Gokhale

Models that learn from data are widely and rapidly being deployed today for real-world use, but they suffer from unforeseen failures that limit their reliability. These failures often have several causes such as distribution shift; adversarial attacks; calibration errors; scarcity of data and/or ground-truth labels; noisy, corrupted, or partial data; and limitations of evaluation metrics. But many failures also occur because many modern AI tasks require reasoning beyond pattern matching and such reasoning abilities are difficult to formulate as data-based input–output function fitting. The reliability problem has become increasingly important under the new paradigm of semantic “multimodal” learning. In this article, I will discuss findings from our work to provide avenues for the development of robust and reliable computer vision systems, particularly by leveraging the interactions between vision and language. This article expands upon the invited talk at AAAI 2024 and covers three thematic areas: robustness of visual recognition systems, open-domain reliability for visual reasoning, and challenges and opportunities associated with generative models in vision.

如今,从数据中学习的模型正被广泛、快速地部署到现实世界中使用,但这些模型会出现不可预见的故障,从而限制了其可靠性。这些故障通常有几个原因,如分布偏移;对抗性攻击;校准错误;数据和/或地面实况标签稀缺;数据嘈杂、损坏或不完整;以及评估指标的局限性。但是,许多失败的原因还在于,许多现代人工智能任务需要进行模式匹配之外的推理,而这种推理能力很难表述为基于数据的输入输出函数拟合。在语义 "多模态 "学习的新范式下,可靠性问题变得越来越重要。在本文中,我将讨论我们的研究成果,为开发稳健可靠的计算机视觉系统提供途径,特别是通过利用视觉与语言之间的互动。本文是对 2024 年 AAAI 大会特邀演讲的进一步阐述,涵盖三个主题领域:视觉识别系统的鲁棒性、视觉推理的开放域可靠性以及与视觉生成模型相关的挑战和机遇。
{"title":"Towards robust visual understanding: A paradigm shift in computer vision from recognition to reasoning","authors":"Tejas Gokhale","doi":"10.1002/aaai.12194","DOIUrl":"https://doi.org/10.1002/aaai.12194","url":null,"abstract":"<p>Models that learn from data are widely and rapidly being deployed today for real-world use, but they suffer from unforeseen failures that limit their reliability. These failures often have several causes such as distribution shift; adversarial attacks; calibration errors; scarcity of data and/or ground-truth labels; noisy, corrupted, or partial data; and limitations of evaluation metrics. But many failures also occur because many modern AI tasks require reasoning beyond pattern matching and such reasoning abilities are difficult to formulate as data-based input–output function fitting. The reliability problem has become increasingly important under the new paradigm of semantic “multimodal” learning. In this article, I will discuss findings from our work to provide avenues for the development of robust and reliable computer vision systems, particularly by leveraging the interactions between vision and language. This article expands upon the invited talk at AAAI 2024 and covers three thematic areas: robustness of visual recognition systems, open-domain reliability for visual reasoning, and challenges and opportunities associated with generative models in vision.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"429-435"},"PeriodicalIF":2.5,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12194","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Better environments for better AI 更好的环境造就更好的人工智能
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-07 DOI: 10.1002/aaai.12187
Sarah Keren

Most AI research focuses exclusively on the AI agent itself, that is, given some input, what are the improvements to the agent's reasoning that will yield the best possible output? In my research, I take a novel approach to increasing the capabilities of AI agents via the use of AI to design the environments in which they are intended to act. My methods identify the inherent capabilities and limitations of AI agents and find the best way to modify their environment in order to maximize performance. With this agenda in mind, I describe here several research projects that vary in their objective, in the AI methodologies that are applied for finding optimal designs, and in the real-world applications to which they correspond. I also discuss how the different projects fit within my overarching objective of using AI to promote effective multi-agent collaboration and to enhance the way robots and machines interact with humans.

大多数人工智能研究只关注人工智能代理本身,也就是说,在给定输入的情况下,如何改进代理的推理才能产生最佳输出?在我的研究中,我采用了一种新颖的方法,通过使用人工智能来设计人工智能代理的行动环境,从而提高人工智能代理的能力。我的方法可以识别人工智能代理的固有能力和局限性,并找到修改其环境的最佳方法,从而最大限度地提高性能。考虑到这一议程,我在此介绍几个研究项目,这些项目在目标、用于寻找最佳设计的人工智能方法以及它们所对应的现实世界应用方面各不相同。我还将讨论不同的项目如何与我的总体目标相吻合,即利用人工智能促进有效的多机器人协作,并增强机器人和机器与人类的互动方式。
{"title":"Better environments for better AI","authors":"Sarah Keren","doi":"10.1002/aaai.12187","DOIUrl":"https://doi.org/10.1002/aaai.12187","url":null,"abstract":"<p>Most AI research focuses exclusively on the AI agent itself, that is, given some input, what are the improvements to the agent's reasoning that will yield the best possible output? In my research, I take a novel approach to increasing the capabilities of AI agents via the use of AI to design the environments in which they are intended to act. My methods identify the inherent capabilities and limitations of AI agents and find the best way to modify their environment in order to maximize performance. With this agenda in mind, I describe here several research projects that vary in their objective, in the AI methodologies that are applied for finding optimal designs, and in the real-world applications to which they correspond. I also discuss how the different projects fit within my overarching objective of using AI to promote effective multi-agent collaboration and to enhance the way robots and machines interact with humans.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"369-375"},"PeriodicalIF":2.5,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12187","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combating misinformation in the age of LLMs: Opportunities and challenges 打击法律硕士时代的错误信息:机遇与挑战
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-01 DOI: 10.1002/aaai.12188
Canyu Chen, Kai Shu

Misinformation such as fake news and rumors is a serious threat for information ecosystems and public trust. The emergence of large language models (LLMs) has great potential to reshape the landscape of combating misinformation. Generally, LLMs can be a double-edged sword in the fight. On the one hand, LLMs bring promising opportunities for combating misinformation due to their profound world knowledge and strong reasoning abilities. Thus, one emerging question is: can we utilize LLMs to combat misinformation? On the other hand, the critical challenge is that LLMs can be easily leveraged to generate deceptive misinformation at scale. Then, another important question is: how to combat LLM-generated misinformation? In this paper, we first systematically review the history of combating misinformation before the advent of LLMs. Then we illustrate the current efforts and present an outlook for these two fundamental questions, respectively. The goal of this survey paper is to facilitate the progress of utilizing LLMs for fighting misinformation and call for interdisciplinary efforts from different stakeholders for combating LLM-generated misinformation.

假新闻和谣言等虚假信息严重威胁着信息生态系统和公众信任。大型语言模型(LLMs)的出现极有可能重塑打击虚假信息的格局。一般来说,LLMs 在这场斗争中可能是一把双刃剑。一方面,LLMs 凭借其深厚的世界知识和强大的推理能力,为打击虚假信息带来了大有可为的机会;另一方面,LLMs 也有可能成为虚假信息领域的 "新宠儿"。因此,一个新出现的问题是:我们能否利用 LLM 来打击错误信息?另一方面,关键的挑战在于 LLMs 很容易被用来大规模生成欺骗性的错误信息。那么,另一个重要问题是:如何打击 LLM 生成的错误信息?在本文中,我们首先系统回顾了在 LLM 出现之前打击误导信息的历史。然后,我们分别阐述了当前的努力,并对这两个基本问题进行了展望。本调查报告的目的是促进利用 LLMs 打击误导信息的进展,并呼吁不同利益相关者为打击 LLM 生成的误导信息做出跨学科努力。
{"title":"Combating misinformation in the age of LLMs: Opportunities and challenges","authors":"Canyu Chen,&nbsp;Kai Shu","doi":"10.1002/aaai.12188","DOIUrl":"https://doi.org/10.1002/aaai.12188","url":null,"abstract":"<p>Misinformation such as fake news and rumors is a serious threat for information ecosystems and public trust. The emergence of large language models (LLMs) has great potential to reshape the landscape of combating misinformation. Generally, LLMs can be a double-edged sword in the fight. On the one hand, LLMs bring promising opportunities for combating misinformation due to their profound world knowledge and strong reasoning abilities. Thus, one emerging question is: <i>can we utilize LLMs to combat misinformation?</i> On the other hand, the critical challenge is that LLMs can be easily leveraged to generate deceptive misinformation at scale. Then, another important question is: <i>how to combat LLM-generated misinformation?</i> In this paper, we first systematically review the history of combating misinformation before the advent of LLMs. Then we illustrate the current efforts and present an outlook for these two fundamental questions, respectively. The goal of this survey paper is to facilitate the progress of utilizing LLMs for fighting misinformation and call for interdisciplinary efforts from different stakeholders for combating LLM-generated misinformation.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"354-368"},"PeriodicalIF":2.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12188","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Food information engineering 食品信息工程
IF 2.5 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-31 DOI: 10.1002/aaai.12185
Azanzi Jiomekong, Allard Oelen, Soren Auer, Lorenz Anna-Lena, Vogt Lars

Food information engineering relies on statistical and AI techniques (e.g., symbolic, connectionist, and neurosymbolic AI) for collecting, storing, processing, diffusing, and putting food information in a form exploitable by humans and machines. Food information is collected manually and automatically. Once collected, food information is organized using tabular data representation schema, symbolic, connectionist or neurosymbolic AI techniques. Once collected, processed, and stored, food information is diffused to different stakeholders using appropriate formats. Even if neurosymbolic AI has shown promising results in many domains, we found that this approach is rarely used in the domain of food information engineering. This paper aims to serve as a good reference for food information engineering researchers. Unlike existing reviews on the subject, we cover all the aspects of food information engineering and we linked the paper to online resources built using Open Research Knowledge Graph. These resources are composed of templates, comparison tables of research contributions and smart reviews. All these resources are organized in the “Food Information Engineering” observatory and will be continually updated with new research contributions.

食品信息工程依靠统计和人工智能技术(如符号、联结主义和神经符号人工智能)来收集、存储、处理、传播食品信息,并将其转化为人类和机器都能利用的形式。食物信息可通过人工或自动方式收集。收集后,使用表格数据表示模式、符号、联结主义或神经符号人工智能技术对食物信息进行组织。一旦收集、处理和存储完毕,食品信息就会以适当的格式传播给不同的利益相关者。尽管神经符号人工智能在许多领域都取得了可喜的成果,但我们发现这种方法在食品信息工程领域却鲜有应用。本文旨在为食品信息工程研究人员提供一个良好的参考。与现有的相关综述不同,我们涵盖了食品信息工程的所有方面,并将本文与使用开放研究知识图谱构建的在线资源相链接。这些资源由模板、研究成果对照表和智能评论组成。所有这些资源都组织在 "食品信息工程 "观察站中,并将根据新的研究成果不断更新。
{"title":"Food information engineering","authors":"Azanzi Jiomekong,&nbsp;Allard Oelen,&nbsp;Soren Auer,&nbsp;Lorenz Anna-Lena,&nbsp;Vogt Lars","doi":"10.1002/aaai.12185","DOIUrl":"https://doi.org/10.1002/aaai.12185","url":null,"abstract":"<p>Food information engineering relies on statistical and AI techniques (e.g., symbolic, connectionist, and neurosymbolic AI) for collecting, storing, processing, diffusing, and putting food information in a form exploitable by humans and machines. Food information is collected manually and automatically. Once collected, food information is organized using tabular data representation schema, symbolic, connectionist or neurosymbolic AI techniques. Once collected, processed, and stored, food information is diffused to different stakeholders using appropriate formats. Even if neurosymbolic AI has shown promising results in many domains, we found that this approach is rarely used in the domain of food information engineering. This paper aims to serve as a good reference for food information engineering researchers. Unlike existing reviews on the subject, we cover all the aspects of food information engineering and we linked the paper to online resources built using Open Research Knowledge Graph. These resources are composed of templates, comparison tables of research contributions and smart reviews. All these resources are organized in the “Food Information Engineering” observatory and will be continually updated with new research contributions.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 3","pages":"338-353"},"PeriodicalIF":2.5,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12185","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142324669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Ai Magazine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1