首页 > 最新文献

Proceedings of the AAAI Symposium Series最新文献

英文 中文
Responsible Integration of Large Language Models (LLMs) in Navy Operational Plan Generation 负责将大型语言模型 (LLM) 整合到海军作战计划生成中
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31179
Simon Kapiamba, H. Fouad, Ira S. Moskowitz
This paper outlines an approach for assessing and quantifyingthe risks associated with integrating Large Language Models(LLMs) in generating naval operational plans. It aims to explorethe potential benefits and challenges of LLMs in thiscontext and to suggest a methodology for a comprehensiverisk assessment framework.
本文概述了一种评估和量化与集成大型语言模型(LLMs)以生成海军作战计划相关的风险的方法。本文旨在探讨大型语言模型在此背景下的潜在优势和挑战,并为全面风险评估框架提出方法建议。
{"title":"Responsible Integration of Large Language Models (LLMs) in Navy Operational Plan Generation","authors":"Simon Kapiamba, H. Fouad, Ira S. Moskowitz","doi":"10.1609/aaaiss.v3i1.31179","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31179","url":null,"abstract":"This paper outlines an approach for assessing and quantifying\u0000the risks associated with integrating Large Language Models\u0000(LLMs) in generating naval operational plans. It aims to explore\u0000the potential benefits and challenges of LLMs in this\u0000context and to suggest a methodology for a comprehensive\u0000risk assessment framework.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"18 16","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Fast and Slow: A Redux of Levels of Learning in General Autonomous Intelligent Agents 学习的快与慢:通用自主智能代理的学习水平再论
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31279
Shiwali Mohan, John E. Laird
Autonomous intelligent agents, including humans, operate in a complex, dynamic environment that necessitates continuous learning. We revisit our thesis that proposes that learning in human-like agents can be categorized into two levels: Level 1 (L1) involving innate and automatic learning mechanisms, while Level 2 (L2) comprises deliberate strategies controlled by the agent. Our thesis draws from our experiences in building artificial agents with complex learning behaviors, such as interactive task learning and open-world learning.
包括人类在内的自主智能代理在复杂多变的环境中运行,需要不断学习。我们重温了我们的论文,该论文提出类人代理的学习可分为两个层次:第一级(L1)涉及与生俱来的自动学习机制,而第二级(L2)则包括由代理控制的深思熟虑的策略。我们的论文借鉴了我们在构建具有复杂学习行为(如交互式任务学习和开放世界学习)的人工代理方面的经验。
{"title":"Learning Fast and Slow: A Redux of Levels of Learning in General Autonomous Intelligent Agents","authors":"Shiwali Mohan, John E. Laird","doi":"10.1609/aaaiss.v3i1.31279","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31279","url":null,"abstract":"Autonomous intelligent agents, including humans, operate in a complex, dynamic environment that necessitates continuous learning. We revisit our thesis that proposes that learning in human-like agents can be categorized into two levels: Level 1 (L1) involving innate and automatic learning mechanisms, while Level 2 (L2) comprises deliberate strategies controlled by the agent. Our thesis draws from our experiences in building artificial agents with complex learning behaviors, such as interactive task learning and open-world learning.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"31 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accounting for Human Engagement Behavior to Enhance AI-Assisted Decision Making 考虑人类参与行为,加强人工智能辅助决策
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31184
Ming Yin
Artificial intelligence (AI) technologies have been increasingly integrated into human workflows. For example, the usage of AI-based decision aids in human decision-making processes has resulted in a new paradigm of AI-assisted decision making---that is, the AI-based decision aid provides a decision recommendation to the human decision makers, while humans make the final decision. The increasing prevalence of human-AI collaborative decision making highlights the need to understand how humans engage with the AI-based decision aid in these decision-making processes, and how to promote the effectiveness of the human-AI team in decision making. In this talk, I'll discuss a few examples illustrating that when AI is used to assist humans---both an individual decision maker or a group of decision makers---in decision making, people's engagement with the AI assistance is largely subject to their heuristics and biases, rather than careful deliberation of the respective strengths and limitations of AI and themselves. I'll then describe how to enhance AI-assisted decision making by accounting for human engagement behavior in the designs of AI-based decision aids. For example, AI recommendations can be presented to decision makers in a way that promotes their appropriate trust and reliance on AI by leveraging or mitigating human biases, informed by the analysis of human competence in decision making. Alternatively, AI-assisted decision making can be improved by developing AI models that can anticipate and adapt to the engagement behavior of human decision makers.
人工智能(AI)技术日益融入人类的工作流程。例如,在人类决策过程中使用基于人工智能的决策辅助工具已经形成了一种新的人工智能辅助决策模式--即基于人工智能的决策辅助工具向人类决策者提供决策建议,而人类则做出最终决策。人类与人工智能协作决策的日益普遍,凸显了人们需要了解在这些决策过程中人类如何与基于人工智能的决策辅助工具互动,以及如何提高人类与人工智能团队在决策中的效率。在本讲座中,我将讨论几个例子,说明当人工智能被用于辅助人类--无论是单个决策者还是群体决策者--进行决策时,人们对人工智能辅助的参与在很大程度上取决于他们的启发式思维和偏见,而不是仔细斟酌人工智能和他们各自的优势和局限性。接下来,我将介绍如何在设计人工智能辅助决策时考虑到人类的参与行为,从而增强人工智能辅助决策的效果。例如,通过分析人类在决策方面的能力,可以利用或减轻人类的偏见,以促进决策者对人工智能的适当信任和依赖的方式向决策者提出人工智能建议。另外,还可以通过开发能够预测和适应人类决策者参与行为的人工智能模型来改进人工智能辅助决策。
{"title":"Accounting for Human Engagement Behavior to Enhance AI-Assisted Decision Making","authors":"Ming Yin","doi":"10.1609/aaaiss.v3i1.31184","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31184","url":null,"abstract":"Artificial intelligence (AI) technologies have been increasingly integrated into human workflows. For example, the usage of AI-based decision aids in human decision-making processes has resulted in a new paradigm of AI-assisted decision making---that is, the AI-based decision aid provides a decision recommendation to the human decision makers, while humans make the final decision. The increasing prevalence of human-AI collaborative decision making highlights the need to understand how humans engage with the AI-based decision aid in these decision-making processes, and how to promote the effectiveness of the human-AI team in decision making. In this talk, I'll discuss a few examples illustrating that when AI is used to assist humans---both an individual decision maker or a group of decision makers---in decision making, people's engagement with the AI assistance is largely subject to their heuristics and biases, rather than careful deliberation of the respective strengths and limitations of AI and themselves. I'll then describe how to enhance AI-assisted decision making by accounting for human engagement behavior in the designs of AI-based decision aids. For example, AI recommendations can be presented to decision makers in a way that promotes their appropriate trust and reliance on AI by leveraging or mitigating human biases, informed by the analysis of human competence in decision making. Alternatively, AI-assisted decision making can be improved by developing AI models that can anticipate and adapt to the engagement behavior of human decision makers.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"11 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI for Social Good Education at Hispanic Serving Institutions 西语裔服务机构的人工智能社会公益教育
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31259
Yu Chen, Gabriel Granco, Yunfei Hou, Heather Macias, Frank A. Gomez
This project aims to broaden AI education by developing and studying the efficacy of innovative learning practices and resources for AI education for social good. We have developed three AI learning modules for students to: 1) identify social issues that align with the SDGs in their community (e.g., poverty, hunger, quality education); 2) learn AI through hands-on labs and business applications; and 3) create AI-powered solutions in teams to address social is-sues they have identified. Student teams are expected to situate AI learning in their communities and contribute to their communities. Students then use the modules to en-gage in an interdisciplinary approach, facilitating AI learn-ing for social good in informational sciences and technology, geography, and computer science at three CSU HSIs (San Jose State University, Cal Poly Pomona and CSU San Bernardino). Finally, we aim to evaluate the efficacy and impact of the proposed AI teaching methods and activities in terms of learning outcomes, student experience, student engagement, and equity.
本项目旨在通过开发和研究人工智能教育创新学习实践和资源的功效,扩大人工智能教育的社会公益性。我们为学生开发了三个人工智能学习模块,以便1)确定其所在社区与可持续发展目标相一致的社会问题(如贫困、饥饿、优质教育);2)通过实践实验室和商业应用学习人工智能;3)以团队形式创建人工智能驱动的解决方案,以解决他们所确定的社会问题。学生团队应将人工智能学习融入其所在社区,并为社区做出贡献。然后,学生利用这些模块参与跨学科方法,在三所 CSU HSI(圣何塞州立大学、加州理工波莫纳分校和 CSU 圣贝纳迪诺分校)的信息科学与技术、地理学和计算机科学领域促进人工智能学习,以造福社会。最后,我们将从学习成果、学生体验、学生参与度和公平性等方面评估所提出的人工智能教学方法和活动的效果和影响。
{"title":"AI for Social Good Education at Hispanic Serving Institutions","authors":"Yu Chen, Gabriel Granco, Yunfei Hou, Heather Macias, Frank A. Gomez","doi":"10.1609/aaaiss.v3i1.31259","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31259","url":null,"abstract":"This project aims to broaden AI education by developing and studying the efficacy of innovative learning practices and resources for AI education for social good. We have developed three AI learning modules for students to: 1) identify social issues that align with the SDGs in their community (e.g., poverty, hunger, quality education); 2) learn AI through hands-on labs and business applications; and 3) create AI-powered solutions in teams to address social is-sues they have identified. Student teams are expected to situate AI learning in their communities and contribute to their communities. Students then use the modules to en-gage in an interdisciplinary approach, facilitating AI learn-ing for social good in informational sciences and technology, geography, and computer science at three CSU HSIs (San Jose State University, Cal Poly Pomona and CSU San Bernardino). Finally, we aim to evaluate the efficacy and impact of the proposed AI teaching methods and activities in terms of learning outcomes, student experience, student engagement, and equity.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"28 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141119091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Framework for Federated Learning and Edge Deployment of Real-Time Reinforcement Learning Decision Engine on Software Defined Radio 软件无线电实时强化学习决策引擎的联合学习和边缘部署框架
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31218
Jithin Jagannath
Machine learning promises to empower dynamic resource allocation requirements of Next Generation (NextG) wireless networks including 6G and tactical networks. Recently, we have seen the impact machine learning can make on various aspects of wireless networks. Yet, in most cases, the progress has been limited to simulations and/or relies on large processing units to run the decision engines as opposed to deploying it on the radio at the edge. While relying on simulations for rapid and efficient training of deep reinforcement learning (DRL) may be necessary, it is key to mitigate the sim-real gap while trying to improve the generalization capability. To mitigate these challenges, we developed the Marconi-Rosenblatt Framework for Intelligent Networks (MR-iNet Gym), an open-source architecture designed for accelerating the deployment of novel DRL for NextG wireless networks. To demonstrate its impact, we tackled the problem of distributed frequency and power allocation while emphasizing the generalization capability of DRL decision engine. The end-to-end solution was implemented on the GPU-embedded software-defined radio and validated using over-the-air evaluation. To the best of our knowledge, these were the first instances that established the feasibility of deploying DRL for optimized distributed resource allocation for next-generation of GPU-embedded radios.
机器学习有望满足下一代(NextG)无线网络(包括 6G 和战术网络)的动态资源分配要求。最近,我们看到了机器学习对无线网络各个方面的影响。然而,在大多数情况下,进展仅限于模拟和/或依赖大型处理单元来运行决策引擎,而不是将其部署在边缘的无线电上。虽然依靠仿真来快速高效地训练深度强化学习(DRL)可能是必要的,但关键是要在努力提高泛化能力的同时缩小仿真与真实之间的差距。为了缓解这些挑战,我们开发了马可尼-罗森布拉特智能网络框架(MR-iNet Gym),这是一个开源架构,旨在加速部署适用于 NextG 无线网络的新型 DRL。为了证明其影响力,我们在强调 DRL 决策引擎的泛化能力的同时,解决了分布式频率和功率分配问题。端到端解决方案是在嵌入 GPU 的软件定义无线电上实现的,并通过空中评估进行了验证。据我们所知,这些是为下一代 GPU 嵌入式无线电优化分布式资源分配部署 DRL 的可行性的首个实例。
{"title":"Framework for Federated Learning and Edge Deployment of Real-Time Reinforcement Learning Decision Engine on Software Defined Radio","authors":"Jithin Jagannath","doi":"10.1609/aaaiss.v3i1.31218","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31218","url":null,"abstract":"Machine learning promises to empower dynamic resource allocation requirements of Next Generation (NextG) wireless networks including 6G and tactical networks. Recently, we have seen the impact machine learning can make on various aspects of wireless networks. Yet, in most cases, the progress has been limited to simulations and/or relies on large processing units to run the decision engines as opposed to deploying it on the radio at the edge. While relying on simulations for rapid and efficient training of deep reinforcement learning (DRL) may be necessary, it is key to mitigate the sim-real gap while trying to improve the generalization capability. To mitigate these challenges, we developed the Marconi-Rosenblatt Framework for Intelligent Networks (MR-iNet Gym), an open-source architecture designed for accelerating the deployment of novel DRL for NextG wireless networks. To demonstrate its impact, we tackled the problem of distributed frequency and power allocation while emphasizing the generalization capability of DRL decision engine. The end-to-end solution was implemented on the GPU-embedded software-defined radio and validated using over-the-air evaluation. To the best of our knowledge, these were the first instances that established the feasibility of deploying DRL for optimized distributed resource allocation for next-generation of GPU-embedded radios.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"76 21","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perception-Dominant Control Types for Human/Machine Systems 人机系统的感知主导控制类型
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31177
Ted Goranson
We explore a novel approach to complex domain modelling by emphasising primitives based on perception. The usual approach either focuses on actors or cognition associated with tokens that convey information. In related research, we have examined using effects and/or outcomes as primitives, and influences as the generator of those outcomes via categoric functors. That approach (influences, effects) has advantages: it leverages what is known and supports the expanded logics we use, where we want to anticipate and engineer possible futures. But it has weaknesses when placed in a dynamic human-machine system where what is perceived or assumed matters more than what is known. The work reported here builds on previous advances in type specification and reasoning to ‘move the primitives forward’ more toward situation encounter and away from situation understanding. The goal is in the context of shared human-machine systems where:• reaction times are shorter than the traditional ingestion/comprehension/response loop can support;• situations that are too complex or dynamic for current comprehension by any means;• there simply is insufficient knowledge about governing situations for the comprehension model to support action; and/or,• the many machine/human and system/system interfaces that are incapable of conveying the needed insights; that is, the communication channels choke the information or influence flows. While the approach is motivated by the above unfriendly conditions, we expect significant benefits. We will explore these but engineer toward a federated decision paradigm where decisions by local human, machine or synthesis are not whole-situation-aware, but that collectively ‘swarm’ locally across the larger system to be more effective, ‘wiser’ than a convention paradigm may produce. The supposed implementation strategy will be through extending an existing ‘playbooks as code’ project whose goals are to advise on local action by modelling and gaming complex system dynamics. A sponsoring context is ‘grey zone’ competition that avoids armed conflict, but that can segue to a mixed system course of action advisory. The general context is a costly ‘blue swan’ risk in large commercial and government enterprises. The method will focus on patterns and relationships in synthetic categories used to model type transitions within topological models of system influence. One may say this is applied intuitionistic type theory, following mechanisms generally described by synthetic differential geometry. In this context, the motivating supposition of this study is that information-carrying influence channels are best modelled in our challenging domain as perceived types rather than understood types.
通过强调基于感知的基元,我们探索了一种复杂领域建模的新方法。通常的方法要么侧重于行动者,要么侧重于与传递信息的标记相关的认知。在相关研究中,我们探讨了使用效果和/或结果作为基元,并通过分类函数使用影响作为这些结果的生成器。 这种方法(影响、结果)有其优点:它充分利用了已知的信息,并支持我们使用的扩展逻辑,在这种逻辑中,我们希望预测和设计可能的未来。但在动态人机系统中,感知或假设的东西比已知的东西更重要,因此这种方法也有弱点。本文所报告的工作建立在之前在类型规范和推理方面所取得的进展基础之上,目的是 "向前推进基元",使其更多地与情境相遇,而不是理解情境。 我们的目标是在人机共用系统中:- 反应时间短于传统的摄取/理解/响应循环所能支持的时间;- 情境过于复杂或动态,目前的理解手段无法应对;- 对于理解模型而言,有关管理情境的知识根本不足以支持行动;和/或- 许多机器/人和系统/系统界面无法传递所需的洞察力;也就是说,通信渠道阻塞了信息流或影响流。虽然这种方法是基于上述不友好的条件,但我们期望它能带来显著的益处。我们将探索这些益处,但会朝着联合决策范式的方向努力。在这种范式中,本地人、机器或合成物的决策并不具有整体情境意识,但它们会在更大的系统中集体 "蜂拥",从而比传统范式更有效、更 "明智"。所谓的实施策略是通过扩展现有的 "代码游戏 "项目,其目标是通过对复杂的系统动态进行建模和博弈,为本地行动提供建议。项目的发起背景是避免武装冲突的 "灰色地带 "竞争,但也可以过渡到混合系统行动方案咨询。一般背景是大型商业和政府企业中代价高昂的 "蓝天鹅 "风险。该方法将侧重于合成类别中的模式和关系,用于在系统影响的拓扑模型中模拟类型转换。可以说,这是应用直观类型理论,遵循合成微分几何学所描述的一般机制。在这种情况下,本研究的动机假设是,在我们所面临的挑战领域中,最好将携带信息的影响渠道建模为感知类型,而不是理解类型。
{"title":"Perception-Dominant Control Types for Human/Machine Systems","authors":"Ted Goranson","doi":"10.1609/aaaiss.v3i1.31177","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31177","url":null,"abstract":"We explore a novel approach to complex domain modelling by emphasising primitives based on perception. The usual approach either focuses on actors or cognition associated with tokens that convey information. In related research, we have examined using effects and/or outcomes as primitives, and influences as the generator of those outcomes via categoric functors. \u0000 That approach (influences, effects) has advantages: it leverages what is known and supports the expanded logics we use, where we want to anticipate and engineer possible futures. But it has weaknesses when placed in a dynamic human-machine system where what is perceived or assumed matters more than what is known. The work reported here builds on previous advances in type specification and reasoning to ‘move the primitives forward’ more toward situation encounter and away from situation understanding. \u0000 The goal is in the context of shared human-machine systems where:\u0000• reaction times are shorter than the traditional ingestion/comprehension/response loop can support;\u0000• situations that are too complex or dynamic for current comprehension by any means;\u0000• there simply is insufficient knowledge about governing situations for the comprehension model to support action; and/or,\u0000• the many machine/human and system/system interfaces that are incapable of conveying the needed insights; that is, the communication channels choke the information or influence flows.\u0000 While the approach is motivated by the above unfriendly conditions, we expect significant benefits. We will explore these but engineer toward a federated decision paradigm where decisions by local human, machine or synthesis are not whole-situation-aware, but that collectively ‘swarm’ locally across the larger system to be more effective, ‘wiser’ than a convention paradigm may produce.\u0000 The supposed implementation strategy will be through extending an existing ‘playbooks as code’ project whose goals are to advise on local action by modelling and gaming complex system dynamics. A sponsoring context is ‘grey zone’ competition that avoids armed conflict, but that can segue to a mixed system course of action advisory. The general context is a costly ‘blue swan’ risk in large commercial and government enterprises.\u0000 The method will focus on patterns and relationships in synthetic categories used to model type transitions within topological models of system influence. One may say this is applied intuitionistic type theory, following mechanisms generally described by synthetic differential geometry. In this context, the motivating supposition of this study is that information-carrying influence channels are best modelled in our challenging domain as perceived types rather than understood types.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"100 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rule-Based Explanations of Machine Learning Classifiers Using Knowledge Graphs 使用知识图谱对机器学习分类器进行基于规则的解释
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31200
Orfeas Menis Mastromichalakis, Edmund Dervakos, A. Chortaras, G. Stamou
The use of symbolic knowledge representation and reasoning as a way to resolve the lack of transparency of machine learning classifiers is a research area that has lately gained a lot of traction. In this work, we use knowledge graphs as the underlying framework providing the terminology for representing explanations for the operation of a machine learning classifier escaping the constraints of using the features of raw data as a means to express the explanations, providing a promising solution to the problem of the understandability of explanations. In particular, given a description of the application domain of the classifier in the form of a knowledge graph, we introduce a novel theoretical framework for representing explanations of its operation, in the form of query-based rules expressed in the terminology of the knowledge graph. This allows for explaining opaque black-box classifiers, using terminology and information that is independent of the features of the classifier and its domain of application, leading to more understandable explanations but also allowing the creation of different levels of explanations according to the final end-user.
使用符号化知识表示和推理来解决机器学习分类器缺乏透明度的问题,是近来备受关注的一个研究领域。在这项工作中,我们使用知识图谱作为底层框架,为机器学习分类器的操作解释提供了术语表达,摆脱了使用原始数据特征作为解释表达手段的限制,为解释的可理解性问题提供了一个很有前景的解决方案。特别是,在以知识图谱的形式描述分类器应用领域的情况下,我们引入了一个新颖的理论框架,以知识图谱术语表达的基于查询的规则的形式来表示分类器操作的解释。这样就可以使用独立于分类器特征及其应用领域的术语和信息来解释不透明的黑盒子分类器,从而获得更易于理解的解释,而且还可以根据最终用户的需求创建不同层次的解释。
{"title":"Rule-Based Explanations of Machine Learning Classifiers Using Knowledge Graphs","authors":"Orfeas Menis Mastromichalakis, Edmund Dervakos, A. Chortaras, G. Stamou","doi":"10.1609/aaaiss.v3i1.31200","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31200","url":null,"abstract":"The use of symbolic knowledge representation and reasoning as a way to resolve the lack of transparency of machine learning classifiers is a research area that has lately gained a lot of traction. In this work, we use knowledge graphs as the underlying framework providing the terminology for representing explanations for the operation of a machine learning classifier escaping the constraints of using the features of raw data as a means to express the explanations, providing a promising solution to the problem of the understandability of explanations. In particular, given a description of the application domain of the classifier in the form of a knowledge graph, we introduce a novel theoretical framework for representing explanations of its operation, in the form of query-based rules expressed in the terminology of the knowledge graph. This allows for explaining opaque black-box classifiers, using terminology and information that is independent of the features of the classifier and its domain of application, leading to more understandable explanations but also allowing the creation of different levels of explanations according to the final end-user.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"22 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Can LLMs Answer Investment Banking Questions? Using Domain-Tuned Functions to Improve LLM Performance on Knowledge-Intensive Analytical Tasks 法律硕士能否回答投资银行问题?使用领域调整函数提高法律硕士在知识密集型分析任务中的表现
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31191
Nicholas Harvel, F. B. Haiek, Anupriya Ankolekar, David James Brunner
Large Language Models (LLMs) can increase the productivity of general-purpose knowledge work, but accuracy is a concern, especially in professional settings requiring domain-specific knowledge and reasoning. To evaluate the suitability of LLMs for such work, we developed a benchmark of 16 analytical tasks representative of the investment banking industry. We evaluated LLM performance without special prompting, with relevant information provided in the prompt, and as part of a system giving the LLM access to domain-tuned functions for information retrieval and planning. Without access to functions, state-of-the-art LLMs performed poorly, completing two or fewer tasks correctly. Access to appropriate domain-tuned functions yielded dramatically better results, although performance was highly sensitive to the design of the functions and the structure of the information they returned. The most effective designs yielded correct answers on 12 out of 16 tasks. Our results suggest that domain-specific functions and information structures, by empowering LLMs with relevant domain knowledge and enabling them to reason in domain-appropriate ways, may be a powerful means of adapting LLMs for use in demanding professional settings.
大型语言模型(LLM)可以提高通用知识工作的效率,但准确性却令人担忧,尤其是在需要特定领域知识和推理的专业环境中。为了评估大型语言模型在此类工作中的适用性,我们开发了一个包含 16 项具有代表性的投资银行业分析任务的基准。我们对 LLM 的性能进行了评估,包括没有特殊提示的情况下、在提示中提供相关信息的情况下,以及作为系统的一部分让 LLM 访问用于信息检索和规划的领域调整功能的情况下。在没有使用这些功能的情况下,最先进的 LLM 表现不佳,只能正确完成两项或更少任务。使用适当的领域调整函数后,结果大为改观,尽管性能对函数的设计及其返回信息的结构非常敏感。最有效的设计在 16 个任务中的 12 个任务中获得了正确答案。我们的研究结果表明,针对特定领域的函数和信息结构,通过赋予 LLMs 相关领域的知识,使他们能够以适合该领域的方式进行推理,可能是使 LLMs 适应高要求专业环境的有力手段。
{"title":"Can LLMs Answer Investment Banking Questions? Using Domain-Tuned Functions to Improve LLM Performance on Knowledge-Intensive Analytical Tasks","authors":"Nicholas Harvel, F. B. Haiek, Anupriya Ankolekar, David James Brunner","doi":"10.1609/aaaiss.v3i1.31191","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31191","url":null,"abstract":"Large Language Models (LLMs) can increase the productivity of general-purpose knowledge work, but accuracy is a concern, especially in professional settings requiring domain-specific knowledge and reasoning. To evaluate the suitability of LLMs for such work, we developed a benchmark of 16 analytical tasks representative of the investment banking industry. We evaluated LLM performance without special prompting, with relevant information provided in the prompt, and as part of a system giving the LLM access to domain-tuned functions for information retrieval and planning. Without access to functions, state-of-the-art LLMs performed poorly, completing two or fewer tasks correctly. Access to appropriate domain-tuned functions yielded dramatically better results, although performance was highly sensitive to the design of the functions and the structure of the information they returned. The most effective designs yielded correct answers on 12 out of 16 tasks. Our results suggest that domain-specific functions and information structures, by empowering LLMs with relevant domain knowledge and enabling them to reason in domain-appropriate ways, may be a powerful means of adapting LLMs for use in demanding professional settings.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"96 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ethical Considerations of Generative AI: A Survey Exploring the Role of Decision Makers in the Loop 生成式人工智能的伦理考量:探索环路中决策者角色的调查
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31243
Yohn Jairo Parra Bautista, Carlos Theran, Richard A. Aló
We explore the foresighted concerns that Norbert Wiener voiced in 1960 about the potential of machines to learn and create strategies that could not be anticipated, drawing parallels to the fable "The Sorcerer's Apprentice" by Goethe. The progress in artificial intelligence (AI) has brought these worries back to the forefront, as shown by a survey AI Impacts conducted in 2022 with more than 700 machine learning researchers. This survey found a five percentage probability that advanced AI might cause "extremely adverse" outcomes, including the possibility of human extinction. Importantly, the introduction of OpenAI's ChatGPT, powered by GPT-4, has led to a surge in entrepreneurial activities, highlighting the ease of use of large language models (LLMs).AI's potential for adverse outcomes, such as military control and unregulated AI races, is explored alongside concerns about AI's role in governance, healthcare, media portrayal, and surpassing human intelligence. Given their transformative impact on content creation, the prominence of generative AI tools such as ChatGPT is noted. The societal assessment of Artificial Intelligence (AI) has grown increasingly intricate and pressing in tandem with the rapid evolution of this technology. As AI continues to advance at a swift pace, the need to comprehensively evaluate its societal implications has become more complex and urgent, necessitating a thorough examination of its potential impact on various domains such as governance, healthcare, media portrayal, and surpassing human intelligence. This assessment is crucial in addressing ethical concerns related to bias, data misuse, technical limitations, and transparency gaps, and in integrating ethical and legal principles throughout AI algorithm lifecycles to ensure alignment with societal well-being. Furthermore, the urgency of addressing the societal implications of AI is underscored by the need for healthcare workforce upskilling and ethical considerations in the era of AI-assisted medicine, emphasizing the critical importance of integrating societal well-being into the development and deployment of AI technologies. Our study entails an examination of the ethical quandaries and obstacles presented when developing methods to evaluate and predict the broader societal impacts of AI on decision-making processes involving the generating of images, videos, and textual content.
我们探讨了诺伯特-维纳(Norbert Wiener)在1960年对机器学习潜力和创造无法预料的战略所表达的前瞻性担忧,并将其与歌德的寓言故事《巫师的学徒》相提并论。2022 年,AI Impacts 对 700 多名机器学习研究人员进行了一项调查。这项调查发现,先进的人工智能可能造成 "极其不利 "结果的概率为 5%,其中包括人类灭绝的可能性。重要的是,由GPT-4驱动的OpenAI的ChatGPT的推出导致了创业活动的激增,凸显了大型语言模型(LLMs)的易用性。在探讨人工智能可能导致的不利结果(如军事控制和不受监管的人工智能竞赛)的同时,还探讨了人们对人工智能在治理、医疗保健、媒体描绘以及超越人类智能方面的作用的担忧。鉴于其对内容创作的变革性影响,ChatGPT 等生成式人工智能工具的重要性也得到了关注。随着人工智能(AI)技术的快速发展,社会对该技术的评估也变得越来越复杂和紧迫。随着人工智能的持续快速发展,全面评估其社会影响的需求也变得更加复杂和迫切,这就要求对其在治理、医疗保健、媒体形象以及超越人类智能等各个领域的潜在影响进行彻底审查。这种评估对于解决与偏见、数据滥用、技术限制和透明度差距有关的伦理问题,以及在整个人工智能算法生命周期中整合伦理和法律原则以确保与社会福祉保持一致至关重要。此外,在人工智能辅助医疗时代,医疗保健人员需要提高技能和考虑伦理问题,这也凸显了解决人工智能社会影响的紧迫性,强调了将社会福祉纳入人工智能技术开发和部署的极端重要性。我们的研究需要审查在开发评估和预测人工智能对涉及生成图像、视频和文本内容的决策过程的广泛社会影响的方法时所遇到的伦理窘境和障碍。
{"title":"Ethical Considerations of Generative AI: A Survey Exploring the Role of Decision Makers in the Loop","authors":"Yohn Jairo Parra Bautista, Carlos Theran, Richard A. Aló","doi":"10.1609/aaaiss.v3i1.31243","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31243","url":null,"abstract":"We explore the foresighted concerns that Norbert Wiener voiced in 1960 about the potential of machines to learn and create strategies that could not be anticipated, drawing parallels to the fable \"The Sorcerer's Apprentice\" by Goethe. The progress in artificial intelligence (AI) has brought these worries back to the forefront, as shown by a survey AI Impacts conducted in 2022 with more than 700 machine learning researchers. This survey found a five percentage probability that advanced AI might cause \"extremely adverse\" outcomes, including the possibility of human extinction. Importantly, the introduction of OpenAI's ChatGPT, powered by GPT-4, has led to a surge in entrepreneurial activities, highlighting the ease of use of large language models (LLMs).AI's potential for adverse outcomes, such as military control and unregulated AI races, is explored alongside concerns about AI's role in governance, healthcare, media portrayal, and surpassing human intelligence. Given their transformative impact on content creation, the prominence of generative AI tools such as ChatGPT is noted. The societal assessment of Artificial Intelligence (AI) has grown increasingly intricate and pressing in tandem with the rapid evolution of this technology. As AI continues to advance at a swift pace, the need to comprehensively evaluate its societal implications has become more complex and urgent, necessitating a thorough examination of its potential impact on various domains such as governance, healthcare, media portrayal, and surpassing human intelligence. This assessment is crucial in addressing ethical concerns related to bias, data misuse, technical limitations, and transparency gaps, and in integrating ethical and legal principles throughout AI algorithm lifecycles to ensure alignment with societal well-being. Furthermore, the urgency of addressing the societal implications of AI is underscored by the need for healthcare workforce upskilling and ethical considerations in the era of AI-assisted medicine, emphasizing the critical importance of integrating societal well-being into the development and deployment of AI technologies. Our study entails an examination of the ethical quandaries and obstacles presented when developing methods to evaluate and predict the broader societal impacts of AI on decision-making processes involving the generating of images, videos, and textual content.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"85 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized Image Generation Through Swiping 通过轻扫生成个性化图像
Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31238
Yuto Nakashima
Generating preferred images from GANs is a challenging task due to the high-dimensional nature of latent space. In this study, we propose a novel approach that uses simple user-swipe interactions to generate preferred images from users. To effectively explore the latent space with only swipe interactions, we apply principal component analysis to the latent space of StyleGAN, creating meaningful subspaces. Additionally, we use a multi-armed bandit algorithm to decide which dimensions to explore, focusing on the user's preferences. Our experiments show that our method is more efficient in generating preferred images than the baseline.
由于潜在空间的高维特性,从 GAN 生成首选图像是一项具有挑战性的任务。在本研究中,我们提出了一种新方法,利用简单的用户滑动交互从用户生成首选图片。为了有效地利用刷卡交互探索潜在空间,我们对 StyleGAN 的潜在空间进行了主成分分析,从而创建了有意义的子空间。此外,我们还使用多臂匪徒算法来决定探索哪些维度,重点关注用户的偏好。实验表明,我们的方法在生成首选图片方面比基线方法更有效。
{"title":"Personalized Image Generation Through Swiping","authors":"Yuto Nakashima","doi":"10.1609/aaaiss.v3i1.31238","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31238","url":null,"abstract":"Generating preferred images from GANs is a challenging task due to the high-dimensional nature of latent space. In this study, we propose a novel approach that uses simple user-swipe interactions to generate preferred images from users. To effectively explore the latent space with only swipe interactions, we apply principal component analysis to the latent space of StyleGAN, creating meaningful subspaces. Additionally, we use a multi-armed bandit algorithm to decide which dimensions to explore, focusing on the user's preferences. Our experiments show that our method is more efficient in generating preferred images than the baseline.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"7 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the AAAI Symposium Series
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1