Proceedings of the AAAI Symposium Series最新文献

英文中文

AI Health Agents: Pathway2vec, ReflectE, Category Theory, and Longevity 人工智能健康代理：Pathway2vec、ReflectE、范畴理论与长寿

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31249

Melanie Swan, Takashi Kido, Eric Roland, Renato P. Dos Santos

Health Agents are introduced as the concept of a personalized AI health advisor overlay for continuous health monitoring (e.g. 1000x/minute) medical-grade smartwatches and wearables for “healthcare by app” instead of “sickcare by appointment.” Individuals can customize the level of detail in the information they view. Health Agents “speak” natural language to humans and formal language to the computational infrastructure, possibly outputting the mathematics of personalized homeostatic health as part of their reinforcement learning agent behavior. As an AI health interface, the agent facilitates the management of precision medicine as a service. Healthy longevity is a high-profile area characterized by the increasing acceptance of medical intervention, longevity biotech venture capital investment, and global priority as 2 billion people will be over 65 in 2050. Aging hallmarks, biomarkers, and clocks provide a quantitative measure for intervention. Some of the leading interventions include metformin, rapamycin, spermidine, NAD+/sirtuins, alpha-ketoglutarate, and taurine. AI-driven digital biology, longevity medicine, and Web3 personalized healthcare come together in the idea of Health Agents. This Web3 genAI tool for automated health management, specifically via digital-biological twins and pathway2vec approaches, demonstrates human-AI intelligence amplification and works towards healthy longevity for global well-being.

健康代理的概念是为医疗级智能手表和可穿戴设备提供个性化的人工智能健康顾问，用于持续监测健康状况（例如 1000 倍/分钟），实现 "应用医疗 "而非 "预约医疗"。个人可以自定义查看信息的详细程度。健康代理对人类 "讲 "自然语言，对计算基础设施 "讲 "形式语言，并可能输出个性化同态健康数学，作为其强化学习代理行为的一部分。作为人工智能健康界面，该代理有助于将精准医疗作为一种服务进行管理。健康长寿是一个备受瞩目的领域，其特点是人们越来越多地接受医疗干预、长寿生物技术风险投资和全球优先事项，因为 2050 年将有 20 亿人超过 65 岁。衰老标志、生物标志物和时钟为干预提供了量化指标。一些主要的干预措施包括二甲双胍、雷帕霉素、亚精胺、NAD+/酪蛋白、α-酮戊二酸和牛磺酸。人工智能驱动的数字生物学、长寿医学和 Web3 个性化医疗保健在 "健康代理"（Health Agents）这一理念中融为一体。这种用于自动健康管理的 Web3 genAI 工具，特别是通过数字生物双胞胎和 pathway2vec 方法，展示了人类-人工智能的智能放大，并致力于实现健康长寿，促进全球福祉。

{"title":"AI Health Agents: Pathway2vec, ReflectE, Category Theory, and Longevity","authors":"Melanie Swan, Takashi Kido, Eric Roland, Renato P. Dos Santos","doi":"10.1609/aaaiss.v3i1.31249","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31249","url":null,"abstract":"Health Agents are introduced as the concept of a personalized AI health advisor overlay for continuous health monitoring (e.g. 1000x/minute) medical-grade smartwatches and wearables for “healthcare by app” instead of “sickcare by appointment.” Individuals can customize the level of detail in the information they view. Health Agents “speak” natural language to humans and formal language to the computational infrastructure, possibly outputting the mathematics of personalized homeostatic health as part of their reinforcement learning agent behavior. As an AI health interface, the agent facilitates the management of precision medicine as a service. Healthy longevity is a high-profile area characterized by the increasing acceptance of medical intervention, longevity biotech venture capital investment, and global priority as 2 billion people will be over 65 in 2050. Aging hallmarks, biomarkers, and clocks provide a quantitative measure for intervention. Some of the leading interventions include metformin, rapamycin, spermidine, NAD+/sirtuins, alpha-ketoglutarate, and taurine. AI-driven digital biology, longevity medicine, and Web3 personalized healthcare come together in the idea of Health Agents. This Web3 genAI tool for automated health management, specifically via digital-biological twins and pathway2vec approaches, demonstrates human-AI intelligence amplification and works towards healthy longevity for global well-being.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"33 17","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Framework for Enhancing Behavioral Science Research with Human-Guided Language Models 利用人导语言模型加强行为科学研究的框架

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31206

J. Scheuerman, Dina M. Acklin

Many behavioral science studies result in large amounts of unstructured data sets that are costly to code and analyze, requiring multiple reviewers to agree on systematically chosen concepts and themes to categorize responses. Large language models (LLMs) have potential to support this work, demonstrating capabilities for categorizing, summarizing, and otherwise organizing unstructured data. In this paper, we consider that although LLMs have the potential to save time and resources performing coding on qualitative data, the implications for behavioral science research are not yet well understood. Model bias and inaccuracies, reliability, and lack of domain knowledge all necessitate continued human guidance. New methods and interfaces must be developed to enable behavioral science researchers to efficiently and systematically categorize unstructured data together with LLMs. We propose a framework for incorporating human feedback into an annotation workflow, leveraging interactive machine learning to provide oversight while improving a language model's predictions over time.

许多行为科学研究都会产生大量的非结构化数据集，这些数据集的编码和分析成本很高，需要多名审稿人就系统选择的概念和主题达成一致，以便对回答进行分类。大型语言模型（LLM）具有支持这项工作的潜力，它展示了对非结构化数据进行分类、总结和组织的能力。在本文中，我们认为虽然大型语言模型有可能节省对定性数据进行编码的时间和资源，但其对行为科学研究的影响还没有得到很好的理解。模型的偏差和不准确性、可靠性以及领域知识的缺乏都需要人类的持续指导。必须开发新的方法和界面，使行为科学研究人员能够高效、系统地将非结构化数据与 LLM 一起进行分类。我们提出了一个将人类反馈纳入注释工作流程的框架，利用交互式机器学习提供监督，同时随着时间的推移改进语言模型的预测。

引用次数: 0

GenAI and Socially Responsible AI in Natural Language Processing Applications: A Linguistic Perspective 自然语言处理应用中的 GenAI 和对社会负责的人工智能：语言学视角

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31230

Christina Alexandris

It is a widely-accepted fact that the processing of very large amounts of data with state-of-the-art Natural Language Processing (NLP) practices (i.e. Machine Learning –ML, language agnostic approaches) has resulted to a dramatic improvement in the speed and efficiency of systems and applications. However, these developments are accompanied with several challenges and difficulties that have been voiced within the last years. Specifically, in regard to NLP, evident improvement in the speed and efficiency of systems and applications with GenAI also entails some aspects that may be problematic, especially when particular text types, languages and/or user groups are concerned.State-of-the-art NLP approaches with automated processing of vast amounts of data in GenAI are related to observed problematic Aspects 1-7, namely: (1) Underrepresentation, (2) Standardization. These result to (3) Barriers in Text Understanding, (4) Discouragement of HCI Usage for Special Text Types and/or User Groups, (5) Barriers in Accessing Information, (6) Likelihood of Errors and False Assumptions and (7) Difficulties in Error Detection and Recovery. An additional problem are typical cases, such as less-resourced languages (A), less experienced users (B) and less agile users (C). A hybrid approach involving the re-introduction and integration of traditional concepts in state-of-the-art processing approaches, whether they are automatic or interactive, concerns the following targets:i), (ii) and (iii): Making more types of information accessible to more types of recipients and user groups (i), Making more types of services accessible and user-friendly to more types of user groups (ii), Making more types of feelings, opinions, voices and reactions visible from more types of user groups (iii)Specifically, in the above-presented cases traditional and classical theories, principles and models are re-introduced and can be integrated into state-of-the art data-driven approaches involving Machine Learning and neural networks, functioning as training data and seed data in Natural Language Processing applications where user requirements and customization are of particular interest and importance. A hybrid approach may be considered a compromise between speed and correctness / userfriendliness in (types of) NLP applications where the achievement of this balance plays a crucial role. In other words, a hybrid approach and the examples presented here target to prevent mechanisms from adopting human biases, ensuring fairness and socially responsible outcome and responsible Social Media. A hybrid approach and the examples presented here also target to customizing content to different linguistic and cultural groups, ensuring equitable information distribution. Here, we present characteristic examples with cases employing the re-introduction of four typical types of traditional concepts concerning classical theories, principles and models. These four typical classical theories, principl

一个公认的事实是，利用最先进的自然语言处理（NLP）方法（即机器学习-ML、语言无关方法）处理海量数据，极大地提高了系统和应用的速度和效率。然而，伴随着这些发展，过去几年中也出现了一些挑战和困难。特别是在 NLP 方面，GenAI 系统和应用的速度和效率的明显提高也带来了一些可能存在问题的方面，尤其是在涉及特定文本类型、语言和/或用户群体时。这些问题导致：(3) 文本理解障碍；(4) 阻碍特殊文本类型和/或用户群体使用人机交互技术；(5) 信息获取障碍；(6) 错误和错误假设的可能性；(7) 错误检测和恢复困难。另一个问题是典型案例，如资源较少的语言 (A)、经验较少的用户 (B) 和不够敏捷的用户 (C)。在最先进的处理方法中重新引入和整合传统概念的混合方法，无论是自动的还是交互的，都涉及到以下目标：i)、(ii)和(iii)：让更多类型的接收者和用户群体可以获取更多类型的信息 (i)，让更多类型的用户群体可以获取更多类型的服务并对其友好 (ii)，让更多类型的用户群体可以看到更多类型的感受、意见、声音和反应 (iii)。具体而言，在上述情况下，传统的经典理论、原则和模型被重新引入，并可以集成到涉及机器学习和神经网络的最先进的数据驱动方法中，在自然语言处理应用中作为训练数据和种子数据发挥作用，在自然语言处理应用中，用户需求和定制化是特别重要的。在（各类）自然语言处理应用中，混合方法可被视为速度与正确性/用户友好性之间的折衷方案，在这些应用中，实现这种平衡起着至关重要的作用。换句话说，混合方法和本文介绍的示例旨在防止机制采用人为偏见，确保公平性和对社会负责的结果，以及负责任的社交媒体。混合方法和本文介绍的实例还旨在为不同语言和文化群体定制内容，确保信息的公平传播。在此，我们通过重新引入四种典型的传统概念（涉及经典理论、原则和模型）的案例来介绍具有特色的实例。这四种典型的经典理论、原则和模型也并非完美无瑕，但它们可以转化为实用策略，融入评估模块、神经网络和训练数据（包括知识图谱）以及对话设计中。所建议和讨论的传统概念的重新引入并不仅限于本文所介绍的特定模型、原则和理论。第一个例子涉及理论语言学中一个经典原则的应用。第二个例子中使用的概念涉及语言学和翻译领域的一个模型。第三个和第四个例子分别展示了语言学-认知科学和语言学-心理学领域的模型和理论框架的跨学科应用。

{"title":"GenAI and Socially Responsible AI in Natural Language Processing Applications: A Linguistic Perspective","authors":"Christina Alexandris","doi":"10.1609/aaaiss.v3i1.31230","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31230","url":null,"abstract":"It is a widely-accepted fact that the processing of very large amounts of data with state-of-the-art Natural Language Processing (NLP) practices (i.e. Machine Learning –ML, language agnostic approaches) has resulted to a dramatic improvement in the speed and efficiency of systems and applications. However, these developments are accompanied with several challenges and difficulties that have been voiced within the last years. Specifically, in regard to NLP, evident improvement in the speed and efficiency of systems and applications with GenAI also entails some aspects that may be problematic, especially when particular text types, languages and/or user groups are concerned.\u0000State-of-the-art NLP approaches with automated processing of vast amounts of data in GenAI are related to observed problematic Aspects 1-7, namely: (1) Underrepresentation, (2) Standardization. These result to (3) Barriers in Text Understanding, (4) Discouragement of HCI Usage for Special Text Types and/or User Groups, (5) Barriers in Accessing Information, (6) Likelihood of Errors and False Assumptions and (7) Difficulties in Error Detection and Recovery. An additional problem are typical cases, such as less-resourced languages (A), less experienced users (B) and less agile users (C). \u0000A hybrid approach involving the re-introduction and integration of traditional concepts in state-of-the-art processing approaches, whether they are automatic or interactive, concerns the following targets:\u0000i), (ii) and (iii): Making more types of information accessible to more types of recipients and user groups (i), Making more types of services accessible and user-friendly to more types of user groups (ii), Making more types of feelings, opinions, voices and reactions visible from more types of user groups (iii)\u0000Specifically, in the above-presented cases traditional and classical theories, principles and models are re-introduced and can be integrated into state-of-the art data-driven approaches involving Machine Learning and neural networks, functioning as training data and seed data in Natural Language Processing applications where user requirements and customization are of particular interest and importance. A hybrid approach may be considered a compromise between speed and correctness / userfriendliness in (types of) NLP applications where the achievement of this balance plays a crucial role. In other words, a hybrid approach and the examples presented here target to prevent mechanisms from adopting human biases, ensuring fairness and socially responsible outcome and responsible Social Media. A hybrid approach and the examples presented here also target to customizing content to different linguistic and cultural groups, ensuring equitable information distribution. \u0000Here, we present characteristic examples with cases employing the re-introduction of four typical types of traditional concepts concerning classical theories, principles and models. These four typical classical theories, principl","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"71 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141123250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Exploring Study on Building Affective Artificial Intelligence by Neural-Symbolic Computing (Extended Abstract) 利用神经符号计算构建情感人工智能的探索研究（扩展摘要）

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31288

Jonathan C.H. Tong, Yung-Fong Hsu, C. Liau

This short paper is the status report of a project in progress. We aim to model human-like agents' decision-making behaviors under risks with neural-symbolic approach. Our model integrates the learning, reasoning, and emotional aspects of an agent and takes the dual process thinking into consideration when the agent is making a decision. The model construction is based on real behavioral and brain imaging data collected in a lottery gambling experiment. We present the model architecture including its main modules and the interactions between them.

这篇短文是一个进行中项目的现状报告。我们的目标是用神经符号方法模拟类人代理在风险下的决策行为。我们的模型整合了代理的学习、推理和情感等方面，并在代理做出决策时考虑了双重过程思维。模型的构建基于在彩票赌博实验中收集到的真实行为和脑成像数据。我们介绍了模型架构，包括其主要模块和模块之间的交互。

引用次数: 0

Reconciling Privacy and Byzantine-robustness in Federated Learning 协调联合学习中的隐私和拜占庭稳健性

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31229

Lun Wang

In this talk, we will discuss how to make federated learningsecure for the server and private for the clients simultaneously.Most prior efforts fall into either of the two categories.At one end of the spectrum, some work uses techniqueslike secure aggregation to hide the individual client’supdates and only reveal the aggregated global update to amalicious server that strives to infer the clients’ privacy fromtheir updates. At the other end of the spectrum, some workuses Byzantine-robust FL protocols to suppress the influenceof malicious clients’ updates. We present a protocol that offersbidirectional defense to simultaneously combat againstthe malicious centralized server and Byzantine maliciousclients. Our protocol also improves the dimension dependenceand achieve a near-optimal statistical rate for stronglyconvex cases.

在本讲座中，我们将讨论如何同时保证联合学习对服务器的安全性和对客户端的私密性。在光谱的一端，一些工作使用安全聚合等技术来隐藏单个客户端的更新，只向恶意服务器披露聚合的全局更新，而恶意服务器则试图从客户端的更新中推断出客户端的隐私。在另一端，一些研究利用拜占庭稳健 FL 协议来抑制恶意客户端更新的影响。我们提出的协议提供双向防御，可同时对抗恶意集中服务器和拜占庭恶意客户端。我们的协议还改善了维度依赖性，并在强凸情况下实现了接近最优的统计率。

引用次数: 0

TinyML4D: Scaling Embedded Machine Learning Education in the Developing World TinyML4D：在发展中国家推广嵌入式机器学习教育

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31265

Brian Plancher, Sebastian Buttrich, Jeremy Ellis, Neena Goveas, Laila Kazimierski, Jesus Lopez Sotelo, Milan Lukic, Diego Mendez, Rosdiadee Nordin, Andres Oliva Trevisan, Massimo Pavan, Manuel Roveri, Marcus Rüb, Jackline Tum, Marian Verhelst, Salah Abdeljabar, Segun Adebayo, Thomas Amberg, H. Aworinde, José Bagur, Gregg Barrett, Nabil Benamar, Bharat Chaudhari, Ronald Criollo, David Cuartielles, J. A. Ferreira Filho, Solomon Gizaw, Evgeni Gousev, Alessandro Grande, Shawn Hymel, Peter Ing, Prashant Manandhar, Pietro Manzoni, Boris Murmann, Eric Pan, R. Paskauskas, Ermanno Pietrosemoli, Tales Pimenta, Marcelo Rovai, Marco Zennaro, Vijay Janapa Reddi

Embedded machine learning (ML) on low-power devices, also known as "TinyML," enables intelligent applications on accessible hardware and fosters collaboration across disciplines to solve real-world problems. Its interdisciplinary and practical nature makes embedded ML education appealing, but barriers remain that limit its accessibility, especially in developing countries. Challenges include limited open-source software, courseware, models, and datasets that can be used with globally accessible heterogeneous hardware. Our vision is that with concerted effort and partnerships between industry and academia, we can overcome such challenges and enable embedded ML education to empower developers and researchers worldwide to build locally relevant AI solutions on low-cost hardware, increasing diversity and sustainability in the field. Towards this aim, we document efforts made by the TinyML4D community to scale embedded ML education globally through open-source curricula and introductory workshops co-created by international educators. We conclude with calls to action to further develop modular and inclusive resources and transform embedded ML into a truly global gateway to embedded AI skills development.

低功耗设备上的嵌入式机器学习 (ML)，也称为 "TinyML"，能够在可访问的硬件上实现智能应用，并促进跨学科合作以解决实际问题。它的跨学科性和实用性使嵌入式 ML 教育颇具吸引力，但仍有一些障碍限制了它的普及，尤其是在发展中国家。面临的挑战包括可用于全球可访问异构硬件的开源软件、课件、模型和数据集有限。我们的愿景是，通过产业界和学术界的共同努力和合作，我们可以克服这些挑战，实现嵌入式 ML 教育，使世界各地的开发人员和研究人员有能力在低成本硬件上构建与本地相关的人工智能解决方案，从而提高该领域的多样性和可持续性。为实现这一目标，我们记录了 TinyML4D 社区通过开源课程和由国际教育工作者共同创建的入门讲习班，在全球范围内推广嵌入式 ML 教育的努力。最后，我们呼吁采取行动，进一步开发模块化和包容性资源，将嵌入式 ML 转变为嵌入式人工智能技能开发的真正全球门户。

{"title":"TinyML4D: Scaling Embedded Machine Learning Education in the Developing World","authors":"Brian Plancher, Sebastian Buttrich, Jeremy Ellis, Neena Goveas, Laila Kazimierski, Jesus Lopez Sotelo, Milan Lukic, Diego Mendez, Rosdiadee Nordin, Andres Oliva Trevisan, Massimo Pavan, Manuel Roveri, Marcus Rüb, Jackline Tum, Marian Verhelst, Salah Abdeljabar, Segun Adebayo, Thomas Amberg, H. Aworinde, José Bagur, Gregg Barrett, Nabil Benamar, Bharat Chaudhari, Ronald Criollo, David Cuartielles, J. A. Ferreira Filho, Solomon Gizaw, Evgeni Gousev, Alessandro Grande, Shawn Hymel, Peter Ing, Prashant Manandhar, Pietro Manzoni, Boris Murmann, Eric Pan, R. Paskauskas, Ermanno Pietrosemoli, Tales Pimenta, Marcelo Rovai, Marco Zennaro, Vijay Janapa Reddi","doi":"10.1609/aaaiss.v3i1.31265","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31265","url":null,"abstract":"Embedded machine learning (ML) on low-power devices, also known as \"TinyML,\" enables intelligent applications on accessible hardware and fosters collaboration across disciplines to solve real-world problems. Its interdisciplinary and practical nature makes embedded ML education appealing, but barriers remain that limit its accessibility, especially in developing countries. Challenges include limited open-source software, courseware, models, and datasets that can be used with globally accessible heterogeneous hardware. Our vision is that with concerted effort and partnerships between industry and academia, we can overcome such challenges and enable embedded ML education to empower developers and researchers worldwide to build locally relevant AI solutions on low-cost hardware, increasing diversity and sustainability in the field. Towards this aim, we document efforts made by the TinyML4D community to scale embedded ML education globally through open-source curricula and introductory workshops co-created by international educators. We conclude with calls to action to further develop modular and inclusive resources and transform embedded ML into a truly global gateway to embedded AI skills development.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"41 22","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decomposed Inductive Procedure Learning: Learning Academic Tasks with Human-Like Data Efficiency 分解归纳式程序学习：以类似人类的数据效率学习学术任务

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31289

Daniel Weitekamp

Human brains have many differently functioning regions which play specialized roles in learning. By contrast, methods for training artificial neural networks, such as reinforcement-learning, typically learn exclusively via a single mechanism: gradient descent. This raises the question: might human learners’ advantage in learning efficiency over deep-learning be attributed to the interplay between multiple specialized mechanisms of learning? In this work we review a series of simulated learner systems which have been built with the aim of modeling human student’s inductive learning as they practice STEM procedural tasks. By comparison to modern deep-learning based methods which train on thousands to millions of examples to acquire passing performance capabilities, these simulated learners match human performance curves---achieving passing levels of performance within about a dozen practice opportunities. We investigate this impressive learning efficiency via an ablation analysis. Beginning with end-to-end reinforcement learning (1-mechanism), we decompose learning systems incrementally to construct the 3-mechanism inductive learning characteristic of prior simulated learners such as Sierra, SimStudent and the Apprentice Learner Architecture. Our analysis shows that learning decomposition plays a significant role in achieving data-efficient learning on par with human learners---a greater role even than simple distinctions between symbolic/subsymbolic learning. Finally we highlight how this breakdown in learning mechanisms can flexibly incorporate diverse forms of natural language and interface grounded instruction, and discuss opportunities for using these flexible learning capabilities in interactive task learning systems that learn directly from a user’s natural instruction.

人类大脑中有许多功能不同的区域，它们在学习中发挥着专门的作用。相比之下，训练人工神经网络的方法（如强化学习）通常只通过单一机制进行学习：梯度下降。这就提出了一个问题：与深度学习相比，人类学习者在学习效率上的优势是否可以归因于多种专门学习机制之间的相互作用？在这项工作中，我们回顾了一系列模拟学习者系统，这些系统旨在模拟人类学生在练习 STEM 程序任务时的归纳学习。与基于深度学习的现代方法相比，这些模拟学习系统能在数千到数百万个示例的训练中获得合格的表现能力，与人类的表现曲线相匹配--在十几次练习机会内就能达到合格水平。我们通过消融分析来研究这种令人印象深刻的学习效率。从端到端强化学习（1 个机制）开始，我们逐步分解学习系统，从而构建出具有先前模拟学习者（如 Sierra、SimStudent 和 Apprentice Learner Architecture）特征的 3 个机制归纳学习。我们的分析表明，学习分解在实现与人类学习者同等的数据高效学习方面发挥了重要作用--甚至比简单区分符号/次符号学习的作用更大。最后，我们强调了这种学习机制的分解如何能够灵活地纳入各种形式的自然语言和基于界面的指令，并讨论了在交互式任务学习系统中使用这些灵活学习能力的机会，这些系统可直接从用户的自然指令中学习。

{"title":"Decomposed Inductive Procedure Learning: Learning Academic Tasks with Human-Like Data Efficiency","authors":"Daniel Weitekamp","doi":"10.1609/aaaiss.v3i1.31289","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31289","url":null,"abstract":"Human brains have many differently functioning regions which play specialized roles in learning. By contrast, methods for training artificial neural networks, such as reinforcement-learning, typically learn exclusively via a single mechanism: gradient descent. This raises the question: might human learners’ advantage in learning efficiency over deep-learning be attributed to the interplay between multiple specialized mechanisms of learning? In this work we review a series of simulated learner systems which have been built with the aim of modeling human student’s inductive learning as they practice STEM procedural tasks. By comparison to modern deep-learning based methods which train on thousands to millions of examples to acquire passing performance capabilities, these simulated learners match human performance curves---achieving passing levels of performance within about a dozen practice opportunities. We investigate this impressive learning efficiency via an ablation analysis. Beginning with end-to-end reinforcement learning (1-mechanism), we decompose learning systems incrementally to construct the 3-mechanism inductive learning characteristic of prior simulated learners such as Sierra, SimStudent and the Apprentice Learner Architecture. Our analysis shows that learning decomposition plays a significant role in achieving data-efficient learning on par with human learners---a greater role even than simple distinctions between symbolic/subsymbolic learning. Finally we highlight how this breakdown in learning mechanisms can flexibly incorporate diverse forms of natural language and interface grounded instruction, and discuss opportunities for using these flexible learning capabilities in interactive task learning systems that learn directly from a user’s natural instruction.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"72 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141121775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Analysis Method for the Impact of GenAI Code Suggestions on Software Engineers’ Thought Processes GenAI 代码建议对软件工程师思维过程影响的分析方法

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31257

Takahiro Yonekawa, Hiroko Yamano, Ichiro Sakata

Interactive generative AI can be used in software programming to generate sufficient quality of code. Software developers can utilize the output code of generative AI as well as website resources from search engine results. In this research, we present a framework for defining states of programming activity and for capturing the actions of developers in a time series.We also describe a scheme for analyzing the thought process of software developers by using a graph structure to describe state transitions. By applying these means, we showed that it is feasible to analyze the effects of changes in the development environment on programming activities.

交互式人工智能生成器可用于软件编程，生成高质量的代码。软件开发人员可以利用生成式人工智能的输出代码以及搜索引擎结果中的网站资源。在这项研究中，我们提出了一个框架，用于定义编程活动的状态，并在时间序列中捕捉开发人员的行为。我们还描述了一种方案，通过使用图结构来描述状态转换，从而分析软件开发人员的思维过程。通过应用这些方法，我们证明了分析开发环境变化对编程活动的影响是可行的。

引用次数: 0

Engineering Approach to Explore Language Reflecting Well-Being 探索反映幸福的语言的工程学方法

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31235

Kazuhiro Ito, Junko Hayashi, Shoko Wakamiya, Masae Manabe, Yasushi Watanabe, Masataka Nakayama, Yukiko Uchida, E. Aramaki

Although well-being is helpful in measuring the state of society from various perspectives, past research has been limited to (1) questionnaire surveys, which make it difficult to target a large number of people, and (2) the major indices focus on individual factors and do not incorporate group factors. To tackle these issues, we collected daily reports from the company employees that included text, their individual subjective well-being, and team subjective well-being. By using the collected data, we constructed a well-being estimation model based on the Large Language Model and examined an indicator called ``sharedness index'', as a state of the team that influences an individual well-being, measured using both score- and text-based methods.

虽然幸福感有助于从不同角度衡量社会状况，但以往的研究仅限于（1）问卷调查，难以针对大量人群；（2）主要指数侧重于个人因素，没有纳入群体因素。为了解决这些问题，我们收集了公司员工的日常报告，其中包括文字、个人主观幸福感和团队主观幸福感。通过使用收集到的数据，我们构建了一个基于大语言模型的幸福感估计模型，并研究了一个名为 "分享指数 "的指标，该指标是团队中影响个人幸福感的一种状态，使用基于分数和文本的方法进行测量。

引用次数: 0

Cultural Algorithm Guided Policy Gradient with Parameter Exploration 带有参数探索的文化算法引导政策梯度

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31240

Mark Nuppnau, Khalid Kattan, R. G. Reynolds

This study explores the integration of cultural algorithms (CA) with the Policy Gradients with Parameter-Based Exploration (PGPE) algorithm for the task of MNIST hand-written digit classification within the EvoJAX framework. The PGPE algorithm is enhanced by incorporating a belief space, consisting on Domain, Situational, and History knowledge sources (KS), to guide the search process and improve convergence speed. The PGPE algorithm, implemented within the EvoJAX framework, can efficiently find an optimal parameter-space policy for the MNIST task. However, increasing the complexity of the task and policy space, such as the CheXpert dataset and DenseNet, requires a more sophisticated approach to efficiently navigate the search space. We introduce CA-PGPE, a novel approach that integrates CA with PGPE to guide the search process and improve convergence speed. Future work will focus on incorporating exploratory knowledge sources and evaluate the enhanced CA-PGPE algorithm on more complex datasets and model architectures, such as CIFAR-10 and CheXpert with DenseNet.

本研究在 EvoJAX 框架内，针对 MNIST 手写数字分类任务，探索了文化算法（CA）与基于参数探索的策略梯度算法（PGPE）的整合。PGPE 算法通过纳入由领域、情境和历史知识源（KS）组成的信念空间得到增强，以指导搜索过程并提高收敛速度。在 EvoJAX 框架内实施的 PGPE 算法能有效地为 MNIST 任务找到最佳参数空间策略。然而，要提高任务和策略空间（如 CheXpert 数据集和 DenseNet）的复杂性，就需要采用更复杂的方法来高效地浏览搜索空间。我们引入了 CA-PGPE，这是一种将 CA 与 PGPE 相结合的新方法，用于指导搜索过程并提高收敛速度。未来的工作重点是纳入探索性知识源，并在更复杂的数据集和模型架构（如 CIFAR-10 和带有 DenseNet 的 CheXpert）上评估增强型 CA-PGPE 算法。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the AAAI Symposium Series

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀