Proceedings of the AAAI Symposium Series最新文献_第5页

Exploring Alternative Approaches to Language Modeling for Learning from Data and Knowledge 探索从数据和知识中学习的语言建模替代方法

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31211

Yuxin Zi, Kaushik Roy, Vignesh Narayanan, Amit Sheth

Despite their extensive application in language understanding tasks, large language models (LLMs) still encounter challenges including hallucinations - occasional fabrication of information - and alignment issues - lack of associations with human-curated world models (e.g., intuitive physics or common-sense knowledge). Moreover, the black-box nature of LLMs presents significant obstacles in training them effectively to achieve desired behaviors. In particular, modifying the concept embedding spaces of LLMs can be highly intractable. This process involves analyzing the implicit impact of such adjustments on the myriad parameters within LLMs and the resulting inductive biases. We propose a novel architecture that wraps powerful function approximation architectures within an outer, interpretable read-out layer. This read-out layer can be scrutinized to explicitly observe the effects of concept modeling during the training of the LLM. Our method stands in contrast with gradient-based implicit mechanisms, which depend solely on adjustments to the LLM parameters and thus evade scrutiny. By conducting extensive experiments across both generative and discriminative language modeling tasks, we evaluate the capabilities of our proposed architecture relative to state-of-the-art LLMs of similar sizes. Additionally, we offer a qualitative examination of the interpretable read-out layer and visualize the concepts it captures. The results demonstrate the potential of our approach for effectively controlling LLM hallucinations and enhancing the alignment with human expectations.

尽管大型语言模型（LLMs）在语言理解任务中得到了广泛应用，但它们仍然面临着各种挑战，其中包括幻觉--偶尔编造信息，以及对齐问题--与人类设定的世界模型（如直观物理或常识性知识）缺乏关联。此外，LLMs 的黑箱性质也给有效训练它们以实现理想行为带来了巨大障碍。特别是，修改 LLM 的概念嵌入空间可能非常困难。这一过程涉及分析此类调整对 LLM 内无数参数的隐含影响以及由此产生的归纳偏差。我们提出了一种新颖的架构，将功能强大的函数近似架构封装在一个可解释的外层读出层中。在 LLM 的训练过程中，可以通过仔细检查读出层来明确观察概念建模的效果。我们的方法与基于梯度的隐式机制形成了鲜明对比，后者完全依赖于对 LLM 参数的调整，因此无法进行仔细检查。通过在生成性和鉴别性语言建模任务中进行广泛的实验，我们评估了我们提出的架构相对于类似规模的最先进 LLM 的能力。此外，我们还对可解释读出层进行了定性检查，并对其捕捉的概念进行了可视化。结果表明，我们的方法具有有效控制 LLM 幻觉和增强与人类期望一致性的潜力。

{"title":"Exploring Alternative Approaches to Language Modeling for Learning from Data and Knowledge","authors":"Yuxin Zi, Kaushik Roy, Vignesh Narayanan, Amit Sheth","doi":"10.1609/aaaiss.v3i1.31211","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31211","url":null,"abstract":"Despite their extensive application in language understanding tasks, large language models (LLMs) still encounter challenges including hallucinations - occasional fabrication of information - and alignment issues - lack of associations with human-curated world models (e.g., intuitive physics or common-sense knowledge). Moreover, the black-box nature of LLMs presents significant obstacles in training them effectively to achieve desired behaviors. In particular, modifying the concept embedding spaces of LLMs can be highly intractable. This process involves analyzing the implicit impact of such adjustments on the myriad parameters within LLMs and the resulting inductive biases. We propose a novel architecture that wraps powerful function approximation architectures within an outer, interpretable read-out layer. This read-out layer can be scrutinized to explicitly observe the effects of concept modeling during the training of the LLM. Our method stands in contrast with gradient-based implicit mechanisms, which depend solely on adjustments to the LLM parameters and thus evade scrutiny. By conducting extensive experiments across both generative and discriminative language modeling tasks, we evaluate the capabilities of our proposed architecture relative to state-of-the-art LLMs of similar sizes. Additionally, we offer a qualitative examination of the interpretable read-out layer and visualize the concepts it captures. The results demonstrate the potential of our approach for effectively controlling LLM hallucinations and enhancing the alignment with human expectations.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"90 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Task-driven Risk-bounded Hierarchical Reinforcement Learning Based on Iterative Refinement 基于迭代改进的任务驱动型风险约束分层强化学习

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31281

Viraj Parimi, Sungkweon Hong, Brian Williams

Deep Reinforcement Learning (DRL) has garnered substantial acclaim for its versatility and widespread applications across diverse domains. Aligned with human-like learning, DRL is grounded in the fundamental principle of learning from interaction, wherein agents dynamically adjust behavior based on environmental feedback in the form of rewards. This iterative trial-and-error process, mirroring human learning, underscores the importance of observation, experimentation, and feedback in shaping understanding and behavior. DRL agents, trained to navigate complex surroundings, refine their knowledge through hierarchical and abstract representations, empowered by deep neural networks. These representations enable efficient handling of long-horizon tasks and flexible adaptation to novel situations, akin to the human ability to construct mental models for comprehending complex concepts and predicting outcomes. Hence, abstract representation building emerges as a critical aspect in the learning processes of both artificial agents and human learners, particularly in long-horizon tasks.Furthermore, human decision-making, deeply rooted in evolutionary history, exhibits a remarkable capacity to balance the tradeoff between risk and cost across various domains. This cognitive process involves assessing potential negative consequences, evaluating factors such as the likelihood of adverse outcomes, severity of potential harm, and overall uncertainty. Humans intuitively gauge inherent risks and adeptly weigh associated costs, extending beyond monetary expenses to include time, effort, and opportunity costs. The nuanced ability of humans to consider the tradeoff between risk and cost highlights the complexity and adaptability of human decision-making, a skill lacking in typical DRL agents. Principles like these derived from human-like learning present an avenue for inspiring advancements in DRL, fostering the development of more adaptive and intelligent artificial agents.Motivated by these observations and focusing on practical challenges in robotics, our efforts target risk-aware stochastic sequential decision-making problem which is crucial for tasks with extended time frames and varied strategies. A novel integration of model-based conditional planning with DRL is proposed, inspired by hierarchical techniques. This approach breaks down complex tasks into manageable subtasks(motion primitives), ensuring safety constraints and informed decision-making. Unlike existing methods, our approach addresses motion primitive improvement iteratively, employing diverse prioritization functions to guide the search process effectively. This risk-bounded planning algorithm seamlessly integrates conditional planning and motion primitive learning, prioritizing computational efforts for enhanced efficiency within specified time limits.

深度强化学习（DRL）因其多功能性和在不同领域的广泛应用而备受赞誉。DRL 与类人学习相一致，立足于从互动中学习的基本原则，即代理根据奖励形式的环境反馈动态调整行为。这种迭代试错过程与人类的学习过程如出一辙，强调了观察、实验和反馈在形成理解和行为方面的重要性。接受过复杂环境导航训练的 DRL 代理，在深度神经网络的支持下，通过分层和抽象的表征来完善自己的知识。这些表征能够高效处理长远任务，灵活适应新情况，类似于人类构建心智模型以理解复杂概念和预测结果的能力。因此，抽象表征的构建成为人工代理和人类学习者学习过程中的一个关键环节，尤其是在长视距任务中。此外，人类决策深深植根于进化史，在平衡各领域风险与成本之间的权衡方面表现出非凡的能力。这一认知过程包括评估潜在的负面后果，评价不利结果的可能性、潜在伤害的严重程度以及总体不确定性等因素。人类能够直观地衡量固有风险，并善于权衡相关成本，这些成本不仅包括金钱支出，还包括时间、精力和机会成本。人类考虑风险与成本之间权衡的细致能力，凸显了人类决策的复杂性和适应性，而这正是典型的 DRL 代理所缺乏的技能。从类似人类的学习中得出的这些原则为激励 DRL 的进步提供了一条途径，促进了更具适应性和智能性的人工代理的发展。受这些观察结果的启发，并着眼于机器人技术中的实际挑战，我们的工作以风险意识随机顺序决策问题为目标，这对于具有较长时限和多种策略的任务至关重要。受分层技术的启发，我们提出了一种基于模型的条件规划与 DRL 的新型集成方法。这种方法将复杂任务分解为易于管理的子任务（运动基元），确保安全约束和知情决策。与现有方法不同的是，我们的方法通过迭代改进运动基元，采用不同的优先级函数来有效指导搜索过程。这种有风险限制的规划算法将条件规划和运动基元学习完美地结合在一起，在规定的时间限制内对计算工作进行优先排序，以提高效率。

{"title":"Task-driven Risk-bounded Hierarchical Reinforcement Learning Based on Iterative Refinement","authors":"Viraj Parimi, Sungkweon Hong, Brian Williams","doi":"10.1609/aaaiss.v3i1.31281","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31281","url":null,"abstract":"Deep Reinforcement Learning (DRL) has garnered substantial acclaim for its versatility and widespread applications across diverse domains. Aligned with human-like learning, DRL is grounded in the fundamental principle of learning from interaction, wherein agents dynamically adjust behavior based on environmental feedback in the form of rewards. This iterative trial-and-error process, mirroring human learning, underscores the importance of observation, experimentation, and feedback in shaping understanding and behavior. DRL agents, trained to navigate complex surroundings, refine their knowledge through hierarchical and abstract representations, empowered by deep neural networks. These representations enable efficient handling of long-horizon tasks and flexible adaptation to novel situations, akin to the human ability to construct mental models for comprehending complex concepts and predicting outcomes. Hence, abstract representation building emerges as a critical aspect in the learning processes of both artificial agents and human learners, particularly in long-horizon tasks.\u0000\u0000Furthermore, human decision-making, deeply rooted in evolutionary history, exhibits a remarkable capacity to balance the tradeoff between risk and cost across various domains. This cognitive process involves assessing potential negative consequences, evaluating factors such as the likelihood of adverse outcomes, severity of potential harm, and overall uncertainty. Humans intuitively gauge inherent risks and adeptly weigh associated costs, extending beyond monetary expenses to include time, effort, and opportunity costs. The nuanced ability of humans to consider the tradeoff between risk and cost highlights the complexity and adaptability of human decision-making, a skill lacking in typical DRL agents. Principles like these derived from human-like learning present an avenue for inspiring advancements in DRL, fostering the development of more adaptive and intelligent artificial agents.\u0000\u0000Motivated by these observations and focusing on practical challenges in robotics, our efforts target risk-aware stochastic sequential decision-making problem which is crucial for tasks with extended time frames and varied strategies. A novel integration of model-based conditional planning with DRL is proposed, inspired by hierarchical techniques. This approach breaks down complex tasks into manageable subtasks(motion primitives), ensuring safety constraints and informed decision-making. Unlike existing methods, our approach addresses motion primitive improvement iteratively, employing diverse prioritization functions to guide the search process effectively. This risk-bounded planning algorithm seamlessly integrates conditional planning and motion primitive learning, prioritizing computational efforts for enhanced efficiency within specified time limits.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"13 18","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141119776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Building Communication Efficient Asynchronous Peer-to-Peer Federated LLMs with Blockchain 利用区块链构建通信效率高的异步点对点联合 LLM

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31212

Sree Bhargavi Balija, Amitash Nanda, Debashis Sahoo

Large language models (LLM) have gathered attention with the advent of ChatGPT. However, developing personalized LLM models faces challenges in real-world applications due to data scarcity and privacy concerns. Federated learning addresses these issues, providing collaborative training while preserving the client’s data. Although it has made significant progress, federated learning still faces ongoing challenges, such as communication efficiency, heterogeneous data, and privacy-preserving methods. This paper presents a novel, fully decentralized federated learning framework for LLMs to address these challenges. We utilize different blockchain-federated LLM (BC-FL) algorithms, effectively balancing the trade-off between latency and accuracy in a decentralized-federated learning environment. Additionally, we address the challenge of communication overhead in peer-to-peer networks by optimizing the path for weight transfer and mitigating node anomalies. We conducted experiments to evaluate memory usage and latency in server and serverless environments. Our results demonstrate a decrease in latency by 5X and a 13% increase in accuracy for serverless cases. Comparisons between synchronous and asynchronous scenarios revealed a 76% reduction in information passing time for the latter. The PageRank method is most efficient in eliminating anomalous nodes for better performance of the global federated LLM model. The code is available on GitHub (https://github.com/Sreebhargavibalijaa/Federated_finetuning_LLM-s_p2p_environment)

随着 ChatGPT 的出现，大型语言模型（LLM）备受关注。然而，由于数据稀缺和隐私问题，开发个性化 LLM 模型在实际应用中面临挑战。联盟学习可以解决这些问题，在保留客户数据的同时提供协作训练。尽管联合学习已经取得了重大进展，但它仍然面临着持续的挑战，如通信效率、异构数据和隐私保护方法。本文提出了一种新颖的、完全去中心化的 LLM 联合学习框架，以应对这些挑战。我们利用不同的区块链联合 LLM（BC-FL）算法，有效地平衡了去中心化联合学习环境中延迟和准确性之间的权衡。此外，我们还通过优化权重传输路径和缓解节点异常来应对点对点网络中的通信开销挑战。我们进行了实验，以评估服务器和无服务器环境中的内存使用情况和延迟。结果表明，在无服务器情况下，延迟降低了 5 倍，准确率提高了 13%。同步和异步场景之间的比较显示，后者的信息传递时间减少了 76%。PageRank 方法能最有效地消除异常节点，从而提高全局联合 LLM 模型的性能。代码可在 GitHub 上获取 (https://github.com/Sreebhargavibalijaa/Federated_finetuning_LLM-s_p2p_environment)

{"title":"Building Communication Efficient Asynchronous Peer-to-Peer Federated LLMs with Blockchain","authors":"Sree Bhargavi Balija, Amitash Nanda, Debashis Sahoo","doi":"10.1609/aaaiss.v3i1.31212","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31212","url":null,"abstract":"Large language models (LLM) have gathered attention with the advent of ChatGPT. However, developing personalized LLM models faces challenges in real-world applications due to data scarcity and privacy concerns. Federated learning addresses these issues, providing collaborative training while preserving the client’s data. Although it has made significant progress, federated learning still faces ongoing challenges, such as communication efficiency, heterogeneous data, and privacy-preserving methods. This paper presents a novel, fully decentralized federated learning framework for LLMs to address these challenges. We utilize different blockchain-federated LLM (BC-FL) algorithms, effectively balancing the trade-off between latency and accuracy in a decentralized-federated learning environment. Additionally, we address the challenge of communication overhead in peer-to-peer networks by optimizing the path for weight transfer and mitigating node anomalies. We conducted experiments to evaluate memory usage and latency in server and serverless environments. Our results demonstrate a decrease in latency by 5X and a 13% increase in accuracy for serverless cases. Comparisons between synchronous and asynchronous scenarios revealed a 76% reduction in information passing time for the latter. The PageRank method is most efficient in eliminating anomalous nodes for better performance of the global federated LLM model. The code is available on GitHub (https://github.com/Sreebhargavibalijaa/Federated_finetuning_LLM-s_p2p_environment)","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"7 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141119356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Confluence of Random Walks, Interacting Particle Systems, and Distributed Machine Learning: Federated Learning through Crawling over Networks 随机漫步、相互作用粒子系统和分布式机器学习的融合：通过网络爬行进行联盟学习

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31216

Seyyedali Hosseinalipour

In this work, we aim to unveil a new class of intermediate FL architectures between centralized and decentralized schemes called “FedCrawl.” FedCrawl takes advantage of benefits of D2D communications similar to decentralized schemes; however, it uses them in a nuanced way. FedCrawl is inspired by web crawlers, which effectively explore the websites to find updated/new content posted on the internet. The cornerstone of FedCrawl is its innovative conceptualization of neural networks (NNs) or other used ML models as autonomous entities, called random walkers, with the capability to move or jump across nodes in the network through peer-to-peer (P2P) or device-to-device (D2D) connections. We introduce five research aspects to study the nuanced intricacies governing random walker behavior in these environments. The first research aspect addresses the interplay between network topology and data distribution, emphasizing the importance of considering both factors for designing efficient random walks in FedCrawl. The second research aspect explores the applicability of node importance metrics inoptimizing random walker paths for FedCrawl. We propose a dynamic perception-aware design, discussed in the third research aspect, where transition matrices adapt to the evolving state of random walkers, balancing exploration and exploitation. The fourth research aspect introduces innovative features like skipping, memory look-back, and caching/trailing to enhance random walker performance. Lastly, the fifth research aspect delves into the dynamics of multiple random walkers in networked environments, introducing the concept of multi-pole random walkers. Complementing these five research aspects, we present five conjectures, each introducing novel perspectives and methodologies in the domain of decentralized learning. These conjectures encompass areas such as temperature-based characterization of random walkers and network nodes, dynamic transition matrices, non-Markovian processes, and an evolutionary framework for random walker patterns.

在这项工作中，我们旨在揭示一种介于集中式和分散式方案之间的新型中间 FL 架构，即 "FedCrawl"。FedCrawl 利用了与分散式方案类似的 D2D 通信优势，但它以一种细致入微的方式使用这些优势。FedCrawl 受到网络爬虫的启发，网络爬虫可以有效地探索网站，找到互联网上发布的更新/新内容。FedCrawl 的基石是将神经网络（NN）或其他使用过的 ML 模型创新性地概念化为自主实体（称为随机漫步者），它们能够通过点对点（P2P）或设备对设备（D2D）连接在网络节点间移动或跳跃。我们介绍了五个研究方面，以研究这些环境中支配随机漫步者行为的微妙复杂性。第一个研究方面涉及网络拓扑和数据分布之间的相互作用，强调在 FedCrawl 中设计高效随机行走时考虑这两个因素的重要性。第二个研究方面探讨了节点重要性度量在优化 FedCrawl 随机行走路径中的适用性。我们提出了一种动态感知设计，过渡矩阵可适应随机漫步者不断变化的状态，在探索和利用之间取得平衡。第四个研究方面引入了跳转、内存回溯和缓存/跟踪等创新功能，以提高随机行走器的性能。最后，第五个研究方面深入研究了网络环境中多个随机行走器的动态，引入了多极随机行走器的概念。作为这五个研究方面的补充，我们提出了五个猜想，每个猜想都在分散学习领域引入了新的视角和方法。这些猜想涵盖的领域包括：基于温度的随机漫步者和网络节点特征描述、动态转换矩阵、非马尔可夫过程以及随机漫步者模式的进化框架。

{"title":"Confluence of Random Walks, Interacting Particle Systems, and Distributed Machine Learning: Federated Learning through Crawling over Networks","authors":"Seyyedali Hosseinalipour","doi":"10.1609/aaaiss.v3i1.31216","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31216","url":null,"abstract":"In this work, we aim to unveil a new class of intermediate FL architectures between centralized and decentralized schemes called “FedCrawl.” FedCrawl takes advantage of benefits of D2D communications similar to decentralized schemes; however, it uses them in a nuanced way. FedCrawl is inspired by web crawlers, which effectively explore the websites to find updated/new content posted on the internet. The cornerstone of FedCrawl is its innovative conceptualization of neural networks (NNs) or other used ML models as autonomous entities, called random walkers, with the capability to move or jump across nodes in the network through peer-to-peer (P2P) or device-to-device (D2D) connections. We introduce five research aspects to study the nuanced intricacies governing random walker behavior in these environments. The first research aspect addresses the interplay between network topology and data distribution, emphasizing the importance of considering both factors for designing efficient random walks in FedCrawl. The second research aspect explores the applicability of node importance metrics in\u0000optimizing random walker paths for FedCrawl. We propose a dynamic perception-aware design, discussed in the third research aspect, where transition matrices adapt to the evolving state of random walkers, balancing exploration and exploitation. The fourth research aspect introduces innovative features like skipping, memory look-back, and caching/trailing to enhance random walker performance. Lastly, the fifth research aspect delves into the dynamics of multiple random walkers in networked environments, introducing the concept of multi-pole random walkers. Complementing these five research aspects, we present five conjectures, each introducing novel perspectives and methodologies in the domain of decentralized learning. These conjectures encompass areas such as temperature-based characterization of random walkers and network nodes, dynamic transition matrices, non-Markovian processes, and an evolutionary framework for random walker patterns.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"3 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141121054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Federated Learning for Automatic Modulation Classification Under Class and Noise Imbalance 类别和噪声失衡情况下自动调制分类的自适应联合学习

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31223

J. A. Sanchez Viloria, Dimitris Stripelis, Panos P. Markopoulos, G. Sklivanitis, D. Pados

The ability to rapidly understand and label the radio spectrum in an autonomous way is key for monitoring spectrum interference, spectrum utilization efficiency, protecting passive users, monitoring and enforcing compliance with regulations, detecting faulty radios, dynamic spectrum access, opportunistic mesh networking, and numerous NextG regulatory and defense applications. We consider the problem of automatic modulation classification (AMC) by a distributed network of wireless sensors that monitor the spectrum for signal transmissions of interest over a large deployment area. Each sensor receives signals under a specific channel condition depending on its location and trains an individual model of a deep neural network (DNN) accordingly to classify signals. To improve modulation classification accuracy, we consider federated learning (FL) where each individual sensor shares its trained model with a centralized controller, which, after aggregation, initializes its model for the next round of training. Without exchanging any spectrum data (such as in cooperative spectrum sensing), this process is repeated over time. A common DNN is built across the net- work while preserving the privacy associated with signals collected at different locations. Given their distributed nature, the statistics of the data across these sensors are likely to differ significantly. We propose the use of adaptive federated learning for AMC. Specifically, we use FEDADAM -an algorithm using Adam for server optimization – and ex- amine how it compares to the FEDAVG algorithm -one of the standard FL algorithms, which averages client parameters after some local iterations, in particular in challenging scenarios that include class imbalance and/or noise-level imbalance across the network. Our extensive numerical studies over 11 standard modulation classes corroborate the merit of adaptive FL, outperforming its standard alternatives in various challenging cases and for various network sizes.

以自主方式快速了解和标记无线电频谱的能力是监控频谱干扰、提高频谱利用效率、保护无源用户、监控和强制遵守法规、检测故障无线电、动态频谱接入、机会网状网络以及众多 NextG 监管和防御应用的关键。我们考虑的是无线传感器分布式网络的自动调制分类（AMC）问题，该网络在一个大的部署区域内监控频谱中感兴趣的信号传输。每个传感器根据其位置接收特定信道条件下的信号，并相应地训练深度神经网络（DNN）的单个模型来对信号进行分类。为了提高调制分类的准确性，我们考虑了联合学习（FL），即每个单独的传感器与中央控制器共享其训练好的模型，中央控制器在汇总后初始化其模型，用于下一轮训练。在不交换任何频谱数据的情况下（如在合作频谱感知中），这一过程会随着时间的推移不断重复。在整个网络中建立共同的 DNN，同时保护在不同地点收集的信号的隐私。鉴于其分布式性质，这些传感器的数据统计很可能存在很大差异。我们建议在 AMC 中使用自适应联合学习。具体来说，我们使用了 FEDADAM 算法（一种使用亚当进行服务器优化的算法），并对其与 FEDAVG 算法（标准 FL 算法之一，该算法在局部迭代后平均客户端参数）进行了比较，尤其是在包括整个网络中的类不平衡和/或噪声级不平衡的挑战性场景中。我们对 11 种标准调制类别进行了广泛的数值研究，证实了自适应 FL 算法的优点，在各种具有挑战性的情况下和各种网络规模下，它都优于其标准替代算法。

{"title":"Adaptive Federated Learning for Automatic Modulation Classification Under Class and Noise Imbalance","authors":"J. A. Sanchez Viloria, Dimitris Stripelis, Panos P. Markopoulos, G. Sklivanitis, D. Pados","doi":"10.1609/aaaiss.v3i1.31223","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31223","url":null,"abstract":"The ability to rapidly understand and label the radio spectrum in an autonomous way is key for monitoring spectrum interference, spectrum utilization efficiency, protecting passive users, monitoring and enforcing compliance with regulations, detecting faulty radios, dynamic spectrum access, opportunistic mesh networking, and numerous NextG regulatory and defense applications. We consider the problem of automatic modulation classification (AMC) by a distributed network of wireless sensors that monitor the spectrum for signal transmissions of interest over a large deployment area. Each sensor receives signals under a specific channel condition depending on its location and trains an individual model of a deep neural network (DNN) accordingly to classify signals. To improve modulation classification accuracy, we consider federated learning (FL) where each individual sensor shares its trained model with a centralized controller, which, after aggregation, initializes its model for the next round of training. Without exchanging any spectrum data (such as in cooperative spectrum sensing), this process is repeated over time. A common DNN is built across the net- work while preserving the privacy associated with signals collected at different locations. Given their distributed nature, the statistics of the data across these sensors are likely to differ significantly. We propose the use of adaptive federated learning for AMC. Specifically, we use FEDADAM -an algorithm using Adam for server optimization – and ex- amine how it compares to the FEDAVG algorithm -one of the standard FL algorithms, which averages client parameters after some local iterations, in particular in challenging scenarios that include class imbalance and/or noise-level imbalance across the network. Our extensive numerical studies over 11 standard modulation classes corroborate the merit of adaptive FL, outperforming its standard alternatives in various challenging cases and for various network sizes.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141121939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Implications of Identity in AI: Creators, Creations, and Consequences 人工智能中身份的含义：创造者、创造物和后果

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31268

Sri Yash Tadimalla, Mary Lou Maher

The field of Artificial Intelligence (AI) is rapidly advancing, with significant potential to transform society. However, it faces a notable challenge: lack of diversity, a longstanding issue in STEM fields. In this context, this position paper examines the intersection of AI and identity as a pathway to understanding biases, inequalities, and ethical considerations in AI development and deployment. We present a multifaceted definition of AI identity, which encompasses its creators, applications, and their broader impacts. Understanding AI's identity involves analyzing the diverse individuals involved in AI's development, the technologies produced, and the social, ethical, and psychological implications. After exploring the AI identity ecosystem and its societal dynamics, We propose a framework that highlights the need for diversity in AI across three dimensions: Creators, Creations, and Consequences through the lens of identity. This paper presents a research framework for examining the implications and changes needed to foster a more inclusive and responsible AI ecosystem through the lens of identity.

人工智能（AI）领域发展迅速，具有改变社会的巨大潜力。然而，它面临着一个显著的挑战：缺乏多样性，这是 STEM 领域的一个长期问题。在此背景下，本立场文件探讨了人工智能与身份的交叉点，以此作为理解人工智能开发和部署中的偏见、不平等和伦理因素的途径。我们提出了人工智能身份的多层面定义，其中包括其创造者、应用及其更广泛的影响。理解人工智能的身份涉及分析参与人工智能开发的不同个体、所产生的技术以及社会、伦理和心理影响。在探索了人工智能身份生态系统及其社会动态之后，我们提出了一个框架，从三个方面强调了人工智能多样性的必要性：通过身份的视角，我们提出了一个框架，强调了人工智能在三个维度上的多样性需求：创造者、创造物和后果。本文提出了一个研究框架，通过身份的视角来研究促进更具包容性和责任感的人工智能生态系统所需的影响和变革。

引用次数: 0

What Can Computers Do Now? Dreyfus Revisited for the Third Wave of Artificial Intelligence 计算机现在能做什么？德雷福斯再论人工智能的第三次浪潮

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31207

Ben Schuering, Thomas Schmid

In recent years, artificial intelligence (AI) has seen significant advances that have in fact exceeded even optimistic prognoses. Using data-driven AI, namely deep learning techniques, it has been demonstrated that computers may now be equipped with abilities of remarkable scope and quality, such as solving image and text processing tasks at human level. Large language models, in particular, have sparked debates regarding opportunities and challenges of this rapidly developing area. Will remaining fundamental challenges of data-driven AI, such as factual or logical mistakes, be overcome for good if complemented and hybridized with symbolic AI techniques, such as knowledge representation and reasoning? Will systems of artificial general intelligence (AGI) emerge from this, possessing common sense and in fact completing the decades-old quest for AI that motivated the raise of the field in the 1950s? In the light of these questions, we review the likewise, decades-old philosophical debate about capabilities and limitations of computers from a hybrid AI point of view. Here, we discuss how hybrid AI is coming closer to disproving Hubert Dreyfus’ famous statements regarding what computers can not do. At the same time, we shed light on a lesser discussed challenge for hybrid AI: the possibility that its developers might be its biggest limiters.

近年来，人工智能（AI）取得了长足的进步，甚至超过了乐观的预测。利用数据驱动的人工智能（即深度学习技术），人们已经证明，计算机现在可以具备非凡的能力和质量，例如解决人类水平的图像和文本处理任务。大型语言模型尤其引发了有关这一快速发展领域的机遇与挑战的讨论。如果与符号人工智能技术（如知识表示和推理）相辅相成，数据驱动型人工智能的其余基本挑战（如事实或逻辑错误）是否会被彻底克服？人工通用智能（AGI）系统是否能从中脱颖而出，拥有常识，并在事实上完成数十年来对人工智能的追求，而这种追求正是 20 世纪 50 年代人工智能领域兴起的动力？鉴于这些问题，我们从混合人工智能的角度回顾了几十年来关于计算机能力和局限性的哲学争论。在此，我们将讨论混合人工智能如何越来越接近于推翻休伯特-德雷福斯（Hubert Dreyfus）关于计算机不能做什么的著名论断。与此同时，我们还揭示了混合人工智能面临的一个较少讨论的挑战：其开发者可能是其最大的限制者。

{"title":"What Can Computers Do Now? Dreyfus Revisited for the Third Wave of Artificial Intelligence","authors":"Ben Schuering, Thomas Schmid","doi":"10.1609/aaaiss.v3i1.31207","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31207","url":null,"abstract":"In recent years, artificial intelligence (AI) has seen significant advances that have in fact exceeded even optimistic prognoses. Using data-driven AI, namely deep learning techniques, it has been demonstrated that computers may now be equipped with abilities of remarkable scope and quality, such as solving image and text processing tasks at human level. Large language models, in particular, have sparked debates regarding opportunities and challenges of this rapidly developing area. Will remaining fundamental challenges of data-driven AI, such as factual or logical mistakes, be overcome for good if complemented and hybridized with symbolic AI techniques, such as knowledge representation and reasoning? Will systems of artificial general intelligence (AGI) emerge from this, possessing common sense and in fact completing the decades-old quest for AI that motivated the raise of the field in the 1950s? In the light of these questions, we review the likewise, decades-old philosophical debate about capabilities and limitations of computers from a hybrid AI point of view. Here, we discuss how hybrid AI is coming closer to disproving Hubert Dreyfus’ famous statements regarding what computers can not do. At the same time, we shed light on a lesser discussed challenge for hybrid AI: the possibility that its developers might be its biggest limiters.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"80 23","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Diversity, Equity, and Inclusion, and the Deployment of Artificial Intelligence Within the Department of Defense 多样性、公平与包容，以及在国防部内部署人工智能

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31233

Sara Darwish, Alison Bragaw-Butler, Paul Marcelli, Kaylee Gassner

Artificial Intelligence (AI) adoption has seen substantial growth across industries. This paper explores the escalating use of AI within the United States Department of Defense (DoD) and the implications that diversity, equity, and inclusion (DEI) have on Service members and Civilians across the Department. More specifically, this paper explores the DEI considerations within AI technologies on individual, team, and Department readiness. The DoD's AI usage spans various strategic and operational capabilities, however this paper explores two critical domains: healthcare and recruitment.In healthcare, AI offers the promise of early disease detection, enhanced diagnostic capabilities, and streamlined administrative processes. However, potential biases stemming from homogenous training data threaten the accuracy and reliability of these systems, jeopardizing Service member health and eroding trust in AI-assisted medical decision-making and potentially the DoD at large.In recruitment, while AI promises efficiency in identifying ideal candidates, its deployment can perpetuate biases, especially when the training data used is not representative of all demographics. Despite efforts to design "unbiased" systems by excluding demographic data, such strategies may inadvertently overlook the unique challenges faced by marginalized communities, further entrenching existing disparities.Both case studies underscore the importance of considering DEI in the development and deployment of AI systems. As the DoD continues to integrate AI into its operations, this paper’s recommendations stress the necessity of continuous DEI assessment to ensure that AI serves as an asset rather than a liability. The authors recommend the following:1. Data diversity & review2. Continuous monitoring and calibration3. Stakeholder engagement4. Adoption of DEI requirements within Ethical AI Frameworks5. Further research

人工智能（AI）的应用在各行各业都有大幅增长。本文探讨了人工智能在美国国防部（DoD）中不断升级的应用，以及多样性、公平性和包容性（DEI）对整个国防部的军人和文职人员的影响。更具体地说，本文探讨了人工智能技术对个人、团队和国防部战备状态的 DEI 影响。国防部的人工智能应用涵盖各种战略和作战能力，但本文探讨的是两个关键领域：医疗保健和征兵。在医疗保健领域，人工智能为早期疾病检测、增强诊断能力和简化管理流程带来了希望。然而，由同质化训练数据产生的潜在偏见威胁着这些系统的准确性和可靠性，危害着军人的健康，削弱了对人工智能辅助医疗决策的信任，并有可能影响整个国防部。在征兵方面，虽然人工智能有望提高识别理想候选人的效率，但其部署可能会使偏见长期存在，特别是当所使用的训练数据不能代表所有人口统计数据时。尽管努力通过排除人口数据来设计 "无偏见 "的系统，但这种策略可能会无意中忽视边缘化群体所面临的独特挑战，从而进一步巩固现有的差距。随着国防部继续将人工智能整合到其行动中，本文的建议强调有必要持续进行发展指数评估，以确保人工智能成为一种资产而非负债。作者建议如下：1. 数据多样性与审查2.持续监控和校准3.利益相关者的参与4.在人工智能道德框架内采用 DEI 要求5.进一步研究

{"title":"Diversity, Equity, and Inclusion, and the Deployment of Artificial Intelligence Within the Department of Defense","authors":"Sara Darwish, Alison Bragaw-Butler, Paul Marcelli, Kaylee Gassner","doi":"10.1609/aaaiss.v3i1.31233","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31233","url":null,"abstract":"Artificial Intelligence (AI) adoption has seen substantial growth across industries. This paper explores the escalating use of AI within the United States Department of Defense (DoD) and the implications that diversity, equity, and inclusion (DEI) have on Service members and Civilians across the Department. More specifically, this paper explores the DEI considerations within AI technologies on individual, team, and Department readiness. The DoD's AI usage spans various strategic and operational capabilities, however this paper explores two critical domains: healthcare and recruitment.\u0000In healthcare, AI offers the promise of early disease detection, enhanced diagnostic capabilities, and streamlined administrative processes. However, potential biases stemming from homogenous training data threaten the accuracy and reliability of these systems, jeopardizing Service member health and eroding trust in AI-assisted medical decision-making and potentially the DoD at large.\u0000In recruitment, while AI promises efficiency in identifying ideal candidates, its deployment can perpetuate biases, especially when the training data used is not representative of all demographics. Despite efforts to design \"unbiased\" systems by excluding demographic data, such strategies may inadvertently overlook the unique challenges faced by marginalized communities, further entrenching existing disparities.\u0000Both case studies underscore the importance of considering DEI in the development and deployment of AI systems. As the DoD continues to integrate AI into its operations, this paper’s recommendations stress the necessity of continuous DEI assessment to ensure that AI serves as an asset rather than a liability. The authors recommend the following:\u00001. Data diversity & review\u00002. Continuous monitoring and calibration\u00003. Stakeholder engagement\u00004. Adoption of DEI requirements within Ethical AI Frameworks\u00005. Further research","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"74 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141123179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Turing-like Experiment in a Cyber Defense Game 网络防御游戏中的类图灵实验

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31271

Yinuo Du, Baptiste Prébot, Cleotilde Gonzalez

During the past decade, researchers of behavioral cyber security have created cognitive agents that are able to learn and make decisions in dynamic environments in ways that assimilate human decision processes. However, many of these efforts have been limited to simple detection tasks and represent basic cognitive functions rather than a whole set of cognitive capabilities required in dynamic cyber defense scenarios. Our current work aims at advancing the development of cognitive agents that learn and make defense-dynamic decisions during cyber attacks by intelligent attack agents. We also aim to evaluate the capability of these cognitive models in ``Turing-like'' experiments, comparing the decisions and performance of these agents against human cyber defenders. In this paper, we present an initial demonstration of a cognitive model of the defender that relies on a cognitive theory of dynamic decision-making, Instance-Based Learning Theory (IBLT); we also demonstrate the execution of the same defense task by human defenders. We rely on OpenAI Gym and CybORG and adapt an existing CAGE scenario to generate a simulation experiment using an IBL defender. We also offer a new Interactive Defense Game (IDG), where textit{human} defenders can perform the same CAGE scenario simulated with the IBL model. Our results suggest that the IBL model makes decisions against two intelligent attack agents that are similar to those observed in a subsequent human experiment. We conclude with a description of the cognitive foundations required to build autonomous intelligent cyber defense agents that can collaborate with humans in autonomous cyber defense teams.

在过去十年中，行为网络安全研究人员创造了认知代理，它们能够在动态环境中以吸收人类决策过程的方式进行学习和决策。然而，其中许多工作仅限于简单的检测任务，代表的是基本的认知功能，而不是动态网络防御场景所需的一整套认知能力。我们目前的工作旨在推进认知代理的开发，使其能够在智能攻击代理发动网络攻击时学习并做出防御动态决策。我们还致力于在 "类图灵 "实验中评估这些认知模型的能力，将这些代理的决策和性能与人类网络防御者进行比较。在本文中，我们初步展示了依赖于动态决策认知理论--基于实例的学习理论（IBLT）的防御者认知模型；我们还展示了人类防御者执行相同防御任务的情况。我们以 OpenAI Gym 和 CybORG 为基础，对现有的 CAGE 场景进行改编，生成了一个使用 IBL 防御者的模拟实验。我们还提供了一个新的互动防御游戏（IDG），在这个游戏中，textit{human}防御者可以使用 IBL 模型模拟相同的笼式防御场景。我们的结果表明，IBL 模型在面对两个智能攻击代理时做出的决策与在随后的人类实验中观察到的结果相似。最后，我们介绍了建立自主智能网络防御代理所需的认知基础，这些代理可以在自主网络防御团队中与人类协作。

{"title":"Turing-like Experiment in a Cyber Defense Game","authors":"Yinuo Du, Baptiste Prébot, Cleotilde Gonzalez","doi":"10.1609/aaaiss.v3i1.31271","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31271","url":null,"abstract":"During the past decade, researchers of behavioral cyber security have created cognitive agents that are able to learn and make decisions in dynamic environments in ways that assimilate human decision processes. However, many of these efforts have been limited to simple detection tasks and represent basic cognitive functions rather than a whole set of cognitive capabilities required in dynamic cyber defense scenarios. Our current work aims at advancing the development of cognitive agents that learn and make defense-dynamic decisions during cyber attacks by intelligent attack agents. We also aim to evaluate the capability of these cognitive models in ``Turing-like'' experiments, comparing the decisions and performance of these agents against human cyber defenders. In this paper, we present an initial demonstration of a cognitive model of the defender that relies on a cognitive theory of dynamic decision-making, Instance-Based Learning Theory (IBLT); we also demonstrate the execution of the same defense task by human defenders. We rely on OpenAI Gym and CybORG and adapt an existing CAGE scenario to generate a simulation experiment using an IBL defender. We also offer a new Interactive Defense Game (IDG), where textit{human} defenders can perform the same CAGE scenario simulated with the IBL model. Our results suggest that the IBL model makes decisions against two intelligent attack agents that are similar to those observed in a subsequent human experiment. We conclude with a description of the cognitive foundations required to build autonomous intelligent cyber defense agents that can collaborate with humans in autonomous cyber defense teams.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"39 17","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Modeling Patterns for Neural-Symbolic Reasoning Using Energy-based Models 利用基于能量的模型为神经符号推理建模

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31187

Charles Dickens, Connor Pryor, Lise Getoor

Neural-symbolic (NeSy) AI strives to empower machine learning and large language models with fast, reliable predictions that exhibit commonsense and trustworthy reasoning by seamlessly integrating neural and symbolic methods. With such a broad scope, several taxonomies have been proposed to categorize this integration, emphasizing knowledge representation, reasoning algorithms, and applications. We introduce a knowledge representation-agnostic taxonomy focusing on the neural-symbolic interface capturing methods that reason with probability, logic, and arithmetic constraints. Moreover, we derive expressions for gradients of a prominent class of learning losses and a formalization of reasoning and learning. Through a rigorous empirical analysis spanning three tasks, we show NeSy approaches reach up to a 37% improvement over neural baselines in a semi-supervised setting and a 19% improvement over GPT-4 on question-answering.

神经-符号（NeSy）人工智能致力于通过无缝集成神经和符号方法，为机器学习和大型语言模型提供快速、可靠的预测，从而展现出常识性和可信赖的推理。由于范围如此广泛，人们提出了几种分类法来对这种整合进行分类，强调知识表示、推理算法和应用。我们引入了一种与知识表示无关的分类法，重点关注神经-符号接口，捕捉利用概率、逻辑和算术约束进行推理的方法。此外，我们还推导出了一类重要学习损失的梯度表达式，以及推理和学习的形式化。通过对三项任务进行严格的实证分析，我们发现 NeSy 方法在半监督设置中比神经基线提高了 37%，在问题解答中比 GPT-4 提高了 19%。

引用次数: 0