Proceedings of the AAAI Symposium Series最新文献

英文中文

Modeling Human-Like Acquisition of Language and Concepts 模拟人类学习语言和概念的过程

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31275

Peter Lindes, Steven Jones

Humans acquire language and related concepts in a trajectory over a lifetime. Concepts for simple interaction with the world are learned before language. Later, words are learned to name these concepts along with structures needed to represent larger meanings. Eventually, language advances to where it can drive the learning of new concepts. Throughout this trajectory a language processing capability uses architectural mechanisms to process language using the knowledge already acquired. We assume that this growing body of knowledge is made up of small units of form-meaning mapping that can be composed in many ways, suggesting that these units are learned incrementally from experience. In prior work we have built a system to comprehend human language within an autonomous robot using knowledge in such units developed by hand. Here we propose a research program to develop the ability of an artificial agent to acquire this knowledge incrementally and autonomously from its experience in a similar trajectory. We then propose a strategy for evaluating this human-like learning system using a large benchmark created as a tool for training deep learning systems. We expect that our human-like learning system will produce better task performance from training on only a small subset of this benchmark.

人的一生都在循序渐进地学习语言和相关概念。在学习语言之前，人们先学会了与世界进行简单互动的概念。后来，人们学会了命名这些概念的词汇，以及表达更大含义所需的结构。最终，语言可以推动新概念的学习。在这一发展过程中，语言处理能力将利用已掌握的知识架构机制来处理语言。我们假定，这个不断增长的知识体系是由形式-意义映射的小单元组成的，它们可以以多种方式组成，这表明这些单元是从经验中逐步学习的。在之前的工作中，我们利用手工开发的此类单元知识，在自主机器人中建立了一个理解人类语言的系统。在此，我们提出了一项研究计划，以开发人工智能代理从类似轨迹的经验中逐步自主获取这些知识的能力。然后，我们提出一种策略，利用作为深度学习系统训练工具而创建的大型基准来评估这种类人学习系统。我们预计，我们的类人学习系统只需在该基准的一小部分上进行训练，就能产生更好的任务性能。

{"title":"Modeling Human-Like Acquisition of Language and Concepts","authors":"Peter Lindes, Steven Jones","doi":"10.1609/aaaiss.v3i1.31275","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31275","url":null,"abstract":"Humans acquire language and related concepts in a trajectory over a lifetime. Concepts for simple interaction with the world are learned before language. Later, words are learned to name these concepts along with structures needed to represent larger meanings. Eventually, language advances to where it can drive the learning of new concepts. Throughout this trajectory a language processing capability uses architectural mechanisms to process language using the knowledge already acquired. We assume that this growing body of knowledge is made up of small units of form-meaning mapping that can be composed in many ways, suggesting that these units are learned incrementally from experience. In prior work we have built a system to comprehend human language within an autonomous robot using knowledge in such units developed by hand. Here we propose a research program to develop the ability of an artificial agent to acquire this knowledge incrementally and autonomously from its experience in a similar trajectory. We then propose a strategy for evaluating this human-like learning system using a large benchmark created as a tool for training deep learning systems. We expect that our human-like learning system will produce better task performance from training on only a small subset of this benchmark.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"81 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141121330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Personalized Image Generation Through Swiping 通过轻扫生成个性化图像

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31238

Yuto Nakashima

Generating preferred images from GANs is a challenging task due to the high-dimensional nature of latent space. In this study, we propose a novel approach that uses simple user-swipe interactions to generate preferred images from users. To effectively explore the latent space with only swipe interactions, we apply principal component analysis to the latent space of StyleGAN, creating meaningful subspaces. Additionally, we use a multi-armed bandit algorithm to decide which dimensions to explore, focusing on the user's preferences. Our experiments show that our method is more efficient in generating preferred images than the baseline.

由于潜在空间的高维特性，从 GAN 生成首选图像是一项具有挑战性的任务。在本研究中，我们提出了一种新方法，利用简单的用户滑动交互从用户生成首选图片。为了有效地利用刷卡交互探索潜在空间，我们对 StyleGAN 的潜在空间进行了主成分分析，从而创建了有意义的子空间。此外，我们还使用多臂匪徒算法来决定探索哪些维度，重点关注用户的偏好。实验表明，我们的方法在生成首选图片方面比基线方法更有效。

引用次数: 0

Rule-Based Explanations of Machine Learning Classifiers Using Knowledge Graphs 使用知识图谱对机器学习分类器进行基于规则的解释

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31200

Orfeas Menis Mastromichalakis, Edmund Dervakos, A. Chortaras, G. Stamou

The use of symbolic knowledge representation and reasoning as a way to resolve the lack of transparency of machine learning classifiers is a research area that has lately gained a lot of traction. In this work, we use knowledge graphs as the underlying framework providing the terminology for representing explanations for the operation of a machine learning classifier escaping the constraints of using the features of raw data as a means to express the explanations, providing a promising solution to the problem of the understandability of explanations. In particular, given a description of the application domain of the classifier in the form of a knowledge graph, we introduce a novel theoretical framework for representing explanations of its operation, in the form of query-based rules expressed in the terminology of the knowledge graph. This allows for explaining opaque black-box classifiers, using terminology and information that is independent of the features of the classifier and its domain of application, leading to more understandable explanations but also allowing the creation of different levels of explanations according to the final end-user.

使用符号化知识表示和推理来解决机器学习分类器缺乏透明度的问题，是近来备受关注的一个研究领域。在这项工作中，我们使用知识图谱作为底层框架，为机器学习分类器的操作解释提供了术语表达，摆脱了使用原始数据特征作为解释表达手段的限制，为解释的可理解性问题提供了一个很有前景的解决方案。特别是，在以知识图谱的形式描述分类器应用领域的情况下，我们引入了一个新颖的理论框架，以知识图谱术语表达的基于查询的规则的形式来表示分类器操作的解释。这样就可以使用独立于分类器特征及其应用领域的术语和信息来解释不透明的黑盒子分类器，从而获得更易于理解的解释，而且还可以根据最终用户的需求创建不同层次的解释。

{"title":"Rule-Based Explanations of Machine Learning Classifiers Using Knowledge Graphs","authors":"Orfeas Menis Mastromichalakis, Edmund Dervakos, A. Chortaras, G. Stamou","doi":"10.1609/aaaiss.v3i1.31200","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31200","url":null,"abstract":"The use of symbolic knowledge representation and reasoning as a way to resolve the lack of transparency of machine learning classifiers is a research area that has lately gained a lot of traction. In this work, we use knowledge graphs as the underlying framework providing the terminology for representing explanations for the operation of a machine learning classifier escaping the constraints of using the features of raw data as a means to express the explanations, providing a promising solution to the problem of the understandability of explanations. In particular, given a description of the application domain of the classifier in the form of a knowledge graph, we introduce a novel theoretical framework for representing explanations of its operation, in the form of query-based rules expressed in the terminology of the knowledge graph. This allows for explaining opaque black-box classifiers, using terminology and information that is independent of the features of the classifier and its domain of application, leading to more understandable explanations but also allowing the creation of different levels of explanations according to the final end-user.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"22 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141122493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Faithful Reasoning over Scientific Claims 对科学主张的忠实推理

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31209

N. Tan, Niket Tandon, David Wadden, Oyvind Tafjord, M. Gahegan, Michael Witbrock

Claim verification in scientific domains requires models that faithfully incorporate relevant knowledge from the ever-growing, vast existing literature. Unfaithful claim verifications can lead to misinformation such as those observed during the COVID-19 pandemic. Fact-checking systems often fail to capture the complex relationship between claims and evidence, especially with ambiguous claims and implicit assumptions. Relying only on current LLMs poses challenges due to hallucinations and information traceability issues. To address these challenges, our approach considers multiple viewpoints onto the scientific literature, enabling the assessment of contradictory arguments and implicit assumptions. Our proposed inference method adds faithful reasoning to large language models by distilling information from diverse, relevant scientific abstracts. This method provides a verdict label that can be weighted by the reputation of the scientific articles and an explanation that can be traced back to sources. Our findings demonstrate that humans not only perceive our explanation to be significantly superior to the off-the-shelf model, but they also evaluate it as faithfully enabling the tracing of evidence back to its original sources.

科学领域的索赔验证要求模型能够忠实地纳入不断增长的大量现有文献中的相关知识。不忠实的声称验证可能会导致错误信息，例如在 COVID-19 大流行期间观察到的错误信息。事实核查系统往往无法捕捉到主张与证据之间的复杂关系，尤其是在主张模棱两可和隐含假设的情况下。由于幻觉和信息可追溯性问题，仅依靠当前的法律知识会带来挑战。为了应对这些挑战，我们的方法考虑了科学文献的多种观点，从而能够评估相互矛盾的论点和隐含假设。我们提出的推理方法通过从不同的相关科学摘要中提炼信息，为大型语言模型添加了忠实推理。该方法提供了一个可根据科学文章的声誉加权的判决标签，以及一个可追溯来源的解释。我们的研究结果表明，人类不仅认为我们的解释明显优于现成的模型，而且还认为我们的解释能够忠实地将证据追溯到其原始来源。

{"title":"Faithful Reasoning over Scientific Claims","authors":"N. Tan, Niket Tandon, David Wadden, Oyvind Tafjord, M. Gahegan, Michael Witbrock","doi":"10.1609/aaaiss.v3i1.31209","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31209","url":null,"abstract":"Claim verification in scientific domains requires models that faithfully incorporate relevant knowledge from the ever-growing, vast existing literature. \u0000Unfaithful claim verifications can lead to misinformation such as those observed during the COVID-19 pandemic. Fact-checking systems often fail to capture the complex relationship between claims and evidence, especially with ambiguous claims and implicit assumptions. Relying only on current LLMs poses challenges due to hallucinations and information traceability issues. To address these challenges, our approach considers multiple viewpoints onto the scientific literature, enabling the assessment of contradictory arguments and implicit assumptions. Our proposed inference method adds faithful reasoning to large language models by distilling information from diverse, relevant scientific abstracts. This method provides a verdict label that can be weighted by the reputation of the scientific articles and an explanation that can be traced back to sources. Our findings demonstrate that humans not only perceive our explanation to be significantly superior to the off-the-shelf model, but they also evaluate it as faithfully enabling the tracing of evidence back to its original sources.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"77 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Toward Autonomy: Metacognitive Learning for Enhanced AI Performance 迈向自主：元认知学习提升人工智能性能

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31270

Brendan Conway-Smith, Robert L. West

Large Language Models (LLMs) lack robust metacognitive learning abilities and depend on human-provided algorithms and prompts for learning and output generation. Metacognition involves processes that monitor and enhance cognition. Learning how to learn - metacognitive learning - is crucial for adapting and optimizing learning strategies over time. Although LLMs possess limited metacognitive abilities, they cannot autonomously refine or optimize these strategies. Humans possess innate mechanisms for metacognitive learning that enable at least two unique abilities: discerning which metacognitive strategies are best and automatizing learning strategies. These processes have been effectively modeled in the ACT-R cognitive architecture, providing insights on a path toward greater learning autonomy in AI. Incorporating human-like metacognitive learning abilities into AI could potentially lead to the development of more autonomous and versatile learning mechanisms, as well as improved problem-solving capabilities and performance across diverse tasks.

大型语言模型（LLM）缺乏强大的元认知学习能力，其学习和输出生成依赖于人类提供的算法和提示。元认知涉及监测和增强认知的过程。学会如何学习--元认知学习--对于随着时间的推移调整和优化学习策略至关重要。虽然低等语言学习者拥有有限的元认知能力，但他们无法自主完善或优化这些策略。人类拥有与生俱来的元认知学习机制，至少可以实现两种独特的能力：辨别哪种元认知策略是最好的，以及将学习策略自动化。ACT-R 认知架构对这些过程进行了有效建模，为人工智能实现更高的学习自主性提供了启示。将类似人类的元认知学习能力融入人工智能，有可能开发出更加自主和多用途的学习机制，并提高解决问题的能力和完成各种任务的性能。

引用次数: 1

Toward Human-Like Representation Learning for Cognitive Architectures 面向认知架构的类人表征学习

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31274

Steven Jones, Peter Lindes

Human-like learning includes an ability to learn concepts from a stream of embodiment sensor data. Echoing previous thoughts such as those from Barsalou that cognition and perception share a common representation system, we suggest an addendum to the common model of cognition. This addendum poses a simultaneous semantic memory and perception learning that bypasses working memory, and that uses parallel processing to learn concepts apart from deliberate reasoning. The goal is to provide a general outline for how to extend a class of cognitive architectures to implement a more human-like interface between cognition and embodiment of an agent, where a critical aspect of that interface is that it is dynamic because of learning.

类人学习包括从体现传感器数据流中学习概念的能力。与巴萨罗等人之前关于认知和感知共享一个共同表征系统的观点相呼应，我们建议对认知的共同模型进行增补。该附录提出了一种同时学习语义记忆和感知的方法，它绕过了工作记忆，利用并行处理来学习刻意推理之外的概念。我们的目标是为如何扩展一类认知架构提供一个总纲，以便在认知和代理的体现之间实现一个更像人类的界面，而该界面的一个关键方面是，由于学习，它是动态的。

引用次数: 0

On Replacing Humans with Large Language Models in Voice-Based Human-in-the-Loop Systems 在基于语音的 "人在回路 "系统中用大型语言模型取代人类

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31178

Shih-Hong Huang, Ting-Hao 'Kenneth' Huang

It is easy to assume that Large Language Models (LLMs) will seamlessly take over applications, especially those that are largely automated. In the case of conversational voice assistants, commercial systems have been widely deployed and used over the past decade. However, are we indeed on the cusp of the future we envisioned? There exists a social-technical gap between what people want to accomplish and the actual capability of technology. In this paper, we present a case study comparing two voice assistants built on Amazon Alexa: one employing a human-in-the-loop workflow, the other utilizes LLM to engage in conversations with users. In our comparison, we discovered that the issues arising in current human-in-the-loop and LLM systems are not identical. However, the presence of a set of similar issues in both systems leads us to believe that focusing on the interaction between users and systems is crucial, perhaps even more so than focusing solely on the underlying technology itself. Merely enhancing the performance of the workers or the models may not adequately address these issues. This observation prompts our research question: What are the overlooked contributing factors in the effort to improve the capabilities of voice assistants, which might not have been emphasized in prior research?

人们很容易假定，大型语言模型（LLM）将无缝接管各种应用，尤其是那些基本自动化的应用。就会话语音助手而言，商业系统在过去十年中得到了广泛部署和使用。然而，我们是否真的站在了我们所设想的未来的风口浪尖上？人们想要实现的目标与技术的实际能力之间存在着社会技术差距。在本文中，我们介绍了一项案例研究，比较了基于亚马逊 Alexa 的两个语音助手：一个采用了人在回路中的工作流程，另一个则利用 LLM 与用户进行对话。在比较过程中，我们发现当前人工智能系统和 LLM 系统中出现的问题并不相同。不过，两个系统中存在的一系列类似问题让我们相信，关注用户与系统之间的互动至关重要，这或许比只关注底层技术本身更为重要。仅仅提高工作人员或模型的性能可能无法充分解决这些问题。这一观察结果引发了我们的研究问题：在努力提高语音助手能力的过程中，有哪些因素被忽视了，而这些因素在以前的研究中可能并没有得到重视？

{"title":"On Replacing Humans with Large Language Models in Voice-Based Human-in-the-Loop Systems","authors":"Shih-Hong Huang, Ting-Hao 'Kenneth' Huang","doi":"10.1609/aaaiss.v3i1.31178","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31178","url":null,"abstract":"It is easy to assume that Large Language Models (LLMs) will seamlessly take over applications, especially those that are largely automated. In the case of conversational voice assistants, commercial systems have been widely deployed and used over the past decade. However, are we indeed on the cusp of the future we envisioned? There exists a social-technical gap between what people want to accomplish and the actual capability of technology. In this paper, we present a case study comparing two voice assistants built on Amazon Alexa: one employing a human-in-the-loop workflow, the other utilizes LLM to engage in conversations with users. In our comparison, we discovered that the issues arising in current human-in-the-loop and LLM systems are not identical. However, the presence of a set of similar issues in both systems leads us to believe that focusing on the interaction between users and systems is crucial, perhaps even more so than focusing solely on the underlying technology itself. Merely enhancing the performance of the workers or the models may not adequately address these issues. This observation prompts our research question: What are the overlooked contributing factors in the effort to improve the capabilities of voice assistants, which might not have been emphasized in prior research?","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"45 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141118958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Human-Like Learning of Social Reasoning via Analogy 通过类比进行类人社会推理学习

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31284

Irina Rabkina

Neurotypical adult humans are impeccably good social reasoners. Despite the occasional faux pas, we know how to interact in most social settings and how to consider others' points of view. Young children, on the other hand, do not. Social reasoning, like many of our most important skills, is learned. Much like human children, AI agents are not good social reasoners. While some algorithms can perform some aspects of social reasoning, we are a ways off from AI that can interact naturally and appropriately in the broad range of settings that people can. In this talk, I will argue that learning social reasoning via the same processes used by people will help AI agents reason--and interact--more like people do. Specifically, I will argue that children learn social reasoning via analogy, and that AI agents should, too. I will present evidence from cognitive modeling experiments demonstrating the former and AI experiments demonstrating the latter. I will also propose future directions for social reasoning research that both demonstrate the need for robust, human-like social reasoning in AI and test the utility of common approaches.

神经正常的成年人是无可挑剔的社交推理高手。尽管偶尔也会犯错，但我们知道如何在大多数社交场合进行互动，知道如何考虑他人的观点。而幼儿则不然。社交推理和我们许多最重要的技能一样，都是后天学习的。与人类儿童一样，人工智能代理也不擅长社交推理。虽然有些算法可以进行某些方面的社会推理，但我们距离人工智能能够像人类一样在广泛的环境中自然、恰当地进行互动还有一段距离。在本讲座中，我将论证，通过与人类相同的过程学习社会推理，将有助于人工智能代理更像人类那样进行推理和互动。具体来说，我将论证儿童通过类比学习社会推理，人工智能代理也应该如此。我将从认知建模实验和人工智能实验中提出证据，证明前者和后者。我还将提出社会推理研究的未来方向，既证明人工智能需要强大的、类似人类的社会推理，又测试常用方法的实用性。

引用次数: 0

Inclusion Ethics in AI: Use Cases in African Fashion 人工智能中的包容伦理：非洲时尚界的使用案例

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31266

Christelle Scharff, James Brusseau, K. Bathula, Kaleemunnisa Fnu, Samyak Rakesh Meshram, Om Gaikhe

This paper addresses the ethics of inclusion in artificial in-telligence in the context of African fashion. Despite the proliferation of fashion-related AI applications and da-tasets global diversity remains limited, and African fash-ion is significantly underrepresented. This paper docu-ments two use-cases that enhance AI's inclusivity by in-corporating sub-Saharan fashion elements. The first case details the creation of a Senegalese fashion dataset and a model for classifying traditional apparel using transfer learning. The second case investigates African wax textile patterns generated through generative adversarial net-works (GANs), specifically StyleGAN architectures, and machine learning diffusion models. Alongside the practi-cal, technological advances, theoretical ethical progress is made in two directions. First, the cases are used to elabo-rate and define the ethics of inclusion, while also contrib-uting to current debates about how inclusion differs from ethical fairness. Second, the cases engage with the ethical debate on whether AI innovation should be slowed to prevent ethical imbalances or accelerated to solve them.

本文探讨了非洲时尚背景下人工智能的包容性伦理问题。尽管与时尚相关的人工智能应用和工具集不断涌现，但全球多样性仍然有限，非洲时尚的代表性严重不足。本文记录了两个通过融入撒哈拉以南地区的时尚元素来增强人工智能包容性的案例。第一个案例详细介绍了塞内加尔时尚数据集的创建，以及利用迁移学习对传统服装进行分类的模型。第二个案例研究了通过生成式对抗网络工程（GAN）（特别是 StyleGAN 架构）和机器学习扩散模型生成的非洲蜡纺织品图案。在实践和技术进步的同时，理论伦理也在两个方向上取得了进展。首先，这些案例被用来确定和定义全纳伦理，同时也有助于当前关于全纳与伦理公平有何不同的辩论。其次，案例参与了关于人工智能创新应该放缓以防止伦理失衡，还是加快以解决伦理失衡的伦理辩论。

{"title":"Inclusion Ethics in AI: Use Cases in African Fashion","authors":"Christelle Scharff, James Brusseau, K. Bathula, Kaleemunnisa Fnu, Samyak Rakesh Meshram, Om Gaikhe","doi":"10.1609/aaaiss.v3i1.31266","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31266","url":null,"abstract":"This paper addresses the ethics of inclusion in artificial in-telligence in the context of African fashion. Despite the proliferation of fashion-related AI applications and da-tasets global diversity remains limited, and African fash-ion is significantly underrepresented. This paper docu-ments two use-cases that enhance AI's inclusivity by in-corporating sub-Saharan fashion elements. The first case details the creation of a Senegalese fashion dataset and a model for classifying traditional apparel using transfer learning. The second case investigates African wax textile patterns generated through generative adversarial net-works (GANs), specifically StyleGAN architectures, and machine learning diffusion models. Alongside the practi-cal, technological advances, theoretical ethical progress is made in two directions. First, the cases are used to elabo-rate and define the ethics of inclusion, while also contrib-uting to current debates about how inclusion differs from ethical fairness. Second, the cases engage with the ethical debate on whether AI innovation should be slowed to prevent ethical imbalances or accelerated to solve them.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"18 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141120817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Criterion Client Selection for Efficient Federated Learning 高效联盟学习的多标准客户端选择

Proceedings of the AAAI Symposium Series

Pub Date : 2024-05-20 DOI: 10.1609/aaaiss.v3i1.31227

Mehreen Tahir, Muhammad Intizar Ali

Federated Learning (FL) has received tremendous attention as a decentralized machine learning (ML) framework that allows distributed data owners to collaboratively train a global model without sharing raw data. Since FL trains the model directly on edge devices, the heterogeneity of participating clients in terms of data distribution, hardware capabilities and network connectivity can significantly impact the overall performance of FL systems. Optimizing for model accuracy could extend the training time due to the diverse and resource-constrained nature of edge devices while minimizing training time could compromise the model's accuracy. Effective client selection thus becomes crucial to ensure that the training process is not only efficient but also capitalizes on the diverse data and computational capabilities of different devices. To this end, we propose FedPROM, a novel framework that tackles client selection in FL as a multi-criteria optimization problem. By leveraging the PROMETHEE method, FedPROM ranks clients based on their suitability for a given FL task, considering multiple criteria such as system resources, network conditions, and data quality. This approach allows FedPROM to dynamically select the most appropriate set of clients for each learning round, optimizing both model accuracy and training efficiency. Our evaluations on diverse datasets demonstrate that FedPROM outperforms several state-of-the-art FL client selection protocols in terms of convergence speed, and accuracy, highlighting the framework's effectiveness and the importance of multi-criteria client selection in FL.

联邦学习（FL）作为一种去中心化的机器学习（ML）框架受到了极大的关注，它允许分布式数据所有者在不共享原始数据的情况下协作训练一个全局模型。由于联邦学习直接在边缘设备上训练模型，参与的客户端在数据分布、硬件能力和网络连接方面的异质性会极大地影响联邦学习系统的整体性能。由于边缘设备的多样性和资源有限性，优化模型准确性可能会延长训练时间，而尽量缩短训练时间则可能会影响模型的准确性。因此，有效的客户端选择对于确保训练过程不仅高效，而且充分利用不同设备的数据和计算能力至关重要。为此，我们提出了 FedPROM，这是一个新颖的框架，将 FL 中的客户端选择作为一个多标准优化问题来处理。通过利用 PROMETHEE 方法，FedPROM 在考虑系统资源、网络条件和数据质量等多重标准的基础上，根据客户端对特定 FL 任务的适用性对其进行排序。通过这种方法，FedPROM 可以为每一轮学习动态选择最合适的客户端集，从而优化模型准确性和训练效率。我们在不同数据集上进行的评估表明，FedPROM 在收敛速度和准确性方面都优于几种最先进的 FL 客户端选择协议，这凸显了该框架的有效性以及多标准客户端选择在 FL 中的重要性。

{"title":"Multi-Criterion Client Selection for Efficient Federated Learning","authors":"Mehreen Tahir, Muhammad Intizar Ali","doi":"10.1609/aaaiss.v3i1.31227","DOIUrl":"https://doi.org/10.1609/aaaiss.v3i1.31227","url":null,"abstract":"Federated Learning (FL) has received tremendous attention as a decentralized machine learning (ML) framework that allows distributed data owners to collaboratively train a global model without sharing raw data. Since FL trains the model directly on edge devices, the heterogeneity of participating clients in terms of data distribution, hardware capabilities and network connectivity can significantly impact the overall performance of FL systems. Optimizing for model accuracy could extend the training time due to the diverse and resource-constrained nature of edge devices while minimizing training time could compromise the model's accuracy. Effective client selection thus becomes crucial to ensure that the training process is not only efficient but also capitalizes on the diverse data and computational capabilities of different devices. To this end, we propose FedPROM, a novel framework that tackles client selection in FL as a multi-criteria optimization problem. By leveraging the PROMETHEE method, FedPROM ranks clients based on their suitability for a given FL task, considering multiple criteria such as system resources, network conditions, and data quality. This approach allows FedPROM to dynamically select the most appropriate set of clients for each learning round, optimizing both model accuracy and training efficiency. Our evaluations on diverse datasets demonstrate that FedPROM outperforms several state-of-the-art FL client selection protocols in terms of convergence speed, and accuracy, highlighting the framework's effectiveness and the importance of multi-criteria client selection in FL.","PeriodicalId":516827,"journal":{"name":"Proceedings of the AAAI Symposium Series","volume":"30 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141119064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the AAAI Symposium Series

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀