首页 > 最新文献

Ai Magazine最新文献

英文 中文
Understanding political polarization using language models: A dataset and method 使用语言模型理解政治两极分化:数据集和方法
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-07 DOI: 10.1002/aaai.12104
Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo

Our paper aims to analyze political polarization in US political system using language models, and thereby help candidates make an informed decision. The availability of this information will help voters understand their candidates' views on the economy, healthcare, education, and other social issues. Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years and a language model-based method that helps analyze how polarized a candidate is. Our data are divided into two parts, background information and political information about a candidate, since our hypothesis is that the political views of a candidate should be based on reason and be independent of factors such as birthplace, alma mater, and so forth. We further split this data into four phases chronologically, to help understand if and how the polarization amongst candidates changes. This data has been cleaned to remove biases. To understand the polarization, we begin by showing results from some classical language models in Word2Vec and Doc2Vec. And then use more powerful techniques like the Longformer, a transformer-based encoder, to assimilate more information and find the nearest neighbors of each candidate based on their political view and their background. The code and data for the project will be available here: “https://github.com/samirangode/Understanding_Polarization”

本文旨在利用语言模型分析美国政治体系中的政治两极分化,从而帮助候选人做出明智的决定。这些信息的可用性将帮助选民了解候选人对经济、医疗、教育和其他社会问题的看法。我们的主要贡献是从维基百科中提取的一个跨越过去120年的数据集,以及一种基于语言的方法,该方法有助于分析候选人的两极分化程度。我们的数据分为两部分,候选人的背景信息和政治信息,因为我们的假设是,候选人的政治观点应该基于理性,不受出生地、母校等因素的影响。我们进一步将这些数据按时间顺序分为四个阶段,以帮助了解候选人之间的两极分化是否以及如何变化。这些数据已被清除,以消除偏见。为了理解两极分化,我们首先展示Word2Vec和Doc2Vec中一些经典语言模型的结果。然后使用更强大的技术,如基于变压器的编码器Longformer,来吸收更多信息,并根据每个候选人的政治观点和背景找到他们最近的邻居。项目的代码和数据将在此处提供:“https://github.com/samirangode/Understanding_Polarization“
{"title":"Understanding political polarization using language models: A dataset and method","authors":"Samiran Gode,&nbsp;Supreeth Bare,&nbsp;Bhiksha Raj,&nbsp;Hyungon Yoo","doi":"10.1002/aaai.12104","DOIUrl":"https://doi.org/10.1002/aaai.12104","url":null,"abstract":"<p>Our paper aims to analyze political polarization in US political system using language models, and thereby help candidates make an informed decision. The availability of this information will help voters understand their candidates' views on the economy, healthcare, education, and other social issues. Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years and a language model-based method that helps analyze how polarized a candidate is. Our data are divided into two parts, background information and political information about a candidate, since our hypothesis is that the political views of a candidate should be based on reason and be independent of factors such as birthplace, alma mater, and so forth. We further split this data into four phases chronologically, to help understand if and how the polarization amongst candidates changes. This data has been cleaned to remove biases. To understand the polarization, we begin by showing results from some classical language models in Word2Vec and Doc2Vec. And then use more powerful techniques like the Longformer, a transformer-based encoder, to assimilate more information and find the nearest neighbors of each candidate based on their political view and their background. The code and data for the project will be available here: “https://github.com/samirangode/Understanding_Polarization”</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 3","pages":"248-254"},"PeriodicalIF":0.9,"publicationDate":"2023-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12104","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50123694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable Image Classification: The Journey So Far and the Road Ahead 可解释图像分类:目前的旅程和未来的道路
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-01 DOI: 10.3390/ai4030033
V. Kamakshi, N. C. Krishnan
Explainable Artificial Intelligence (XAI) has emerged as a crucial research area to address the interpretability challenges posed by complex machine learning models. In this survey paper, we provide a comprehensive analysis of existing approaches in the field of XAI, focusing on the tradeoff between model accuracy and interpretability. Motivated by the need to address this tradeoff, we conduct an extensive review of the literature, presenting a multi-view taxonomy that offers a new perspective on XAI methodologies. We analyze various sub-categories of XAI methods, considering their strengths, weaknesses, and practical challenges. Moreover, we explore causal relationships in model explanations and discuss approaches dedicated to explaining cross-domain classifiers. The latter is particularly important in scenarios where training and test data are sampled from different distributions. Drawing insights from our analysis, we propose future research directions, including exploring explainable allied learning paradigms, developing evaluation metrics for both traditionally trained and allied learning-based classifiers, and applying neural architectural search techniques to minimize the accuracy–interpretability tradeoff. This survey paper provides a comprehensive overview of the state-of-the-art in XAI, serving as a valuable resource for researchers and practitioners interested in understanding and advancing the field.
可解释人工智能(XAI)已成为解决复杂机器学习模型所带来的可解释性挑战的关键研究领域。在这篇调查论文中,我们对XAI领域的现有方法进行了全面分析,重点关注模型准确性和可解释性之间的权衡。出于解决这种权衡的需要,我们对文献进行了广泛的回顾,提出了一个多视图分类法,为XAI方法提供了一个新的视角。我们分析了XAI方法的各个子类,考虑了它们的优点、缺点和实际挑战。此外,我们探讨了模型解释中的因果关系,并讨论了专门用于解释跨领域分类器的方法。后者在训练和测试数据来自不同分布的情况下尤为重要。根据我们的分析,我们提出了未来的研究方向,包括探索可解释的联合学习范式,为传统训练和基于联合学习的分类器开发评估指标,以及应用神经架构搜索技术来最大限度地减少准确性和可解释性之间的权衡。本调查报告提供了XAI最新技术的全面概述,为有兴趣了解和推进该领域的研究人员和实践者提供了宝贵的资源。
{"title":"Explainable Image Classification: The Journey So Far and the Road Ahead","authors":"V. Kamakshi, N. C. Krishnan","doi":"10.3390/ai4030033","DOIUrl":"https://doi.org/10.3390/ai4030033","url":null,"abstract":"Explainable Artificial Intelligence (XAI) has emerged as a crucial research area to address the interpretability challenges posed by complex machine learning models. In this survey paper, we provide a comprehensive analysis of existing approaches in the field of XAI, focusing on the tradeoff between model accuracy and interpretability. Motivated by the need to address this tradeoff, we conduct an extensive review of the literature, presenting a multi-view taxonomy that offers a new perspective on XAI methodologies. We analyze various sub-categories of XAI methods, considering their strengths, weaknesses, and practical challenges. Moreover, we explore causal relationships in model explanations and discuss approaches dedicated to explaining cross-domain classifiers. The latter is particularly important in scenarios where training and test data are sampled from different distributions. Drawing insights from our analysis, we propose future research directions, including exploring explainable allied learning paradigms, developing evaluation metrics for both traditionally trained and allied learning-based classifiers, and applying neural architectural search techniques to minimize the accuracy–interpretability tradeoff. This survey paper provides a comprehensive overview of the state-of-the-art in XAI, serving as a valuable resource for researchers and practitioners interested in understanding and advancing the field.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"39 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81398236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Deep Learning Techniques for Blind Image Super-Resolution within a High-Scale Multi-Domain Perspective 高尺度多域视角下盲图像超分辨率深度学习技术评价
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-01 DOI: 10.3390/ai4030032
V. A. de Santiago Júnior
Despite several solutions and experiments have been conducted recently addressing image super-resolution (SR), boosted by deep learning (DL), they do not usually design evaluations with high scaling factors. Moreover, the datasets are generally benchmarks which do not truly encompass significant diversity of domains to proper evaluate the techniques. It is also interesting to remark that blind SR is attractive for real-world scenarios since it is based on the idea that the degradation process is unknown, and, hence, techniques in this context rely basically on low-resolution (LR) images. In this article, we present a high-scale (8×) experiment which evaluates five recent DL techniques tailored for blind image SR: Adaptive Pseudo Augmentation (APA), Blind Image SR with Spatially Variant Degradations (BlindSR), Deep Alternating Network (DAN), FastGAN, and Mixture of Experts Super-Resolution (MoESR). We consider 14 datasets from five different broader domains (Aerial, Fauna, Flora, Medical, and Satellite), and another remark is that some of the DL approaches were designed for single-image SR but others not. Based on two no-reference metrics, NIQE and the transformer-based MANIQA score, MoESR can be regarded as the best solution although the perceptual quality of the created high-resolution (HR) images of all the techniques still needs to improve.
尽管在深度学习(DL)的推动下,最近已经进行了一些解决方案和实验来解决图像超分辨率(SR)问题,但它们通常不会设计具有高缩放因子的评估。此外,数据集通常是基准,并没有真正包含重要的领域多样性,以正确评估技术。同样有趣的是,盲SR对于现实世界的场景是有吸引力的,因为它是基于退化过程未知的想法,因此,在这种情况下,技术基本上依赖于低分辨率(LR)图像。在这篇文章中,我们提出了一个高尺度(8倍)实验,评估了最近为盲图像SR定制的五种深度学习技术:自适应伪增强(APA)、具有空间变异退化的盲图像SR (BlindSR)、深度交替网络(DAN)、FastGAN和混合专家超分辨率(MoESR)。我们考虑了来自五个不同更广泛领域(航空、动物、植物、医疗和卫星)的14个数据集,另一个评论是,一些深度学习方法是为单图像SR设计的,而另一些则不是。基于NIQE和基于变压器的MANIQA评分这两个无参考指标,MoESR可以被认为是最佳解决方案,尽管所有技术创建的高分辨率(HR)图像的感知质量仍有待提高。
{"title":"Evaluating Deep Learning Techniques for Blind Image Super-Resolution within a High-Scale Multi-Domain Perspective","authors":"V. A. de Santiago Júnior","doi":"10.3390/ai4030032","DOIUrl":"https://doi.org/10.3390/ai4030032","url":null,"abstract":"Despite several solutions and experiments have been conducted recently addressing image super-resolution (SR), boosted by deep learning (DL), they do not usually design evaluations with high scaling factors. Moreover, the datasets are generally benchmarks which do not truly encompass significant diversity of domains to proper evaluate the techniques. It is also interesting to remark that blind SR is attractive for real-world scenarios since it is based on the idea that the degradation process is unknown, and, hence, techniques in this context rely basically on low-resolution (LR) images. In this article, we present a high-scale (8×) experiment which evaluates five recent DL techniques tailored for blind image SR: Adaptive Pseudo Augmentation (APA), Blind Image SR with Spatially Variant Degradations (BlindSR), Deep Alternating Network (DAN), FastGAN, and Mixture of Experts Super-Resolution (MoESR). We consider 14 datasets from five different broader domains (Aerial, Fauna, Flora, Medical, and Satellite), and another remark is that some of the DL approaches were designed for single-image SR but others not. Based on two no-reference metrics, NIQE and the transformer-based MANIQA score, MoESR can be regarded as the best solution although the perceptual quality of the created high-resolution (HR) images of all the techniques still needs to improve.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"60 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80087246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI and core electoral processes: Mapping the horizons 人工智能和核心选举进程:规划前景
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-01 DOI: 10.1002/aaai.12105
Deepak P, Stanley Simoes, Muiris MacCarthaigh

It is well documented that there has been significant enthusiasm across the globe in respect of using AI for all forms of social activity. However, the electoral process – the time, place, and manner of elections within democratic nations – is one of few sectors in which there has been limited penetration of AI. Electoral management bodies in many countries have recently started exploring and deliberating over the use of AI in the electoral process. In this paper, we consider five avenues within the core electoral process which have potential for AI usage, and map the challenges involved in using AI within them. These five avenues are: voter list maintenance, determining polling booth locations, polling booth protection processes, voter authentication, and video monitoring of elections. Within each avenue, we lay down the context, illustrate current or potential usage of AI, and discuss extant or potential ramifications of AI usage, as well as potential directions for mitigating risks when considering AI usage. We believe that the scant current usage of AI within electoral processes provides a very rare opportunity to deliberate on the risks and mitigation possibilities prior to actual and widespread AI deployment. This paper is an attempt to map the horizons of risks and opportunities in using AI within electoral processes and to help shape the debate around the topic.

有充分的证据表明,在将人工智能用于所有形式的社会活动方面,全球都表现出了极大的热情。然而,选举过程——民主国家内选举的时间、地点和方式——是人工智能渗透有限的少数部门之一。许多国家的选举管理机构最近开始探索和审议在选举过程中使用人工智能的问题。在本文中,我们考虑了核心选举过程中有可能使用人工智能的五种途径,并绘制了在其中使用人工智能所涉及的挑战。这五条途径是:选民名单维护、确定投票站位置、投票站保护程序、选民身份验证和选举视频监控。在每一种途径中,我们都会列出背景,说明人工智能的当前或潜在使用,并讨论人工智能使用的现存或潜在后果,以及在考虑人工智能使用时降低风险的潜在方向。我们认为,目前在选举过程中很少使用人工智能,这为在实际广泛部署人工智能之前仔细考虑风险和缓解可能性提供了一个非常难得的机会。本文试图绘制在选举过程中使用人工智能的风险和机会范围,并帮助形成围绕该主题的辩论。
{"title":"AI and core electoral processes: Mapping the horizons","authors":"Deepak P,&nbsp;Stanley Simoes,&nbsp;Muiris MacCarthaigh","doi":"10.1002/aaai.12105","DOIUrl":"https://doi.org/10.1002/aaai.12105","url":null,"abstract":"<p>It is well documented that there has been significant enthusiasm across the globe in respect of using AI for all forms of social activity. However, the electoral process – the time, place, and manner of elections within democratic nations – is one of few sectors in which there has been limited penetration of AI. Electoral management bodies in many countries have recently started exploring and deliberating over the use of AI in the electoral process. In this paper, we consider five avenues within the core electoral process which have potential for AI usage, and map the challenges involved in using AI within them. These five avenues are: voter list maintenance, determining polling booth locations, polling booth protection processes, voter authentication, and video monitoring of elections. Within each avenue, we lay down the context, illustrate current or potential usage of AI, and discuss extant or potential ramifications of AI usage, as well as potential directions for mitigating risks when considering AI usage. We believe that the scant current usage of AI within electoral processes provides a very rare opportunity to deliberate on the risks and mitigation possibilities prior to actual and widespread AI deployment. This paper is an attempt to map the horizons of risks and opportunities in using AI within electoral processes and to help shape the debate around the topic.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 3","pages":"218-239"},"PeriodicalIF":0.9,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12105","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50115102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying Few-Shot Learning for In-the-Wild Camera-Trap Species Classification 应用少镜头学习进行野外相机陷阱物种分类
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-31 DOI: 10.3390/ai4030031
Haoyu Chen, S. Lindshield, P. Ndiaye, Yaya Hamady Ndiaye, J. Pruetz, A. Reibman
Few-shot learning (FSL) describes the challenge of learning a new task using a minimum amount of labeled data, and we have observed significant progress made in this area. In this paper, we explore the effectiveness of the FSL theory by considering a real-world problem where labels are hard to obtain. To assist a large study on chimpanzee hunting activities, we aim to classify various animal species that appear in our in-the-wild camera traps located in Senegal. Using the philosophy of FSL, we aim to train an FSL network to learn to separate animal species using large public datasets and implement the network on our data with its novel species/classes and unseen environments, needing only to label a few images per new species. Here, we first discuss constraints and challenges caused by having in-the-wild uncurated data, which are often not addressed in benchmark FSL datasets. Considering these new challenges, we create two experiments and corresponding evaluation metrics to determine a network’s usefulness in a real-world implementation scenario. We then compare results from various FSL networks, and describe how factors may affect a network’s potential real-world usefulness. We consider network design factors such as distance metrics or extra pre-training, and examine their roles in a real-world implementation setting. We also consider additional factors such as support set selection and ease of implementation, which are usually ignored when a benchmark dataset has been established.
FSL (Few-shot learning)描述了使用最少的标记数据学习新任务的挑战,我们已经观察到在这一领域取得了重大进展。在本文中,我们通过考虑一个难以获得标签的现实问题来探讨FSL理论的有效性。为了协助一项关于黑猩猩狩猎活动的大型研究,我们的目标是对出现在我们位于塞内加尔的野外相机陷阱中的各种动物进行分类。利用FSL的理念,我们的目标是训练一个FSL网络来学习使用大型公共数据集分离动物物种,并在我们的数据上使用新物种/类别和看不见的环境实现网络,只需要标记每个新物种的一些图像。在这里,我们首先讨论由未经整理的野外数据引起的约束和挑战,这在基准FSL数据集中通常没有得到解决。考虑到这些新的挑战,我们创建了两个实验和相应的评估指标,以确定网络在现实世界实现场景中的有用性。然后,我们比较了来自各种FSL网络的结果,并描述了可能影响网络潜在现实用途的因素。我们考虑网络设计因素,如距离度量或额外的预训练,并检查它们在现实世界的实现设置中的作用。我们还考虑了其他因素,如支持集选择和实现的便利性,这些通常在建立基准数据集时被忽略。
{"title":"Applying Few-Shot Learning for In-the-Wild Camera-Trap Species Classification","authors":"Haoyu Chen, S. Lindshield, P. Ndiaye, Yaya Hamady Ndiaye, J. Pruetz, A. Reibman","doi":"10.3390/ai4030031","DOIUrl":"https://doi.org/10.3390/ai4030031","url":null,"abstract":"Few-shot learning (FSL) describes the challenge of learning a new task using a minimum amount of labeled data, and we have observed significant progress made in this area. In this paper, we explore the effectiveness of the FSL theory by considering a real-world problem where labels are hard to obtain. To assist a large study on chimpanzee hunting activities, we aim to classify various animal species that appear in our in-the-wild camera traps located in Senegal. Using the philosophy of FSL, we aim to train an FSL network to learn to separate animal species using large public datasets and implement the network on our data with its novel species/classes and unseen environments, needing only to label a few images per new species. Here, we first discuss constraints and challenges caused by having in-the-wild uncurated data, which are often not addressed in benchmark FSL datasets. Considering these new challenges, we create two experiments and corresponding evaluation metrics to determine a network’s usefulness in a real-world implementation scenario. We then compare results from various FSL networks, and describe how factors may affect a network’s potential real-world usefulness. We consider network design factors such as distance metrics or extra pre-training, and examine their roles in a real-world implementation setting. We also consider additional factors such as support set selection and ease of implementation, which are usually ignored when a benchmark dataset has been established.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"4 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80887306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In Memoriam: Roger C. Schank, 1946–2023 纪念:罗杰·C·尚克,1946–2023
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-31 DOI: 10.1002/aaai.12106
Richard Granger, David Leake, Christopher K. Riesbeck
<p>A summary of Roger Schank's career might initially appear fairly typical for an eminent academic. Following a PhD in linguistics at the University of Texas at Austin in 1969, Roger held faculty positions in linguistics and computer science at Stanford, computer science and psychology at Yale, and computer science and education at Northwestern. He served terms as chair of computer science at both Yale and Northwestern. After Northwestern, he was Chief Educational Officer for Carnegie Mellon's Silicon Valley campus. He authored over 30 books spanning AI, cognitive science, psychology and education. He advised nearly 50 PhD students. He was a Fellow of AAAI.</p><p>But the hundreds of people who worked or interacted with Roger over the years know there was nothing typical about him. Roger was a force of nature. He questioned everything, especially (and gleefully) focusing on topics that were supposed to be canon.  He came in, broke things apart, and built new things in their place. In linguistics, he rejected the Chomskyian approach to divorce the study of language from the study of meaning, with his seminal work on semantic primitives. In AI, where language processing focused on the propositions, he argued for the importance of much larger memory structures such as scripts and plans, and for memory processes, such as remindings, for modeling understanding. He argued for examples, that is, cases, rather than logical rules, for modeling human reasoning. Much of his work elicited initial pushback, which then transitioned to wary toleration, and finally arrived at such widespread acceptance that now his ideas are often assumed without attribution.</p><p>Roger relished debate, and engaged avidly in ongoing discourse on the issues he studied.  Where many labs have weekly “discussions” or “chats”, Roger fashioned weekly “Friday fights” and an “Indefensible position” seminar. One facet of these was the Socratic investigation of complex topics; another was as a crucible for the courage to make bold claims and the skills to distill, defend, and question them.  He questioned loudly. But under the disputative bearing, to those who knew and worked with him he had abundant loyalty and good will.</p><p>He was an explorer of the mind and of the world, an astute observer of humans and human nature: an intuitive psychologist. He had a knack for identifying key questions, always noticing customs, behaviors, and anomalies to explain, gathering data and categorizing to generate theories. His travels and knowledge of wine and food were a rich source of examples for his work and camaraderie.  He did things in a big way, from academic passions like studying how language and the mind work and how people learn, to personal passions like food and football. Many stories about Roger occur at restaurants because meals were events. Many fans watch weekend football, but Roger created a room with half a dozen separate TVs, most with picture in picture, to monitor a dozen games s
罗杰·尚克(Roger Schank)职业生涯的总结最初可能对一位杰出的学者来说相当典型。1969年,罗杰在得克萨斯大学奥斯汀分校获得语言学博士学位后,曾在斯坦福大学语言学和计算机科学、耶鲁大学计算机科学和心理学以及西北大学计算机科学与教育学院担任教职。他曾在耶鲁大学和西北大学担任计算机科学系主任。西北大学毕业后,他担任卡内基梅隆大学硅谷校区的首席教育官。他写了30多本书,涵盖人工智能、认知科学、心理学和教育。他为近50名博士生提供咨询。他是AAAI的会员。但这些年来与罗杰共事或互动的数百人都知道,罗杰并没有什么典型之处。罗杰是一股自然的力量。他质疑一切,尤其是(高兴地)关注那些本应是正典的话题。他进来了,把东西拆开,在它们的位置上建造新的东西。在语言学方面,他拒绝了乔姆斯基的方法,即将语言研究与意义研究分开,并在语义原语方面进行了开创性的工作。在人工智能中,语言处理侧重于命题,他认为更大的记忆结构(如脚本和计划)的重要性,以及记忆过程(如回忆)对理解建模的重要性。他为人类推理建模举例,即案例,而不是逻辑规则。他的大部分作品最初都遭到了抵制,后来转变为谨慎的宽容,最终被广泛接受,以至于现在他的想法往往被认为没有归属。罗杰喜欢辩论,并热衷于就他所研究的问题进行持续的讨论。在许多实验室每周都会进行“讨论”或“聊天”的地方,罗杰每周都会举办“周五打架”和“不可辩护的立场”研讨会。其中一个方面是对复杂主题的苏格拉底式调查;另一个则是培养大胆主张的勇气和提炼、辩护和质疑这些主张的技能。他大声质问。但在争论的气氛下,对于那些认识他并与他共事的人来说,他有着丰富的忠诚和善意。他是心灵和世界的探索者,是人类和人性的敏锐观察者:一位直觉心理学家。他有识别关键问题的诀窍,总是注意到需要解释的习俗、行为和异常现象,收集数据并进行分类以生成理论。他的旅行以及对葡萄酒和食物的了解是他工作和同志情谊的丰富例证。他做了很多事情,从研究语言和思维如何运作以及人们如何学习等学术激情,到食物和足球等个人激情。很多关于罗杰的故事都发生在餐馆里,因为吃饭是大事。许多球迷观看周末的足球比赛,但罗杰创造了一个房间,里面有六台独立的电视,大多数都是画中画的,可以同时观看十几场比赛。罗杰坚信发展社区,不仅在研究实验室和部门,而且在国家和国际层面。他创立了新的领域,以便在认知科学和教育领域建立持久的志同道合的社区。他对人类记忆的研究开创了基于案例的推理领域,并于今年举行了第31届国际会议。他是认知科学领域、认知科学学会和《认知科学》杂志的联合创始人。他在斯坦福大学、耶鲁大学和西北大学的博士生,以及他的公司和学习科学研究所的许多开发人员、艺术家和内容创作者,将证明他一次又一次建立的社区和文化传统。例如,当博士生完成论文时举办派对并不罕见,但罗杰的派对有额外的派对。每个拥有博士学位的学生(以前和现在)都必须向团队展示一种天赋(真实的、想象的、假装的或滑稽的),最终新的博士学位揭示了他们的秘密天赋。这种仪式在学生之间创造了一种超越学术的人际关系。随着他的热情转向教育,他领导了学习科学领域的建立,将教育、认知科学和计算机科学相结合。1989年,他在西北大学成立了学习科学研究所,并在教育学院创建了第一个学习科学硕士和博士项目。他开创了人工智能故事研究的先河,推动了基于故事的创新教育环境的发展。他自己是一个伟大的故事讲述者;他在个人生活和书籍中都用故事来连接、阐释和教育。最后,罗杰非常关心影响。
{"title":"In Memoriam: Roger C. Schank, 1946–2023","authors":"Richard Granger,&nbsp;David Leake,&nbsp;Christopher K. Riesbeck","doi":"10.1002/aaai.12106","DOIUrl":"https://doi.org/10.1002/aaai.12106","url":null,"abstract":"&lt;p&gt;A summary of Roger Schank's career might initially appear fairly typical for an eminent academic. Following a PhD in linguistics at the University of Texas at Austin in 1969, Roger held faculty positions in linguistics and computer science at Stanford, computer science and psychology at Yale, and computer science and education at Northwestern. He served terms as chair of computer science at both Yale and Northwestern. After Northwestern, he was Chief Educational Officer for Carnegie Mellon's Silicon Valley campus. He authored over 30 books spanning AI, cognitive science, psychology and education. He advised nearly 50 PhD students. He was a Fellow of AAAI.&lt;/p&gt;&lt;p&gt;But the hundreds of people who worked or interacted with Roger over the years know there was nothing typical about him. Roger was a force of nature. He questioned everything, especially (and gleefully) focusing on topics that were supposed to be canon.  He came in, broke things apart, and built new things in their place. In linguistics, he rejected the Chomskyian approach to divorce the study of language from the study of meaning, with his seminal work on semantic primitives. In AI, where language processing focused on the propositions, he argued for the importance of much larger memory structures such as scripts and plans, and for memory processes, such as remindings, for modeling understanding. He argued for examples, that is, cases, rather than logical rules, for modeling human reasoning. Much of his work elicited initial pushback, which then transitioned to wary toleration, and finally arrived at such widespread acceptance that now his ideas are often assumed without attribution.&lt;/p&gt;&lt;p&gt;Roger relished debate, and engaged avidly in ongoing discourse on the issues he studied.  Where many labs have weekly “discussions” or “chats”, Roger fashioned weekly “Friday fights” and an “Indefensible position” seminar. One facet of these was the Socratic investigation of complex topics; another was as a crucible for the courage to make bold claims and the skills to distill, defend, and question them.  He questioned loudly. But under the disputative bearing, to those who knew and worked with him he had abundant loyalty and good will.&lt;/p&gt;&lt;p&gt;He was an explorer of the mind and of the world, an astute observer of humans and human nature: an intuitive psychologist. He had a knack for identifying key questions, always noticing customs, behaviors, and anomalies to explain, gathering data and categorizing to generate theories. His travels and knowledge of wine and food were a rich source of examples for his work and camaraderie.  He did things in a big way, from academic passions like studying how language and the mind work and how people learn, to personal passions like food and football. Many stories about Roger occur at restaurants because meals were events. Many fans watch weekend football, but Roger created a room with half a dozen separate TVs, most with picture in picture, to monitor a dozen games s","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 3","pages":"343-344"},"PeriodicalIF":0.9,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50148872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Alzheimer’s Disease and Brain Tumor Detection Using Deep Learning with Particle Swarm Optimization 基于粒子群优化的深度学习改进阿尔茨海默病和脑肿瘤检测
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-28 DOI: 10.3390/ai4030030
R. Ibrahim, Rawan Ghnemat, Q. Abu Al-haija
Convolutional Neural Networks (CNNs) have exhibited remarkable potential in effectively tackling the intricate task of classifying MRI images, specifically in Alzheimer’s disease detection and brain tumor identification. While CNNs optimize their parameters automatically through training processes, finding the optimal values for these parameters can still be a challenging task due to the complexity of the search space and the potential for suboptimal results. Consequently, researchers often encounter difficulties determining the ideal parameter settings for CNNs. This challenge necessitates using trial-and-error methods or expert judgment, as the search for the best combination of parameters involves exploring a vast space of possibilities. Despite the automatic optimization during training, the process does not guarantee finding the globally-optimal parameter values. Hence, researchers often rely on iterative experimentation and expert knowledge to fine-tune these parameters and maximize CNN performance. This poses a significant obstacle in developing real-world applications that leverage CNNs for MRI image analysis. This paper presents a new hybrid model that combines the Particle Swarm Optimization (PSO) algorithm with CNNs to enhance detection and classification capabilities. Our method utilizes the PSO algorithm to determine the optimal configuration of CNN hyper-parameters. Subsequently, these optimized parameters are applied to the CNN architectures for classification. As a result, our hybrid model exhibits improved prediction accuracy for brain diseases while reducing the loss of function value. To evaluate the performance of our proposed model, we conducted experiments using three benchmark datasets. Two datasets were utilized for Alzheimer’s disease: the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and an international dataset from Kaggle. The third dataset focused on brain tumors. The experimental assessment demonstrated the superiority of our proposed model, achieving unprecedented accuracy rates of 98.50%, 98.83%, and 97.12% for the datasets mentioned earlier, respectively.
卷积神经网络(cnn)在有效处理复杂的MRI图像分类任务方面表现出了显著的潜力,特别是在阿尔茨海默病检测和脑肿瘤识别方面。虽然cnn通过训练过程自动优化参数,但由于搜索空间的复杂性和潜在的次优结果,找到这些参数的最优值仍然是一项具有挑战性的任务。因此,研究人员经常在确定cnn的理想参数设置时遇到困难。这一挑战需要使用试错法或专家判断,因为寻找参数的最佳组合涉及探索广阔的可能性空间。尽管在训练过程中进行了自动优化,但该过程不能保证找到全局最优的参数值。因此,研究人员经常依靠迭代实验和专家知识来微调这些参数并最大化CNN的性能。这对开发利用cnn进行MRI图像分析的实际应用构成了重大障碍。本文提出了一种新的混合模型,将粒子群优化算法(PSO)与cnn相结合,以提高检测和分类能力。我们的方法利用粒子群算法来确定CNN超参数的最优配置。随后,将这些优化后的参数应用到CNN架构中进行分类。因此,我们的混合模型在减少功能价值损失的同时,对脑部疾病的预测精度有所提高。为了评估我们提出的模型的性能,我们使用三个基准数据集进行了实验。两个数据集用于阿尔茨海默病:阿尔茨海默病神经影像学倡议(ADNI)和来自Kaggle的国际数据集。第三个数据集中于脑肿瘤。实验评估证明了我们提出的模型的优越性,在前面提到的数据集上分别达到了前所未有的98.50%、98.83%和97.12%的准确率。
{"title":"Improving Alzheimer’s Disease and Brain Tumor Detection Using Deep Learning with Particle Swarm Optimization","authors":"R. Ibrahim, Rawan Ghnemat, Q. Abu Al-haija","doi":"10.3390/ai4030030","DOIUrl":"https://doi.org/10.3390/ai4030030","url":null,"abstract":"Convolutional Neural Networks (CNNs) have exhibited remarkable potential in effectively tackling the intricate task of classifying MRI images, specifically in Alzheimer’s disease detection and brain tumor identification. While CNNs optimize their parameters automatically through training processes, finding the optimal values for these parameters can still be a challenging task due to the complexity of the search space and the potential for suboptimal results. Consequently, researchers often encounter difficulties determining the ideal parameter settings for CNNs. This challenge necessitates using trial-and-error methods or expert judgment, as the search for the best combination of parameters involves exploring a vast space of possibilities. Despite the automatic optimization during training, the process does not guarantee finding the globally-optimal parameter values. Hence, researchers often rely on iterative experimentation and expert knowledge to fine-tune these parameters and maximize CNN performance. This poses a significant obstacle in developing real-world applications that leverage CNNs for MRI image analysis. This paper presents a new hybrid model that combines the Particle Swarm Optimization (PSO) algorithm with CNNs to enhance detection and classification capabilities. Our method utilizes the PSO algorithm to determine the optimal configuration of CNN hyper-parameters. Subsequently, these optimized parameters are applied to the CNN architectures for classification. As a result, our hybrid model exhibits improved prediction accuracy for brain diseases while reducing the loss of function value. To evaluate the performance of our proposed model, we conducted experiments using three benchmark datasets. Two datasets were utilized for Alzheimer’s disease: the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and an international dataset from Kaggle. The third dataset focused on brain tumors. The experimental assessment demonstrated the superiority of our proposed model, achieving unprecedented accuracy rates of 98.50%, 98.83%, and 97.12% for the datasets mentioned earlier, respectively.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"42 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87853976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High-Performance and Lightweight AI Model for Robot Vacuum Cleaners with Low Bitwidth Strong Non-Uniform Quantization 低位宽强非均匀量化机器人吸尘器的高性能轻量级AI模型
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-27 DOI: 10.3390/ai4030029
Qian Huang, Zhimin Tang
Artificial intelligence (AI) plays a critical role in the operation of robot vacuum cleaners, enabling them to intelligently navigate to clean and avoid indoor obstacles. Due to limited computational resources, manufacturers must balance performance and cost. This necessitates the development of lightweight AI models that can achieve high performance. Traditional uniform weight quantization assigns the same number of levels to all weights, regardless of their distribution or importance. Consequently, this lack of adaptability may lead to sub-optimal quantization results, as the quantization levels do not align with the statistical properties of the weights. To address this challenge, in this work, we propose a new technique called low bitwidth strong non-uniform quantization, which largely reduces the memory footprint of AI models while maintaining high accuracy. Our proposed non-uniform quantization method, as opposed to traditional uniform quantization, aims to align with the actual weight distribution of well-trained neural network models. The proposed quantization scheme builds upon the observation of weight distribution characteristics in AI models and aims to leverage this knowledge to enhance the efficiency of neural network implementations. Additionally, we adjust the input image size to reduce the computational and memory demands of AI models. The goal is to identify an appropriate image size and its corresponding AI models that can be used in resource-constrained robot vacuum cleaners while still achieving acceptable accuracy on the object classification task. Experimental results indicate that when compared to the state-of-the-art AI models in the literature, the proposed AI model achieves a 2-fold decrease in memory usage from 15.51 MB down to 7.68 MB while maintaining the same accuracy of around 93%. In addition, the proposed non-uniform quantization model reduces memory usage by 20 times (from 15.51 MB down to 0.78 MB) with a slight accuracy drop of 3.11% (the classification accuracy is still above 90%). Thus, our proposed high-performance and lightweight AI model strikes an excellent balance between model complexity, classification accuracy, and computational resources for robot vacuum cleaners.
人工智能(AI)在机器人吸尘器的操作中起着至关重要的作用,使它们能够智能地导航,清洁和避开室内障碍物。由于有限的计算资源,制造商必须平衡性能和成本。这就需要开发能够实现高性能的轻量级AI模型。传统的均匀权重量化为所有权重分配相同数量的级别,而不考虑它们的分布或重要性。因此,这种适应性的缺乏可能导致次优量化结果,因为量化级别与权重的统计属性不一致。为了解决这一挑战,在这项工作中,我们提出了一种称为低位宽强非均匀量化的新技术,该技术在保持高精度的同时大大减少了人工智能模型的内存占用。我们提出的非均匀量化方法,与传统的均匀量化方法相反,旨在与训练良好的神经网络模型的实际权重分布保持一致。提出的量化方案建立在对人工智能模型中权重分布特征的观察基础上,旨在利用这一知识来提高神经网络实现的效率。此外,我们调整输入图像的大小,以减少人工智能模型的计算和内存需求。目标是确定合适的图像大小及其相应的人工智能模型,这些模型可以用于资源受限的机器人吸尘器,同时在对象分类任务上仍然达到可接受的精度。实验结果表明,与文献中最先进的人工智能模型相比,所提出的人工智能模型的内存使用减少了2倍,从15.51 MB降至7.68 MB,同时保持了93%左右的相同准确率。此外,本文提出的非均匀量化模型将内存使用减少了20倍(从15.51 MB下降到0.78 MB),准确率下降了3.11%(分类准确率仍在90%以上)。因此,我们提出的高性能轻量级AI模型在机器人吸尘器的模型复杂性、分类精度和计算资源之间取得了很好的平衡。
{"title":"High-Performance and Lightweight AI Model for Robot Vacuum Cleaners with Low Bitwidth Strong Non-Uniform Quantization","authors":"Qian Huang, Zhimin Tang","doi":"10.3390/ai4030029","DOIUrl":"https://doi.org/10.3390/ai4030029","url":null,"abstract":"Artificial intelligence (AI) plays a critical role in the operation of robot vacuum cleaners, enabling them to intelligently navigate to clean and avoid indoor obstacles. Due to limited computational resources, manufacturers must balance performance and cost. This necessitates the development of lightweight AI models that can achieve high performance. Traditional uniform weight quantization assigns the same number of levels to all weights, regardless of their distribution or importance. Consequently, this lack of adaptability may lead to sub-optimal quantization results, as the quantization levels do not align with the statistical properties of the weights. To address this challenge, in this work, we propose a new technique called low bitwidth strong non-uniform quantization, which largely reduces the memory footprint of AI models while maintaining high accuracy. Our proposed non-uniform quantization method, as opposed to traditional uniform quantization, aims to align with the actual weight distribution of well-trained neural network models. The proposed quantization scheme builds upon the observation of weight distribution characteristics in AI models and aims to leverage this knowledge to enhance the efficiency of neural network implementations. Additionally, we adjust the input image size to reduce the computational and memory demands of AI models. The goal is to identify an appropriate image size and its corresponding AI models that can be used in resource-constrained robot vacuum cleaners while still achieving acceptable accuracy on the object classification task. Experimental results indicate that when compared to the state-of-the-art AI models in the literature, the proposed AI model achieves a 2-fold decrease in memory usage from 15.51 MB down to 7.68 MB while maintaining the same accuracy of around 93%. In addition, the proposed non-uniform quantization model reduces memory usage by 20 times (from 15.51 MB down to 0.78 MB) with a slight accuracy drop of 3.11% (the classification accuracy is still above 90%). Thus, our proposed high-performance and lightweight AI model strikes an excellent balance between model complexity, classification accuracy, and computational resources for robot vacuum cleaners.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"12 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76532360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Federated Learning for IoT Intrusion Detection 物联网入侵检测的联邦学习
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-24 DOI: 10.3390/ai4030028
Riccardo Lazzarini, H. Tianfield, V. Charissis
The number of Internet of Things (IoT) devices has increased considerably in the past few years, resulting in a large growth of cyber attacks on IoT infrastructure. As part of a defense in depth approach to cybersecurity, intrusion detection systems (IDSs) have acquired a key role in attempting to detect malicious activities efficiently. Most modern approaches to IDS in IoT are based on machine learning (ML) techniques. The majority of these are centralized, which implies the sharing of data from source devices to a central server for classification. This presents potentially crucial issues related to privacy of user data as well as challenges in data transfers due to their volumes. In this article, we evaluate the use of federated learning (FL) as a method to implement intrusion detection in IoT environments. FL is an alternative, distributed method to centralized ML models, which has seen a surge of interest in IoT intrusion detection recently. In our implementation, we evaluate FL using a shallow artificial neural network (ANN) as the shared model and federated averaging (FedAvg) as the aggregation algorithm. The experiments are completed on the ToN_IoT and CICIDS2017 datasets in binary and multiclass classification. Classification is performed by the distributed devices using their own data. No sharing of data occurs among participants, maintaining data privacy. When compared against a centralized approach, results have shown that a collaborative FL IDS can be an efficient alternative, in terms of accuracy, precision, recall and F1-score, making it a viable option as an IoT IDS. Additionally, with these results as baseline, we have evaluated alternative aggregation algorithms, namely FedAvgM, FedAdam and FedAdagrad, in the same setting by using the Flower FL framework. The results from the evaluation show that, in our scenario, FedAvg and FedAvgM tend to perform better compared to the two adaptive algorithms, FedAdam and FedAdagrad.
物联网(IoT)设备的数量在过去几年中大幅增加,导致对物联网基础设施的网络攻击大量增加。作为网络安全深度防御方法的一部分,入侵检测系统(ids)在有效检测恶意活动方面发挥了关键作用。物联网中大多数现代IDS方法都是基于机器学习(ML)技术。其中大多数是集中式的,这意味着将数据从源设备共享到中央服务器以进行分类。这就提出了与用户数据隐私相关的潜在关键问题,以及由于数据量而导致的数据传输挑战。在本文中,我们评估了联邦学习(FL)作为在物联网环境中实现入侵检测的方法的使用。FL是集中式机器学习模型的另一种分布式方法,最近对物联网入侵检测的兴趣激增。在我们的实现中,我们使用浅人工神经网络(ANN)作为共享模型和联邦平均(FedAvg)作为聚合算法来评估FL。在ToN_IoT和CICIDS2017数据集上完成了二值分类和多类分类实验。分类由分布式设备使用它们自己的数据执行。参与者之间不会共享数据,维护数据隐私。与集中式方法相比,结果表明,在准确性、精密度、召回率和f1分数方面,协作式FL IDS是一种有效的替代方案,使其成为物联网IDS的可行选择。此外,以这些结果为基准,我们使用Flower FL框架在相同的设置下评估了备选聚合算法,即FedAvgM, FedAdam和FedAdagrad。评估结果表明,在我们的场景中,FedAvg和FedAvgM比FedAdam和FedAdagrad两种自适应算法表现得更好。
{"title":"Federated Learning for IoT Intrusion Detection","authors":"Riccardo Lazzarini, H. Tianfield, V. Charissis","doi":"10.3390/ai4030028","DOIUrl":"https://doi.org/10.3390/ai4030028","url":null,"abstract":"The number of Internet of Things (IoT) devices has increased considerably in the past few years, resulting in a large growth of cyber attacks on IoT infrastructure. As part of a defense in depth approach to cybersecurity, intrusion detection systems (IDSs) have acquired a key role in attempting to detect malicious activities efficiently. Most modern approaches to IDS in IoT are based on machine learning (ML) techniques. The majority of these are centralized, which implies the sharing of data from source devices to a central server for classification. This presents potentially crucial issues related to privacy of user data as well as challenges in data transfers due to their volumes. In this article, we evaluate the use of federated learning (FL) as a method to implement intrusion detection in IoT environments. FL is an alternative, distributed method to centralized ML models, which has seen a surge of interest in IoT intrusion detection recently. In our implementation, we evaluate FL using a shallow artificial neural network (ANN) as the shared model and federated averaging (FedAvg) as the aggregation algorithm. The experiments are completed on the ToN_IoT and CICIDS2017 datasets in binary and multiclass classification. Classification is performed by the distributed devices using their own data. No sharing of data occurs among participants, maintaining data privacy. When compared against a centralized approach, results have shown that a collaborative FL IDS can be an efficient alternative, in terms of accuracy, precision, recall and F1-score, making it a viable option as an IoT IDS. Additionally, with these results as baseline, we have evaluated alternative aggregation algorithms, namely FedAvgM, FedAdam and FedAdagrad, in the same setting by using the Flower FL framework. The results from the evaluation show that, in our scenario, FedAvg and FedAvgM tend to perform better compared to the two adaptive algorithms, FedAdam and FedAdagrad.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"105 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80869430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Training Artificial Neural Networks Using a Global Optimization Method That Utilizes Neural Networks 利用神经网络的全局优化方法训练人工神经网络
IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-20 DOI: 10.3390/ai4030027
I. Tsoulos, Alexandros T. Tzallas
Perhaps one of the best-known machine learning models is the artificial neural network, where a number of parameters must be adjusted to learn a wide range of practical problems from areas such as physics, chemistry, medicine, etc. Such problems can be reduced to pattern recognition problems and then modeled from artificial neural networks, whether these problems are classification problems or regression problems. To achieve the goal of neural networks, they must be trained by appropriately adjusting their parameters using some global optimization methods. In this work, the application of a recent global minimization technique is suggested for the adjustment of neural network parameters. In this technique, an approximation of the objective function to be minimized is created using artificial neural networks and then sampling is performed from the approximation function and not the original one. Therefore, in the present work, learning of the parameters of artificial neural networks is performed using other neural networks. The new training method was tested on a series of well-known problems, a comparative study was conducted against other neural network parameter tuning techniques, and the results were more than promising. From what was seen after performing the experiments and comparing the proposed technique with others that have been used for classification datasets as well as regression datasets, there was a significant difference in the performance of the proposed technique, starting with 30% for classification datasets and reaching 50% for regression problems. However, the proposed technique, because it presupposes the use of global optimization techniques involving artificial neural networks, may require significantly higher execution time than other techniques.
也许最著名的机器学习模型之一是人工神经网络,其中必须调整许多参数才能从物理,化学,医学等领域学习广泛的实际问题。这些问题可以简化为模式识别问题,然后从人工神经网络建模,无论这些问题是分类问题还是回归问题。为了实现神经网络的目标,必须使用一些全局优化方法,通过适当调整神经网络的参数来训练神经网络。在这项工作中,建议应用一种最新的全局最小化技术来调整神经网络参数。在这种技术中,使用人工神经网络创建要最小化的目标函数的近似值,然后从近似值而不是原始函数中进行采样。因此,在目前的工作中,人工神经网络的参数学习是利用其他神经网络来完成的。在一系列已知问题上对该方法进行了测试,并与其他神经网络参数整定技术进行了对比研究,结果令人满意。从进行实验并将所提出的技术与其他用于分类数据集和回归数据集的技术进行比较后所看到的情况来看,所提出的技术的性能存在显着差异,从分类数据集的30%开始,回归问题达到50%。然而,所提出的技术,因为它以使用涉及人工神经网络的全局优化技术为前提,可能需要比其他技术更高的执行时间。
{"title":"Training Artificial Neural Networks Using a Global Optimization Method That Utilizes Neural Networks","authors":"I. Tsoulos, Alexandros T. Tzallas","doi":"10.3390/ai4030027","DOIUrl":"https://doi.org/10.3390/ai4030027","url":null,"abstract":"Perhaps one of the best-known machine learning models is the artificial neural network, where a number of parameters must be adjusted to learn a wide range of practical problems from areas such as physics, chemistry, medicine, etc. Such problems can be reduced to pattern recognition problems and then modeled from artificial neural networks, whether these problems are classification problems or regression problems. To achieve the goal of neural networks, they must be trained by appropriately adjusting their parameters using some global optimization methods. In this work, the application of a recent global minimization technique is suggested for the adjustment of neural network parameters. In this technique, an approximation of the objective function to be minimized is created using artificial neural networks and then sampling is performed from the approximation function and not the original one. Therefore, in the present work, learning of the parameters of artificial neural networks is performed using other neural networks. The new training method was tested on a series of well-known problems, a comparative study was conducted against other neural network parameter tuning techniques, and the results were more than promising. From what was seen after performing the experiments and comparing the proposed technique with others that have been used for classification datasets as well as regression datasets, there was a significant difference in the performance of the proposed technique, starting with 30% for classification datasets and reaching 50% for regression problems. However, the proposed technique, because it presupposes the use of global optimization techniques involving artificial neural networks, may require significantly higher execution time than other techniques.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"391 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79645574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Ai Magazine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1