Ai Magazine最新文献

英文中文

Understanding the spirit of a norm: Challenges for norm-learning agents 理解准则的精神：规范学习机构面临的挑战

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-31 DOI: 10.1002/aaai.12138

Thomas Arnold, Matthias Scheutz

Social and moral norms are a fabric for holding human societies together and helping them to function. As such they will also become a means of evaluating the performance of future human–machine systems. While machine ethics has offered various approaches to endowing machines with normative competence, from the more logic-based to the more data-based, none of the proposals so far have considered the challenge of capturing the “spirit of a norm,” which often eludes rigid interpretation and complicates doing the right thing. We present some paradigmatic scenarios across contexts to illustrate why the spirit of a norm can be critical to make explicit and why it exposes the inadequacies of mere data-driven “value alignment” techniques such as reinforcement learning RL for interactive, real-time human–robot interaction. Instead, we argue that norm learning, in particular, learning to capture the spirit of a norm, requires combining common-sense inference-based and data-driven approaches.

社会和道德规范是维系人类社会并帮助其运转的结构。因此，它们也将成为评估未来人机系统性能的一种手段。虽然机器伦理学为赋予机器规范能力提供了各种方法，从基于逻辑的方法到基于数据的方法，但迄今为止的所有建议都没有考虑到捕捉 "规范精神 "这一挑战，而 "规范精神 "往往无法得到严格的解释，并使做正确的事变得更加复杂。我们介绍了一些不同情境下的典型场景，以说明为什么明确规范的精神至关重要，为什么它暴露了单纯的数据驱动型 "价值一致性 "技术（如用于交互式实时人机交互的强化学习 RL）的不足之处。相反，我们认为，规范学习，尤其是捕捉规范精神的学习，需要将基于常识推理的方法与数据驱动的方法相结合。

引用次数: 0

Groups of experts often differ in their decisions: What are the implications for AI and machine learning? A commentary on Noise: A Flaw in Human Judgment, by Kahneman, Sibony, and Sunstein (2021) 专家组的决定往往各不相同：这对人工智能和机器学习有何影响？关于 "噪音 "的评论：人类判断力的缺陷》的评论，作者：卡尼曼、西博尼和孙斯坦（2021 年）

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-26 DOI: 10.1002/aaai.12135

Derek H. Sleeman, Ken Gilhooly

Machine Learning systems rely heavily on annotated instances. Such annotations are frequently done by human experts, or by tools developed by experts, and so the central message of this book, Noise: A Flaw in Human Judgment (Kahneman, Sibony, and Sunstein 2021) is of considerable importance to AI/Machine Learning community. The core message is that if a number of experts are asked to annotate tasks that involve judgments, these responses will frequently differ. This observation poses a problem for how analysts choose a particular annotated dataset (from the group), or process the set of responses to give a “balanced” response, or whether to reject all the annotated datasets. A further important aspect of this book is the case studies which demonstrate that differences in judgments between fellow experts have been reported in a significant number of disciplines including, business, the law, government, and medicine. Kahneman, Sibony and Sunstein (2021), referred to as KSS subsequently, discuss how Expert Biases can be reduced, but the main focus of this book is a discussion of Noise, that is, differences that often occur between fellow experts, and how Noise can often be reduced. To address the last point KSS have formulated a set of six decision hygiene principles which include the recommendation that complex tasks should be subdivided, and then each subtask should be solved separately. A further principle is that each task should be solved by individual experts before the various judgments are discussed with fellow experts. Effectively, the book being reviewed covers three main topics: First, it reports several motivating studies that show how judgments of fellow experts varied significantly in the pricing of insurance premiums, and in setting the lengths of custodial sentences. These motivating studies very effectively illustrate the central concepts of Judgment, Noise, and Bias; that section also provides definitions of these core concepts and discusses how Noise is often amplified in group meetings. Secondly, the authors provide detailed discussion of further studies, in a variety of domains, which report the levels of disagreement between experts. Thirdly, KSS discusses how to reduce the levels of Noise between experts, as noted above, the authors refer to these as Principles of Noise Hygiene. These three parts are interwoven in a complex way throughout the book; in our view, the best overview of the book is given in the section Review and Conclusions: Taking Noise Seriously (KSS, p. 361).

机器学习系统在很大程度上依赖于注释实例。这些注释通常由人类专家或专家开发的工具完成，因此本书的核心信息《噪音：人类判断的缺陷》（Noise：人类判断力的缺陷》（Kahneman、Sibony 和 Sunstein，2021 年）一书的中心思想对人工智能/机器学习界相当重要。该书的核心信息是，如果要求一些专家对涉及判断的任务进行注释，这些专家的回答往往会有所不同。这一观察结果给分析人员带来了一个问题，即如何（从群体中）选择特定的注释数据集，或如何处理响应集以给出 "平衡 "响应，或是否拒绝所有注释数据集。本书的另一个重要方面是案例研究，这些案例研究表明，包括商业、法律、政府和医学在内的许多学科都有专家同行之间判断差异的报道。卡尼曼、西博尼和孙斯坦（Kahneman, Sibony and Sunstein，2021 年）（随后简称为 KSS）讨论了如何减少专家偏见，但本书的重点是讨论噪音，即专家同行之间经常出现的差异，以及如何减少噪音。针对最后一点，KSS 制定了一套六项决策卫生原则，其中包括建议将复杂的任务进行细分，然后分别解决每个子任务。另一项原则是，每项任务都应先由专家个人解决，然后再与其他专家讨论各种判断。实际上，这本书主要涉及三个主题：首先，该书报告了几项激励性研究，这些研究表明，在保险费定价和确定监禁刑期方面，同行专家的判断如何存在显著差异。这些激励性研究非常有效地说明了 "判断"、"噪音 "和 "偏见 "等核心概念；该部分还提供了这些核心概念的定义，并讨论了 "噪音 "在小组会议中如何经常被放大。其次，作者详细讨论了在不同领域开展的进一步研究，这些研究报告了专家之间的分歧程度。第三，KSS 讨论了如何降低专家之间的 "噪音 "水平，如上所述，作者将其称为 "噪音卫生原则"。这三个部分在全书中以复杂的方式交织在一起；我们认为，《回顾与结论》部分是对全书最好的概述：认真对待噪声》（KSS，第 361 页）。

{"title":"Groups of experts often differ in their decisions: What are the implications for AI and machine learning? A commentary on Noise: A Flaw in Human Judgment, by Kahneman, Sibony, and Sunstein (2021)","authors":"Derek H. Sleeman, Ken Gilhooly","doi":"10.1002/aaai.12135","DOIUrl":"10.1002/aaai.12135","url":null,"abstract":"Machine Learning systems rely heavily on annotated instances. Such annotations are frequently done by human experts, or by tools developed by experts, and so the central message of this book, Noise: A Flaw in Human Judgment (Kahneman, Sibony, and Sunstein 2021) is of considerable importance to AI/Machine Learning community. The core message is that if a number of experts are asked to annotate tasks that involve judgments, these responses will frequently differ. This observation poses a problem for how analysts choose a particular annotated dataset (from the group), or process the set of responses to give a “balanced” response, or whether to reject all the annotated datasets. A further important aspect of this book is the case studies which demonstrate that differences in judgments between fellow experts have been reported in a significant number of disciplines including, business, the law, government, and medicine. Kahneman, Sibony and Sunstein (2021), referred to as KSS subsequently, discuss how Expert Biases can be reduced, but the main focus of this book is a discussion of Noise, that is, differences that often occur between fellow experts, and how Noise can often be reduced. To address the last point KSS have formulated a set of six decision hygiene principles which include the recommendation that complex tasks should be subdivided, and then each subtask should be solved separately. A further principle is that each task should be solved by individual experts before the various judgments are discussed with fellow experts. Effectively, the book being reviewed covers three main topics: First, it reports several motivating studies that show how judgments of fellow experts varied significantly in the pricing of insurance premiums, and in setting the lengths of custodial sentences. These motivating studies very effectively illustrate the central concepts of Judgment, Noise, and Bias; that section also provides definitions of these core concepts and discusses how Noise is often amplified in group meetings. Secondly, the authors provide detailed discussion of further studies, in a variety of domains, which report the levels of disagreement between experts. Thirdly, KSS discusses how to reduce the levels of Noise between experts, as noted above, the authors refer to these as Principles of Noise Hygiene. These three parts are interwoven in a complex way throughout the book; in our view, the best overview of the book is given in the section Review and Conclusions: Taking Noise Seriously (KSS, p. 361).","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"555-567"},"PeriodicalIF":0.9,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12135","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136376496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust internal representations for domain generalization 用于领域泛化的强大内部表征

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-26 DOI: 10.1002/aaai.12137

Mohammad Rostami

This paper, which is part of the New Faculty Highlights Invited Speaker Program of AAAI'23, serves as a comprehensive survey of my research in transfer learning by utilizing embedding spaces. The work reviewed in this paper specifically revolves around the inherent challenges associated with continual learning and limited availability of labeled data. By providing an overview of my past and ongoing contributions, this paper aims to present a holistic understanding of my research, paving the way for future explorations and advancements in the field. My research delves into the various settings of transfer learning, including, few-shot learning, zero-shot learning, continual learning, domain adaptation, and distributed learning. I hope this survey provides a forward-looking perspective for researchers who would like to focus on similar research directions.

这篇论文是 AAAI'23 新教师亮点特邀演讲计划的一部分，是对我利用嵌入空间进行迁移学习研究的全面调查。本文所回顾的工作特别围绕与持续学习和标记数据可用性有限相关的固有挑战展开。通过概述我过去和现在的贡献，本文旨在全面介绍我的研究，为该领域未来的探索和进步铺平道路。我的研究深入探讨了迁移学习的各种设置，包括少次学习、零次学习、持续学习、领域适应和分布式学习。我希望这份调查报告能为希望关注类似研究方向的研究人员提供一个前瞻性视角。

引用次数: 0

Trustworthy residual vehicle value prediction for auto finance 为汽车融资提供值得信赖的车辆残值预测

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-21 DOI: 10.1002/aaai.12136

Mihye Kim, Jimyung Choi, Jaehyun Kim, Wooyoung Kim, Yeonung Baek, Gisuk Bang, Kwangwoon Son, Yeonman Ryou, Kee-Eung Kim

The residual value (RV) of a vehicle refers to its estimated worth at some point in the future. It is a core component in every auto finance product, used to determine the credit lines and the leasing rates. As such, an accurate prediction of RV is critical for the auto finance industry, since it can pose a risk of revenue loss by over-prediction or make the financial product incompetent through under-prediction. Although there are a number of prior studies on training machine learning models on a large amount of used car sales data, we had to cope with real-world operational requirements such as compliance with regulations (i.e., monotonicity of output with respect to a subset of features) and generalization to unseen input (i.e., new and rare car models). In this paper, we describe how we addressed these practical challenges and created value for our business at Hyundai Capital Services, the top auto financial service provider in Korea.

车辆的剩余价值（RV）是指车辆在未来某个时间的估计价值。它是每种汽车金融产品的核心组成部分，用于确定信贷额度和租赁费率。因此，准确预测余值对汽车金融业至关重要，因为过度预测可能会造成收入损失，而预测不足则会使金融产品失效。虽然之前有很多关于在大量二手车销售数据上训练机器学习模型的研究，但我们必须应对现实世界中的操作要求，如符合法规（即输出相对于特征子集的单调性）和泛化到未见输入（即新车型和稀有车型）。在本文中，我们将介绍如何应对这些实际挑战，并为韩国顶级汽车金融服务提供商现代资金服务公司的业务创造价值。

引用次数: 0

Video Turing Test: A first step towards human-level AI 视频图灵测试：迈向人类级人工智能的第一步

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-17 DOI: 10.1002/aaai.12128

Minsu Lee, Yu-Jung Heo, Seongho Choi, Woo Suk Choi, Byoung-Tak Zhang

The development of artificial intelligence (AI) agents capable of human-level understanding of video content and conducting conversations with humans on this basis is a promising application that people expect. However, this is a challenging task that requires the holistic integration of multimodal information with temporal dependencies and reasoning, as well as social and physical commonsense. In addition, the development of appropriate systematic evaluation methods is essential. In this context, we introduce the Video Turing Test (VTT), a blind test used to evaluate human-likeness in terms of video comprehension ability. Moreover, we propose Vincent as a video understanding AI. We explain the configuration of VTT, the architecture of Vincent to prepare for VTT and the proposed evaluation methods for video comprehension. We also estimate the current intelligence level of AI based on our results and discuss future research directions.

开发能够理解视频内容并在此基础上与人类进行对话的人工智能（AI）代理是人们期待的一项前景广阔的应用。然而，这是一项具有挑战性的任务，需要将多模态信息与时间依赖性、推理以及社会和物理常识进行整体整合。此外，开发适当的系统评估方法也至关重要。在此背景下，我们引入了视频图灵测试（VTT），这是一种用于评估视频理解能力是否与人类相似的盲测。此外，我们还提出了文森特作为视频理解人工智能的建议。我们解释了视频图灵测试的配置、文森特为视频图灵测试做准备的架构以及建议的视频理解评估方法。我们还根据我们的结果估算了当前人工智能的智能水平，并讨论了未来的研究方向。

引用次数: 0

SolderNet: Towards trustworthy visual inspection of solder joints in electronics manufacturing using explainable artificial intelligence SolderNet：利用可解释人工智能实现电子制造中焊点的可信视觉检测

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-15 DOI: 10.1002/aaai.12129

Hayden Gunraj, Paul Guerrier, Sheldon Fernandez, Alexander Wong

In electronics manufacturing, solder joint defects are a common problem affecting a variety of printed circuit board components. To identify and correct solder joint defects, the solder joints on a circuit board are typically inspected manually by trained human inspectors, which is a very time-consuming and error-prone process. To improve both inspection efficiency and accuracy, in this work, we describe an explainable deep learning-based visual quality inspection system tailored for visual inspection of solder joints in electronics manufacturing environments. At the core of this system is an explainable solder joint defect identification system called SolderNet that we design and implement with trust and transparency in mind. While several challenges remain before the full system can be developed and deployed, this study presents important progress towards trustworthy visual inspection of solder joints in electronics manufacturing.

在电子制造过程中，焊点缺陷是影响各种印刷电路板组件的常见问题。为了识别和纠正焊点缺陷，电路板上的焊点通常由训练有素的人工检测人员进行人工检测，这是一个非常耗时且容易出错的过程。为了提高检测效率和准确性，我们在这项工作中介绍了一种基于可解释深度学习的视觉质量检测系统，该系统专为电子制造环境中的焊点视觉检测而定制。该系统的核心是一个名为 SolderNet 的可解释焊点缺陷识别系统。虽然在开发和部署完整系统之前仍存在一些挑战，但本研究为电子制造领域值得信赖的焊点视觉检测取得了重要进展。

引用次数: 0

Accurate detection of weld seams for laser welding in real-world manufacturing 在实际生产中准确检测激光焊接焊缝

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-13 DOI: 10.1002/aaai.12134

Rabia Ali, Muhammad Sarmad, Jawad Tayyub, Alexander Vogel

Welding is a fabrication process used to join or fuse two mechanical parts. Modern welding machines have automated lasers that follow a predefined weld seam path between the two parts to create a bond. Previous efforts have used simple computer vision edge detectors to automatically detect the weld seam on an image at the junction of two metals to be welded. However, these systems lack reliability and accuracy resulting in manual human verification of the detected edges. This paper presents a neural network architecture that automatically detects the weld seam edge between two metals with high accuracy. We augment this system with a preclassifier that filters out anomalous workpieces (e.g., incorrect placement). Finally, we justify our design choices by evaluating against several existing deep network pipelines as well as proof through real-world use. We also describe in detail the process of deploying the system in a real-world shop floor including evaluation and monitoring. We make public a large well-labeled laser seam dataset to perform deep learning-based edge detection in industrial settings.

焊接是一种用于连接或融合两个机械零件的制造工艺。现代焊接机配备了自动激光器，可按照预先确定的焊缝路径在两个部件之间进行焊接。以前，人们使用简单的计算机视觉边缘检测器来自动检测两种待焊接金属交界处图像上的焊缝。然而，这些系统缺乏可靠性和准确性，导致需要人工验证检测到的边缘。本文介绍了一种神经网络架构，它能高精度地自动检测两种金属之间的焊缝边缘。我们用一个预分类器对该系统进行了增强，该预分类器可过滤掉异常工件（如不正确的位置）。最后，我们对照现有的几个深度网络管道进行评估，并通过实际使用进行证明，从而证明我们的设计选择是正确的。我们还详细描述了在实际车间部署系统的过程，包括评估和监控。我们公开了一个大型标记良好的激光接缝数据集，以便在工业环境中执行基于深度学习的边缘检测。

{"title":"Accurate detection of weld seams for laser welding in real-world manufacturing","authors":"Rabia Ali, Muhammad Sarmad, Jawad Tayyub, Alexander Vogel","doi":"10.1002/aaai.12134","DOIUrl":"10.1002/aaai.12134","url":null,"abstract":"Welding is a fabrication process used to join or fuse two mechanical parts. Modern welding machines have automated lasers that follow a predefined weld seam path between the two parts to create a bond. Previous efforts have used simple computer vision edge detectors to automatically detect the weld seam on an image at the junction of two metals to be welded. However, these systems lack reliability and accuracy resulting in manual human verification of the detected edges. This paper presents a neural network architecture that automatically detects the weld seam edge between two metals with high accuracy. We augment this system with a preclassifier that filters out anomalous workpieces (e.g., incorrect placement). Finally, we justify our design choices by evaluating against several existing deep network pipelines as well as proof through real-world use. We also describe in detail the process of deploying the system in a real-world shop floor including evaluation and monitoring. We make public a large well-labeled laser seam dataset to perform deep learning-based edge detection in industrial settings.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"431-441"},"PeriodicalIF":0.9,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12134","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135856069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Expanding impact of mobile health programs: SAHELI for maternal and child care 扩大移动医疗项目的影响：用于母婴护理的 SAHELI

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-12 DOI: 10.1002/aaai.12126

Shresth Verma, Gargi Singh, Aditya Mate, Paritosh Verma, Sruthi Gorantla, Neha Madhiwalla, Aparna Hegde, Divy Thakkar, Manish Jain, Milind Tambe, Aparna Taneja

Underserved communities face critical health challenges due to lack of access to timely and reliable information. Nongovernmental organizations are leveraging the widespread use of cellphones to combat these healthcare challenges and spread preventative awareness. The health workers at these organizations reach out individually to beneficiaries; however, such programs still suffer from declining engagement. We have deployed Saheli, a system to efficiently utilize the limited availability of health workers for improving maternal and child health in India. Saheli uses the Restless Multi-armed Bandit (RMAB) framework to identify beneficiaries for outreach. It is the first deployed application for RMABs in public health, and is already in continuous use by our partner NGO, ARMMAN. We have already reached ∼130K beneficiaries with Saheli, and are on track to serve one million beneficiaries by the end of 2023. This scale and impact has been achieved through multiple innovations in the RMAB model and its development, in preparation of real world data, and in deployment practices; and through careful consideration of responsible AI practices. Specifically, in this paper, we describe our approach to learn from past data to improve the performance of Saheli's RMAB model, the real-world challenges faced during deployment and adoption of Saheli, and the end-to-end pipeline.

由于无法获得及时可靠的信息，得不到医疗服务的社区面临着严峻的健康挑战。非政府组织正在利用手机的广泛使用来应对这些医疗挑战，并传播预防意识。这些组织的卫生工作者会单独接触受益人；然而，这些项目仍然存在参与度下降的问题。我们在印度部署了 Saheli 系统，以有效利用有限的医疗工作者来改善母婴健康。Saheli 使用无休止多臂匪徒（RMAB）框架来识别受益人，以便开展外联活动。这是 RMABs 在公共卫生领域的首次应用，我们的合作伙伴非政府组织 ARMMAN 已在持续使用。通过 Saheli，我们已经为 13 万受益人提供了服务，并有望在 2023 年底前为 100 万受益人提供服务。这一规模和影响是通过在 RMAB 模型及其开发、真实世界数据准备和部署实践方面的多重创新，以及对负责任的人工智能实践的认真考虑而实现的。具体而言，在本文中，我们将介绍我们从过去的数据中学习以提高 Saheli RMAB 模型性能的方法、Saheli 部署和采用过程中面临的现实挑战以及端到端管道。

{"title":"Expanding impact of mobile health programs: SAHELI for maternal and child care","authors":"Shresth Verma, Gargi Singh, Aditya Mate, Paritosh Verma, Sruthi Gorantla, Neha Madhiwalla, Aparna Hegde, Divy Thakkar, Manish Jain, Milind Tambe, Aparna Taneja","doi":"10.1002/aaai.12126","DOIUrl":"10.1002/aaai.12126","url":null,"abstract":"Underserved communities face critical health challenges due to lack of access to timely and reliable information. Nongovernmental organizations are leveraging the widespread use of cellphones to combat these healthcare challenges and spread preventative awareness. The health workers at these organizations reach out individually to beneficiaries; however, such programs still suffer from declining engagement. We have deployed Saheli, a system to efficiently utilize the limited availability of health workers for improving maternal and child health in India. Saheli uses the Restless Multi-armed Bandit (RMAB) framework to identify beneficiaries for outreach. It is the first deployed application for RMABs in public health, and is already in continuous use by our partner NGO, ARMMAN. We have already reached ∼130K beneficiaries with Saheli, and are on track to serve one million beneficiaries by the end of 2023. This scale and impact has been achieved through multiple innovations in the RMAB model and its development, in preparation of real world data, and in deployment practices; and through careful consideration of responsible AI practices. Specifically, in this paper, we describe our approach to learn from past data to improve the performance of Saheli's RMAB model, the real-world challenges faced during deployment and adoption of Saheli, and the end-to-end pipeline.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"363-376"},"PeriodicalIF":0.9,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12126","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136013330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Energy forecasting with robust, flexible, and explainable machine learning algorithms 利用稳健、灵活、可解释的机器学习算法进行能源预测

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-11 DOI: 10.1002/aaai.12130

Zhaoyang Zhu, Weiqi Chen, Rui Xia, Tian Zhou, Peisong Niu, Bingqing Peng, Wenwei Wang, Hengbo Liu, Ziqing Ma, Xinyue Gu, Jin Wang, Qiming Chen, Linxiao Yang, Qingsong Wen, Liang Sun

Energy forecasting is crucial in scheduling and planning future electric load, so as to improve the reliability and safeness of the power grid. Despite recent developments of forecasting algorithms in the machine learning community, there is a lack of general and advanced algorithms specifically considering requirements from the power industry perspective. In this paper, we present eForecaster, a unified AI platform including robust, flexible, and explainable machine learning algorithms for diversified energy forecasting applications. Since October 2021, multiple commercial bus load, system load, and renewable energy forecasting systems built upon eForecaster have been deployed in seven provinces of China. The deployed systems consistently reduce the average Mean Absolute Error (MAE) by 39.8% to 77.0%, with reduced manual work and explainable guidance. In particular, eForecaster also integrates multiple interpretation methods to uncover the working mechanism of the predictive models, which significantly improves forecasts adoption and user satisfaction.

能源预测对于调度和规划未来电力负荷，从而提高电网的可靠性和安全性至关重要。尽管近年来机器学习界的预测算法取得了长足发展，但仍缺乏专门考虑电力行业需求的通用高级算法。在本文中，我们介绍了 eForecaster，这是一个统一的人工智能平台，包含稳健、灵活、可解释的机器学习算法，适用于多样化的能源预测应用。自 2021 年 10 月以来，基于 eForecaster 的多个商用母线负荷、系统负荷和可再生能源预测系统已在中国七个省份部署。已部署的系统持续降低平均绝对误差（MAE）39.8% 至 77.0%，并减少了人工操作和可解释的指导。特别是，eForecaster 还集成了多种解释方法，以揭示预测模型的工作机制，从而显著提高了预测的采用率和用户满意度。

{"title":"Energy forecasting with robust, flexible, and explainable machine learning algorithms","authors":"Zhaoyang Zhu, Weiqi Chen, Rui Xia, Tian Zhou, Peisong Niu, Bingqing Peng, Wenwei Wang, Hengbo Liu, Ziqing Ma, Xinyue Gu, Jin Wang, Qiming Chen, Linxiao Yang, Qingsong Wen, Liang Sun","doi":"10.1002/aaai.12130","DOIUrl":"10.1002/aaai.12130","url":null,"abstract":"Energy forecasting is crucial in scheduling and planning future electric load, so as to improve the reliability and safeness of the power grid. Despite recent developments of forecasting algorithms in the machine learning community, there is a lack of general and advanced algorithms specifically considering requirements from the power industry perspective. In this paper, we present eForecaster, a unified AI platform including robust, flexible, and explainable machine learning algorithms for diversified energy forecasting applications. Since October 2021, multiple commercial bus load, system load, and renewable energy forecasting systems built upon eForecaster have been deployed in seven provinces of China. The deployed systems consistently reduce the average Mean Absolute Error (MAE) by 39.8% to 77.0%, with reduced manual work and explainable guidance. In particular, eForecaster also integrates multiple interpretation methods to uncover the working mechanism of the predictive models, which significantly improves forecasts adoption and user satisfaction.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"377-393"},"PeriodicalIF":0.9,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12130","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136212318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Decision making in open agent systems 开放式代理系统中的决策制定

IF 0.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Ai Magazine

Pub Date : 2023-10-09 DOI: 10.1002/aaai.12131

Adam Eck, Leen-Kiat Soh, Prashant Doshi

In many real-world applications of AI, the set of actors and tasks are not constant, but instead change over time. Robots tasked with suppressing wildfires eventually run out of limited suppressant resources and need to temporarily disengage from the collaborative work in order to recharge, or they might become damaged and leave the environment permanently. In a large business organization, objectives and goals change with the market, requiring workers to adapt to perform different sets of tasks across time. We call these multiagent systems (MAS) open agent systems (OASYS), and the openness of the sets of agents and tasks necessitates new capabilities and modeling for decision making compared to planning and learning in closed environments. In this article, we discuss three notions of openness: agent openness, task openness, and type openness. We also review the past and current research on addressing the novel challenges brought about by openness in OASYS. We share lessons learned from these efforts and suggest directions for promising future work in this area. We also encourage the community to engage and participate in this area of MAS research to address critical real-world problems in the application of AI to enhance our daily lives.

在人工智能的许多实际应用中，参与者和任务的集合并不是固定不变的，而是随着时间的推移而变化。负责扑灭野火的机器人最终会耗尽有限的灭火剂资源，需要暂时脱离协同工作以补充能量，否则它们可能会受损并永久离开环境。在大型企业组织中，目标和目的会随着市场的变化而变化，这就要求工人在不同的时间段适应执行不同的任务。我们称这些多代理系统（MAS）为开放代理系统（OASYS），与封闭环境中的规划和学习相比，代理和任务集的开放性要求决策制定具备新的能力和建模。在本文中，我们将讨论开放性的三个概念：代理开放性、任务开放性和类型开放性。我们还回顾了过去和当前为应对 OASYS 开放性带来的新挑战而开展的研究。我们分享了从这些工作中汲取的经验教训，并为该领域未来有前景的工作提出了方向性建议。我们还鼓励社会各界参与这一领域的 MAS 研究，以解决人工智能应用中的关键现实问题，从而改善我们的日常生活。

{"title":"Decision making in open agent systems","authors":"Adam Eck, Leen-Kiat Soh, Prashant Doshi","doi":"10.1002/aaai.12131","DOIUrl":"10.1002/aaai.12131","url":null,"abstract":"In many real-world applications of AI, the set of actors and tasks are not constant, but instead change over time. Robots tasked with suppressing wildfires eventually run out of limited suppressant resources and need to temporarily disengage from the collaborative work in order to recharge, or they might become damaged and leave the environment permanently. In a large business organization, objectives and goals change with the market, requiring workers to adapt to perform different sets of tasks across time. We call these multiagent systems (MAS) open agent systems (OASYS), and the openness of the sets of agents and tasks necessitates new capabilities and modeling for decision making compared to planning and learning in closed environments. In this article, we discuss three notions of openness: agent openness, task openness, and type openness. We also review the past and current research on addressing the novel challenges brought about by openness in OASYS. We share lessons learned from these efforts and suggest directions for promising future work in this area. We also encourage the community to engage and participate in this area of MAS research to address critical real-world problems in the application of AI to enhance our daily lives.","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"508-523"},"PeriodicalIF":0.9,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12131","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135146704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Ai Magazine

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀