首页 > 最新文献

Ai Magazine最新文献

英文 中文
Artificial intelligence and the impact of the EU AI Act in business organizations 人工智能和欧盟人工智能法案对商业组织的影响
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-04 DOI: 10.1002/aaai.70039
Marc Selgas Cors, Renata Thiébaut

Artificial intelligence (AI) is transforming industries worldwide, and the e-commerce sector is at the forefront of leveraging its capabilities to drive innovation and efficiency. The paper explores the integration of artificial intelligence in e-commerce, focusing on the ethical and regulatory implications introduced by the EU AI Act. This legislative framework aims to ensure the responsible deployment of AI by classifying AI systems into risk categories and imposing compliance requirements. It also underscores both the opportunities and challenges that AI presents to businesses, particularly in enhancing consumer experiences through automation and data-driven decision-making processes. The paper provides a comprehensive review of the AI landscape in Europe, analyzing the impact of the EU AI Act, particularly on small and medium-sized enterprises and startups. Through a mixed-methods approach, the study investigates how regulatory compliance may influence business innovation, market competitiveness, and consumer trust. The recommendations proposed aim to develop a trustworthy AI ecosystem that could stimulate long-term growth and enhance the global positioning of small European businesses.

人工智能(AI)正在改变全球各行业,电子商务行业正处于利用其能力推动创新和效率的前沿。本文探讨了人工智能在电子商务中的整合,重点是欧盟人工智能法案引入的道德和监管影响。该立法框架旨在通过将人工智能系统划分为风险类别并施加合规要求,确保负责任的人工智能部署。它还强调了人工智能给企业带来的机遇和挑战,特别是通过自动化和数据驱动的决策流程来增强消费者体验。本文全面回顾了欧洲的人工智能形势,分析了欧盟人工智能法案的影响,特别是对中小型企业和初创企业的影响。通过混合方法,本研究探讨了法规遵从如何影响商业创新、市场竞争力和消费者信任。拟议的建议旨在建立一个值得信赖的人工智能生态系统,以刺激长期增长,增强欧洲小企业的全球定位。
{"title":"Artificial intelligence and the impact of the EU AI Act in business organizations","authors":"Marc Selgas Cors,&nbsp;Renata Thiébaut","doi":"10.1002/aaai.70039","DOIUrl":"https://doi.org/10.1002/aaai.70039","url":null,"abstract":"<p>Artificial intelligence (AI) is transforming industries worldwide, and the e-commerce sector is at the forefront of leveraging its capabilities to drive innovation and efficiency. The paper explores the integration of artificial intelligence in e-commerce, focusing on the ethical and regulatory implications introduced by the EU AI Act. This legislative framework aims to ensure the responsible deployment of AI by classifying AI systems into risk categories and imposing compliance requirements. It also underscores both the opportunities and challenges that AI presents to businesses, particularly in enhancing consumer experiences through automation and data-driven decision-making processes. The paper provides a comprehensive review of the AI landscape in Europe, analyzing the impact of the EU AI Act, particularly on small and medium-sized enterprises and startups. Through a mixed-methods approach, the study investigates how regulatory compliance may influence business innovation, market competitiveness, and consumer trust. The recommendations proposed aim to develop a trustworthy AI ecosystem that could stimulate long-term growth and enhance the global positioning of small European businesses.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 4","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70039","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From rights to runtime: Privacy engineering for agentic AI 从权利到运行:代理AI的隐私工程
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-26 DOI: 10.1002/aaai.70036
Keivan Navaie

Agentic AI shifts stacks from request-response to plan-execute. Systems no longer just answer; they act—planning tasks, calling tools, keeping memory, and changing external state. That shift moves privacy from policy docs into the runtime. This opinion piece argues that we do not need a new privacy theory for agents; we need enforceable, observable controls that render existing rights as product behavior. Anchoring on GDPR—with portable touchpoints to CPRA, LGPD, and PDPA, we propose a developer-first toolkit: optional, bounded, user-visible memory; a purpose-aware egress gate that enforces minimization and transfer rules; proportional safeguards that scale with stakes; and traces that tell a coherent story across components and suppliers. We show how the EU AI Act's risk management, logging, and oversight can scaffold these controls and enable evidence reuse. The result is an agentic runtime that keeps people in control and teams audit-ready by design.

代理AI将堆栈从请求-响应转变为计划-执行。系统不再只是回答问题;它们负责计划任务、调用工具、保存内存和更改外部状态。这种转变将隐私从策略文档转移到了运行时。这篇评论文章认为,我们不需要一个新的代理隐私理论;我们需要可执行的、可观察的控件,将现有的权利呈现为产品行为。锚定gdpr -与便携式接触点CPRA, LGPD和PDPA,我们提出了一个开发人员优先的工具包:可选的,有限的,用户可见的内存;执行最小化和转移规则的目的感知出口门;与利害相关的比例保障措施;以及在组件和供应商之间讲述连贯故事的痕迹。我们展示了欧盟人工智能法案的风险管理、记录和监督如何支撑这些控制并实现证据重用。其结果是一个代理运行时,它使人员处于控制之中,并且团队可以根据设计进行审计。
{"title":"From rights to runtime: Privacy engineering for agentic AI","authors":"Keivan Navaie","doi":"10.1002/aaai.70036","DOIUrl":"https://doi.org/10.1002/aaai.70036","url":null,"abstract":"<p>Agentic AI shifts stacks from request-response to plan-execute. Systems no longer just answer; they act—planning tasks, calling tools, keeping memory, and changing external state. That shift moves privacy from policy docs into the runtime. This opinion piece argues that we do not need a new privacy theory for agents; we need enforceable, observable controls that render existing rights as product behavior. Anchoring on GDPR—with portable touchpoints to CPRA, LGPD, and PDPA, we propose a developer-first toolkit: optional, bounded, user-visible memory; a purpose-aware egress gate that enforces minimization and transfer rules; proportional safeguards that scale with stakes; and traces that tell a coherent story across components and suppliers. We show how the EU AI Act's risk management, logging, and oversight can scaffold these controls and enable evidence reuse. The result is an agentic runtime that keeps people in control and teams audit-ready by design.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 4","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70036","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145367007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A community-driven vision for a new knowledge resource for AI 社区驱动的人工智能新知识资源愿景
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-26 DOI: 10.1002/aaai.70035
Vinay K Chaudhri, Chaitan Baru, Brandon Bennett, Mehul Bhatt, Darion Cassel, Anthony G Cohn, Rina Dechter, Esra Erdem, Dave Ferrucci, Ken Forbus, Gregory Gelfond, Michael Genesereth, Andrew S. Gordon, Benjamin Grosof, Gopal Gupta, Jim Hendler, Sharat Israni, Tyler R. Josephson, Patrick Kyllonen, Yuliya Lierler, Vladimir Lifschitz, Clifton McFate, Hande Küçük McGinty, Leora Morgenstern, Alessandro Oltramari, Praveen Paritosh, Dan Roth, Blake Shepard, Cogan Shimizu, Denny Vrandečić, Mark Whiting, Michael Witbrock

The long-standing goal of creating a comprehensive, multi-purpose knowledge resource, reminiscent of the 1984 Cyc project, still persists in AI. Despite the success of knowledge resources like WordNet, ConceptNet, Wolfram|Alpha and other commercial knowledge graphs, verifiable, general-purpose, widely available sources of knowledge remain a critical deficiency in AI infrastructure. Large language models struggle due to knowledge gaps; robotic planning lacks necessary world knowledge; and the detection of factually false information relies heavily on human expertise. What kind of knowledge resource is most needed in AI today? How can modern technology shape its development and evaluation? A recent AAAI workshop gathered over 50 researchers to explore these questions. This paper synthesizes our findings and outlines a community-driven vision for a new knowledge infrastructure. In addition to leveraging contemporary advances in knowledge representation and reasoning, one promising idea is to build an open engineering framework to exploit knowledge modules effectively within the context of practical applications. Such a framework should include sets of conventions and social structures that are adopted by contributors.

创建一个全面的、多用途的知识资源的长期目标,让人想起1984年的Cyc项目,仍然坚持在人工智能中。尽管WordNet、ConceptNet、Wolfram b| Alpha等商业知识图谱等知识资源取得了成功,但可验证的、通用的、广泛可用的知识来源仍然是人工智能基础设施的一个关键缺陷。大型语言模型由于知识差距而挣扎;机器人规划缺乏必要的世界知识;而对虚假信息的检测在很大程度上依赖于人类的专业知识。今天的人工智能最需要什么样的知识资源?现代技术如何塑造其发展和评价?最近的AAAI研讨会聚集了50多名研究人员来探索这些问题。本文综合了我们的研究结果,并概述了社区驱动的新知识基础设施愿景。除了利用知识表示和推理方面的当代进步,一个有希望的想法是建立一个开放的工程框架,在实际应用的背景下有效地利用知识模块。这种框架应包括捐助国采用的若干套公约和社会结构。
{"title":"A community-driven vision for a new knowledge resource for AI","authors":"Vinay K Chaudhri,&nbsp;Chaitan Baru,&nbsp;Brandon Bennett,&nbsp;Mehul Bhatt,&nbsp;Darion Cassel,&nbsp;Anthony G Cohn,&nbsp;Rina Dechter,&nbsp;Esra Erdem,&nbsp;Dave Ferrucci,&nbsp;Ken Forbus,&nbsp;Gregory Gelfond,&nbsp;Michael Genesereth,&nbsp;Andrew S. Gordon,&nbsp;Benjamin Grosof,&nbsp;Gopal Gupta,&nbsp;Jim Hendler,&nbsp;Sharat Israni,&nbsp;Tyler R. Josephson,&nbsp;Patrick Kyllonen,&nbsp;Yuliya Lierler,&nbsp;Vladimir Lifschitz,&nbsp;Clifton McFate,&nbsp;Hande Küçük McGinty,&nbsp;Leora Morgenstern,&nbsp;Alessandro Oltramari,&nbsp;Praveen Paritosh,&nbsp;Dan Roth,&nbsp;Blake Shepard,&nbsp;Cogan Shimizu,&nbsp;Denny Vrandečić,&nbsp;Mark Whiting,&nbsp;Michael Witbrock","doi":"10.1002/aaai.70035","DOIUrl":"https://doi.org/10.1002/aaai.70035","url":null,"abstract":"<p>The long-standing goal of creating a comprehensive, multi-purpose knowledge resource, reminiscent of the 1984 Cyc project, still persists in AI. Despite the success of knowledge resources like WordNet, ConceptNet, Wolfram|Alpha and other commercial knowledge graphs, verifiable, general-purpose, widely available sources of knowledge remain a critical deficiency in AI infrastructure. Large language models struggle due to knowledge gaps; robotic planning lacks necessary world knowledge; and the detection of factually false information relies heavily on human expertise. What kind of knowledge resource is most needed in AI today? How can modern technology shape its development and evaluation? A recent AAAI workshop gathered over 50 researchers to explore these questions. This paper synthesizes our findings and outlines a community-driven vision for a new knowledge infrastructure. In addition to leveraging contemporary advances in knowledge representation and reasoning, one promising idea is to build an open engineering framework to exploit knowledge modules effectively within the context of practical applications. Such a framework should include sets of conventions and social structures that are adopted by contributors.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 4","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70035","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145367001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CAGE challenge 4: A scalable multi-agent reinforcement learning gym for autonomous cyber defence CAGE挑战4:用于自主网络防御的可扩展多智能体强化学习健身房
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-21 DOI: 10.1002/aaai.70021
Mitchell Kiely, Metin Ahiskali, Etienne Borde, Benjamin Bowman, David Bowman, Dirk Van Bruggen, K. C. Cowan, Prithviraj Dasgupta, Erich Devendorf, Ben Edwards, Alex Fitts, Sunny Fugate, Ryan Gabrys, Wayne Gould, H. Howie Huang, Jules Jacobs, Ryan Kerr, Isaiah J. King, Li Li, Luis Martinez, Christopher Moir, Craig Murphy, Olivia Naish, Claire Owens, Miranda Purchase, Ahmad Ridley, Adrian Taylor, Sara Farmer, William John Valentine, Yiyi Zhang

As cyber threats become increasingly automated and sophisticated, novel solutions must be introduced to improve defense of enterprise networks. Deep reinforcement learning (DRL) has demonstrated potential in mitigating these advanced threats. Single DRL agents have proven utility toward execution of autonomous cyber defense. Despite the success of employing single DRL agents, this approach presents significant limitations, especially regarding scalability within large enterprise networks. An attractive alternative to the single-agent approach is the use of multi-agent reinforcement learning (MARL). However, developing MARL agents is costly with few options for examining MARL cyber defense techniques against adversarial agents. This paper presents a MARL network security environment, the fourth iteration of the cyber autonomy gym for experimentation (CAGE) challenges. This challenge was specifically designed to test the efficacy of MARL algorithms in an enterprise network. Our work aims to evaluate the potential of MARL as a robust and scalable solution for autonomous network defense.

随着网络威胁变得越来越自动化和复杂,必须引入新的解决方案来提高企业网络的防御能力。深度强化学习(DRL)已经证明了在缓解这些高级威胁方面的潜力。单一DRL代理已被证明在执行自主网络防御方面具有实用价值。尽管使用单个DRL代理取得了成功,但这种方法存在明显的局限性,特别是在大型企业网络中的可伸缩性方面。一个有吸引力的替代单智能体方法是使用多智能体强化学习(MARL)。然而,开发MARL代理是昂贵的,并且很少有选择来检查针对对抗性代理的MARL网络防御技术。本文提出了在MARL网络安全环境下,第四次迭代网络自治实验馆(CAGE)的挑战。这个挑战是专门为测试MARL算法在企业网络中的有效性而设计的。我们的工作旨在评估MARL作为自主网络防御的鲁棒和可扩展解决方案的潜力。
{"title":"CAGE challenge 4: A scalable multi-agent reinforcement learning gym for autonomous cyber defence","authors":"Mitchell Kiely,&nbsp;Metin Ahiskali,&nbsp;Etienne Borde,&nbsp;Benjamin Bowman,&nbsp;David Bowman,&nbsp;Dirk Van Bruggen,&nbsp;K. C. Cowan,&nbsp;Prithviraj Dasgupta,&nbsp;Erich Devendorf,&nbsp;Ben Edwards,&nbsp;Alex Fitts,&nbsp;Sunny Fugate,&nbsp;Ryan Gabrys,&nbsp;Wayne Gould,&nbsp;H. Howie Huang,&nbsp;Jules Jacobs,&nbsp;Ryan Kerr,&nbsp;Isaiah J. King,&nbsp;Li Li,&nbsp;Luis Martinez,&nbsp;Christopher Moir,&nbsp;Craig Murphy,&nbsp;Olivia Naish,&nbsp;Claire Owens,&nbsp;Miranda Purchase,&nbsp;Ahmad Ridley,&nbsp;Adrian Taylor,&nbsp;Sara Farmer,&nbsp;William John Valentine,&nbsp;Yiyi Zhang","doi":"10.1002/aaai.70021","DOIUrl":"https://doi.org/10.1002/aaai.70021","url":null,"abstract":"<p>As cyber threats become increasingly automated and sophisticated, novel solutions must be introduced to improve defense of enterprise networks. Deep reinforcement learning (DRL) has demonstrated potential in mitigating these advanced threats. Single DRL agents have proven utility toward execution of autonomous cyber defense. Despite the success of employing single DRL agents, this approach presents significant limitations, especially regarding scalability within large enterprise networks. An attractive alternative to the single-agent approach is the use of multi-agent reinforcement learning (MARL). However, developing MARL agents is costly with few options for examining MARL cyber defense techniques against adversarial agents. This paper presents a MARL network security environment, the fourth iteration of the cyber autonomy gym for experimentation (CAGE) challenges. This challenge was specifically designed to test the efficacy of MARL algorithms in an enterprise network. Our work aims to evaluate the potential of MARL as a robust and scalable solution for autonomous network defense.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70021","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-based AI planning and execution platforms for robotics 基于模型的机器人人工智能规划与执行平台
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-06 DOI: 10.1002/aaai.70034
Or Wertheim, Ronen I. Brafman

Model-based planning and execution systems offer a principled approach to building flexible autonomous robots that can perform diverse tasks by automatically combining a host of basic skills. This idea is almost as old as modern robotics. Yet, while diverse general-purpose reasoning architectures have been proposed since, general-purpose software platforms that support the construction of planner-based controllers and their integration with modern robotic platforms have emerged only recently, starting with the influential ROSPlan system. Since then, a growing number of domain-independent model-based platforms for robot task-level control have emerged. In this paper, we consider the diverse design choices and issues existing platforms attempt to address, the different solutions proposed so far, and suggest avenues for future development. We also briefly discuss the elephant in the room: foundation models.

基于模型的规划和执行系统为构建灵活的自主机器人提供了一种原则性的方法,这种机器人可以通过自动组合大量基本技能来执行各种任务。这个想法几乎和现代机器人技术一样古老。然而,尽管此后提出了各种通用推理架构,但支持基于规划器的控制器构建及其与现代机器人平台集成的通用软件平台直到最近才出现,首先是有影响力的ROSPlan系统。此后,越来越多的基于领域独立模型的机器人任务级控制平台应运而生。在本文中,我们考虑了不同的设计选择和现有平台试图解决的问题,到目前为止提出的不同解决方案,并提出了未来发展的途径。我们还简要讨论了房间里的大象:基础模型。
{"title":"Model-based AI planning and execution platforms for robotics","authors":"Or Wertheim,&nbsp;Ronen I. Brafman","doi":"10.1002/aaai.70034","DOIUrl":"https://doi.org/10.1002/aaai.70034","url":null,"abstract":"<p>Model-based planning and execution systems offer a principled approach to building flexible autonomous robots that can perform diverse tasks by automatically combining a host of basic skills. This idea is almost as old as modern robotics. Yet, while diverse general-purpose reasoning architectures have been proposed since, general-purpose software platforms that support the construction of planner-based controllers and their integration with modern robotic platforms have emerged only recently, starting with the influential ROSPlan system. Since then, a growing number of domain-independent model-based platforms for robot task-level control have emerged. In this paper, we consider the diverse design choices and issues existing platforms attempt to address, the different solutions proposed so far, and suggest avenues for future development. We also briefly discuss the elephant in the room: foundation models.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70034","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145271727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable AI, energy and critical infrastructure systems 可解释的人工智能、能源和关键基础设施系统
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-23 DOI: 10.1002/aaai.70033
Francesco Leofante, André Artelt, Demetrios Eliades, Anna Korre, Francesca Toni, Tim Miller

The AAAI 2025 Bridge on “Explainable AI, Energy and Critical Infrastructure Systems” was held at the Pennsylvania Convention Centre, Philadelphia, Pennsylvania, USA, on February 25, 2025. The bridge gathered researchers and practitioners, bringing together innovation research across explainable AI, energy and critical infrastructure systems so they can enhance each other. The Bridge featured five keynote presentations by experts, one tutorial, poster presentations by authors who contributed their research findings, and three breakout sessions to discuss new challenges arising at the intersection of these exciting disciplines.

2025年2月25日,以“可解释的人工智能、能源和关键基础设施系统”为主题的AAAI 2025年会在美国宾夕法尼亚州费城宾夕法尼亚会议中心举行。这座桥聚集了研究人员和实践者,将可解释的人工智能、能源和关键基础设施系统的创新研究汇集在一起,使它们能够相互增强。“桥梁”有五个专家的主题演讲、一个教程、贡献研究成果的作者的海报演讲,以及三个分组会议,讨论在这些令人兴奋的学科交叉领域出现的新挑战。
{"title":"Explainable AI, energy and critical infrastructure systems","authors":"Francesco Leofante,&nbsp;André Artelt,&nbsp;Demetrios Eliades,&nbsp;Anna Korre,&nbsp;Francesca Toni,&nbsp;Tim Miller","doi":"10.1002/aaai.70033","DOIUrl":"https://doi.org/10.1002/aaai.70033","url":null,"abstract":"<p>The AAAI 2025 Bridge on “Explainable AI, Energy and Critical Infrastructure Systems” was held at the Pennsylvania Convention Centre, Philadelphia, Pennsylvania, USA, on February 25, 2025. The bridge gathered researchers and practitioners, bringing together innovation research across explainable AI, energy and critical infrastructure systems so they can enhance each other. The Bridge featured five keynote presentations by experts, one tutorial, poster presentations by authors who contributed their research findings, and three breakout sessions to discuss new challenges arising at the intersection of these exciting disciplines.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70033","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145181548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Open-source AI at scale: Establishing an enterprise AI strategy through modular frameworks 大规模开源人工智能:通过模块化框架建立企业人工智能战略
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-22 DOI: 10.1002/aaai.70032
Serdar Kadıoğlu

We present a comprehensive enterprise AI strategy developed within the AI Center of Excellence at Fidelity Investments, emphasizing the strategic integration of open-source AI frameworks into scalable, modular, and reproducible enterprise-grade solutions. Our approach is structured around five key pillars: learning from offline data, learning from online feedback, intelligent decision-making, automated assistants, and responsible AI practices. Through a suite of 12 open-source libraries, we demonstrate how modular and interoperable tools can collectively enhance scalability, fairness, and explainability in real-world AI deployments. We further illustrate the impact of this strategy through three enterprise case studies. Finally, we distill a set of best deployment practices to guide organizations in implementing modular, open-source AI strategies at scale.

我们提出了由富达投资人工智能卓越中心开发的全面企业人工智能战略,强调将开源人工智能框架战略性地集成到可扩展、模块化和可复制的企业级解决方案中。我们的方法围绕着五个关键支柱:从离线数据中学习、从在线反馈中学习、智能决策、自动助手和负责任的人工智能实践。通过一套12个开源库,我们展示了模块化和可互操作的工具如何在现实世界的人工智能部署中共同增强可扩展性、公平性和可解释性。我们通过三个企业案例研究进一步说明该策略的影响。最后,我们提炼出一组最佳部署实践,以指导组织大规模实施模块化、开源的人工智能战略。
{"title":"Open-source AI at scale: Establishing an enterprise AI strategy through modular frameworks","authors":"Serdar Kadıoğlu","doi":"10.1002/aaai.70032","DOIUrl":"https://doi.org/10.1002/aaai.70032","url":null,"abstract":"<p>We present a comprehensive enterprise AI strategy developed within the AI Center of Excellence at Fidelity Investments, emphasizing the strategic integration of open-source AI frameworks into scalable, modular, and reproducible enterprise-grade solutions. Our approach is structured around five key pillars: learning from offline data, learning from online feedback, intelligent decision-making, automated assistants, and responsible AI practices. Through a suite of 12 open-source libraries, we demonstrate how modular and interoperable tools can collectively enhance scalability, fairness, and explainability in real-world AI deployments. We further illustrate the impact of this strategy through three enterprise case studies. Finally, we distill a set of best deployment practices to guide organizations in implementing modular, open-source AI strategies at scale.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70032","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145111174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal AI Teacher: Integrating Edge Computing and Reasoning Models for Enhanced Student Error Analysis 多模态人工智能教师:集成边缘计算和推理模型以增强学生错误分析
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-21 DOI: 10.1002/aaai.70030
Tianlong Xu, Yi-Fan Zhang, Zhendong Chu, Qingsong Wen

This paper extends our previously published work on the virtual AI teacher (VATE) system, presented at IAAI-25. VATE is designed to autonomously analyze and correct student errors in mathematical problem-solving using advanced large language models (LLMs). By incorporating student draft images as a primary input for reasoning, the system provides fine-grained error cause analysis and supports real-time, multi-round AI—student dialogues. In this extended version, we introduce a new snap-to-solve module for handling low-reasoning tasks using edge-deployed LLMs, enabling faster and partially offline interaction. We also include expanded benchmarking experiments, including human expert evaluations and ablation studies, to assess model performance and learning outcomes. Deployed on the Squirrel AI platform, VATE demonstrates high accuracy (78.3%) in error analysis and improves student learning efficiency, with strong user satisfaction. These results suggest that VATE is a scalable, cost-effective solution with the potential to transform educational practices.

本文扩展了我们之前在iai -25上发表的关于虚拟人工智能教师(VATE)系统的工作。VATE旨在使用先进的大型语言模型(llm)自主分析和纠正学生在数学问题解决方面的错误。通过将学生草稿图像作为推理的主要输入,该系统提供了细粒度的错误原因分析,并支持实时、多轮人工智能学生对话。在这个扩展版本中,我们引入了一个新的快照解决模块,用于处理使用边缘部署llm的低推理任务,从而实现更快的部分离线交互。我们还包括扩展的基准实验,包括人类专家评估和消融研究,以评估模型的性能和学习结果。VATE部署在Squirrel AI平台上,误差分析准确率高达78.3%,提高了学生的学习效率,用户满意度高。这些结果表明,VATE是一种可扩展的、具有成本效益的解决方案,具有改变教育实践的潜力。
{"title":"Multimodal AI Teacher: Integrating Edge Computing and Reasoning Models for Enhanced Student Error Analysis","authors":"Tianlong Xu,&nbsp;Yi-Fan Zhang,&nbsp;Zhendong Chu,&nbsp;Qingsong Wen","doi":"10.1002/aaai.70030","DOIUrl":"https://doi.org/10.1002/aaai.70030","url":null,"abstract":"<p>This paper extends our previously published work on the virtual AI teacher (VATE) system, presented at IAAI-25. VATE is designed to autonomously analyze and correct student errors in mathematical problem-solving using advanced large language models (LLMs). By incorporating student draft images as a primary input for reasoning, the system provides fine-grained error cause analysis and supports real-time, multi-round AI—student dialogues. In this extended version, we introduce a new snap-to-solve module for handling low-reasoning tasks using edge-deployed LLMs, enabling faster and partially offline interaction. We also include expanded benchmarking experiments, including human expert evaluations and ablation studies, to assess model performance and learning outcomes. Deployed on the Squirrel AI platform, VATE demonstrates high accuracy (78.3%) in error analysis and improves student learning efficiency, with strong user satisfaction. These results suggest that VATE is a scalable, cost-effective solution with the potential to transform educational practices.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70030","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145102329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated vulnerability evaluation with large language models and vulnerability ontologies 使用大型语言模型和漏洞本体的自动化漏洞评估
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-15 DOI: 10.1002/aaai.70031
Rikhiya Ghosh, Hans-Martin von Stockhausen, Martin Schmitt, George Marica Vasile, Sanjeev Kumar Karn, Oladimeji Farri

The National Vulnerability Database (NVD) publishes over a thousand new vulnerabilities monthly, with a projected 25 percent increase in 2024, highlighting the crucial need for rapid vulnerability identification to mitigate cybersecurity attacks and save costs and resources. In this work, we propose using large language models (LLMs) to learn vulnerability evaluation from historical assessments of medical device vulnerabilities in a single manufacturer's portfolio. We highlight the effectiveness and challenges of using LLMs for automatic vulnerability evaluation and introduce a method to enrich historical data with cybersecurity ontologies, enabling the system to understand new vulnerabilities without retraining the LLM. Our LLM system integrates with the in-house application—Cybersecurity Management System (CSMS)—to help Siemens Healthineers (SHS) product cybersecurity experts efficiently assess the vulnerabilities in our products. Also, we present a comprehensive set of experiments that helps showcase the properties of the LLM and dataset, the various guardrails we have implemented to safeguard the system in production, and the guidelines for efficient integration of LLMs into the cybersecurity tool.

国家漏洞数据库(NVD)每月发布1000多个新漏洞,预计到2024年将增加25%,这凸显了快速识别漏洞以减轻网络安全攻击并节省成本和资源的关键需求。在这项工作中,我们建议使用大型语言模型(llm)从单个制造商组合中的医疗设备漏洞的历史评估中学习漏洞评估。我们强调了使用LLM进行自动漏洞评估的有效性和挑战,并引入了一种使用网络安全本体丰富历史数据的方法,使系统能够在不重新训练LLM的情况下理解新的漏洞。我们的法学硕士系统集成了内部应用程序网络安全管理系统(csm),以帮助西门子医疗(SHS)产品网络安全专家有效地评估我们产品中的漏洞。此外,我们还提供了一组全面的实验,有助于展示LLM和数据集的属性,我们为保护生产中的系统而实施的各种护栏,以及将LLM有效集成到网络安全工具中的指导方针。
{"title":"Automated vulnerability evaluation with large language models and vulnerability ontologies","authors":"Rikhiya Ghosh,&nbsp;Hans-Martin von Stockhausen,&nbsp;Martin Schmitt,&nbsp;George Marica Vasile,&nbsp;Sanjeev Kumar Karn,&nbsp;Oladimeji Farri","doi":"10.1002/aaai.70031","DOIUrl":"https://doi.org/10.1002/aaai.70031","url":null,"abstract":"<p>The National Vulnerability Database (NVD) publishes over a thousand new vulnerabilities monthly, with a projected 25 percent increase in 2024, highlighting the crucial need for rapid vulnerability identification to mitigate cybersecurity attacks and save costs and resources. In this work, we propose using large language models (LLMs) to learn vulnerability evaluation from historical assessments of medical device vulnerabilities in a single manufacturer's portfolio. We highlight the effectiveness and challenges of using LLMs for automatic vulnerability evaluation and introduce a method to enrich historical data with cybersecurity ontologies, enabling the system to understand new vulnerabilities without retraining the LLM. Our LLM system integrates with the in-house application—Cybersecurity Management System (CSMS)—to help Siemens Healthineers (SHS) product cybersecurity experts efficiently assess the vulnerabilities in our products. Also, we present a comprehensive set of experiments that helps showcase the properties of the LLM and dataset, the various guardrails we have implemented to safeguard the system in production, and the guidelines for efficient integration of LLMs into the cybersecurity tool.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70031","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145057736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OnAIR: Applications of the NASA on-board artificial intelligence research platform OnAIR: NASA机载人工智能研究平台的应用
IF 3.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-09-15 DOI: 10.1002/aaai.70020
Evana Gizzi, Connor Firth, Caleb Adams, James Berck, P. Timothy Chase Jr, Christian Cassamajor-Paul, Rachael Chertok, Lily Clough, Jonathan Davis, Melissa De La Cruz, Matthew Dosberg, Alan Gibson, Jonathan Hammer, Ibrahim Haroon, Michael A. Johnson, Brian Kempa, James Marshall, Patrick Maynard, Brett McKinney, Leyton McKinney, Michael Monaghan, Robin Onsay, Hayley Owens, Sam Pedrotty, Daniel Rogers, Mahmooda Sultana, Jivko Sinapov, Bethany Theiling, Aaron Woodard, Caroline Zouloumian, Connor Williams

Infusing artificial intelligence algorithms into production aerospace systems can be challenging due to costs, timelines, and a risk-averse industry. We introduce the Onboard Artificial Intelligence Research (OnAIR) platform, an open-source software pipeline and cognitive architecture tool that enables full life cycle AI research for on-board intelligent systems. We begin with a description and user walk-through of the OnAIR tool. Next, we describe four use cases of OnAIR for both research and deployed onboard applications, detailing their use of OnAIR and the benefits it provided to the development and function of each respective scenario. Lastly, we describe two upcoming planned deployments which will leverage OnAIR for crucial mission outcomes. We conclude with remarks on future work and goals for the forward progression of OnAIR as a tool to enable a larger AI and aerospace research community.

由于成本、时间和厌恶风险的行业,将人工智能算法注入生产航空航天系统可能具有挑战性。我们推出车载人工智能研究(OnAIR)平台,这是一个开源软件管道和认知架构工具,可实现车载智能系统的全生命周期人工智能研究。我们从OnAIR工具的描述和用户演练开始。接下来,我们将描述用于研究和部署机载应用程序的OnAIR的四个用例,详细介绍它们对OnAIR的使用以及它为每个各自场景的开发和功能提供的好处。最后,我们描述了两个即将到来的计划部署,它们将利用OnAIR实现关键的任务结果。最后,我们对OnAIR作为一种工具的未来工作和目标进行了评论,以实现更大的人工智能和航空航天研究界。
{"title":"OnAIR: Applications of the NASA on-board artificial intelligence research platform","authors":"Evana Gizzi,&nbsp;Connor Firth,&nbsp;Caleb Adams,&nbsp;James Berck,&nbsp;P. Timothy Chase Jr,&nbsp;Christian Cassamajor-Paul,&nbsp;Rachael Chertok,&nbsp;Lily Clough,&nbsp;Jonathan Davis,&nbsp;Melissa De La Cruz,&nbsp;Matthew Dosberg,&nbsp;Alan Gibson,&nbsp;Jonathan Hammer,&nbsp;Ibrahim Haroon,&nbsp;Michael A. Johnson,&nbsp;Brian Kempa,&nbsp;James Marshall,&nbsp;Patrick Maynard,&nbsp;Brett McKinney,&nbsp;Leyton McKinney,&nbsp;Michael Monaghan,&nbsp;Robin Onsay,&nbsp;Hayley Owens,&nbsp;Sam Pedrotty,&nbsp;Daniel Rogers,&nbsp;Mahmooda Sultana,&nbsp;Jivko Sinapov,&nbsp;Bethany Theiling,&nbsp;Aaron Woodard,&nbsp;Caroline Zouloumian,&nbsp;Connor Williams","doi":"10.1002/aaai.70020","DOIUrl":"https://doi.org/10.1002/aaai.70020","url":null,"abstract":"<p>Infusing artificial intelligence algorithms into production aerospace systems can be challenging due to costs, timelines, and a risk-averse industry. We introduce the Onboard Artificial Intelligence Research (OnAIR) platform, an open-source software pipeline and cognitive architecture tool that enables full life cycle AI research for on-board intelligent systems. We begin with a description and user walk-through of the OnAIR tool. Next, we describe four use cases of OnAIR for both research and deployed onboard applications, detailing their use of OnAIR and the benefits it provided to the development and function of each respective scenario. Lastly, we describe two upcoming planned deployments which will leverage OnAIR for crucial mission outcomes. We conclude with remarks on future work and goals for the forward progression of OnAIR as a tool to enable a larger AI and aerospace research community.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"46 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.70020","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145057735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Ai Magazine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1