IEEE transactions on artificial intelligence最新文献

英文中文

Exploring Machine Learning for Semiconductor Process Optimization: A Systematic Review 探索半导体工艺优化中的机器学习：系统综述

IEEE transactions on artificial intelligence

Pub Date : 2024-07-17 DOI: 10.1109/TAI.2024.3429479

Ying-Lin Chen;Sara Sacchi;Bappaditya Dey;Victor Blanco;Sandip Halder;Philippe Leray;Stefan De Gendt

As machine learning (ML) continues to find applications, extensive research is currently underway across various domains. This study examines the current methodologies of ML being investigated to optimize semiconductor manufacturing processes. Our research involved searching the SPIE Digital Library, IEEE Xplore, and ArXiv databases, identifying 58 publications in the field of ML-based semiconductor process optimization. These investigations employ ML techniques such as feature extraction, feature selection, and neural network architecture are analyzed using different algorithms. These models find applications in advanced process control, virtual metrology, and quality control, critical aspects in semiconductor manufacturing for enhancing throughput and reducing production costs. We categorize the articles based on the methods and applications employed, summarizing the primary findings. Furthermore, we discuss the general conclusion of several studies. Overall, the reviewed literature suggests that ML-based semiconductor manufacturing is rapidly gaining popularity and advancing at a swift pace.

随着机器学习（ML）的不断应用，目前正在各个领域进行广泛的研究。本研究考察了目前正在研究的机器学习方法，以优化半导体制造工艺。我们的研究包括检索SPIE数字图书馆，IEEE explore和ArXiv数据库，确定了58篇基于ml的半导体工艺优化领域的出版物。这些研究采用ML技术，如特征提取、特征选择和神经网络架构，使用不同的算法进行分析。这些模型应用于先进的过程控制，虚拟计量和质量控制，半导体制造的关键方面，以提高吞吐量和降低生产成本。我们根据所采用的方法和应用对文章进行分类，总结了主要发现。此外，我们还讨论了一些研究的一般结论。总的来说，文献综述表明，基于机器学习的半导体制造正在迅速普及和发展。

{"title":"Exploring Machine Learning for Semiconductor Process Optimization: A Systematic Review","authors":"Ying-Lin Chen;Sara Sacchi;Bappaditya Dey;Victor Blanco;Sandip Halder;Philippe Leray;Stefan De Gendt","doi":"10.1109/TAI.2024.3429479","DOIUrl":"https://doi.org/10.1109/TAI.2024.3429479","url":null,"abstract":"As machine learning (ML) continues to find applications, extensive research is currently underway across various domains. This study examines the current methodologies of ML being investigated to optimize semiconductor manufacturing processes. Our research involved searching the SPIE Digital Library, IEEE Xplore, and ArXiv databases, identifying 58 publications in the field of ML-based semiconductor process optimization. These investigations employ ML techniques such as feature extraction, feature selection, and neural network architecture are analyzed using different algorithms. These models find applications in advanced process control, virtual metrology, and quality control, critical aspects in semiconductor manufacturing for enhancing throughput and reducing production costs. We categorize the articles based on the methods and applications employed, summarizing the primary findings. Furthermore, we discuss the general conclusion of several studies. Overall, the reviewed literature suggests that ML-based semiconductor manufacturing is rapidly gaining popularity and advancing at a swift pace.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"5969-5989"},"PeriodicalIF":0.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Observer-Based Adaptive Fuzzy Control for Singular Systems with Nonlinear Perturbation and Actuator Saturation 基于观测器的自适应模糊控制，用于具有非线性扰动和致动器饱和的奇异系统

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429052

Qingtan Meng;Qian Ma

This article investigates the adaptive fuzzy control problem for singular systems with actuator saturation and nonlinear perturbation, where the system consists of two coupled differential and algebraic subsystems. To cope with the actuator saturation, a new auxiliary system whose order is the same as the differential subsystem is introduced. With the help of the backstepping method and adaptive fuzzy control method, an observer-based adaptive output feedback tracking control approach is utilized. Under the designed controller, it is proved that the closed-loop system is impulse-free and regular, and all the involved signals are bounded. Furthermore, it is ensured that the tracking error can be adjusted by the errors between the control inputs and the corresponding saturated inputs, as well as the design parameters. Finally, simulation studies demonstrate the validity of the control approach.

本文研究了具有执行器饱和和非线性扰动的奇异系统的自适应模糊控制问题，该系统由两个耦合的微分和代数子系统组成。为了应对执行器饱和，引入了一个新的辅助系统，其阶数与微分子系统相同。在反步法和自适应模糊控制方法的帮助下，采用了基于观测器的自适应输出反馈跟踪控制方法。在所设计的控制器下，闭环系统证明是无脉冲和规则的，所有参与信号都是有界的。此外，还确保了跟踪误差可通过控制输入和相应饱和输入之间的误差以及设计参数进行调整。最后，模拟研究证明了控制方法的有效性。

引用次数: 0

Artificial Intelligence-Driven Framework for Augmented Reality Markerless Navigation in Knee Surgery 人工智能驱动的膝关节手术增强现实无标记导航框架

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429048

Xue Hu;Fabrizio Cutolo;Hisham Iqbal;Johann Henckel;Ferdinando Rodriguez y Baena

Conventional orthopedic navigation systems depend on marker-based tracking, which may introduce additional skin incisions, increase the risk and discomfort for the patient, and entail increased workflow complexity. The guidance is conveyed via 2-D monitors, which may distract the surgeon and increase the cognitive burden. This study presents an artificial intelligence (AI)—driven surgical navigation framework for knee replacement surgery. The system comprises an augmented reality (AR) interface that combines an occlusions-robust deep learning-based markerless bone tracking and registration algorithm with a commercial HoloLens 2 headset calibrated for the user's perspective on both eyes. The feasibility of such a system in navigating a bone drilling task is investigated with an experienced orthopedic surgeon on three cadaveric knees under realistic operating room (OR) conditions. After registering an implant model to computed tomography (CT) scans, the preoperative plans are determined based on the location of the fixation pins. Navigation accuracy is quantified using a highly accurate optical tracking system. The achieved drilling error is 7.88

$pm$

2.41 mm in translation and 7.36

$pm$

1.77

${}^{boldsymbol{circ}}$

in orientation. The results demonstrate the viability of integrating AI and AR technology to navigate knee surgery.

传统的骨科导航系统依赖于基于标记的跟踪，这可能会带来额外的皮肤切口，增加病人的风险和不适感，并增加工作流程的复杂性。导引通过二维显示器传递，这可能会分散外科医生的注意力，增加认知负担。本研究提出了一种人工智能（AI）驱动的膝关节置换手术导航框架。该系统包括一个增强现实（AR）界面，它将基于闭塞的深度学习无标记骨追踪和配准算法与根据用户双眼视角校准的商用 HoloLens 2 头显相结合。在真实的手术室（OR）条件下，由一名经验丰富的骨科医生对三个尸体膝关节进行骨钻孔任务导航，研究了这种系统的可行性。将植入物模型与计算机断层扫描（CT）扫描结果进行比对后，根据固定钉的位置确定术前计划。使用高精度光学跟踪系统对导航精度进行量化。钻孔误差在平移时为 7.88 $pm$ 2.41 mm，在定位时为 7.36 $pm$ 1.77${}^{boldsymbol{circ}}$。这些结果证明了将人工智能和 AR 技术整合到膝关节手术导航中的可行性。

{"title":"Artificial Intelligence-Driven Framework for Augmented Reality Markerless Navigation in Knee Surgery","authors":"Xue Hu;Fabrizio Cutolo;Hisham Iqbal;Johann Henckel;Ferdinando Rodriguez y Baena","doi":"10.1109/TAI.2024.3429048","DOIUrl":"https://doi.org/10.1109/TAI.2024.3429048","url":null,"abstract":"Conventional orthopedic navigation systems depend on marker-based tracking, which may introduce additional skin incisions, increase the risk and discomfort for the patient, and entail increased workflow complexity. The guidance is conveyed via 2-D monitors, which may distract the surgeon and increase the cognitive burden. This study presents an artificial intelligence (AI)—driven surgical navigation framework for knee replacement surgery. The system comprises an augmented reality (AR) interface that combines an occlusions-robust deep learning-based markerless bone tracking and registration algorithm with a commercial HoloLens 2 headset calibrated for the user's perspective on both eyes. The feasibility of such a system in navigating a bone drilling task is investigated with an experienced orthopedic surgeon on three cadaveric knees under realistic operating room (OR) conditions. After registering an implant model to computed tomography (CT) scans, the preoperative plans are determined based on the location of the fixation pins. Navigation accuracy is quantified using a highly accurate optical tracking system. The achieved drilling error is 7.88 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 2.41 mm in translation and 7.36 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 1.77\u0000<inline-formula><tex-math>${}^{boldsymbol{circ}}$</tex-math></inline-formula>\u0000 in orientation. The results demonstrate the viability of integrating AI and AR technology to navigate knee surgery.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5205-5215"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10599938","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation 用于分类和分段的全局注意力引导双域点云特征学习

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3429050

Zihao Li;Pan Gao;Kang You;Chuan Yan;Manoranjan Paul

Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a global attention-guided dual-domain feature learning network (GAD) to address the above-mentioned issues. We first devise the contextual position-enhanced transformer (CPT) module, which is armed with an improved global attention mechanism, to produces a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the dual-domain K-nearest neighbor feature fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.

以往的研究已经证明了基于点的神经模型在点云分析任务中的有效性。然而，为原始点坐标生成高效输入嵌入仍是一个关键问题。此外，另一个问题在于邻近聚合的效率有限，而邻近聚合是网络干系中的关键组成部分。在本文中，我们提出了一种全局注意力引导的双域特征学习网络（GAD）来解决上述问题。我们首先设计了上下文位置增强变换器（CPT）模块，该模块采用改进的全局注意力机制，生成全局感知输入嵌入，作为后续聚合的指导。然后，级联双域 K 近邻特征融合（DKFF），通过新颖的双域特征学习（既重视局部几何关系，又重视长距离语义联系）进行有效的特征聚合。在多个点云分析任务（如分类、部件分割和场景语义分割）上的广泛实验证明了所提方法的卓越性能和所设计模块的功效。

{"title":"Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation","authors":"Zihao Li;Pan Gao;Kang You;Chuan Yan;Manoranjan Paul","doi":"10.1109/TAI.2024.3429050","DOIUrl":"10.1109/TAI.2024.3429050","url":null,"abstract":"Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a global attention-guided dual-domain feature learning network (GAD) to address the above-mentioned issues. We first devise the contextual position-enhanced transformer (CPT) module, which is armed with an improved global attention mechanism, to produces a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the dual-domain K-nearest neighbor feature fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5167-5178"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141655081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Energy Scheduling Optimization for Microgrids Based on Partially Observable Markov Game 基于部分可观测马尔可夫博弈的微电网能源调度优化

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3428510

Jiakai Gong;Nuo Yu;Fen Han;Bin Tang;Haolong Wu;Yuan Ge

Microgrids (MGs) are essential for enhancing energy efficiency and minimizing power usage through the regulation of energy storage systems. Nevertheless, privacy-related concerns obstruct the real-time precise regulation of these systems due to unavailable state-of-charge (SOC) data. This article introduces a self-adaptive energy scheduling optimization framework for MGs that operates without SOC information, utilizing a partially observable Markov game (POMG) to decrease energy usage. Furthermore, to develop an optimal energy scheduling strategy, a MG system optimization approach using recurrent multiagent deep deterministic policy gradient (RMADDPG) is presented. This method is evaluated against other existing techniques such as MADDPG, deterministic recurrent policy gradient (DRPG), and independent Q-learning (IQL), demonstrating reductions in electrical energy consumption by 4.29%, 5.56%, and 12.95%, respectively, according to simulation outcomes.

微电网（MGs）对于通过调节储能系统提高能源效率和最大限度地减少用电量至关重要。然而，由于无法获得充电状态（SOC）数据，与隐私相关的问题阻碍了这些系统的实时精确调节。本文介绍了在没有 SOC 信息的情况下运行的 MG 自适应能量调度优化框架，利用部分可观测马尔可夫博弈（POMG）来减少能量使用。此外，为了制定最佳能源调度策略，还介绍了一种使用递归多代理深度确定性策略梯度（RMADDPG）的 MG 系统优化方法。该方法与其他现有技术（如 MADDPG、确定性递归策略梯度（DRPG）和独立 Q-learning（IQL））进行了对比评估，根据模拟结果，电能消耗分别减少了 4.29%、5.56% 和 12.95%。

引用次数: 0

U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services U-Park：以用户为中心的电动共享微型交通服务智能停车推荐系统

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3428513

Sen Yan;Noel E. O’Connor;Mingming Liu

Electric shared micromobility services (ESMSs) has become a vital element within the mobility as a service framework, contributing to sustainable transportation systems. However, existing ESMS face notable design challenges such as shortcomings in integration, transparency, and user-centered approaches, resulting in increased operational costs and decreased service quality. A key operational issue for ESMS revolves around parking, particularly ensuring the availability of parking spaces as users approach their destinations. For instance, a recent study illustrated that nearly 13% of shared e-bike users in Dublin, Ireland, encounter difficulties parking their e-bikes due to inadequate planning and guidance. In response, we introduce U-Park, a user-centric smart parking recommendation system designed for ESMS, providing tailored recommendations to users by analyzing their historical mobility data, trip trajectory, and parking space availability. We present the system architecture, implement it, and evaluate its performance using real-world data from an Irish-based shared e-bike provider, MOBY Bikes. Our results illustrate U-Park's ability to predict a user's destination within a shared e-bike system, achieving an approximate accuracy rate of over 97.60%, all without requiring direct user input. Experiments have proven that this predictive capability empowers U-Park to suggest the optimal parking station to users based on the availability of predicted parking spaces, improving the probability of obtaining a parking spot by 24.91% on average and 29.66% on maximum when parking availability is limited.

电动共享微型交通服务（ESMS）已成为交通即服务框架中的一个重要元素，为可持续交通系统做出了贡献。然而，现有的 ESMS 在设计上面临着明显的挑战，如在整合、透明度和以用户为中心的方法上存在不足，导致运营成本增加和服务质量下降。ESMS 的一个关键运营问题是停车问题，尤其是在用户接近目的地时确保停车位的可用性。例如，最近的一项研究表明，爱尔兰都柏林近 13% 的共享电动自行车用户在停放电动自行车时遇到困难，原因是规划和引导不足。为此，我们介绍了 U-Park，这是一个以用户为中心、专为 ESMS 设计的智能停车推荐系统，通过分析用户的历史移动数据、行程轨迹和停车位可用性，为用户提供量身定制的推荐。我们介绍了该系统的架构、实施方法，并使用爱尔兰共享电动自行车提供商 MOBY Bikes 的真实数据对其性能进行了评估。我们的结果表明，U-Park 能够预测用户在共享电动自行车系统中的目的地，准确率超过 97.60%，而且无需用户直接输入。实验证明，这种预测能力使 U-Park 能够根据预测停车位的可用性向用户推荐最佳停车站，在停车位有限的情况下，获得停车位的概率平均提高了 24.91%，最高提高了 29.66%。

{"title":"U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services","authors":"Sen Yan;Noel E. O’Connor;Mingming Liu","doi":"10.1109/TAI.2024.3428513","DOIUrl":"https://doi.org/10.1109/TAI.2024.3428513","url":null,"abstract":"Electric shared micromobility services (ESMSs) has become a vital element within the mobility as a service framework, contributing to sustainable transportation systems. However, existing ESMS face notable design challenges such as shortcomings in integration, transparency, and user-centered approaches, resulting in increased operational costs and decreased service quality. A key operational issue for ESMS revolves around parking, particularly ensuring the availability of parking spaces as users approach their destinations. For instance, a recent study illustrated that nearly 13% of shared e-bike users in Dublin, Ireland, encounter difficulties parking their e-bikes due to inadequate planning and guidance. In response, we introduce U-Park, a user-centric smart parking recommendation system designed for ESMS, providing tailored recommendations to users by analyzing their historical mobility data, trip trajectory, and parking space availability. We present the system architecture, implement it, and evaluate its performance using real-world data from an Irish-based shared e-bike provider, MOBY Bikes. Our results illustrate U-Park's ability to predict a user's destination within a shared e-bike system, achieving an approximate accuracy rate of over 97.60%, all without requiring direct user input. Experiments have proven that this predictive capability empowers U-Park to suggest the optimal parking station to users based on the availability of predicted parking spaces, improving the probability of obtaining a parking spot by 24.91% on average and 29.66% on maximum when parking availability is limited.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5179-5193"},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10599560","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息

IEEE transactions on artificial intelligence

Pub Date : 2024-07-16 DOI: 10.1109/TAI.2024.3422574

引用次数: 0

A Survey on Symbolic Knowledge Distillation of Large Language Models 大型语言模型的符号知识提炼研究综述

IEEE transactions on artificial intelligence

Pub Date : 2024-07-15 DOI: 10.1109/TAI.2024.3428519

Kamal Acharya;Alvaro Velasquez;Houbing Herbert Song

This survey article delves into the emerging and critical area of symbolic knowledge distillation in large language models (LLMs). As LLMs such as generative pretrained transformer-3 (GPT-3) and bidirectional encoder representations from transformers (BERT) continue to expand in scale and complexity, the challenge of effectively harnessing their extensive knowledge becomes paramount. This survey concentrates on the process of distilling the intricate, often implicit knowledge contained within these models into a more symbolic, explicit form. This transformation is crucial for enhancing the interpretability, efficiency, and applicability of LLMs. We categorize the existing research based on methodologies and applications, focusing on how symbolic knowledge distillation can be used to improve the transparency and functionality of smaller, more efficient artificial intelligence (AI) models. The survey discusses the core challenges, including maintaining the depth of knowledge in a comprehensible format, and explores the various approaches and techniques that have been developed in this field. We identify gaps in current research and potential opportunities for future advancements. This survey aims to provide a comprehensive overview of symbolic knowledge distillation in LLMs, spotlighting its significance in the progression toward more accessible and efficient AI systems.

这篇综述文章深入研究了大型语言模型（llm）中符号知识提炼的新兴和关键领域。随着生成式预训练变压器3 （GPT-3）和变压器双向编码器表示（BERT）等llm在规模和复杂性上不断扩大，有效利用其广泛知识的挑战变得至关重要。这个调查集中在将这些模型中包含的复杂的，通常是隐含的知识提炼成更具象征性的，明确的形式的过程。这种转换对于提高法学硕士的可解释性、效率和适用性至关重要。我们根据方法和应用对现有研究进行了分类，重点关注如何使用符号知识蒸馏来提高更小、更高效的人工智能（AI）模型的透明度和功能。该调查讨论了核心挑战，包括以可理解的形式保持知识的深度，并探索了该领域已开发的各种方法和技术。我们发现当前研究的差距和未来发展的潜在机会。本调查旨在提供法学硕士符号知识蒸馏的全面概述，突出其在向更易于访问和高效的人工智能系统发展中的重要性。

{"title":"A Survey on Symbolic Knowledge Distillation of Large Language Models","authors":"Kamal Acharya;Alvaro Velasquez;Houbing Herbert Song","doi":"10.1109/TAI.2024.3428519","DOIUrl":"https://doi.org/10.1109/TAI.2024.3428519","url":null,"abstract":"This survey article delves into the emerging and critical area of symbolic knowledge distillation in large language models (LLMs). As LLMs such as generative pretrained transformer-3 (GPT-3) and bidirectional encoder representations from transformers (BERT) continue to expand in scale and complexity, the challenge of effectively harnessing their extensive knowledge becomes paramount. This survey concentrates on the process of distilling the intricate, often implicit knowledge contained within these models into a more symbolic, explicit form. This transformation is crucial for enhancing the interpretability, efficiency, and applicability of LLMs. We categorize the existing research based on methodologies and applications, focusing on how symbolic knowledge distillation can be used to improve the transparency and functionality of smaller, more efficient artificial intelligence (AI) models. The survey discusses the core challenges, including maintaining the depth of knowledge in a comprehensible format, and explores the various approaches and techniques that have been developed in this field. We identify gaps in current research and potential opportunities for future advancements. This survey aims to provide a comprehensive overview of symbolic knowledge distillation in LLMs, spotlighting its significance in the progression toward more accessible and efficient AI systems.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"5928-5948"},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Human-in-the-Middle Attack Against Object Detection Systems 针对物体检测系统的中间人攻击

IEEE transactions on artificial intelligence

Pub Date : 2024-07-15 DOI: 10.1109/TAI.2024.3428520

Han Wu;Sareh Rowlands;Johan Wahlström

Object detection systems using deep learning models have become increasingly popular in robotics thanks to the rising power of central processing units (CPUs) and graphics processing units (GPUs) in embedded systems. However, these models are susceptible to adversarial attacks. While some attacks are limited by strict assumptions on access to the detection system, we propose a novel hardware attack inspired by Man-in-the-Middle attacks in cryptography. This attack generates a universal adversarial perturbations (UAPs) and injects the perturbation between the universal serial bus (USB) camera and the detection system via a hardware attack. Besides, prior research is misled by an evaluation metric that measures the model accuracy rather than the attack performance. In combination with our proposed evaluation metrics, we significantly increased the strength of adversarial perturbations. These findings raise serious concerns for applications of deep learning models in safety-critical systems, such as autonomous driving.

由于嵌入式系统中中央处理器（CPU）和图形处理器（GPU）的性能不断提升，使用深度学习模型的物体检测系统在机器人领域越来越受欢迎。然而，这些模型容易受到恶意攻击。有些攻击受限于访问检测系统的严格假设，而我们提出的新型硬件攻击则受密码学中的 "中间人 "攻击启发。这种攻击会产生一种通用对抗扰动（UAPs），并通过硬件攻击将扰动注入通用串行总线（USB）摄像头和检测系统之间。此外，先前的研究还受到了衡量模型准确性而非攻击性能的评估指标的误导。结合我们提出的评估指标，我们大大提高了对抗性扰动的强度。这些发现引起了人们对深度学习模型在自动驾驶等安全关键系统中应用的严重担忧。

引用次数: 0

360° High-Resolution Depth Estimation via Uncertainty-Aware Structural Knowledge Transfer 通过不确定性感知结构知识转移实现 360° 高分辨率深度估算

IEEE transactions on artificial intelligence

Pub Date : 2024-07-12 DOI: 10.1109/TAI.2024.3427068

Zidong Cao;Hao Ai;Athanasios V. Vasilakos;Lin Wang

To predict high-resolution (HR) omnidirectional depth maps, existing methods typically leverage HR omnidirectional image (ODI) as the input via fully supervised learning. However, in practice, taking HR ODI as input is undesired due to resource-constrained devices. In addition, depth maps are often with lower resolution than color images. Therefore, in this article, we explore for the first time to estimate the HR omnidirectional depth directly from a low-resolution (LR) ODI, when no HR depth ground truth (GT) map is available. Our key idea is to transfer the scene structural knowledge from the HR image modality and the corresponding LR depth maps to achieve the goal of HR depth estimation without any extra inference cost. Specifically, we introduce ODI super-resolution (SR) as an auxiliary task and train both tasks collaboratively in a weakly supervised manner to boost the performance of HR depth estimation. The ODI SR task extracts the scene structural knowledge via uncertainty estimation. Buttressed by this, a scene structural knowledge transfer (SSKT) module is proposed with two key components. First, we employ a cylindrical implicit interpolation function (CIIF) to learn cylindrical neural interpolation weights for feature up-sampling and share the parameters of CIIFs between the two tasks. Then, we propose a feature distillation (FD) loss that provides extra structural regularization to help the HR depth estimation task learn more scene structural knowledge. Extensive experiments demonstrate that our weakly supervised method outperforms baseline methods, and even achieves comparable performance with the fully supervised methods.

为了预测高分辨率（HR）全向深度图，现有方法通常通过完全监督学习，利用 HR 全向图像（ODI）作为输入。然而，在实际应用中，由于设备资源有限，将高分辨率全向图像作为输入是不可取的。此外，深度图的分辨率通常低于彩色图像。因此，在本文中，我们首次探索了在没有高清深度地面实况（GT）图的情况下，直接从低分辨率（LR）ODI 估算高清全向深度的方法。我们的主要想法是从高分辨率图像模式和相应的低分辨率深度图中转移场景结构知识，以实现高分辨率深度估计的目标，而无需任何额外的推理成本。具体来说，我们引入 ODI 超分辨率（SR）作为辅助任务，并以弱监督的方式对两个任务进行协同训练，以提高 HR 深度估计的性能。ODI SR 任务通过不确定性估计提取场景结构知识。在此基础上，我们提出了一个场景结构知识转移（SSKT）模块，该模块由两个关键部分组成。首先，我们采用圆柱隐式插值函数（CIIF）来学习用于特征上采样的圆柱神经插值权重，并在两个任务之间共享 CIIF 的参数。然后，我们提出了一种提供额外结构正则化的特征蒸馏（FD）损失，以帮助 HR 深度估计任务学习更多场景结构知识。大量实验证明，我们的弱监督方法优于基线方法，甚至达到了与完全监督方法相当的性能。

{"title":"360° High-Resolution Depth Estimation via Uncertainty-Aware Structural Knowledge Transfer","authors":"Zidong Cao;Hao Ai;Athanasios V. Vasilakos;Lin Wang","doi":"10.1109/TAI.2024.3427068","DOIUrl":"https://doi.org/10.1109/TAI.2024.3427068","url":null,"abstract":"To predict high-resolution (HR) omnidirectional depth maps, existing methods typically leverage HR omnidirectional image (ODI) as the input via fully supervised learning. However, in practice, taking HR ODI as input is undesired due to resource-constrained devices. In addition, depth maps are often with lower resolution than color images. Therefore, in this article, we explore for the first time to estimate the HR omnidirectional depth directly from a low-resolution (LR) ODI, when no HR depth ground truth (GT) map is available. Our key idea is to transfer the scene structural knowledge from the HR image modality and the corresponding LR depth maps to achieve the goal of HR depth estimation without any extra inference cost. Specifically, we introduce ODI super-resolution (SR) as an auxiliary task and train both tasks collaboratively in a weakly supervised manner to boost the performance of HR depth estimation. The ODI SR task extracts the scene structural knowledge via uncertainty estimation. Buttressed by this, a scene structural knowledge transfer (SSKT) module is proposed with two key components. First, we employ a cylindrical implicit interpolation function (CIIF) to learn cylindrical neural interpolation weights for feature up-sampling and share the parameters of CIIFs between the two tasks. Then, we propose a feature distillation (FD) loss that provides extra structural regularization to help the HR depth estimation task learn more scene structural knowledge. Extensive experiments demonstrate that our weakly supervised method outperforms baseline methods, and even achieves comparable performance with the fully supervised methods.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5392-5402"},"PeriodicalIF":0.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE transactions on artificial intelligence

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀