IEEE transactions on artificial intelligence最新文献_第2页

IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息

IEEE transactions on artificial intelligence

Pub Date : 2024-10-16 DOI: 10.1109/TAI.2024.3470571

引用次数: 0

Efficient Evaluation Methods for Neural Architecture Search: A Survey 神经结构搜索的高效评价方法综述

IEEE transactions on artificial intelligence

Pub Date : 2024-10-10 DOI: 10.1109/TAI.2024.3477457

Xiaotian Song;Xiangning Xie;Zeqiong Lv;Gary G. Yen;Weiping Ding;Jiancheng Lv;Yanan Sun

Neural architecture search (NAS) has received increasing attention because of its exceptional merits in automating the design of deep neural network (DNN) architectures. However, the performance evaluation process, as a key part of NAS, often requires training a large number of DNNs. This inevitably makes NAS computationally expensive. In past years, many efficient evaluation methods (EEMs) have been proposed to address this critical issue. In this article, we comprehensively survey these EEMs published up to date, and provide a detailed analysis to motivate the further development of this research direction. Specifically, we divide the existing EEMs into four categories based on the number of DNNs trained for constructing these EEMs. The categorization can reflect the degree of efficiency in principle, which can in turn help quickly grasp the methodological features. In surveying each category, we further discuss the design principles and analyze the strengths and weaknesses to clarify the landscape of existing EEMs, thus making easily understanding the research trends of EEMs. Furthermore, we also discuss the current challenges and issues to identify future research directions in this emerging topic. In summary, this survey provides a convenient overview of EEM for interested users, and they can easily select the proper EEM method for the tasks at hand. In addition, the researchers in the NAS field could continue exploring the future directions suggested in the article.

神经结构搜索（NAS）因其在深度神经网络（DNN）结构自动化设计方面的独特优势而受到越来越多的关注。然而，作为NAS的关键部分，性能评估过程往往需要训练大量的dnn。这不可避免地使NAS在计算上变得昂贵。在过去的几年里，人们提出了许多有效的评估方法（EEMs）来解决这一关键问题。在本文中，我们对这些最新发表的eem进行了全面的综述，并提供了详细的分析，以激励这一研究方向的进一步发展。具体来说，我们根据为构建这些eem而训练的dnn的数量将现有的eem分为四类。分类在原则上可以反映效率程度，从而有助于快速掌握方法特征。在对每个类别的调查中，我们进一步讨论了设计原则，并分析了优势和劣势，以澄清现有eem的格局，从而易于了解eem的研究趋势。此外，我们还讨论了当前的挑战和问题，以确定这一新兴主题的未来研究方向。总之，这项调查为感兴趣的用户提供了一个方便的EEM概述，他们可以很容易地为手头的任务选择合适的EEM方法。此外，NAS领域的研究人员可以继续探索文章中提出的未来方向。

{"title":"Efficient Evaluation Methods for Neural Architecture Search: A Survey","authors":"Xiaotian Song;Xiangning Xie;Zeqiong Lv;Gary G. Yen;Weiping Ding;Jiancheng Lv;Yanan Sun","doi":"10.1109/TAI.2024.3477457","DOIUrl":"https://doi.org/10.1109/TAI.2024.3477457","url":null,"abstract":"Neural architecture search (NAS) has received increasing attention because of its exceptional merits in automating the design of deep neural network (DNN) architectures. However, the performance evaluation process, as a key part of NAS, often requires training a large number of DNNs. This inevitably makes NAS computationally expensive. In past years, many efficient evaluation methods (EEMs) have been proposed to address this critical issue. In this article, we comprehensively survey these EEMs published up to date, and provide a detailed analysis to motivate the further development of this research direction. Specifically, we divide the existing EEMs into four categories based on the number of DNNs trained for constructing these EEMs. The categorization can reflect the degree of efficiency in principle, which can in turn help quickly grasp the methodological features. In surveying each category, we further discuss the design principles and analyze the strengths and weaknesses to clarify the landscape of existing EEMs, thus making easily understanding the research trends of EEMs. Furthermore, we also discuss the current challenges and issues to identify future research directions in this emerging topic. In summary, this survey provides a convenient overview of EEM for interested users, and they can easily select the proper EEM method for the tasks at hand. In addition, the researchers in the NAS field could continue exploring the future directions suggested in the article.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"5990-6011"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Comprehensive Exploration of Real-Time 3-D View Reconstruction Methods 实时三维视图重建方法的综合探索

IEEE transactions on artificial intelligence

Pub Date : 2024-10-10 DOI: 10.1109/TAI.2024.3477425

Arya Agrawal;Teena Sharma;Nishchal K. Verma

Real-time 3-D view reconstruction in an unfamiliar environment poses complexity for various applications due to varying conditions such as occlusion, latency, precision, etc. This article thoroughly examines and tests contemporary methodologies addressing challenges in 3-D view reconstruction. The methods being explored in this article are categorized into volumetric and mesh, generative adversarial network based, and open source library based methods. The exploration of these methods undergoes detailed discussions, encompassing methods, advantages, limitations, and empirical results. The real-time testing of each method is done on benchmarked datasets, including ShapeNet, Pascal 3D+, Pix3D, etc. The narrative highlights the crucial role of 3-D view reconstruction in domains such as robotics, virtual and augmented reality, medical imaging, cultural heritage preservation, etc. The article also anticipates future scopes by exploring generative models, unsupervised learning, and advanced sensor fusion to increase the robustness of the algorithms.

在陌生环境下的实时三维视图重建，由于遮挡、延迟、精度等条件的变化，给各种应用带来了复杂性。本文全面检查和测试了解决三维视图重建挑战的当代方法。本文探讨的方法分为基于体积和网格、基于生成对抗网络和基于开源库的方法。对这些方法的探索进行了详细的讨论，包括方法、优点、局限性和实证结果。在ShapeNet、Pascal 3D+、Pix3D等基准数据集上对每种方法进行了实时测试。叙述强调了三维视图重建在机器人、虚拟和增强现实、医学成像、文化遗产保护等领域的关键作用。本文还通过探索生成模型、无监督学习和先进的传感器融合来预测未来的范围，以增加算法的鲁棒性。

引用次数: 0

Simplified Kernel-Based Cost-Sensitive Broad Learning System for Imbalanced Fault Diagnosis 用于不平衡故障诊断的基于成本敏感核的简化广泛学习系统

IEEE transactions on artificial intelligence

Pub Date : 2024-10-10 DOI: 10.1109/TAI.2024.3478191

Kaixiang Yang;Wuxing Chen;Yifan Shi;Zhiwen Yu;C. L. Philip Chen

In the field of intelligent manufacturing, tackling the classification challenges caused by imbalanced data is crucial. Although the broad learning system (BLS) has been recognized as an effective and efficient method, its performance wanes with imbalanced datasets. Therefore, this article proposes a novel approach named simplified kernel-based cost-sensitive broad learning system (SKCSBLS) to address these issues. Based on the framework of cost-sensitive broad learning system (CSBLS) that assigns distinctive adjustment costs for individual classes, SKCSBLS emphasizes the importance of the minority class while mitigating the impact of data imbalance. Additionally, considering the complexity introduced by noisy or overlapping data points, we incorporate kernel mapping into the CSBLS. This improvement not only improves the system's capability to handle overlapping classes of samples, but also improves the overall classification effectiveness. Our experimental results highlight the potential of SKCSBLS in overcoming the challenges inherent in unbalanced data, providing a robust solution for advanced fault diagnosis in intelligent systems.

在智能制造领域，解决不平衡数据带来的分类难题至关重要。尽管广义学习系统（BLS）已被公认为是一种有效且高效的方法，但它的性能在不平衡数据集上会减弱。因此，本文提出了一种名为简化核成本敏感广义学习系统（SKCSBLS）的新方法来解决这些问题。成本敏感广义学习系统（CSBLS）为各个类别分配了不同的调整成本，SKCSBLS 在此框架的基础上，强调了少数类别的重要性，同时减轻了数据不平衡的影响。此外，考虑到噪声或重叠数据点带来的复杂性，我们在 CSBLS 中加入了核映射。这一改进不仅提高了系统处理重叠类样本的能力，还提高了整体分类效果。我们的实验结果凸显了 SKCSBLS 在克服不平衡数据固有挑战方面的潜力，为智能系统中的高级故障诊断提供了稳健的解决方案。

{"title":"Simplified Kernel-Based Cost-Sensitive Broad Learning System for Imbalanced Fault Diagnosis","authors":"Kaixiang Yang;Wuxing Chen;Yifan Shi;Zhiwen Yu;C. L. Philip Chen","doi":"10.1109/TAI.2024.3478191","DOIUrl":"https://doi.org/10.1109/TAI.2024.3478191","url":null,"abstract":"In the field of intelligent manufacturing, tackling the classification challenges caused by imbalanced data is crucial. Although the broad learning system (BLS) has been recognized as an effective and efficient method, its performance wanes with imbalanced datasets. Therefore, this article proposes a novel approach named simplified kernel-based cost-sensitive broad learning system (SKCSBLS) to address these issues. Based on the framework of cost-sensitive broad learning system (CSBLS) that assigns distinctive adjustment costs for individual classes, SKCSBLS emphasizes the importance of the minority class while mitigating the impact of data imbalance. Additionally, considering the complexity introduced by noisy or overlapping data points, we incorporate kernel mapping into the CSBLS. This improvement not only improves the system's capability to handle overlapping classes of samples, but also improves the overall classification effectiveness. Our experimental results highlight the potential of SKCSBLS in overcoming the challenges inherent in unbalanced data, providing a robust solution for advanced fault diagnosis in intelligent systems.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6629-6644"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning Neural Network Classifiers by Distributing Nearest Neighbors on Adaptive Hypersphere 基于自适应超球上近邻分布的神经网络分类器学习

IEEE transactions on artificial intelligence

Pub Date : 2024-10-10 DOI: 10.1109/TAI.2024.3477436

Xiaojing Zhang;Shuangrong Liu;Lin Wang;Bo Yang;Jiawei Fan

In this study, the adaptive hypersphere nearest neighbors (ASNN) method is proposed as an optimization framework to enhance the generalization performance of neural network classifiers. In terms of the classification task, the neural network draws decision boundaries by constructing the discriminative features of samples. To learn those features, attributed to the flexibility and separability, the pair-wise constraint-based methods that consist of the pair-wise loss and an embedding space (e.g., hypersphere space) have gained considerable attention over the last decade. Despite their success, pair-wise constraint-based methods still suffer from premature convergence or divergence problems, driven by two main challenges. 1) The poor scalability of the embedding space constrains the variety of the distribution of embedded samples, thereby increasing the optimization difficulty. 2) It is hard to select suitable positive/negative pairs during the training. In order to address the aforementioned problems, we propose an adaptive hypersphere nearest neighbors method. On the one hand, we improve the scalability of features via a scale-adaptive hypersphere embedding space. On the other hand, we introduce a neighborhood-based probability loss, which magnifies the difference between pairs and enhances the discriminative power of features generated by the neural networks based on the nearest neighbor-based pairing strategy. Experiments on UCI datasets and image recognition tasks demonstrate that the proposed ASNN not only achieves improved intraclass consistency and interclass separability of samples, but also outperforms its competitive counterparts.

为了提高神经网络分类器的泛化性能，本文提出了自适应超球最近邻（ASNN）方法作为优化框架。在分类任务方面，神经网络通过构造样本的判别特征来绘制决策边界。为了学习这些特征，由于其灵活性和可分离性，由成对损失和嵌入空间（如超球空间）组成的基于成对约束的方法在过去十年中获得了相当大的关注。尽管它们取得了成功，但基于成对约束的方法仍然存在过早收敛或分歧的问题，这主要受到两个主要挑战的驱动。1)嵌入空间的可扩展性差，限制了嵌入样本分布的多样性，从而增加了优化难度。2)在训练过程中很难选择合适的正/负对。为了解决上述问题，我们提出了一种自适应超球最近邻方法。一方面，我们通过自适应尺度的超球嵌入空间提高了特征的可扩展性。另一方面，引入基于邻域的概率损失，放大了基于最近邻配对策略的神经网络对特征的差异，增强了神经网络对特征的判别能力。在UCI数据集和图像识别任务上的实验表明，该方法不仅提高了样本的类内一致性和类间可分离性，而且优于同类方法。

{"title":"Learning Neural Network Classifiers by Distributing Nearest Neighbors on Adaptive Hypersphere","authors":"Xiaojing Zhang;Shuangrong Liu;Lin Wang;Bo Yang;Jiawei Fan","doi":"10.1109/TAI.2024.3477436","DOIUrl":"https://doi.org/10.1109/TAI.2024.3477436","url":null,"abstract":"In this study, the adaptive hypersphere nearest neighbors (ASNN) method is proposed as an optimization framework to enhance the generalization performance of neural network classifiers. In terms of the classification task, the neural network draws decision boundaries by constructing the discriminative features of samples. To learn those features, attributed to the flexibility and separability, the pair-wise constraint-based methods that consist of the pair-wise loss and an embedding space (e.g., hypersphere space) have gained considerable attention over the last decade. Despite their success, pair-wise constraint-based methods still suffer from premature convergence or divergence problems, driven by two main challenges. 1) The poor scalability of the embedding space constrains the variety of the distribution of embedded samples, thereby increasing the optimization difficulty. 2) It is hard to select suitable positive/negative pairs during the training. In order to address the aforementioned problems, we propose an adaptive hypersphere nearest neighbors method. On the one hand, we improve the scalability of features via a scale-adaptive hypersphere embedding space. On the other hand, we introduce a neighborhood-based probability loss, which magnifies the difference between pairs and enhances the discriminative power of features generated by the neural networks based on the nearest neighbor-based pairing strategy. Experiments on UCI datasets and image recognition tasks demonstrate that the proposed ASNN not only achieves improved intraclass consistency and interclass separability of samples, but also outperforms its competitive counterparts.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"234-249"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142976088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Partial Domain Adaptation for Building Borehole Lithology Model Under Weaker Geological Prior 在较弱地质先验条件下建立钻孔岩性模型的部分域自适应方法

IEEE transactions on artificial intelligence

Pub Date : 2024-10-08 DOI: 10.1109/TAI.2024.3476434

Jing Li;Jichen Wang;Zerui Li;Yu Kang;Wenjun Lv

Lithology identification plays a pivotal role in stratigraphic characterization and reservoir exploration. The promising field of intelligent logging lithology identification, which employs machine learning algorithms to infer lithology from logging curves, is gaining significant attention. However, models trained on labeled wells currently face challenges in accurately predicting the lithologies of new unlabeled wells due to significant discrepancies in data distribution among different wells caused by the complex sedimentary environment and variations in logging equipment. Additionally, there is no guarantee that newly drilled wells share the same lithology classes as previously explored ones. Therefore, our research aims to leverage source logging and lithology data along with target logging data to train a model capable of directly discerning the lithologies of target wells. The challenges are centered around the disparities in data distribution and the lack of prior knowledge regarding potential lithology classes in the target well. To tackle these concerns, we have made concerted efforts: 1) proposing a novel lithology identification framework, sample transferability weighting based partial domain adaptation (ST-PDA), to effectively address the practical scenario of encountering an unknown label space in target wells; 2) designing a sample transferability weighting module to assign higher weights to shared-class samples, thus effectively mitigating the negative transfer caused by unshared-class source samples; 3) developing a module, convolutional neural network with integrated channel attention mechanism (CG

${}^{2}$

CA), to serve as the backbone network for feature extraction; and 4) incorporating a target sample reconstruction module to enhance the feature representation and further facilitating positive transfer. Extensive experiments on 16 real-world wells demonstrated the strong performance of ST-PDA and highlighted the necessity of each component in the framework.

岩性识别在地层表征和储层勘探中起着举足轻重的作用。智能测井岩性识别是利用机器学习算法从测井曲线中推断岩性的一个很有前途的领域，目前正受到人们的广泛关注。然而，由于复杂的沉积环境和测井设备的变化，不同井之间的数据分布存在显著差异，因此在标记井上训练的模型目前在准确预测新未标记井的岩性方面面临挑战。此外，也不能保证新钻的井与以前勘探的井具有相同的岩性。因此，我们的研究旨在利用源测井和岩性数据以及目标测井数据来训练能够直接识别目标井岩性的模型。挑战集中在数据分布的差异和缺乏对目标井潜在岩性类型的先验知识。为了解决这些问题，我们做出了共同的努力：1)提出了一种新的岩性识别框架，即基于样本可转移性加权的部分域自适应（ST-PDA），以有效解决在目标井中遇到未知标记空间的实际情况；2)设计样本可转移性加权模块，为共享类样本赋予更高的权重，有效缓解非共享类源样本带来的负迁移；3)开发集成通道关注机制的卷积神经网络模块（CG${}^{2}$CA），作为特征提取的骨干网络；4)引入目标样本重构模块，增强特征表征，进一步促进正迁移。在16口实际井中进行的大量实验证明了ST-PDA的强大性能，并强调了框架中每个组件的必要性。

{"title":"Partial Domain Adaptation for Building Borehole Lithology Model Under Weaker Geological Prior","authors":"Jing Li;Jichen Wang;Zerui Li;Yu Kang;Wenjun Lv","doi":"10.1109/TAI.2024.3476434","DOIUrl":"https://doi.org/10.1109/TAI.2024.3476434","url":null,"abstract":"Lithology identification plays a pivotal role in stratigraphic characterization and reservoir exploration. The promising field of intelligent logging lithology identification, which employs machine learning algorithms to infer lithology from logging curves, is gaining significant attention. However, models trained on labeled wells currently face challenges in accurately predicting the lithologies of new unlabeled wells due to significant discrepancies in data distribution among different wells caused by the complex sedimentary environment and variations in logging equipment. Additionally, there is no guarantee that newly drilled wells share the same lithology classes as previously explored ones. Therefore, our research aims to leverage source logging and lithology data along with target logging data to train a model capable of directly discerning the lithologies of target wells. The challenges are centered around the disparities in data distribution and the lack of prior knowledge regarding potential lithology classes in the target well. To tackle these concerns, we have made concerted efforts: 1) proposing a novel lithology identification framework, sample transferability weighting based partial domain adaptation (ST-PDA), to effectively address the practical scenario of encountering an unknown label space in target wells; 2) designing a sample transferability weighting module to assign higher weights to shared-class samples, thus effectively mitigating the negative transfer caused by unshared-class source samples; 3) developing a module, convolutional neural network with integrated channel attention mechanism (CG\u0000<inline-formula><tex-math>${}^{2}$</tex-math></inline-formula>\u0000CA), to serve as the backbone network for feature extraction; and 4) incorporating a target sample reconstruction module to enhance the feature representation and further facilitating positive transfer. Extensive experiments on 16 real-world wells demonstrated the strong performance of ST-PDA and highlighted the necessity of each component in the framework.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6645-6658"},"PeriodicalIF":0.0,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient CORDIC-Based Activation Functions for RNN Acceleration on FPGAs fpga上基于cordic的RNN加速激活函数

IEEE transactions on artificial intelligence

Pub Date : 2024-10-07 DOI: 10.1109/TAI.2024.3474648

Wan Shen;Junye Jiang;Minghan Li;Shuanglong Liu

Recurrent neural networks (RNNs), particularly long short-term memory (LSTM) networks, have emerged as standard tools for tackling a wide range of time series applications, such as natural language processing. However, deploying these models on edge devices presents great challenges due to limited computational resources. Additionally, the implementation of RNN activation functions on low-end hardware devices significantly impacts the overall network performance, as activations constitute the dominant part of execution time. In this work, we propose an efficient approach for implementing commonly used RNN activations, leveraging an optimized coordinate rotation digital computer algorithm (CORDIC). Moreover, we propose a unified hardware architecture for mapping the CORDIC-based method onto field-programmable gate arrays (FPGAs), which can be configured to implement multiple nonlinear activation functions. Our architecture reduces the computational time with fewer iterations in CORDIC compared with existing methods, rendering it particularly suitable for resource-constrained edge devices. Our design is implemented on a Xilinx Zynq-7000 device and evaluated across three RNNs and benchmark datasets. Experimental results demonstrate that our design achieves up to a 2

$boldsymbol{times}$

speedup while maintaining model accuracy compared with the state-of-the-art designs.

循环神经网络（rnn），特别是长短期记忆（LSTM）网络，已经成为处理广泛时间序列应用的标准工具，例如自然语言处理。然而，由于计算资源有限，在边缘设备上部署这些模型面临着巨大的挑战。此外，在低端硬件设备上实现RNN激活函数会显著影响整体网络性能，因为激活构成了执行时间的主要部分。在这项工作中，我们提出一种有效的方法来实现常用的RNN激活，利用优化的坐标旋转数字计算机算法（CORDIC）。此外，我们提出了一个统一的硬件架构，用于将基于cordic的方法映射到现场可编程门阵列（fpga）上，该fpga可以配置为实现多个非线性激活函数。与现有方法相比，我们的架构减少了CORDIC的计算时间和更少的迭代，使其特别适合资源受限的边缘设备。我们的设计在Xilinx Zynq-7000设备上实现，并在三个rnn和基准数据集上进行评估。实验结果表明，与最先进的设计相比，我们的设计在保持模型精度的同时实现了高达2$boldsymbol{times}$的加速。

{"title":"Efficient CORDIC-Based Activation Functions for RNN Acceleration on FPGAs","authors":"Wan Shen;Junye Jiang;Minghan Li;Shuanglong Liu","doi":"10.1109/TAI.2024.3474648","DOIUrl":"https://doi.org/10.1109/TAI.2024.3474648","url":null,"abstract":"Recurrent neural networks (RNNs), particularly long short-term memory (LSTM) networks, have emerged as standard tools for tackling a wide range of time series applications, such as natural language processing. However, deploying these models on edge devices presents great challenges due to limited computational resources. Additionally, the implementation of RNN activation functions on low-end hardware devices significantly impacts the overall network performance, as activations constitute the dominant part of execution time. In this work, we propose an efficient approach for implementing commonly used RNN activations, leveraging an optimized coordinate rotation digital computer algorithm (CORDIC). Moreover, we propose a unified hardware architecture for mapping the CORDIC-based method onto field-programmable gate arrays (FPGAs), which can be configured to implement multiple nonlinear activation functions. Our architecture reduces the computational time with fewer iterations in CORDIC compared with existing methods, rendering it particularly suitable for resource-constrained edge devices. Our design is implemented on a Xilinx Zynq-7000 device and evaluated across three RNNs and benchmark datasets. Experimental results demonstrate that our design achieves up to a 2<inline-formula><tex-math>$boldsymbol{times}$</tex-math></inline-formula> speedup while maintaining model accuracy compared with the state-of-the-art designs.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"199-210"},"PeriodicalIF":0.0,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142975727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Boosting Few-Shot Semantic Segmentation With Prior-Driven Edge Feature Enhancement Network 基于先验驱动边缘特征增强网络的少镜头语义分割

IEEE transactions on artificial intelligence

Pub Date : 2024-10-07 DOI: 10.1109/TAI.2024.3474650

Jingkai Ma;Shuang Bai;Wenchao Pan

Few-shot semantic segmentation (FSS) focuses on segmenting objects of novel classes with only a small number of annotated samples and has achieved great development. However, compared with general semantic segmentation, inaccurate boundary predictions remain a serious problem in FSS. This is because, in scenarios with few samples, the extracted query features by the model struggle to contain sufficient detailed information to focus on the boundary of the target. To address this issue, we propose a prior-driven edge feature enhancement network (PDEFE) that utilizes the prior information of the object edges to enhance the query feature, thereby promoting the accurate segmentation of the target. Specifically, we first design an edge feature enhancement module (EFEM) that can utilize object edges to enhance the feature of the query object's boundaries. Furthermore, we also propose an edge prior mask generator (EPMG) to generate prior masks for edges based on the gradient information of the image, which can guide the model to pay more attention to the boundaries of the target in the query image. Extensive experiments on PASCAL-

$5^{i}$

and COCO-

$20^{i}$

demonstrate that PDEFE significantly improves upon two baseline detectors (up to 2.7

$sim$

4.2% mIoU in average), achieving state-of-the-art performance.

少射语义分割（Few-shot semantic segmentation， FSS）专注于用少量的标注样本对新类别的对象进行分割，并取得了很大的发展。然而，与一般的语义分割相比，边界预测不准确仍然是FSS中存在的一个严重问题。这是因为，在样本较少的场景中，模型提取的查询特征难以包含足够的详细信息，以关注目标的边界。为了解决这一问题，我们提出了一种先验驱动的边缘特征增强网络（PDEFE），该网络利用物体边缘的先验信息来增强查询特征，从而促进目标的准确分割。具体来说，我们首先设计了一个边缘特征增强模块（EFEM），该模块可以利用对象的边缘来增强查询对象的边界特征。此外，我们还提出了一种边缘先验掩码生成器（EPMG），基于图像的梯度信息生成边缘的先验掩码，可以引导模型更加关注查询图像中目标的边界。在PASCAL-$5^{i}$和COCO-$20^{i}$上进行的大量实验表明，PDEFE在两个基线检测器上显着改善（平均高达2.7$sim$4.2% mIoU），实现了最先进的性能。

{"title":"Boosting Few-Shot Semantic Segmentation With Prior-Driven Edge Feature Enhancement Network","authors":"Jingkai Ma;Shuang Bai;Wenchao Pan","doi":"10.1109/TAI.2024.3474650","DOIUrl":"https://doi.org/10.1109/TAI.2024.3474650","url":null,"abstract":"Few-shot semantic segmentation (FSS) focuses on segmenting objects of novel classes with only a small number of annotated samples and has achieved great development. However, compared with general semantic segmentation, inaccurate boundary predictions remain a serious problem in FSS. This is because, in scenarios with few samples, the extracted query features by the model struggle to contain sufficient detailed information to focus on the boundary of the target. To address this issue, we propose a prior-driven edge feature enhancement network (PDEFE) that utilizes the prior information of the object edges to enhance the query feature, thereby promoting the accurate segmentation of the target. Specifically, we first design an edge feature enhancement module (EFEM) that can utilize object edges to enhance the feature of the query object's boundaries. Furthermore, we also propose an edge prior mask generator (EPMG) to generate prior masks for edges based on the gradient information of the image, which can guide the model to pay more attention to the boundaries of the target in the query image. Extensive experiments on PASCAL-<inline-formula><tex-math>$5^{i}$</tex-math></inline-formula> and COCO-<inline-formula><tex-math>$20^{i}$</tex-math></inline-formula> demonstrate that PDEFE significantly improves upon two baseline detectors (up to 2.7<inline-formula><tex-math>$sim$</tex-math></inline-formula>4.2% mIoU in average), achieving state-of-the-art performance.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"211-220"},"PeriodicalIF":0.0,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142975726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Knowledge Probabilization in Ensemble Distillation: Improving Accuracy and Uncertainty Quantification for Object Detectors 集成蒸馏中的知识概率化：提高目标检测器的精度和不确定度量化

IEEE transactions on artificial intelligence

Pub Date : 2024-10-07 DOI: 10.1109/TAI.2024.3474654

Yang Yang;Chao Wang;Lei Gong;Min Wu;Zhenghua Chen;Xiang Li;Xianglan Chen;Xuehai Zhou

Ensemble object detectors have demonstrated remarkable effectiveness in enhancing prediction accuracy and uncertainty quantification. However, their widespread adoption is hindered by significant computational and storage demands, limiting their feasibility in resource-constrained settings. To overcome this, researchers have focused on distilling the knowledge from ensemble object detectors into a single model. In this article, we introduce probabilization based ensemble distillation (ProbED), an innovative ensemble distillation framework that consolidates knowledge from multiple object detectors into a single, resource-efficient model. Unlike traditional ensemble distillation methods that average the outputs of subteachers, ProbED captures comprehensive outcome distributions from all subteachers, providing a more nuanced approach to knowledge transfer. ProbED employs knowledge probabilization to achieve a sophisticated and refined aggregation of teacher knowledge, including feature knowledge, semantic knowledge, and localization knowledge, resulting in dual improvements in prediction accuracy and uncertainty quantification for the student model. In particular, ProED's novel knowledge probabilization-based approach to aggregating teacher knowledge is inspired by our empirical observations, which demonstrate that knowledge probabilization excels in effectively representing uncertainty, improving prediction, and facilitating robust knowledge transfer. Furthermore, we introduce a random smoothing perturbation technique to modify inputs within ProbED, further enhancing the distillation process. Extensive experiments highlight ProbED's ability to significantly enhance the prediction accuracy and uncertainty quantification of various object detectors, demonstrating its superior performance compared to other state-of-the-art techniques.

集成目标检测器在提高预测精度和不确定度量化方面显示出显著的效果。然而，它们的广泛采用受到大量计算和存储需求的阻碍，限制了它们在资源受限环境下的可行性。为了克服这个问题，研究人员专注于将集合物体探测器的知识提炼成一个单一的模型。在本文中，我们介绍了基于概率的集成蒸馏（ProbED），这是一种创新的集成蒸馏框架，它将来自多个对象检测器的知识整合到一个单一的资源高效模型中。与传统的集成蒸馏方法不同，该方法平均了副教师的输出，probe捕获了所有副教师的综合结果分布，为知识转移提供了更细致的方法。ProbED通过知识概率化实现了对教师知识（包括特征知识、语义知识和定位知识）的精细聚合，从而在学生模型的预测精度和不确定性量化方面实现了双重提升。特别是，ProED基于知识概率的新方法聚合教师知识的灵感来自于我们的经验观察，这些观察表明，知识概率在有效地表示不确定性、改进预测和促进稳健的知识转移方面表现出色。此外，我们引入了随机平滑摄动技术来修改探针内的输入，进一步提高了蒸馏过程。大量的实验表明，probe能够显著提高各种目标探测器的预测精度和不确定性量化，与其他最先进的技术相比，显示出其优越的性能。

{"title":"Knowledge Probabilization in Ensemble Distillation: Improving Accuracy and Uncertainty Quantification for Object Detectors","authors":"Yang Yang;Chao Wang;Lei Gong;Min Wu;Zhenghua Chen;Xiang Li;Xianglan Chen;Xuehai Zhou","doi":"10.1109/TAI.2024.3474654","DOIUrl":"https://doi.org/10.1109/TAI.2024.3474654","url":null,"abstract":"Ensemble object detectors have demonstrated remarkable effectiveness in enhancing prediction accuracy and uncertainty quantification. However, their widespread adoption is hindered by significant computational and storage demands, limiting their feasibility in resource-constrained settings. To overcome this, researchers have focused on distilling the knowledge from ensemble object detectors into a single model. In this article, we introduce probabilization based ensemble distillation (ProbED), an innovative ensemble distillation framework that consolidates knowledge from multiple object detectors into a single, resource-efficient model. Unlike traditional ensemble distillation methods that average the outputs of subteachers, ProbED captures comprehensive outcome distributions from all subteachers, providing a more nuanced approach to knowledge transfer. ProbED employs knowledge probabilization to achieve a sophisticated and refined aggregation of teacher knowledge, including feature knowledge, semantic knowledge, and localization knowledge, resulting in dual improvements in prediction accuracy and uncertainty quantification for the student model. In particular, ProED's novel knowledge probabilization-based approach to aggregating teacher knowledge is inspired by our empirical observations, which demonstrate that knowledge probabilization excels in effectively representing uncertainty, improving prediction, and facilitating robust knowledge transfer. Furthermore, we introduce a random smoothing perturbation technique to modify inputs within ProbED, further enhancing the distillation process. Extensive experiments highlight ProbED's ability to significantly enhance the prediction accuracy and uncertainty quantification of various object detectors, demonstrating its superior performance compared to other state-of-the-art techniques.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"221-233"},"PeriodicalIF":0.0,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142975973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evolution of Web API Cooperation Network via Exploring Community Structure and Popularity 基于社区结构和流行度的Web API合作网络演进

IEEE transactions on artificial intelligence

Pub Date : 2024-10-02 DOI: 10.1109/TAI.2024.3472614

Guosheng Kang;Yang Wang;Jianxun Liu;Buqing Cao;Yong Xiao;Yu Xu

With the growing popularity of the Internet, Web applications have become increasingly essential in our daily lives. Web application programming interfaces (Web APIs) play a crucial role in facilitating interaction between applications. However, most Web service platforms are suffering from the imbalance of Web services now, many services of good quality but low popularity are difficult to be invoked even once and do not create direct connections with the users. Some graph-based Web service recommendation methods also often present a long-tailed distribution of recommended Web services due to limited Mashup–API invocation relationships. To relieve this problem and promote service recommendation, in this article, we propose a community structure and popularity-based approach by constructing an evolving cooperation network for Web APIs. We leverage the Louvain algorithm in community detection to assign community structure to each Web API and consider both the popularity and community structure in constructing the network. By optimizing the Barabάsi–Albert (BA) evolving network model, we demonstrate that our approach outperforms the BA, Bianconi–Barabάsi (BB), and popularity-similarity optimization (PSO) models in Web service clustering. Based on our proposed evolutionary network model for the evolutionary extension of API cooperation network and used for downstream Web service recommendation tasks, the experimental results also show that our recommended approach outperforms some other baseline models for Web service recommendation.

随着Internet的日益普及，Web应用程序在我们的日常生活中变得越来越重要。Web应用程序编程接口（Web api）在促进应用程序之间的交互方面起着至关重要的作用。然而，目前大多数Web服务平台都存在着Web服务不均衡的问题，许多质量好的但知名度不高的服务甚至很难被调用一次，也无法与用户建立直接连接。由于Mashup-API调用关系有限，一些基于图的Web服务推荐方法也经常呈现推荐的Web服务的长尾分布。为了缓解这一问题并促进服务推荐，本文通过构建Web api的演进合作网络，提出了一种基于社区结构和流行度的方法。我们利用社区检测中的Louvain算法为每个Web API分配社区结构，并在构建网络时考虑流行度和社区结构。通过优化barab si - albert （BA）进化网络模型，我们证明了我们的方法在Web服务聚类中优于BA、bianconi - barab si （BB）和流行度相似度优化（PSO）模型。基于我们提出的API协作网络进化扩展的进化网络模型，并将其用于下游Web服务推荐任务，实验结果还表明，我们的推荐方法优于其他一些Web服务推荐基线模型。

{"title":"Evolution of Web API Cooperation Network via Exploring Community Structure and Popularity","authors":"Guosheng Kang;Yang Wang;Jianxun Liu;Buqing Cao;Yong Xiao;Yu Xu","doi":"10.1109/TAI.2024.3472614","DOIUrl":"https://doi.org/10.1109/TAI.2024.3472614","url":null,"abstract":"With the growing popularity of the Internet, Web applications have become increasingly essential in our daily lives. Web application programming interfaces (Web APIs) play a crucial role in facilitating interaction between applications. However, most Web service platforms are suffering from the imbalance of Web services now, many services of good quality but low popularity are difficult to be invoked even once and do not create direct connections with the users. Some graph-based Web service recommendation methods also often present a long-tailed distribution of recommended Web services due to limited Mashup–API invocation relationships. To relieve this problem and promote service recommendation, in this article, we propose a community structure and popularity-based approach by constructing an evolving cooperation network for Web APIs. We leverage the Louvain algorithm in community detection to assign community structure to each Web API and consider both the popularity and community structure in constructing the network. By optimizing the Barabάsi–Albert (BA) evolving network model, we demonstrate that our approach outperforms the BA, Bianconi–Barabάsi (BB), and popularity-similarity optimization (PSO) models in Web service clustering. Based on our proposed evolutionary network model for the evolutionary extension of API cooperation network and used for downstream Web service recommendation tasks, the experimental results also show that our recommended approach outperforms some other baseline models for Web service recommendation.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6659-6671"},"PeriodicalIF":0.0,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0