首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Bilateral-Head Region-Based Convolutional Neural Networks: A Unified Approach for Incremental Few-Shot Object Detection 基于双侧头部区域的卷积神经网络:增量少拍物体检测的统一方法
Pub Date : 2024-03-26 DOI: 10.1109/TAI.2024.3381919
Yiting Li;Haiyue Zhu;Sichao Tian;Jun Ma;Cheng Xiang;Prahlad Vadakkepat
Practical object detection systems are highly desired to be open-ended for learning on frequently evolved datasets. Moreover, learning with little supervision further adds flexibility for real-world applications such as autonomous driving and robotics, where large-scale datasets could be prohibitive or expensive to obtain. However, continual adaption with small training examples often results in catastrophic forgetting and dramatic overfitting. To address such issues, a compositional learning system is proposed to enable effective incremental object detection from nonstationary and few-shot data streams. First of all, a novel bilateral–head framework is proposed to decouple the representation learning of base (pretrained) and novel (few-shot) classes into separate embedding spaces, which takes care of novel concept integration and base knowledge retention simultaneously. Moreover, to enhance learning stability, a robust parameter updating rule, i.e., recall and progress mechanism, is carried out to constrain the optimization trajectory of sequential model adaption. Beyond that, to enforce intertask class discrimination with little memory burden, we present a between-class regularization method that expands the decision space of few-shot classes for constructing unbiased feature representation. Final, we deeply investigate the incomplete annotation issue considering the realistic scenario of incremental few-shot object detection (iFSOD) and propose a semisupervised object labeling mechanism to accurately recover the missing annotations for previously encountered classes, which further enhances the robustness of the target detector to counteract catastrophic forgetting. Extensive experiments conducted on both Pascal visual object classes dataset (VOC) and microsoft common objects in context dataset (MS-COCO) datasets demonstrate the effectiveness of our method.
人们非常希望实用的物体检测系统是开放式的,以便在频繁变化的数据集上进行学习。此外,在自动驾驶和机器人等现实世界应用中,大规模数据集的获取可能过于昂贵或令人望而却步。然而,使用少量训练实例进行持续适应往往会导致灾难性遗忘和严重的过拟合。为了解决这些问题,我们提出了一种组合学习系统,以便从非稳态和少量数据流中实现有效的增量目标检测。首先,我们提出了一个新颖的双边头框架,将基础类(预训练)和新类(少量拍摄)的表征学习分离到不同的嵌入空间,从而同时兼顾新概念整合和基础知识保留。此外,为了增强学习的稳定性,还采用了一种稳健的参数更新规则,即召回和进步机制,来约束顺序模型自适应的优化轨迹。此外,为了在减轻记忆负担的情况下实现任务间的类别区分,我们提出了一种类别间正则化方法,该方法扩展了少拍类别的决策空间,以构建无偏的特征表示。最后,我们深入研究了增量少拍目标检测(iFSOD)现实场景中的不完整注释问题,并提出了一种半监督目标标注机制,以准确恢复之前遇到的类的缺失注释,从而进一步增强目标检测器的鲁棒性,抵御灾难性遗忘。在帕斯卡视觉对象类数据集(VOC)和微软上下文中的常见对象数据集(MS-COCO)上进行的大量实验证明了我们方法的有效性。
{"title":"Bilateral-Head Region-Based Convolutional Neural Networks: A Unified Approach for Incremental Few-Shot Object Detection","authors":"Yiting Li;Haiyue Zhu;Sichao Tian;Jun Ma;Cheng Xiang;Prahlad Vadakkepat","doi":"10.1109/TAI.2024.3381919","DOIUrl":"https://doi.org/10.1109/TAI.2024.3381919","url":null,"abstract":"Practical object detection systems are highly desired to be open-ended for learning on frequently evolved datasets. Moreover, learning with little supervision further adds flexibility for real-world applications such as autonomous driving and robotics, where large-scale datasets could be prohibitive or expensive to obtain. However, continual adaption with small training examples often results in catastrophic forgetting and dramatic overfitting. To address such issues, a compositional learning system is proposed to enable effective incremental object detection from nonstationary and few-shot data streams. First of all, a novel bilateral–head framework is proposed to decouple the representation learning of base (pretrained) and novel (few-shot) classes into separate embedding spaces, which takes care of novel concept integration and base knowledge retention simultaneously. Moreover, to enhance learning stability, a robust parameter updating rule, i.e., recall and progress mechanism, is carried out to constrain the optimization trajectory of sequential model adaption. Beyond that, to enforce intertask class discrimination with little memory burden, we present a between-class regularization method that expands the decision space of few-shot classes for constructing unbiased feature representation. Final, we deeply investigate the incomplete annotation issue considering the realistic scenario of incremental few-shot object detection (iFSOD) and propose a semisupervised object labeling mechanism to accurately recover the missing annotations for previously encountered classes, which further enhances the robustness of the target detector to counteract catastrophic forgetting. Extensive experiments conducted on both Pascal visual object classes dataset (VOC) and microsoft common objects in context dataset (MS-COCO) datasets demonstrate the effectiveness of our method.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Progressively Select and Reject Pseudolabeled Samples for Open-Set Domain Adaptation 逐步选择和剔除伪标记样本,实现开放集域自适应
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3379940
Qian Wang;Fanlin Meng;Toby P. Breckon
Domain adaptation solves image classification problems in the target domain by taking advantage of the labeled source data and unlabeled target data. Usually, the source and target domains share the same set of classes. As a special case, open-set domain adaptation (OSDA) assumes there exist additional classes in the target domain but are not present in the source domain. To solve such a domain adaptation problem, our proposed method learns discriminative common subspaces for the source and target domains using a novel open-set locality preserving projection (OSLPP) algorithm. The source and target domain data are aligned in the learned common spaces classwise. To handle the open-set classification problem, our method progressively selects target samples to be pseudolabeled as known classes, rejects the outliers if they are detected as unknown classes, and leaves the remaining target samples as uncertain. The common subspace learning algorithm OSLPP simultaneously aligns the labeled source data and pseudolabeled target data from known classes and pushes the rejected target data away from the known classes. The common subspace learning and the pseudolabeled sample selection/rejection facilitate each other in an iterative learning framework and achieve state-of-the-art performance on four benchmark datasets Office-31, Office-Home, VisDA17, and Syn2Real-O with the average harmonic mean of open-set recognition accuracy (HOS) of 87.6%, 67.0%, 76.1%, and 65.6%, respectively.
域自适应利用已标注的源数据和未标注的目标数据,解决目标域的图像分类问题。通常,源域和目标域共享相同的类集。作为一种特例,开放集域适应(OSDA)假定目标域中存在额外的类,但源域中并不存在。为了解决这样的域适应问题,我们提出的方法使用一种新颖的开放集局部保存投影(OSLPP)算法来学习源域和目标域的判别性共同子空间。源域和目标域数据在学习到的公共空间中按类对齐。为了处理开放集分类问题,我们的方法会逐步选择目标样本作为已知类进行伪标注,如果检测到异常值为未知类,则将其剔除,剩下的目标样本则作为不确定类。公共子空间学习算法 OSLPP 同时将标记的源数据和伪标记的目标数据从已知类别中对齐,并将拒绝的目标数据推离已知类别。在迭代学习框架中,公共子空间学习和伪标签样本选择/剔除相互促进,在 Office-31、Office-Home、VisDA17 和 Syn2Real-O 四个基准数据集上取得了最先进的性能,开放集识别准确率(HOS)的平均谐波平均值分别为 87.6%、67.0%、76.1% 和 65.6%。
{"title":"Progressively Select and Reject Pseudolabeled Samples for Open-Set Domain Adaptation","authors":"Qian Wang;Fanlin Meng;Toby P. Breckon","doi":"10.1109/TAI.2024.3379940","DOIUrl":"https://doi.org/10.1109/TAI.2024.3379940","url":null,"abstract":"Domain adaptation solves image classification problems in the target domain by taking advantage of the labeled source data and unlabeled target data. Usually, the source and target domains share the same set of classes. As a special case, open-set domain adaptation (OSDA) assumes there exist additional classes in the target domain but are not present in the source domain. To solve such a domain adaptation problem, our proposed method learns discriminative common subspaces for the source and target domains using a novel open-set locality preserving projection (OSLPP) algorithm. The source and target domain data are aligned in the learned common spaces classwise. To handle the open-set classification problem, our method progressively selects target samples to be pseudolabeled as known classes, rejects the outliers if they are detected as unknown classes, and leaves the remaining target samples as uncertain. The common subspace learning algorithm OSLPP simultaneously aligns the labeled source data and pseudolabeled target data from known classes and pushes the rejected target data away from the known classes. The common subspace learning and the pseudolabeled sample selection/rejection facilitate each other in an iterative learning framework and achieve state-of-the-art performance on four benchmark datasets Office-31, Office-Home, VisDA17, and Syn2Real-O with the average harmonic mean of open-set recognition accuracy (HOS) of 87.6%, 67.0%, 76.1%, and 65.6%, respectively.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3372794
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2024.3372794","DOIUrl":"https://doi.org/10.1109/TAI.2024.3372794","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10478786","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140291299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Lightweight Multidendritic Pyramidal Neuron Model With Neural Plasticity on Image Recognition 具有图像识别神经可塑性的轻量级多树突锥体神经元模型
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3379968
Yu Zhang;Pengxing Cai;Yanan Sun;Zhiming Zhang;Zhenyu Lei;Shangce Gao
Simulating the method of neurons in the human brain that process signals is crucial for constructing a neural network with biological interpretability. However, existing deep neural networks simplify the function of a single neuron without considering dendritic plasticity. In this article, we present a multidendrite pyramidal neuron model (MDPN) for image classification, which mimics the multilevel dendritic structure of a nerve cell. Unlike the traditional feedforward network model, MDPN discards premature linear summation integration and employs a nonlinear dendritic computation such that improving the neuroplasticity. To model a lightweight and effective classification system, we emphasized the importance of single neuron and redefined the function of each subcomponent. Experimental results verify the effectiveness and robustness of our proposed MDPN in classifying 16 standardized image datasets with different characteristics. Compared to other state-of-the-art and well-known networks, MDPN is superior in terms of classifica-tion accuracy.
模拟人脑中神经元处理信号的方法对于构建具有生物可解释性的神经网络至关重要。然而,现有的深度神经网络简化了单个神经元的功能,没有考虑树突的可塑性。在本文中,我们提出了一种用于图像分类的多树突锥体神经元模型(MDPN),该模型模拟了神经细胞的多级树突结构。与传统的前馈网络模型不同,MDPN 摒弃了过早的线性求和整合,采用了非线性树突计算,从而提高了神经可塑性。为了建立一个轻便有效的分类系统模型,我们强调了单个神经元的重要性,并重新定义了每个子组件的功能。实验结果验证了我们提出的 MDPN 在对 16 个具有不同特征的标准化图像数据集进行分类时的有效性和鲁棒性。与其他最先进的知名网络相比,MDPN 在分类准确性方面更胜一筹。
{"title":"A Lightweight Multidendritic Pyramidal Neuron Model With Neural Plasticity on Image Recognition","authors":"Yu Zhang;Pengxing Cai;Yanan Sun;Zhiming Zhang;Zhenyu Lei;Shangce Gao","doi":"10.1109/TAI.2024.3379968","DOIUrl":"https://doi.org/10.1109/TAI.2024.3379968","url":null,"abstract":"Simulating the method of neurons in the human brain that process signals is crucial for constructing a neural network with biological interpretability. However, existing deep neural networks simplify the function of a single neuron without considering dendritic plasticity. In this article, we present a multidendrite pyramidal neuron model (MDPN) for image classification, which mimics the multilevel dendritic structure of a nerve cell. Unlike the traditional feedforward network model, MDPN discards premature linear summation integration and employs a nonlinear dendritic computation such that improving the neuroplasticity. To model a lightweight and effective classification system, we emphasized the importance of single neuron and redefined the function of each subcomponent. Experimental results verify the effectiveness and robustness of our proposed MDPN in classifying 16 standardized image datasets with different characteristics. Compared to other state-of-the-art and well-known networks, MDPN is superior in terms of classifica-tion accuracy.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Similarity-Based Positional Attention-Aided Deep Learning Model for Copy–Move Forgery Detection 基于相似性的位置注意力辅助深度学习模型,用于仿制-移动伪造检测
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3379941
Ayush Roy;Sk Mohiuddin;Ram Sarkar
The process of modifying digital images has been made significantly easier by the availability of several image editing software. However, in a variety of contexts, including journalism, judicial processes, and historical documentation, the authenticity of images is of utmost importance. In particular, copy–move forgery is a distinct type of image manipulation, where a portion of an image is copied and pasted into another area of the same image, creating a fictitious or altered version of the original. In this research, we present a lightweight MultiResUnet architecture with the similarity-based positional attention module (SPAM) attention module for copy–move forgery detection (CMFD). By using a similarity measure across the patches of the features, this attention module identifies the patches, where a forged region is present. The lightweight network also aids in resource-efficient training and transforms the model into one that can be used in real time. We have employed four commonly used but extremely difficult CMFD datasets, namely CoMoFoD, COVERAGE, CASIA v2, and MICC-F600, to assess the effectiveness of our model. The proposed model significantly lowers false positives, thereby improving the pixel-level accuracy and dependability of CMFD tools.
由于有了多种图像编辑软件,修改数字图像的过程大大简化。然而,在新闻报道、司法程序和历史文献等各种情况下,图像的真实性至关重要。其中,复制移动伪造是一种独特的图像处理方式,即把图像的一部分复制并粘贴到同一图像的另一个区域,从而创建一个虚构或篡改的原始版本。在这项研究中,我们提出了一种轻量级 MultiResUnet 架构,该架构带有基于相似性的位置注意模块(SPAM)注意模块,可用于复制移动伪造检测(CMFD)。该注意模块通过使用特征斑块间的相似性度量,识别出存在伪造区域的斑块。轻量级网络还有助于进行资源节约型训练,并将模型转化为可实时使用的模型。我们采用了四个常用但难度极高的 CMFD 数据集,即 CoMoFoD、COVERAGE、CASIA v2 和 MICC-F600,来评估我们模型的有效性。所提出的模型大大降低了误报率,从而提高了像素级精度和 CMFD 工具的可靠性。
{"title":"A Similarity-Based Positional Attention-Aided Deep Learning Model for Copy–Move Forgery Detection","authors":"Ayush Roy;Sk Mohiuddin;Ram Sarkar","doi":"10.1109/TAI.2024.3379941","DOIUrl":"https://doi.org/10.1109/TAI.2024.3379941","url":null,"abstract":"The process of modifying digital images has been made significantly easier by the availability of several image editing software. However, in a variety of contexts, including journalism, judicial processes, and historical documentation, the authenticity of images is of utmost importance. In particular, copy–move forgery is a distinct type of image manipulation, where a portion of an image is copied and pasted into another area of the same image, creating a fictitious or altered version of the original. In this research, we present a lightweight MultiResUnet architecture with the similarity-based positional attention module (SPAM) attention module for copy–move forgery detection (CMFD). By using a similarity measure across the patches of the features, this attention module identifies the patches, where a forged region is present. The lightweight network also aids in resource-efficient training and transforms the model into one that can be used in real time. We have employed four commonly used but extremely difficult CMFD datasets, namely CoMoFoD, COVERAGE, CASIA v2, and MICC-F600, to assess the effectiveness of our model. The proposed model significantly lowers false positives, thereby improving the pixel-level accuracy and dependability of CMFD tools.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regional Ensemble for Improving Unsupervised Outlier Detectors 改进无监督离群点检测器的区域集合
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3381102
Jiawei Yang;Sylwan Rahardja;Susanto Rahardja
Outlier ensemble is an important methodology for improving outlier detection, but faces severe challenges in unsupervised settings. Unlike traditional outlier ensembles which revised scores by considering only the values of the scores from multiple detectors, we present a novel regional ensemble (RE). RE combines the scores from multiple objects and multiple detectors and simultaneously takes into consideration both the values and the distribution of these scores. RE specifically enhances the score of a given object by using the scores of neighboring objects of the given object, under the assumption that the scores of the majority of neighboring objects are reliable. RE provides many potential applications, particularly in data mining and machine learning. Compared to existing outlier ensembles with 30 real-world datasets tested, RE attained the best performance with 14 datasets, while the current standard achieves superior performance with only eight datasets. RE can significantly improve the best existing from 0.83 to 0.86 AUC on average.
离群点集合是改进离群点检测的一种重要方法,但在无监督环境下面临着严峻的挑战。传统的离群点集合只考虑多个检测器的得分值来修订得分,与此不同,我们提出了一种新颖的区域集合(RE)。RE 结合了来自多个对象和多个检测器的分数,并同时考虑了这些分数的值和分布。RE 特别通过使用给定对象的邻近对象的分数来提高给定对象的分数,前提是大多数邻近对象的分数是可靠的。RE 提供了许多潜在的应用,尤其是在数据挖掘和机器学习方面。在 30 个真实世界数据集的测试中,与现有的离群点集合相比,RE 在 14 个数据集上取得了最佳性能,而目前的标准仅在 8 个数据集上取得了优异性能。RE 能将现有的最佳 AUC 从平均 0.83 大幅提高到 0.86。
{"title":"Regional Ensemble for Improving Unsupervised Outlier Detectors","authors":"Jiawei Yang;Sylwan Rahardja;Susanto Rahardja","doi":"10.1109/TAI.2024.3381102","DOIUrl":"https://doi.org/10.1109/TAI.2024.3381102","url":null,"abstract":"Outlier ensemble is an important methodology for improving outlier detection, but faces severe challenges in unsupervised settings. Unlike traditional outlier ensembles which revised scores by considering only the values of the scores from multiple detectors, we present a novel regional ensemble (RE). RE combines the scores from multiple objects and multiple detectors and simultaneously takes into consideration both the values and the distribution of these scores. RE specifically enhances the score of a given object by using the scores of neighboring objects of the given object, under the assumption that the scores of the majority of neighboring objects are reliable. RE provides many potential applications, particularly in data mining and machine learning. Compared to existing outlier ensembles with 30 real-world datasets tested, RE attained the best performance with 14 datasets, while the current standard achieves superior performance with only eight datasets. RE can significantly improve the best existing from 0.83 to 0.86 AUC on average.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multimodal Multiobjective Evolutionary Algorithm for Filter Feature Selection in Multilabel Classification 多标签分类中过滤器特征选择的多模态多目标进化算法
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2024.3380590
Emrah Hancer;Bing Xue;Mengjie Zhang
Multilabel learning is an emergent topic that addresses the challenge of associating multiple labels with a single instance simultaneously. Multilabel datasets often exhibit high dimensionality with noisy, irrelevant, and redundant features. In recent years, multilabel feature selection (MLFS) has gained prominence as a crucial and emerging machine learning task due to its ability to handle such data effectively. However, existing approaches for MLFS often prioritize top-ranked features based on intrinsic data criteria, disregarding relationships within the feature subset. Additionally, compared with conventional feature selection, multiobjective evolutionary algorithms (MOEAs) have not been widely explored in the context of MLFS. This study aims to address these gaps by proposing a multimodal multiobjective evolutionary algorithm (MMOEA) called MMDE_SICD which incorporates a preelimination scheme, an improved initialization scheme, an exploration scheme inspired by genetic operations and a statistically inspired crowding distance scheme. The results show that the proposed MMDE_SICD algorithm can outperform a variety of MOEAs and MMOEAs as well as conventional MLFS algorithms. Notably, this study is the first of its kind to consider MLFS as a multimodal multiobjective problem.
多标签学习是一个新兴课题,它解决了同时将多个标签与单个实例相关联的难题。多标签数据集通常具有高维度、噪声、不相关和冗余特征。近年来,多标签特征选择(MLFS)作为一项重要的新兴机器学习任务,因其能有效处理此类数据而备受瞩目。然而,现有的多标签特征选择方法通常会根据内在数据标准确定排名靠前的特征的优先级,而忽略特征子集内部的关系。此外,与传统的特征选择相比,多目标进化算法(MOEAs)在 MLFS 中的应用尚未得到广泛探索。本研究旨在通过提出一种名为 MMDE_SICD 的多模态多目标进化算法(MMOEA)来填补这些空白,该算法融合了预消除方案、改进的初始化方案、受遗传操作启发的探索方案以及受统计启发的拥挤距离方案。研究结果表明,所提出的 MMDE_SICD 算法的性能优于各种 MOEA 和 MMOEA 以及传统的 MLFS 算法。值得注意的是,本研究首次将 MLFS 视为多模式多目标问题。
{"title":"A Multimodal Multiobjective Evolutionary Algorithm for Filter Feature Selection in Multilabel Classification","authors":"Emrah Hancer;Bing Xue;Mengjie Zhang","doi":"10.1109/TAI.2024.3380590","DOIUrl":"https://doi.org/10.1109/TAI.2024.3380590","url":null,"abstract":"Multilabel learning is an emergent topic that addresses the challenge of associating multiple labels with a single instance simultaneously. Multilabel datasets often exhibit high dimensionality with noisy, irrelevant, and redundant features. In recent years, multilabel feature selection (MLFS) has gained prominence as a crucial and emerging machine learning task due to its ability to handle such data effectively. However, existing approaches for MLFS often prioritize top-ranked features based on intrinsic data criteria, disregarding relationships within the feature subset. Additionally, compared with conventional feature selection, multiobjective evolutionary algorithms (MOEAs) have not been widely explored in the context of MLFS. This study aims to address these gaps by proposing a multimodal multiobjective evolutionary algorithm (MMOEA) called MMDE_SICD which incorporates a preelimination scheme, an improved initialization scheme, an exploration scheme inspired by genetic operations and a statistically inspired crowding distance scheme. The results show that the proposed MMDE_SICD algorithm can outperform a variety of MOEAs and MMOEAs as well as conventional MLFS algorithms. Notably, this study is the first of its kind to consider MLFS as a multimodal multiobjective problem.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guest Editorial: Special Issue on Physics-Informed Machine Learning 特邀编辑:物理信息机器学习特刊
Pub Date : 2024-03-25 DOI: 10.1109/TAI.2023.3342563
Francesco Piccialli;Maizar Raissi;Felipe A. C. Viana;Giancarlo Fortino;Huimin Lu;Amir Hussain
The special issue delves into the tantalizing prospects of machine learning for multiscale modeling, a domain where the traditional methodologies often encounter scalability issues. Here, Physics-informed machine learning (PIML) promises to bridge scales from the microscopic to the macroscopic, creating models that are not only scalable but also more accurate and less resource-intensive. Furthermore, the contributors have taken on the challenge of machine learning model interpretability. They have explored how these models can provide insights into physical systems, thus serving a dual purpose of solving complex problems while also contributing to the body of knowledge in physics. The integration of physical laws with machine learning is not just an innovation; it is a renaissance of understanding. The papers in this issue showcase the pioneering works that merge the robustness of physics with the flexibility of machine learning. Here, we provide an overview of the significant contributions made by our authors in advancing the field of PIML.
本特刊深入探讨了机器学习在多尺度建模方面的诱人前景,在这一领域,传统方法经常遇到可扩展性问题。物理信息机器学习(PIML)有望在从微观到宏观的尺度之间架起一座桥梁,从而创建出不仅具有可扩展性,而且更精确、资源消耗更少的模型。此外,撰稿人还接受了机器学习模型可解释性的挑战。他们探讨了这些模型如何为物理系统提供洞察力,从而达到解决复杂问题的双重目的,同时为物理学知识体系做出贡献。物理定律与机器学习的结合不仅是一种创新,更是一种理解的复兴。本期的论文展示了将物理学的稳健性与机器学习的灵活性相结合的开创性工作。在此,我们将概述作者们在推动 PIML 领域发展方面做出的重大贡献。
{"title":"Guest Editorial: Special Issue on Physics-Informed Machine Learning","authors":"Francesco Piccialli;Maizar Raissi;Felipe A. C. Viana;Giancarlo Fortino;Huimin Lu;Amir Hussain","doi":"10.1109/TAI.2023.3342563","DOIUrl":"https://doi.org/10.1109/TAI.2023.3342563","url":null,"abstract":"The special issue delves into the tantalizing prospects of machine learning for multiscale modeling, a domain where the traditional methodologies often encounter scalability issues. Here, Physics-informed machine learning (PIML) promises to bridge scales from the microscopic to the macroscopic, creating models that are not only scalable but also more accurate and less resource-intensive. Furthermore, the contributors have taken on the challenge of machine learning model interpretability. They have explored how these models can provide insights into physical systems, thus serving a dual purpose of solving complex problems while also contributing to the body of knowledge in physics. The integration of physical laws with machine learning is not just an innovation; it is a renaissance of understanding. The papers in this issue showcase the pioneering works that merge the robustness of physics with the flexibility of machine learning. Here, we provide an overview of the significant contributions made by our authors in advancing the field of PIML.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10478751","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140291323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Prescribed-Time Neural Control of Nonlinear Systems via Dynamic Surface Technique 通过动态曲面技术实现非线性系统的自适应规定时间神经控制
Pub Date : 2024-03-24 DOI: 10.1109/TAI.2024.3404914
Ping Wang;Chengpu Yu;Maolong Lv;Zilong Zhao
The adaptive practical prescribed-time (PPT) neural control is studied for multiinput multioutput (MIMO) nonlinear systems with unknown nonlinear functions and unknown input gain matrices. Unlike existing PPT design schemes based on backstepping, this study proposes a novel PPT control framework using the dynamic surface control (DSC) approach. First, a novel nonlinear filter (NLF) with an adaptive parameter estimator and a piecewise function is constructed to effectively compensate for filter errors and facilitate prescribed-time convergence. Based on this, a unified DSC-based adaptive PPT control algorithm, augmented with a neural networks (NNs) approximator, is developed, where NNs are used to approximate unknown nonlinear system functions. This algorithm not only addresses the inherent computational complexity explosion associated with traditional backstepping methods but also reduces the constraints on filter design parameters compared to the DSC algorithm that relies on linear filters. The simulation showcases the effectiveness and superiority of the devised scheme by employing a two-degree-of-freedom robot manipulator.
针对具有未知非线性函数和未知输入增益矩阵的多输入多输出(MIMO)非线性系统,研究了自适应实用规定时间(PPT)神经控制。与现有的基于反步法的 PPT 设计方案不同,本研究提出了一种使用动态表面控制 (DSC) 方法的新型 PPT 控制框架。首先,构建了一个带有自适应参数估计器和片断函数的新型非线性滤波器(NLF),以有效补偿滤波器误差并促进规定时间收敛。在此基础上,开发了一种基于 DSC 的统一自适应 PPT 控制算法,该算法使用神经网络(NNs)近似器进行增强,其中 NNs 用于近似未知的非线性系统函数。与依赖线性滤波器的 DSC 算法相比,该算法不仅解决了传统反步法固有的计算复杂度爆炸问题,还减少了对滤波器设计参数的限制。模拟仿真采用了一个两自由度机器人机械手,展示了所设计方案的有效性和优越性。
{"title":"Adaptive Prescribed-Time Neural Control of Nonlinear Systems via Dynamic Surface Technique","authors":"Ping Wang;Chengpu Yu;Maolong Lv;Zilong Zhao","doi":"10.1109/TAI.2024.3404914","DOIUrl":"https://doi.org/10.1109/TAI.2024.3404914","url":null,"abstract":"The adaptive practical prescribed-time (PPT) neural control is studied for multiinput multioutput (MIMO) nonlinear systems with unknown nonlinear functions and unknown input gain matrices. Unlike existing PPT design schemes based on backstepping, this study proposes a novel PPT control framework using the dynamic surface control (DSC) approach. First, a novel nonlinear filter (NLF) with an adaptive parameter estimator and a piecewise function is constructed to effectively compensate for filter errors and facilitate prescribed-time convergence. Based on this, a unified DSC-based adaptive PPT control algorithm, augmented with a neural networks (NNs) approximator, is developed, where NNs are used to approximate unknown nonlinear system functions. This algorithm not only addresses the inherent computational complexity explosion associated with traditional backstepping methods but also reduces the constraints on filter design parameters compared to the DSC algorithm that relies on linear filters. The simulation showcases the effectiveness and superiority of the devised scheme by employing a two-degree-of-freedom robot manipulator.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-Based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey 计算机视觉中基于变压器的生成对抗网络:全面调查
Pub Date : 2024-03-24 DOI: 10.1109/TAI.2024.3404910
Shiv Ram Dubey;Satish Kumar Singh
Generative adversarial networks (GANs) have been very successful for synthesizing the images in a given dataset. The artificially generated images by GANs are very realistic. The GANs have shown potential usability in several computer vision applications, including image generation, image-to-image translation, and video synthesis. Conventionally, the generator network is the backbone of GANs, which generates the samples, and the discriminator network is used to facilitate the training of the generator network. The generator and discriminator networks are usually a convolutional neural network (CNN). The convolution-based networks exploit the local relationship in a layer, which requires the deep networks to extract the abstract features. However, recently developed transformer networks are able to exploit the global relationship with tremendous performance improvement for several problems in computer vision. Motivated from the success of transformer networks and GANs, recent works have tried to exploit the transformers in GAN framework for the image/video synthesis. This article presents a comprehensive survey on the developments and advancements in GANs utilizing the transformer networks for computer vision applications. The performance comparison for several applications on benchmark datasets is also performed and analyzed. The conducted survey will be very useful to understand the research trends and gaps related with transformer-based GANs and to develop the advanced GAN architectures by exploiting the global and local relationships for different applications.
生成式对抗网络(GAN)在合成给定数据集中的图像方面非常成功。GAN 人工生成的图像非常逼真。GANs 已在多个计算机视觉应用中显示出潜在的可用性,包括图像生成、图像到图像的翻译和视频合成。传统上,生成器网络是 GAN 的骨干网络,用于生成样本,而判别器网络则用于促进生成器网络的训练。生成器网络和鉴别器网络通常是一个卷积神经网络(CNN)。基于卷积的网络利用层中的局部关系,这就需要深度网络来提取抽象特征。然而,最近开发的变压器网络能够利用全局关系,在计算机视觉的多个问题上取得了巨大的性能提升。受变压器网络和 GAN 的成功启发,最近的研究尝试在 GAN 框架中利用变压器进行图像/视频合成。本文全面介绍了利用变压器网络进行计算机视觉应用的 GAN 的发展和进步。文章还对基准数据集上的多个应用进行了性能比较和分析。所进行的调查将非常有助于了解与基于变压器的 GAN 相关的研究趋势和差距,并通过利用不同应用的全局和局部关系来开发先进的 GAN 架构。
{"title":"Transformer-Based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey","authors":"Shiv Ram Dubey;Satish Kumar Singh","doi":"10.1109/TAI.2024.3404910","DOIUrl":"https://doi.org/10.1109/TAI.2024.3404910","url":null,"abstract":"Generative adversarial networks (GANs) have been very successful for synthesizing the images in a given dataset. The artificially generated images by GANs are very realistic. The GANs have shown potential usability in several computer vision applications, including image generation, image-to-image translation, and video synthesis. Conventionally, the generator network is the backbone of GANs, which generates the samples, and the discriminator network is used to facilitate the training of the generator network. The generator and discriminator networks are usually a convolutional neural network (CNN). The convolution-based networks exploit the local relationship in a layer, which requires the deep networks to extract the abstract features. However, recently developed transformer networks are able to exploit the global relationship with tremendous performance improvement for several problems in computer vision. Motivated from the success of transformer networks and GANs, recent works have tried to exploit the transformers in GAN framework for the image/video synthesis. This article presents a comprehensive survey on the developments and advancements in GANs utilizing the transformer networks for computer vision applications. The performance comparison for several applications on benchmark datasets is also performed and analyzed. The conducted survey will be very useful to understand the research trends and gaps related with transformer-based GANs and to develop the advanced GAN architectures by exploiting the global and local relationships for different applications.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1