首页 > 最新文献

IEEE Transactions on Emerging Topics in Computational Intelligence最新文献

英文 中文
Accelerating Evolutionary Multitasking Optimization With a Generalized GPU-Based Framework 利用基于 GPU 的通用框架加速多任务进化优化
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-12 DOI: 10.1109/TETCI.2024.3381512
Zhitong Ma;Jinghui Zhong;Wei-Li Liu;Jun Zhang
Evolutionary multitasking(EMT), which conducts evolutionary research on multiple tasks simultaneously, is an emerging research topic in the computation intelligence community. It aims to enhance the convergence characteristics by simultaneously conducting evolutionary research on multiple tasks, thereby facilitating knowledge transfer among tasks and achieving exceptional performance in solution quality. However, most of the existing EMT algorithms still suffer from the high computational burden especially when the number of tasks is large. To address this issue, this paper proposes a GPU-based multitasking evolutionary framework, which is able to handle thousands of tasks that arrive asynchronous in a short time. Besides, a concurrent multi-island mechanism is proposed to enable the parallel EMT algorithm to efficiently solve high-dimensional problems. Experimental results on eight problems with differing characteristics have demonstrated that the proposed framework is effective in solving high-dimensional problems and can significantly reduce the search time.
同时对多个任务进行进化研究的多任务进化(EMT)是计算智能界的一个新兴研究课题。它旨在通过同时对多个任务进行进化研究来增强收敛特性,从而促进任务间的知识转移,并在解决方案质量方面取得优异表现。然而,大多数现有的 EMT 算法仍然存在计算负担过重的问题,尤其是当任务数量较多时。为了解决这个问题,本文提出了一种基于 GPU 的多任务进化框架,它能够在短时间内处理数千个异步到达的任务。此外,本文还提出了一种并发多岛机制,使并行 EMT 算法能够高效地解决高维问题。对八个不同特征问题的实验结果表明,所提出的框架能有效解决高维问题,并能显著缩短搜索时间。
{"title":"Accelerating Evolutionary Multitasking Optimization With a Generalized GPU-Based Framework","authors":"Zhitong Ma;Jinghui Zhong;Wei-Li Liu;Jun Zhang","doi":"10.1109/TETCI.2024.3381512","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3381512","url":null,"abstract":"Evolutionary multitasking(EMT), which conducts evolutionary research on multiple tasks simultaneously, is an emerging research topic in the computation intelligence community. It aims to enhance the convergence characteristics by simultaneously conducting evolutionary research on multiple tasks, thereby facilitating knowledge transfer among tasks and achieving exceptional performance in solution quality. However, most of the existing EMT algorithms still suffer from the high computational burden especially when the number of tasks is large. To address this issue, this paper proposes a GPU-based multitasking evolutionary framework, which is able to handle thousands of tasks that arrive asynchronous in a short time. Besides, a concurrent multi-island mechanism is proposed to enable the parallel EMT algorithm to efficiently solve high-dimensional problems. Experimental results on eight problems with differing characteristics have demonstrated that the proposed framework is effective in solving high-dimensional problems and can significantly reduce the search time.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 6","pages":"3995-4010"},"PeriodicalIF":5.3,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142691730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MA-MFCNet: Mixed Attention-Based Multi-Scale Feature Calibration Network for Image Dehazing MA-MFCNet:基于混合注意力的多尺度图像去重特征校准网络
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-11 DOI: 10.1109/TETCI.2024.3382233
Luqiao Li;Zhihua Chen;Lei Dai;Ran Li;Bin Sheng
High-quality clear images are the basis for advanced vision tasks such as target detection and semantic segmentation. This paper proposes an image dehazing algorithm named mixed attention-based multi-scale feature calibration network, aiming at solving the problem of uneven haze distribution in low-quality fuzzy images acquired in foggy environments, which is difficult to remove effectively. Our algorithm adopts a U-shaped structure to extract multi-scale features and deep semantic information. In the encoding module, a mixed attention module is designed to assign different weights to each position in the feature map, focusing on the important information and regions where haze is difficult to be removed in the image. In the decoding module, a self-calibration recovery module is designed to fully integrate different levels of features, calibrate feature information, and restore spatial texture details. Finally, the multi-scale feature information is aggregated by the reconstruction module and accurately mapped into the solution space to obtain a clear image after haze removal. Extensive experiments show that our algorithm outperforms state-of-the-art image dehazing algorithms in various synthetic datasets and real hazy scenes in terms of qualitative and quantitative comparisons, and can effectively remove haze in different scenes and recover images with high quality.
高质量的清晰图像是目标检测和语义分割等高级视觉任务的基础。本文提出了一种名为 "基于混合注意力的多尺度特征校准网络 "的图像去污算法,旨在解决雾霾环境下获取的低质量模糊图像中雾霾分布不均匀、难以有效去除的问题。我们的算法采用 U 型结构提取多尺度特征和深层语义信息。在编码模块中,设计了一个混合注意力模块,为特征图中的每个位置分配不同的权重,重点关注图像中的重要信息和难以去除雾霾的区域。在解码模块中,设计了一个自校准恢复模块,以充分整合不同层次的特征,校准特征信息,恢复空间纹理细节。最后,多尺度特征信息由重构模块汇总,并精确映射到解算空间,从而获得去除雾霾后的清晰图像。大量实验表明,在各种合成数据集和真实雾霾场景中,我们的算法在定性和定量比较方面都优于最先进的图像去雾霾算法,能有效去除不同场景中的雾霾,恢复出高质量的图像。
{"title":"MA-MFCNet: Mixed Attention-Based Multi-Scale Feature Calibration Network for Image Dehazing","authors":"Luqiao Li;Zhihua Chen;Lei Dai;Ran Li;Bin Sheng","doi":"10.1109/TETCI.2024.3382233","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3382233","url":null,"abstract":"High-quality clear images are the basis for advanced vision tasks such as target detection and semantic segmentation. This paper proposes an image dehazing algorithm named mixed attention-based multi-scale feature calibration network, aiming at solving the problem of uneven haze distribution in low-quality fuzzy images acquired in foggy environments, which is difficult to remove effectively. Our algorithm adopts a U-shaped structure to extract multi-scale features and deep semantic information. In the encoding module, a mixed attention module is designed to assign different weights to each position in the feature map, focusing on the important information and regions where haze is difficult to be removed in the image. In the decoding module, a self-calibration recovery module is designed to fully integrate different levels of features, calibrate feature information, and restore spatial texture details. Finally, the multi-scale feature information is aggregated by the reconstruction module and accurately mapped into the solution space to obtain a clear image after haze removal. Extensive experiments show that our algorithm outperforms state-of-the-art image dehazing algorithms in various synthetic datasets and real hazy scenes in terms of qualitative and quantitative comparisons, and can effectively remove haze in different scenes and recover images with high quality.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3408-3421"},"PeriodicalIF":5.3,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142368262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Heavy-Ball Over Time-Varying Digraphs With Barzilai-Borwein Step Sizes 在具有 Barzilai-Borwein 步长的时变数图上进行分布式重球计算
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-10 DOI: 10.1109/TETCI.2024.3382232
Shaofu Yang;Yang Shen;Jinde Cao;Tingwen Huang
In this paper, we consider the problem of distributed optimization over time-varying directed graphs, where each agent maintains a private objective function and the goal of all agents is to cooperatively minimize the sum of their objects. By combining Heavy-Ball method with Barzilai-Borwein step sizes, a novel discrete-time accelerated distributed algorithm termed as HBBB is proposed. Compared with existing distributed algorithms over digraphs, HBBB exploits the merit of both momentum and adaptive step sizes for acceleration but requiring very few additional computational costs. It is proved that the algorithm converges to the exact optimal solution at a geometric rate as long as a scale factor for step sizes and the momentum coefficient do not exceed certain bounds. Finally, numerical experiments are performed to show the effectiveness of HBBB as well as its performance, which is comparable to and sometimes even better than existing distributed optimization algorithms over general directed graphs.
在本文中,我们考虑了时变有向图上的分布式优化问题,其中每个代理都有一个私有目标函数,所有代理的目标都是合作最小化其对象之和。通过将重球法与 Barzilai-Borwein 步长相结合,我们提出了一种新型离散时间加速分布式算法,称为 HBBB。与现有的数图分布式算法相比,HBBB 利用了动量和自适应步长的优点来加速,但只需要很少的额外计算成本。实验证明,只要步长和动量系数的比例系数不超过一定的界限,该算法就能以几何速度收敛到精确的最优解。最后,通过数值实验证明了 HBBB 的有效性及其性能,其性能可与一般有向图上的现有分布式优化算法相媲美,有时甚至优于它们。
{"title":"Distributed Heavy-Ball Over Time-Varying Digraphs With Barzilai-Borwein Step Sizes","authors":"Shaofu Yang;Yang Shen;Jinde Cao;Tingwen Huang","doi":"10.1109/TETCI.2024.3382232","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3382232","url":null,"abstract":"In this paper, we consider the problem of distributed optimization over time-varying directed graphs, where each agent maintains a private objective function and the goal of all agents is to cooperatively minimize the sum of their objects. By combining \u0000<underline>H</u>\u0000eavy-\u0000<underline>B</u>\u0000all method with \u0000<underline>B</u>\u0000arzilai-\u0000<underline>B</u>\u0000orwein step sizes, a novel discrete-time accelerated distributed algorithm termed as HBBB is proposed. Compared with existing distributed algorithms over digraphs, HBBB exploits the merit of both momentum and adaptive step sizes for acceleration but requiring very few additional computational costs. It is proved that the algorithm converges to the exact optimal solution at a geometric rate as long as a scale factor for step sizes and the momentum coefficient do not exceed certain bounds. Finally, numerical experiments are performed to show the effectiveness of HBBB as well as its performance, which is comparable to and sometimes even better than existing distributed optimization algorithms over general directed graphs.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 6","pages":"4011-4021"},"PeriodicalIF":5.3,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142691758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Citywide Crowd Flows in Critical Areas Based on Dynamic Spatio-Temporal Network 基于动态时空网络预测重要区域的全市人流量
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-10 DOI: 10.1109/TETCI.2024.3372420
Heli Sun;Ruirui Xue;Tingting Hu;Tengfei Pan;Liang He;Yuan Rao;Zhi Wang;Yingxue Wang;Yuan Chen;Hui He
Citywide crowd flow prediction is an important problem for traffic control, risk assessment, and public safety, especially in critical areas. However, the large scale of the city and the interactions between multiple regions make this problem more challenging. Furthermore, it is impacted by temporal closeness, period, and trend features. Besides, geographic information and meta-features, such as periods of a day and days of a week also affect spatio-temporal correlation. Simultaneously, the influence between different regions will change over time, which is called dynamic correlation. We concentrate on how to concurrently model the important features and dynamic spatial correlation to increase prediction accuracy and simplify the problem. To forecast the crowd flow in critical areas, we propose a two-step framework. First, the grid density peak clustering algorithm is used to set the temporal attenuation factor, which selects the critical areas. Then, the effects of geographic information on spatio-temporal correlation are modeled by graph embedding and the effects of different temporal features are represented by graph convolutional neural networks. In addition, we use the multi-attention mechanism to capture the dynamic spatio-temporal correlation. On two real datasets, experimental results show that our model can balance time complexity and prediction accuracy well. It is 20% better in accuracy than other baselines, and the prediction speed is better than most models.
全城人流预测是交通控制、风险评估和公共安全方面的一个重要问题,尤其是在关键区域。然而,城市的巨大规模和多个区域之间的相互作用使这一问题更具挑战性。此外,它还受到时间紧密性、周期和趋势特征的影响。此外,地理信息和元特征(如一天中的时段和一周中的天数)也会影响时空相关性。同时,不同地区之间的影响也会随时间发生变化,这就是所谓的动态相关性。我们主要研究如何同时对重要特征和动态空间相关性进行建模,以提高预测精度并简化问题。为了预测关键区域的人流,我们提出了一个分两步走的框架。首先,利用网格密度峰值聚类算法设置时间衰减系数,从而选择关键区域。然后,通过图嵌入来模拟地理信息对时空相关性的影响,并通过图卷积神经网络来表示不同时间特征的影响。此外,我们还利用多关注机制来捕捉动态时空相关性。在两个真实数据集上的实验结果表明,我们的模型能很好地平衡时间复杂性和预测准确性。其准确率比其他基线模型高出 20%,预测速度也优于大多数模型。
{"title":"Predicting Citywide Crowd Flows in Critical Areas Based on Dynamic Spatio-Temporal Network","authors":"Heli Sun;Ruirui Xue;Tingting Hu;Tengfei Pan;Liang He;Yuan Rao;Zhi Wang;Yingxue Wang;Yuan Chen;Hui He","doi":"10.1109/TETCI.2024.3372420","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3372420","url":null,"abstract":"Citywide crowd flow prediction is an important problem for traffic control, risk assessment, and public safety, especially in critical areas. However, the large scale of the city and the interactions between multiple regions make this problem more challenging. Furthermore, it is impacted by temporal closeness, period, and trend features. Besides, geographic information and meta-features, such as periods of a day and days of a week also affect spatio-temporal correlation. Simultaneously, the influence between different regions will change over time, which is called dynamic correlation. We concentrate on how to concurrently model the important features and dynamic spatial correlation to increase prediction accuracy and simplify the problem. To forecast the crowd flow in critical areas, we propose a two-step framework. First, the grid density peak clustering algorithm is used to set the temporal attenuation factor, which selects the critical areas. Then, the effects of geographic information on spatio-temporal correlation are modeled by graph embedding and the effects of different temporal features are represented by graph convolutional neural networks. In addition, we use the multi-attention mechanism to capture the dynamic spatio-temporal correlation. On two real datasets, experimental results show that our model can balance time complexity and prediction accuracy well. It is 20% better in accuracy than other baselines, and the prediction speed is better than most models.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3703-3715"},"PeriodicalIF":5.3,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142377137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-View Graph Contrastive Learning Framework for Defending Against Adversarial Attacks 抵御对抗性攻击的多视图图对比学习框架
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-08 DOI: 10.1109/TETCI.2024.3382230
Feilong Cao;Xing Ye;Hailiang Ye
Graph neural networks are easily deceived by adversarial attacks that intentionally modify the graph structure. Particularly, homophilous edges connecting similar nodes can be maliciously deleted when adversarial edges are inserted into the graph. Graph structure learning (GSL) reconstructs an optimal graph structure and corresponding representation and has recently received considerable attention in adversarial attacks. However, constrained by a single topology view of the poisoned graph and few labels, most GSL techniques are difficult to effectively learn robust representations that sufficiently carry precise structure information and similar node information. Therefore, this paper develops a robust multi-view graph contrastive learning (RM-GCL) framework to defend against adversarial attacks. It exploits additional structural information and contrastive supervision signals from the data to guide graph structure optimization. In particular, an adaptive graph-augmented contrastive learning (AGCL) module is devised to obtain reliable representations. Besides, a node-level attention mechanism is incorporated to fuse these representations adaptively acquired from AGCL and then complete node classification tasks. Experiments on multiple datasets manifest that RM-GCL exceeds the state-of-the-art approaches and successfully defends against various attacks.
图神经网络很容易被故意修改图结构的恶意攻击所欺骗。特别是,当对抗性边缘被插入图中时,连接相似节点的同亲边缘会被恶意删除。图结构学习(GSL)可重建最佳图结构和相应的表示,最近在对抗性攻击中受到广泛关注。然而,受制于中毒图的单一拓扑视图和少量标签,大多数 GSL 技术难以有效学习到能充分承载精确结构信息和相似节点信息的鲁棒表示。因此,本文开发了一种鲁棒多视图图对比学习(RM-GCL)框架,以抵御对抗性攻击。它利用数据中的附加结构信息和对比监督信号来指导图结构优化。特别是,它设计了一个自适应图增强对比学习(AGCL)模块,以获得可靠的表征。此外,还加入了节点级关注机制,以融合 AGCL 自适应获得的这些表征,然后完成节点分类任务。在多个数据集上的实验表明,RM-GCL 超越了最先进的方法,并成功抵御了各种攻击。
{"title":"A Multi-View Graph Contrastive Learning Framework for Defending Against Adversarial Attacks","authors":"Feilong Cao;Xing Ye;Hailiang Ye","doi":"10.1109/TETCI.2024.3382230","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3382230","url":null,"abstract":"Graph neural networks are easily deceived by adversarial attacks that intentionally modify the graph structure. Particularly, homophilous edges connecting similar nodes can be maliciously deleted when adversarial edges are inserted into the graph. Graph structure learning (GSL) reconstructs an optimal graph structure and corresponding representation and has recently received considerable attention in adversarial attacks. However, constrained by a single topology view of the poisoned graph and few labels, most GSL techniques are difficult to effectively learn robust representations that sufficiently carry precise structure information and similar node information. Therefore, this paper develops a robust multi-view graph contrastive learning (RM-GCL) framework to defend against adversarial attacks. It exploits additional structural information and contrastive supervision signals from the data to guide graph structure optimization. In particular, an adaptive graph-augmented contrastive learning (AGCL) module is devised to obtain reliable representations. Besides, a node-level attention mechanism is incorporated to fuse these representations adaptively acquired from AGCL and then complete node classification tasks. Experiments on multiple datasets manifest that RM-GCL exceeds the state-of-the-art approaches and successfully defends against various attacks.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 6","pages":"4022-4032"},"PeriodicalIF":5.3,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142691736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Unlearning: Solutions and Challenges 机器学习:解决方案与挑战
IF 5.3 3区 计算机科学 Q1 Mathematics Pub Date : 2024-04-04 DOI: 10.1109/TETCI.2024.3379240
Jie Xu;Zihan Wu;Cong Wang;Xiaohua Jia
Machine learning models may inadvertently memorize sensitive, unauthorized, or malicious data, posing risks of privacy breaches, security vulnerabilities, and performance degradation. To address these issues, machine unlearning has emerged as a critical technique to selectively remove specific training data points' influence on trained models. This paper provides a comprehensive taxonomy and analysis of the solutions in machine unlearning. We categorize existing solutions into exact unlearning approaches that remove data influence thoroughly and approximate unlearning approaches that efficiently minimize data influence. By comprehensively reviewing solutions, we identify and discuss their strengths and limitations. Furthermore, we propose future directions to advance machine unlearning and establish it as an essential capability for trustworthy and adaptive machine learning models. This paper provides researchers with a roadmap of open problems, encouraging impactful contributions to address real-world needs for selective data removal.
机器学习模型可能会无意中记住敏感、未经授权或恶意的数据,从而带来隐私泄露、安全漏洞和性能下降的风险。为了解决这些问题,机器解除学习已成为一种关键技术,可选择性地消除特定训练数据点对训练模型的影响。本文对机器非学习的解决方案进行了全面的分类和分析。我们将现有解决方案分为彻底消除数据影响的精确解除学习方法和有效减少数据影响的近似解除学习方法。通过全面回顾解决方案,我们确定并讨论了它们的优势和局限性。此外,我们还提出了推进机器解除学习的未来方向,并将其确立为值得信赖的自适应机器学习模型的基本能力。本文为研究人员提供了一个开放问题路线图,鼓励他们为解决选择性数据移除的实际需求做出有影响力的贡献。
{"title":"Machine Unlearning: Solutions and Challenges","authors":"Jie Xu;Zihan Wu;Cong Wang;Xiaohua Jia","doi":"10.1109/TETCI.2024.3379240","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3379240","url":null,"abstract":"Machine learning models may inadvertently memorize sensitive, unauthorized, or malicious data, posing risks of privacy breaches, security vulnerabilities, and performance degradation. To address these issues, machine unlearning has emerged as a critical technique to selectively remove specific training data points' influence on trained models. This paper provides a comprehensive taxonomy and analysis of the solutions in machine unlearning. We categorize existing solutions into exact unlearning approaches that remove data influence thoroughly and approximate unlearning approaches that efficiently minimize data influence. By comprehensively reviewing solutions, we identify and discuss their strengths and limitations. Furthermore, we propose future directions to advance machine unlearning and establish it as an essential capability for trustworthy and adaptive machine learning models. This paper provides researchers with a roadmap of open problems, encouraging impactful contributions to address real-world needs for selective data removal.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 3","pages":"2150-2168"},"PeriodicalIF":5.3,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141096368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unifying Global-Local Representations in Salient Object Detection With Transformers 利用变换器统一突出物体检测中的全局-局部表征
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-02 DOI: 10.1109/TETCI.2024.3380442
Sucheng Ren;Nanxuan Zhao;Qiang Wen;Guoqiang Han;Shengfeng He
The fully convolutional network (FCN) has dominated salient object detection for a long period. However, the locality of CNN requires the model deep enough to have a global receptive field and such a deep model always leads to the loss of local details. In this paper, we introduce a new attention-based encoder, vision transformer, into salient object detection to ensure the globalization of the representations from shallow to deep layers. With the global view in very shallow layers, the transformer encoder preserves more local representations to recover the spatial details in final saliency maps. Besides, as each layer can capture a global view of its previous layer, adjacent layers can implicitly maximize the representation differences and minimize the redundant features, making every output feature of transformer layers contribute uniquely to the final prediction. To decode features from the transformer, we propose a simple yet effective deeply-transformed decoder. The decoder densely decodes and upsamples the transformer features, generating the final saliency map with less noise injection. Experimental results demonstrate that our method significantly outperforms other FCN-based and transformer-based methods in five benchmarks by a large margin, with an average of 12.17% improvement in terms of Mean Absolute Error (MAE).
长期以来,全卷积网络(FCN)一直主导着突出物体检测。然而,全卷积网络的局部性要求模型足够深,以拥有全局感受野,而这样的深层模型总是会导致局部细节的丢失。在本文中,我们在突出物体检测中引入了一种新的基于注意力的编码器--视觉转换器,以确保表征从浅层到深层的全局化。在极浅层的全局视图中,变换器编码器保留了更多的局部表征,以恢复最终显著性图中的空间细节。此外,由于每一层都能捕捉到上一层的全局视图,相邻层可以隐含地最大化表征差异,最小化冗余特征,从而使变换器层的每个输出特性都能为最终预测做出独特贡献。为了对变换层的特征进行解码,我们提出了一种简单而有效的深度变换解码器。解码器对变换器特征进行密集解码和高采样,在生成最终突出图时减少噪声注入。实验结果表明,在五个基准测试中,我们的方法明显优于其他基于 FCN 和变压器的方法,平均绝对误差(MAE)提高了 12.17%。
{"title":"Unifying Global-Local Representations in Salient Object Detection With Transformers","authors":"Sucheng Ren;Nanxuan Zhao;Qiang Wen;Guoqiang Han;Shengfeng He","doi":"10.1109/TETCI.2024.3380442","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3380442","url":null,"abstract":"The fully convolutional network (FCN) has dominated salient object detection for a long period. However, the locality of CNN requires the model deep enough to have a global receptive field and such a deep model always leads to the loss of local details. In this paper, we introduce a new attention-based encoder, vision transformer, into salient object detection to ensure the globalization of the representations from shallow to deep layers. With the global view in very shallow layers, the transformer encoder preserves more local representations to recover the spatial details in final saliency maps. Besides, as each layer can capture a global view of its previous layer, adjacent layers can implicitly maximize the representation differences and minimize the redundant features, making every output feature of transformer layers contribute uniquely to the final prediction. To decode features from the transformer, we propose a simple yet effective deeply-transformed decoder. The decoder densely decodes and upsamples the transformer features, generating the final saliency map with less noise injection. Experimental results demonstrate that our method significantly outperforms other FCN-based and transformer-based methods in five benchmarks by a large margin, with an average of 12.17% improvement in terms of Mean Absolute Error (MAE).","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 4","pages":"2870-2879"},"PeriodicalIF":5.3,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141965864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Relational Inference for Few-Shot Learning in 3D Left Atrial Segmentation 三维左心房分段中的分层关系推理(Hierarchical Relational Inference for Few-Shot Learning in 3D Left Atrial Segmentation
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-02 DOI: 10.1109/TETCI.2024.3377267
Xuejiao Li;Jun Chen;Heye Zhang;Yongwon Cho;Sung Ho Hwang;Zhifan Gao;Guang Yang
Three-dimensional left atrial (LA) segmentation from late gadolinium-enhanced cardiac magnetic resonance (LGE CMR) images is of great significance in the prevention and treatment of atrial fibrillation. Despite deep learning-based approaches have made significant progress in 3D LA segmentation, they usually require a large number of labeled images for training. Few-shot learning can quickly adapt to novel tasks with only a few data samples. However, the resolution discrepancy of LGE CMR images presents challenges for few-shot learning in 3D LA segmentation. To address this issue, we propose the Hierarchical Relational Inference Network (HRIN), which extracts the interactive features of support and query volumes through a bidirectional hierarchical relationship learning module. HRIN learns the commonality and discrepancy between support and query volumes by modeling the higher-order relations. Notably, we embed the bidirectional interaction information between support and query volumes into the prototypes to adaptively predict the query. Additionally, we leverage prior knowledge of foreground and background information in the support volume to model queries. We validated the performance of our method on a total of 369 scans from two centers. Our proposed HRIN achieves higher segmentation performance compared to other state-of-the-art segmentation methods. With only 5% data samples, the average Dice Similarity Coefficient of the two centers respectively reaches 0.8454 and 0.8110. Compared with other methods under the same conditions, the highest values only reach 0.7012 and 0.6898. Our approach improves the adaptability and generalization of few-shot segmentation from LGE CMR images, enabling precise evaluation of LA remodeling.
从晚期钆增强心脏磁共振(LGE CMR)图像中进行三维左心房(LA)分割对预防和治疗心房颤动具有重要意义。尽管基于深度学习的方法在三维 LA 分割方面取得了重大进展,但它们通常需要大量标记图像进行训练。少量学习只需少量数据样本就能快速适应新任务。然而,LGE CMR 图像的分辨率差异给三维 LA 分割中的少量学习带来了挑战。为解决这一问题,我们提出了层次关系推理网络(HRIN),通过双向层次关系学习模块提取支持量和查询量的交互特征。HRIN 通过对高阶关系建模来学习支持量和查询量之间的共性和差异。值得注意的是,我们将支持量和查询量之间的双向交互信息嵌入到原型中,以便自适应地预测查询。此外,我们还利用支持卷中的前景和背景信息的先验知识对查询进行建模。我们在两个中心的总共 369 次扫描中验证了我们方法的性能。与其他最先进的分割方法相比,我们提出的 HRIN 实现了更高的分割性能。在只有 5%数据样本的情况下,两个中心的平均骰子相似系数分别达到了 0.8454 和 0.8110。与相同条件下的其他方法相比,最高值仅为 0.7012 和 0.6898。我们的方法提高了从 LGE CMR 图像中进行少次分割的适应性和通用性,从而能够精确评估 LA 重塑情况。
{"title":"Hierarchical Relational Inference for Few-Shot Learning in 3D Left Atrial Segmentation","authors":"Xuejiao Li;Jun Chen;Heye Zhang;Yongwon Cho;Sung Ho Hwang;Zhifan Gao;Guang Yang","doi":"10.1109/TETCI.2024.3377267","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3377267","url":null,"abstract":"Three-dimensional left atrial (LA) segmentation from late gadolinium-enhanced cardiac magnetic resonance (LGE CMR) images is of great significance in the prevention and treatment of atrial fibrillation. Despite deep learning-based approaches have made significant progress in 3D LA segmentation, they usually require a large number of labeled images for training. Few-shot learning can quickly adapt to novel tasks with only a few data samples. However, the resolution discrepancy of LGE CMR images presents challenges for few-shot learning in 3D LA segmentation. To address this issue, we propose the Hierarchical Relational Inference Network (HRIN), which extracts the interactive features of support and query volumes through a bidirectional hierarchical relationship learning module. HRIN learns the commonality and discrepancy between support and query volumes by modeling the higher-order relations. Notably, we embed the bidirectional interaction information between support and query volumes into the prototypes to adaptively predict the query. Additionally, we leverage prior knowledge of foreground and background information in the support volume to model queries. We validated the performance of our method on a total of 369 scans from two centers. Our proposed HRIN achieves higher segmentation performance compared to other state-of-the-art segmentation methods. With only 5% data samples, the average Dice Similarity Coefficient of the two centers respectively reaches 0.8454 and 0.8110. Compared with other methods under the same conditions, the highest values only reach 0.7012 and 0.6898. Our approach improves the adaptability and generalization of few-shot segmentation from LGE CMR images, enabling precise evaluation of LA remodeling.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3352-3367"},"PeriodicalIF":5.3,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142368450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in Deep Learning for B-Mode Ultrasound Segmentation: A Comprehensive Review 深度学习在 B 型超声波分割中的应用进展:全面回顾
IF 5.3 3区 计算机科学 Q1 Mathematics Pub Date : 2024-04-02 DOI: 10.1109/TETCI.2024.3377676
Mohammed Yusuf Ansari;Iffa Afsa Changaai Mangalote;Pramod Kumar Meher;Omar Aboumarzouk;Abdulla Al-Ansari;Osama Halabi;Sarada Prasad Dakua
Ultrasound (US) is generally preferred because it is of low-cost, safe, and non-invasive. US image segmentation is crucial in image analysis. Recently, deep learning-based methods are increasingly being used to segment US images. This survey systematically summarizes and highlights crucial aspects of the deep learning techniques developed in the last five years for US segmentation of various body regions. We investigate and analyze the most popular loss functions and metrics for training and evaluating the neural network for US segmentation. Furthermore, we study the patterns in neural network architectures proposed for the segmentation of various regions of interest. We present neural network modules and priors that address the anatomical challenges associated with different body organs in US images. We have found that variants of U-Net that have dedicated modules to overcome the low-contrast and blurry nature of images are suitable for US image segmentation. Finally, we also discuss the advantages and challenges associated with deep learning methods in the context of US image segmentation.
超声波(US)因其成本低、安全、无创伤而受到普遍青睐。US 图像分割在图像分析中至关重要。最近,基于深度学习的方法越来越多地被用于 US 图像分割。本调查系统地总结并强调了过去五年中开发的深度学习技术的关键方面,这些技术用于对不同身体区域的 US 图像进行分割。我们研究并分析了最流行的损失函数和指标,用于训练和评估用于 US 分割的神经网络。此外,我们还研究了为分割各种感兴趣区域而提出的神经网络架构的模式。我们提出了神经网络模块和先验,以应对 US 图像中与不同人体器官相关的解剖学挑战。我们发现,具有专用模块以克服图像低对比度和模糊特性的 U-Net 变体适用于 US 图像分割。最后,我们还讨论了深度学习方法在 US 图像分割方面的优势和挑战。
{"title":"Advancements in Deep Learning for B-Mode Ultrasound Segmentation: A Comprehensive Review","authors":"Mohammed Yusuf Ansari;Iffa Afsa Changaai Mangalote;Pramod Kumar Meher;Omar Aboumarzouk;Abdulla Al-Ansari;Osama Halabi;Sarada Prasad Dakua","doi":"10.1109/TETCI.2024.3377676","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3377676","url":null,"abstract":"Ultrasound (US) is generally preferred because it is of low-cost, safe, and non-invasive. US image segmentation is crucial in image analysis. Recently, deep learning-based methods are increasingly being used to segment US images. This survey systematically summarizes and highlights crucial aspects of the deep learning techniques developed in the last five years for US segmentation of various body regions. We investigate and analyze the most popular loss functions and metrics for training and evaluating the neural network for US segmentation. Furthermore, we study the patterns in neural network architectures proposed for the segmentation of various regions of interest. We present neural network modules and priors that address the anatomical challenges associated with different body organs in US images. We have found that variants of U-Net that have dedicated modules to overcome the low-contrast and blurry nature of images are suitable for US image segmentation. Finally, we also discuss the advantages and challenges associated with deep learning methods in the context of US image segmentation.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 3","pages":"2126-2149"},"PeriodicalIF":5.3,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141095529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Mental Activities and Age on Brain Network: An Analysis From Complex Network Perspective 智力活动和年龄对大脑网络的影响:复杂网络视角下的分析
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-02 DOI: 10.1109/TETCI.2024.3374957
Cemre Candemir;Vahid Khalilpour Akram;Ali Saffet Gonul
The functional connections in the human brain offer many opportunities to explore changing dynamic patterns of the brain under different circumstances. Different factors such as age, mental activity, and health status may affect functional connectivity, connected regions, and the robustness of connections in the brain. In this study, we evaluate the functional connectivity of the whole brain changing with age from a complex network perspective during different processes in healthy adults. We conducted a functional Magnetic Resonance Imaging (fMRI) study that includes both resting and cognitive states with elderly and young participants (n = 38). To analyze the functional connectivity structure in view of graph theory, we used the minimum dominating sets (MDS) and then minimum hitting sets (MHS) of the connectivity networks. Based on our analysis, age, and mental activity show a significant effect on the hitting sets and dominating sets of the brain regions. The results also indicate that the working mechanism of the brain changes from local to diffused under the circumstances of a particular computational load with age. In this manner, the proposed method can be used as a complementary method for clinical procedures to evaluate and measure the effect of aging on the human brain.
人脑中的功能连接为探索大脑在不同情况下的动态变化模式提供了很多机会。年龄、心理活动和健康状况等不同因素可能会影响大脑的功能连接、连接区域和连接的稳健性。在本研究中,我们从复杂网络的角度评估了健康成年人在不同过程中全脑功能连接性随年龄的变化。我们进行了一项功能磁共振成像(fMRI)研究,其中包括老年和年轻参与者(38 人)的静息和认知状态。为了根据图论分析功能连接结构,我们使用了连接网络的最小支配集(MDS)和最小命中集(MHS)。根据我们的分析,年龄和心理活动对脑区的命中集和支配集有显著影响。结果还表明,随着年龄的增长,大脑的工作机制在特定计算负荷的情况下会从局部变为扩散。因此,所提出的方法可作为临床程序的补充方法,用于评估和测量衰老对人脑的影响。
{"title":"The Impact of Mental Activities and Age on Brain Network: An Analysis From Complex Network Perspective","authors":"Cemre Candemir;Vahid Khalilpour Akram;Ali Saffet Gonul","doi":"10.1109/TETCI.2024.3374957","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3374957","url":null,"abstract":"The functional connections in the human brain offer many opportunities to explore changing dynamic patterns of the brain under different circumstances. Different factors such as age, mental activity, and health status may affect functional connectivity, connected regions, and the robustness of connections in the brain. In this study, we evaluate the functional connectivity of the whole brain changing with age from a complex network perspective during different processes in healthy adults. We conducted a functional Magnetic Resonance Imaging (fMRI) study that includes both resting and cognitive states with elderly and young participants (n = 38). To analyze the functional connectivity structure in view of graph theory, we used the minimum dominating sets (MDS) and then minimum hitting sets (MHS) of the connectivity networks. Based on our analysis, age, and mental activity show a significant effect on the hitting sets and dominating sets of the brain regions. The results also indicate that the working mechanism of the brain changes from local to diffused under the circumstances of a particular computational load with age. In this manner, the proposed method can be used as a complementary method for clinical procedures to evaluate and measure the effect of aging on the human brain.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 4","pages":"2791-2803"},"PeriodicalIF":5.3,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141965841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Emerging Topics in Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1