自动驾驶图像识别系统的自适应注意力模块

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE International Journal of Intelligent Systems Pub Date : 2024-07-12 DOI:10.1155/2024/3934270

Ma Xianghua, Hu Kaitao, Sun Xiangyu, Shining Chen

{"title":"自动驾驶图像识别系统的自适应注意力模块","authors":"Ma Xianghua, Hu Kaitao, Sun Xiangyu, Shining Chen","doi":"10.1155/2024/3934270","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Lightweight, high-performance networks are important in vision perception systems. Recent research on convolutional neural networks has shown that attention mechanisms can significantly improve the network performance. However, existing approaches either ignore the significance of using both types of attention mechanisms (channel and space) simultaneously or increase the model complexity. In this study, we propose the adaptive attention module (AAM), which is a truly lightweight yet effective module that comprises channel and spatial submodules to balance model performance and complexity. The AAM initially utilizes the channel submodule to generate intermediate channel-refined features. In this module, an adaptive mechanism enables the model to autonomously learn the weights between features extracted by global max pooling and global average pooling to adapt to different stages of the model, thus enhancing performance. The spatial submodule employs a group-interact-aggregate strategy to enhance the expression of important features. It groups the intermediate channel-refined features along the channel dimension into multiple subfeatures for parallel processing and generates spatial attention feature descriptors and channelwise refined subfeatures for each subfeature; subsequently, it aggregates all the refined subfeatures and employs a “channel shuffle” operator to transfer information between different subfeatures, thereby generating the final refined features and adaptively emphasizing important regions. Additionally, AAM is a plug-and-play architectural unit that can be directly used to replace standard convolutions in various convolutional neural networks. Extensive tests on CIFAR-100, ImageNet-1k, BDD100K, and MS COCO demonstrate that AAM improves the baseline network performance under various models and tasks, thereby validating its versatility.</p>\n </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/3934270","citationCount":"0","resultStr":"{\"title\":\"Adaptive Attention Module for Image Recognition Systems in Autonomous Driving\",\"authors\":\"Ma Xianghua, Hu Kaitao, Sun Xiangyu, Shining Chen\",\"doi\":\"10.1155/2024/3934270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n <p>Lightweight, high-performance networks are important in vision perception systems. Recent research on convolutional neural networks has shown that attention mechanisms can significantly improve the network performance. However, existing approaches either ignore the significance of using both types of attention mechanisms (channel and space) simultaneously or increase the model complexity. In this study, we propose the adaptive attention module (AAM), which is a truly lightweight yet effective module that comprises channel and spatial submodules to balance model performance and complexity. The AAM initially utilizes the channel submodule to generate intermediate channel-refined features. In this module, an adaptive mechanism enables the model to autonomously learn the weights between features extracted by global max pooling and global average pooling to adapt to different stages of the model, thus enhancing performance. The spatial submodule employs a group-interact-aggregate strategy to enhance the expression of important features. It groups the intermediate channel-refined features along the channel dimension into multiple subfeatures for parallel processing and generates spatial attention feature descriptors and channelwise refined subfeatures for each subfeature; subsequently, it aggregates all the refined subfeatures and employs a “channel shuffle” operator to transfer information between different subfeatures, thereby generating the final refined features and adaptively emphasizing important regions. Additionally, AAM is a plug-and-play architectural unit that can be directly used to replace standard convolutions in various convolutional neural networks. Extensive tests on CIFAR-100, ImageNet-1k, BDD100K, and MS COCO demonstrate that AAM improves the baseline network performance under various models and tasks, thereby validating its versatility.</p>\\n </div>\",\"PeriodicalId\":14089,\"journal\":{\"name\":\"International Journal of Intelligent Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/3934270\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1155/2024/3934270\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/2024/3934270","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

轻量级、高性能的网络在视觉感知系统中非常重要。最近对卷积神经网络的研究表明，注意力机制能显著提高网络性能。然而，现有的方法要么忽略了同时使用两种注意机制（通道和空间）的重要性，要么增加了模型的复杂性。在本研究中，我们提出了自适应注意力模块（AAM），这是一个真正轻量级但有效的模块，由通道和空间子模块组成，以平衡模型性能和复杂性。AAM 最初利用信道子模块生成中间信道提炼特征。在该模块中，自适应机制使模型能够自主学习全局最大池化和全局平均池化提取的特征之间的权重，以适应模型的不同阶段，从而提高性能。空间子模块采用分组-交互-聚合策略来增强重要特征的表达。它将沿通道维度的中间通道细化特征分组为多个子特征进行并行处理，并为每个子特征生成空间注意力特征描述符和通道细化子特征；随后，它汇总所有细化子特征，并采用 "通道洗牌 "算子在不同子特征之间传递信息，从而生成最终细化特征，并自适应地强调重要区域。此外，AAM 还是一个即插即用的架构单元，可直接用于替代各种卷积神经网络中的标准卷积。在 CIFAR-100、ImageNet-1k、BDD100K 和 MS COCO 上进行的广泛测试表明，在各种模型和任务下，AAM 都能提高基线网络的性能，从而验证了它的多功能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Adaptive Attention Module for Image Recognition Systems in Autonomous Driving

Lightweight, high-performance networks are important in vision perception systems. Recent research on convolutional neural networks has shown that attention mechanisms can significantly improve the network performance. However, existing approaches either ignore the significance of using both types of attention mechanisms (channel and space) simultaneously or increase the model complexity. In this study, we propose the adaptive attention module (AAM), which is a truly lightweight yet effective module that comprises channel and spatial submodules to balance model performance and complexity. The AAM initially utilizes the channel submodule to generate intermediate channel-refined features. In this module, an adaptive mechanism enables the model to autonomously learn the weights between features extracted by global max pooling and global average pooling to adapt to different stages of the model, thus enhancing performance. The spatial submodule employs a group-interact-aggregate strategy to enhance the expression of important features. It groups the intermediate channel-refined features along the channel dimension into multiple subfeatures for parallel processing and generates spatial attention feature descriptors and channelwise refined subfeatures for each subfeature; subsequently, it aggregates all the refined subfeatures and employs a “channel shuffle” operator to transfer information between different subfeatures, thereby generating the final refined features and adaptively emphasizing important regions. Additionally, AAM is a plug-and-play architectural unit that can be directly used to replace standard convolutions in various convolutional neural networks. Extensive tests on CIFAR-100, ImageNet-1k, BDD100K, and MS COCO demonstrate that AAM improves the baseline network performance under various models and tasks, thereby validating its versatility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Intelligent Systems 工程技术-计算机：人工智能

CiteScore

11.30

自引率

14.30%

发文量

304

审稿时长

9 months

期刊介绍： The International Journal of Intelligent Systems serves as a forum for individuals interested in tapping into the vast theories based on intelligent systems construction. With its peer-reviewed format, the journal explores several fascinating editorials written by today''s experts in the field. Because new developments are being introduced each day, there''s much to be learned — examination, analysis creation, information retrieval, man–computer interactions, and more. The International Journal of Intelligent Systems uses charts and illustrations to demonstrate these ground-breaking issues, and encourages readers to share their thoughts and experiences.