Revisiting non-learned operators based deep learning for image classification: a lightweight directional-aware network

IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Artificial Intelligence Review Pub Date : 2024-12-20 DOI:10.1007/s10462-024-11038-0
Yuwei Guo, Wenhao Zhang, Yupeng Gao, Licheng Jiao, Shuo Wang, Jiabo Du, Fang Liu
{"title":"Revisiting non-learned operators based deep learning for image classification: a lightweight directional-aware network","authors":"Yuwei Guo,&nbsp;Wenhao Zhang,&nbsp;Yupeng Gao,&nbsp;Licheng Jiao,&nbsp;Shuo Wang,&nbsp;Jiabo Du,&nbsp;Fang Liu","doi":"10.1007/s10462-024-11038-0","DOIUrl":null,"url":null,"abstract":"<div><p>Due to the stable feature representation capability provided by non-learned operators, the integration with deep learning models, i.e., non-learned operator based deep learning models, has become a paradigm, however, performance-wise, it is still not promising. In this paper, by revisiting non-learned operator based deep learning models, we reveal the reasons for their underperformance: lack of geometric invariance, insufficient sparsity, and neglect of directional importance. In response, we present a Lightweight Directional-Aware Network (LDAN) for image classification. Specifically, to generate sparse geometric-invariant features, we propose a ShearletNet to capture multi-directional features in three different levels. Then, a Directional-Aware module is designed to highlight the discriminative multi-directional features and generate multi-scale features. Finally, a Pointwise Convolution module is used to integrate the multi-directional features with the multi-scale ones for reducing the computational resources. Experiments on the commonly used CIFAR10, CIFAR100, Self-Taught Learning 10 (STL10), and Tiny ImageNet datasets demonstrate the efficiency and effectiveness of the proposed LDAN. Compared to the existing non-learned operator based models, LDAN reduces the parameter count by 80.83% while achieving a 6.32% increase in accuracy.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 2","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11038-0.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-11038-0","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the stable feature representation capability provided by non-learned operators, the integration with deep learning models, i.e., non-learned operator based deep learning models, has become a paradigm, however, performance-wise, it is still not promising. In this paper, by revisiting non-learned operator based deep learning models, we reveal the reasons for their underperformance: lack of geometric invariance, insufficient sparsity, and neglect of directional importance. In response, we present a Lightweight Directional-Aware Network (LDAN) for image classification. Specifically, to generate sparse geometric-invariant features, we propose a ShearletNet to capture multi-directional features in three different levels. Then, a Directional-Aware module is designed to highlight the discriminative multi-directional features and generate multi-scale features. Finally, a Pointwise Convolution module is used to integrate the multi-directional features with the multi-scale ones for reducing the computational resources. Experiments on the commonly used CIFAR10, CIFAR100, Self-Taught Learning 10 (STL10), and Tiny ImageNet datasets demonstrate the efficiency and effectiveness of the proposed LDAN. Compared to the existing non-learned operator based models, LDAN reduces the parameter count by 80.83% while achieving a 6.32% increase in accuracy.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
重新审视基于非学习算子的图像分类深度学习:轻量级方向感知网络
由于非学习算子提供了稳定的特征表示能力,因此与深度学习模型的整合,即基于非学习算子的深度学习模型,已成为一种范式,但从性能上看,其前景仍不容乐观。本文通过重新审视基于非学习算子的深度学习模型,揭示了其性能不佳的原因:缺乏几何不变性、稀疏性不足以及忽视方向重要性。为此,我们提出了一种用于图像分类的轻量级方向感知网络(LDAN)。具体来说,为了生成稀疏的几何不变特征,我们提出了一个 ShearletNet 来捕捉三个不同层次的多方向特征。然后,我们设计了一个方向感知模块,以突出具有区分性的多方向特征,并生成多尺度特征。最后,使用点式卷积模块将多方向特征与多尺度特征整合在一起,以减少计算资源。在常用的 CIFAR10、CIFAR100、Self-Taught Learning 10 (STL10) 和 Tiny ImageNet 数据集上的实验证明了所提出的 LDAN 的效率和有效性。与现有的基于算子的非学习模型相比,LDAN 减少了 80.83% 的参数数量,同时准确率提高了 6.32%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Artificial Intelligence Review
Artificial Intelligence Review 工程技术-计算机:人工智能
CiteScore
22.00
自引率
3.30%
发文量
194
审稿时长
5.3 months
期刊介绍: Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.
期刊最新文献
Reinforcement learning for road pricing: a review and future directions On the fairness, diversity and reliability of text-to-image generative models Scaling transformers for time series forecasting: do pretrained large models outperform small-scale alternatives? Evaluating large language models effectiveness for flow-based intrusion detection: a comparative study with ML and DL baselines Synergizing blockchain and AI to fortify IoT security: a comprehensive review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1