多任务学习的可变多尺度注意融合网络和自适应校正梯度优化

IF 9.1 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Pub Date : 2025-06-01 Epub Date: 2025-02-01 DOI:10.1016/j.patcog.2025.111423
Naihua Ji , Yongqiang Sun , Fanyun Meng , Liping Pang , Yuzhu Tian
{"title":"多任务学习的可变多尺度注意融合网络和自适应校正梯度优化","authors":"Naihua Ji ,&nbsp;Yongqiang Sun ,&nbsp;Fanyun Meng ,&nbsp;Liping Pang ,&nbsp;Yuzhu Tian","doi":"10.1016/j.patcog.2025.111423","DOIUrl":null,"url":null,"abstract":"<div><div>Network architecture and optimization are two indispensable parts in multi-task learning, which together improve the performance of multi-task learning. Previous work has rarely focused on both aspects simultaneously. In this paper, we analyze the multi-task learning from network architecture and optimization. In network architecture aspect, we propose a variable multi-scale attention fusion network, which overcomes the issue of feature loss when processing small-scale feature maps during upsampling and resolves the problem of inadequate learning in conventional multi-scale models due to significant spatial size disparities. In optimization aspect, a adaptive correcting gradient scheme is put forward to treat the defects of conflicts and dominance among multiple tasks during the process of training, and it effectively alleviates the imbalance of multi-task training. Various ablation experiments and comparative experiments demonstrate that simultaneously considering the network framework and optimization can make great improvement for the performance of multi-task learning. Our code is available at <span><span>https://github.com/SyqxhSt/Net-Opt-MTL</span><svg><path></path></svg></span></div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"162 ","pages":"Article 111423"},"PeriodicalIF":9.1000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Variable multi-scale attention fusion network and adaptive correcting gradient optimization for multi-task learning\",\"authors\":\"Naihua Ji ,&nbsp;Yongqiang Sun ,&nbsp;Fanyun Meng ,&nbsp;Liping Pang ,&nbsp;Yuzhu Tian\",\"doi\":\"10.1016/j.patcog.2025.111423\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Network architecture and optimization are two indispensable parts in multi-task learning, which together improve the performance of multi-task learning. Previous work has rarely focused on both aspects simultaneously. In this paper, we analyze the multi-task learning from network architecture and optimization. In network architecture aspect, we propose a variable multi-scale attention fusion network, which overcomes the issue of feature loss when processing small-scale feature maps during upsampling and resolves the problem of inadequate learning in conventional multi-scale models due to significant spatial size disparities. In optimization aspect, a adaptive correcting gradient scheme is put forward to treat the defects of conflicts and dominance among multiple tasks during the process of training, and it effectively alleviates the imbalance of multi-task training. Various ablation experiments and comparative experiments demonstrate that simultaneously considering the network framework and optimization can make great improvement for the performance of multi-task learning. Our code is available at <span><span>https://github.com/SyqxhSt/Net-Opt-MTL</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"162 \",\"pages\":\"Article 111423\"},\"PeriodicalIF\":9.1000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325000834\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/1 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325000834","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

网络架构和优化是多任务学习中不可缺少的两个部分,它们共同提高了多任务学习的性能。以往的工作很少同时关注这两个方面。本文从网络结构和优化两个方面分析了多任务学习。在网络架构方面,我们提出了一种可变多尺度注意力融合网络,克服了上采样过程中处理小尺度特征映射时的特征丢失问题,解决了传统多尺度模型由于空间大小差异较大而学习不足的问题。在优化方面,针对训练过程中多任务间存在的冲突和优势等缺陷,提出了一种自适应校正梯度方案,有效缓解了多任务训练的不平衡性。各种消融实验和对比实验表明,同时考虑网络框架和优化可以大大提高多任务学习的性能。我们的代码可在https://github.com/SyqxhSt/Net-Opt-MTL上获得
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Variable multi-scale attention fusion network and adaptive correcting gradient optimization for multi-task learning
Network architecture and optimization are two indispensable parts in multi-task learning, which together improve the performance of multi-task learning. Previous work has rarely focused on both aspects simultaneously. In this paper, we analyze the multi-task learning from network architecture and optimization. In network architecture aspect, we propose a variable multi-scale attention fusion network, which overcomes the issue of feature loss when processing small-scale feature maps during upsampling and resolves the problem of inadequate learning in conventional multi-scale models due to significant spatial size disparities. In optimization aspect, a adaptive correcting gradient scheme is put forward to treat the defects of conflicts and dominance among multiple tasks during the process of training, and it effectively alleviates the imbalance of multi-task training. Various ablation experiments and comparative experiments demonstrate that simultaneously considering the network framework and optimization can make great improvement for the performance of multi-task learning. Our code is available at https://github.com/SyqxhSt/Net-Opt-MTL
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
期刊最新文献
Unfolded ISTA for deep sparse subspace clustering ACSD-Net: SSM-based feature extraction with confidence-guided dynamic fusion for multimodal AxSpA abnormal pattern recognition M-MambaS: Multimodal Mamba for small lesion segmentation Spinal segment landmark localization method under arbitrary view Improving face forgery detection via hierarchical mixture of experts and fine-grained visual-text alignment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1