Distilled fine-grained domain adversarial network prompted by normalization and regularization

IF 6.8 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Sciences Pub Date : 2025-06-01 Epub Date: 2025-02-12 DOI:10.1016/j.ins.2025.121970
Zhiqun Pan , Yongxiong Wang , Jiapeng Zhang , Yihan Shan , Zhe Wang , Jin Peng
{"title":"Distilled fine-grained domain adversarial network prompted by normalization and regularization","authors":"Zhiqun Pan ,&nbsp;Yongxiong Wang ,&nbsp;Jiapeng Zhang ,&nbsp;Yihan Shan ,&nbsp;Zhe Wang ,&nbsp;Jin Peng","doi":"10.1016/j.ins.2025.121970","DOIUrl":null,"url":null,"abstract":"<div><div>Fine-grained domain adaptation presents a significant challenge in machine learning, where subtle differences between classes and domains often lead to poor generalization. Addressing this issue is crucial for improving the accuracy and robustness of models when applied to unseen data from different but related domains. We propose a distilled fine-grained domain adversarial network in which feature distributions across diverse domains are effectively aligned, and classification discrimination is enhanced during the mini-batch training phase. Based on knowledge distillation, the capability for fine-grained feature extraction is transferred from the teacher network to the student feature extractor, all while preserving the original data and prediction distributions. Consequently, low-entropy prediction distributions can be effectively leveraged by the domain adversarial network for unsupervised training in the target domain, leading to notable improvements in fine-grained recognition performance. In order to train the domain adversarial network more effectively, a linear transformation bottleneck layer for feature normalization is introduced after the feature extractor. This layer helps alleviate the normalization conflict between training and inference, which arises due to domain shift in mini-batches of data. Moreover, to strike a balance between the competing effects of the classifier and discriminator, we devise a double bilinear dropout module in the domain discriminator.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"704 ","pages":"Article 121970"},"PeriodicalIF":6.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525001021","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/12 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Fine-grained domain adaptation presents a significant challenge in machine learning, where subtle differences between classes and domains often lead to poor generalization. Addressing this issue is crucial for improving the accuracy and robustness of models when applied to unseen data from different but related domains. We propose a distilled fine-grained domain adversarial network in which feature distributions across diverse domains are effectively aligned, and classification discrimination is enhanced during the mini-batch training phase. Based on knowledge distillation, the capability for fine-grained feature extraction is transferred from the teacher network to the student feature extractor, all while preserving the original data and prediction distributions. Consequently, low-entropy prediction distributions can be effectively leveraged by the domain adversarial network for unsupervised training in the target domain, leading to notable improvements in fine-grained recognition performance. In order to train the domain adversarial network more effectively, a linear transformation bottleneck layer for feature normalization is introduced after the feature extractor. This layer helps alleviate the normalization conflict between training and inference, which arises due to domain shift in mini-batches of data. Moreover, to strike a balance between the competing effects of the classifier and discriminator, we devise a double bilinear dropout module in the domain discriminator.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
由归一化和正则化推动的精细化领域对抗网络
细粒度的领域适应在机器学习中提出了重大挑战,其中类和领域之间的细微差异通常导致较差的泛化。当应用于来自不同但相关领域的未见过的数据时,解决这个问题对于提高模型的准确性和鲁棒性至关重要。我们提出了一种精炼的细粒度领域对抗网络,该网络有效地对齐了不同领域的特征分布,并在小批量训练阶段增强了分类辨别能力。在知识蒸馏的基础上,将细粒度特征提取的能力从教师网络转移到学生特征提取器,同时保留了原始数据和预测分布。因此,领域对抗网络可以有效地利用低熵预测分布进行目标领域的无监督训练,从而显著提高细粒度识别性能。为了更有效地训练域对抗网络,在特征提取器之后引入了用于特征归一化的线性变换瓶颈层。这一层有助于缓解训练和推理之间的规范化冲突,这种冲突是由于小批量数据的域转移而产生的。此外,为了在分类器和鉴别器的竞争效应之间取得平衡,我们在域鉴别器中设计了双双线性dropout模块。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
期刊最新文献
A material-aware, multi-modal synthetic dataset for smart city applications: Design and construction Dynamic control of discrete event systems with unreliable resources by Petri nets Enhancing friend recommendations in combined social networks: A dynamic approach using label-aware riemannian residual neural networks Coordinated optimization of emergency repair for the post-disaster transportation network and the emergency resource allocation scheme Frequent subgraph-based persistent homology for graph classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1