Junbo Yang , Borui Hu , Hanyu Li , Yang Liu , Xinbo Gao , Jungong Han , Fanglin Chen , Xuangou Wu
{"title":"Dynamic VAEs via semantic-aligned matching for continual zero-shot learning","authors":"Junbo Yang , Borui Hu , Hanyu Li , Yang Liu , Xinbo Gao , Jungong Han , Fanglin Chen , Xuangou Wu","doi":"10.1016/j.patcog.2024.111199","DOIUrl":null,"url":null,"abstract":"<div><div>Continual Zero-shot Learning (CZSL) is capable of classifying unseen categories across a sequence of tasks. However, CZSL is often plagued by the challenge of catastrophic forgetting. While recent studies have shown that preserving past data for experience replay can effectively address this issue, it may be limited to specific scenarios and pose a risk of data leakage. Additionally, many existing CZSL models fail to adequately highlight the correlation between semantic and visual features. To tackle these shortcomings, we introduce dynamic Variational Autoencoders (VAEs) via semantic-aligned matching for CZSL. The proposed model utilizes both semantic and visual VAEs to enhance the transfer capability of knowledge from past tasks. Leveraging generative experience replay, our model effectively combats catastrophic forgetting. Our approach was assessed on five datasets: aPY, AWA1, AWA2, CUB, and SUN, yielding superior performance to baseline models.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111199"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320324009506","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Continual Zero-shot Learning (CZSL) is capable of classifying unseen categories across a sequence of tasks. However, CZSL is often plagued by the challenge of catastrophic forgetting. While recent studies have shown that preserving past data for experience replay can effectively address this issue, it may be limited to specific scenarios and pose a risk of data leakage. Additionally, many existing CZSL models fail to adequately highlight the correlation between semantic and visual features. To tackle these shortcomings, we introduce dynamic Variational Autoencoders (VAEs) via semantic-aligned matching for CZSL. The proposed model utilizes both semantic and visual VAEs to enhance the transfer capability of knowledge from past tasks. Leveraging generative experience replay, our model effectively combats catastrophic forgetting. Our approach was assessed on five datasets: aPY, AWA1, AWA2, CUB, and SUN, yielding superior performance to baseline models.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.