{"title":"A Comprehensive Survey on Deep Multimodal Learning with Missing Modality","authors":"Renjie Wu, Hu Wang, Hsiang-Ting Chen","doi":"arxiv-2409.07825","DOIUrl":null,"url":null,"abstract":"During multimodal model training and reasoning, data samples may miss certain\nmodalities and lead to compromised model performance due to sensor limitations,\ncost constraints, privacy concerns, data loss, and temporal and spatial\nfactors. This survey provides an overview of recent progress in Multimodal\nLearning with Missing Modality (MLMM), focusing on deep learning techniques. It\nis the first comprehensive survey that covers the historical background and the\ndistinction between MLMM and standard multimodal learning setups, followed by a\ndetailed analysis of current MLMM methods, applications, and datasets,\nconcluding with a discussion about challenges and potential future directions\nin the field.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
During multimodal model training and reasoning, data samples may miss certain
modalities and lead to compromised model performance due to sensor limitations,
cost constraints, privacy concerns, data loss, and temporal and spatial
factors. This survey provides an overview of recent progress in Multimodal
Learning with Missing Modality (MLMM), focusing on deep learning techniques. It
is the first comprehensive survey that covers the historical background and the
distinction between MLMM and standard multimodal learning setups, followed by a
detailed analysis of current MLMM methods, applications, and datasets,
concluding with a discussion about challenges and potential future directions
in the field.