{"title":"在多模态变异自动编码器中分辨共享和私有潜在因素","authors":"Kaspar Märtens, Christopher Yau","doi":"arxiv-2403.06338","DOIUrl":null,"url":null,"abstract":"Generative models for multimodal data permit the identification of latent\nfactors that may be associated with important determinants of observed data\nheterogeneity. Common or shared factors could be important for explaining\nvariation across modalities whereas other factors may be private and important\nonly for the explanation of a single modality. Multimodal Variational\nAutoencoders, such as MVAE and MMVAE, are a natural choice for inferring those\nunderlying latent factors and separating shared variation from private. In this\nwork, we investigate their capability to reliably perform this disentanglement.\nIn particular, we highlight a challenging problem setting where\nmodality-specific variation dominates the shared signal. Taking a cross-modal\nprediction perspective, we demonstrate limitations of existing models, and\npropose a modification how to make them more robust to modality-specific\nvariation. Our findings are supported by experiments on synthetic as well as\nvarious real-world multi-omics data sets.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Disentangling shared and private latent factors in multimodal Variational Autoencoders\",\"authors\":\"Kaspar Märtens, Christopher Yau\",\"doi\":\"arxiv-2403.06338\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generative models for multimodal data permit the identification of latent\\nfactors that may be associated with important determinants of observed data\\nheterogeneity. Common or shared factors could be important for explaining\\nvariation across modalities whereas other factors may be private and important\\nonly for the explanation of a single modality. Multimodal Variational\\nAutoencoders, such as MVAE and MMVAE, are a natural choice for inferring those\\nunderlying latent factors and separating shared variation from private. In this\\nwork, we investigate their capability to reliably perform this disentanglement.\\nIn particular, we highlight a challenging problem setting where\\nmodality-specific variation dominates the shared signal. Taking a cross-modal\\nprediction perspective, we demonstrate limitations of existing models, and\\npropose a modification how to make them more robust to modality-specific\\nvariation. Our findings are supported by experiments on synthetic as well as\\nvarious real-world multi-omics data sets.\",\"PeriodicalId\":501070,\"journal\":{\"name\":\"arXiv - QuanBio - Genomics\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2403.06338\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.06338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Disentangling shared and private latent factors in multimodal Variational Autoencoders
Generative models for multimodal data permit the identification of latent
factors that may be associated with important determinants of observed data
heterogeneity. Common or shared factors could be important for explaining
variation across modalities whereas other factors may be private and important
only for the explanation of a single modality. Multimodal Variational
Autoencoders, such as MVAE and MMVAE, are a natural choice for inferring those
underlying latent factors and separating shared variation from private. In this
work, we investigate their capability to reliably perform this disentanglement.
In particular, we highlight a challenging problem setting where
modality-specific variation dominates the shared signal. Taking a cross-modal
prediction perspective, we demonstrate limitations of existing models, and
propose a modification how to make them more robust to modality-specific
variation. Our findings are supported by experiments on synthetic as well as
various real-world multi-omics data sets.