Xupeng Niu;Dengao Li;Jumin Zhao;Long Tan;Ruiqin Bai
{"title":"ESCS: An Expandable Semantic Communication System for Multimodal Data Based on Contrastive Learning","authors":"Xupeng Niu;Dengao Li;Jumin Zhao;Long Tan;Ruiqin Bai","doi":"10.1109/LCOMM.2024.3518538","DOIUrl":null,"url":null,"abstract":"Semantic communication represents a novel paradigm for meeting the substantial data transmission demands of sixth-generation (6G) mobile communications. To address the challenges of information redundancy, conflicts, asynchrony, and internal interference among multimodal data, this letter introduces an expandable semantic communication system (ESCS). We propose a generic multimodal cross-attention (MMCA) module that enhances interactions among heterogeneous features under the guidance of an autonomously selected leader. By employing dual-contrastive learning, we impose stringent requirements for the feature representation capabilities of the transmitter, enabling it to differentiate between samples containing heterogeneous data. Evaluation results from four tasks under additive white Gaussian noise (AWGN) and fading channels indicate that the proposed system significantly outperforms state-of-the-art methods regarding storage overhead, task accuracy, and channel utilization.","PeriodicalId":13197,"journal":{"name":"IEEE Communications Letters","volume":"29 2","pages":"368-372"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10804137/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic communication represents a novel paradigm for meeting the substantial data transmission demands of sixth-generation (6G) mobile communications. To address the challenges of information redundancy, conflicts, asynchrony, and internal interference among multimodal data, this letter introduces an expandable semantic communication system (ESCS). We propose a generic multimodal cross-attention (MMCA) module that enhances interactions among heterogeneous features under the guidance of an autonomously selected leader. By employing dual-contrastive learning, we impose stringent requirements for the feature representation capabilities of the transmitter, enabling it to differentiate between samples containing heterogeneous data. Evaluation results from four tasks under additive white Gaussian noise (AWGN) and fading channels indicate that the proposed system significantly outperforms state-of-the-art methods regarding storage overhead, task accuracy, and channel utilization.
期刊介绍:
The IEEE Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of communication over different media and channels including wire, underground, waveguide, optical fiber, and storage channels. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of communication systems.