{"title":"用于人机通信的词嵌入式语义-拓扑保存量化技术","authors":"Zhenyi Lin;Lin Yang;Yi Gong;Kaibin Huang","doi":"10.1109/TCOMM.2024.3471992","DOIUrl":null,"url":null,"abstract":"The vision of 6G mobile networks aims to connect intelligent machines to humans to provide the latter with cooperation, care, and assistance. The mainstream approach for human-to-machine (H2M) semantic communication is to map words into (word) embedding vectors which are clustered according to their semantic similarity to facilitate machines’ interpretation of human languages. The computation-intensive tasks of text-to-embedding mapping are usually delegated to an edge server that senses human commands, maps them into embedding vectors, and then transmits the vectors to a machine over a wireless link. In this work, we propose a quantization framework customized for embedding vectors, called semantic-topology preserving VQ (SemTop-VQ), to overcome the communication bottleneck due to the vectors’ high dimensionality. While traditional VQ focuses on minimizing the distortion of individual vectors, SemTop-VQ aims to minimize the distortion of the topology of embedding matrix, referring to the vectors’ relative positions that represent semantics. To this end, we adopt a topology-distortion metric, termed pointwise-inner-product (PIP) loss, a hierarchical VQ architecture targeting high-dimensional VQ. In this architecture, an embedding vector is decomposed into blocks; the norm and shape (normalized vector) are quantized separately using a scalar and a Grassmannian quantizers, respectively. The main feature of SemTop-VQ lies in deriving from the PIP loss a set of so-called semantic-importance indicators, which reflect the level of influences of individual blocks’ quantization errors on the topology distortion. Then the indicators are applied to optimize quantization-bit allocation for decomposed vector blocks under the criterion of PIP-loss minimization. In practice, the usage probabilities of embedding vectors for a specific machine task are highly skewed and the task is time-varying. We exploit this fact to further develop SemTop-VQ to feature task adaptation that can attain a higher communication efficiency. The task-adaptive VQ is realized via the use of a frequently used (quantization) codebook that is much smaller in size than the original codebook and continuously updated via estimation of embedding-usage distribution. Our experiments using real embedding datasets, namely Word2Vec and Glove, demonstrate the effectiveness of SemTop-VQ as a goal-oriented technique for efficient H2M communications.","PeriodicalId":13041,"journal":{"name":"IEEE Transactions on Communications","volume":"73 4","pages":"2401-2415"},"PeriodicalIF":8.3000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic-Topology Preserving Quantization of Word Embeddings for Human-to-Machine Communications\",\"authors\":\"Zhenyi Lin;Lin Yang;Yi Gong;Kaibin Huang\",\"doi\":\"10.1109/TCOMM.2024.3471992\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The vision of 6G mobile networks aims to connect intelligent machines to humans to provide the latter with cooperation, care, and assistance. The mainstream approach for human-to-machine (H2M) semantic communication is to map words into (word) embedding vectors which are clustered according to their semantic similarity to facilitate machines’ interpretation of human languages. The computation-intensive tasks of text-to-embedding mapping are usually delegated to an edge server that senses human commands, maps them into embedding vectors, and then transmits the vectors to a machine over a wireless link. In this work, we propose a quantization framework customized for embedding vectors, called semantic-topology preserving VQ (SemTop-VQ), to overcome the communication bottleneck due to the vectors’ high dimensionality. While traditional VQ focuses on minimizing the distortion of individual vectors, SemTop-VQ aims to minimize the distortion of the topology of embedding matrix, referring to the vectors’ relative positions that represent semantics. To this end, we adopt a topology-distortion metric, termed pointwise-inner-product (PIP) loss, a hierarchical VQ architecture targeting high-dimensional VQ. In this architecture, an embedding vector is decomposed into blocks; the norm and shape (normalized vector) are quantized separately using a scalar and a Grassmannian quantizers, respectively. The main feature of SemTop-VQ lies in deriving from the PIP loss a set of so-called semantic-importance indicators, which reflect the level of influences of individual blocks’ quantization errors on the topology distortion. Then the indicators are applied to optimize quantization-bit allocation for decomposed vector blocks under the criterion of PIP-loss minimization. In practice, the usage probabilities of embedding vectors for a specific machine task are highly skewed and the task is time-varying. We exploit this fact to further develop SemTop-VQ to feature task adaptation that can attain a higher communication efficiency. The task-adaptive VQ is realized via the use of a frequently used (quantization) codebook that is much smaller in size than the original codebook and continuously updated via estimation of embedding-usage distribution. Our experiments using real embedding datasets, namely Word2Vec and Glove, demonstrate the effectiveness of SemTop-VQ as a goal-oriented technique for efficient H2M communications.\",\"PeriodicalId\":13041,\"journal\":{\"name\":\"IEEE Transactions on Communications\",\"volume\":\"73 4\",\"pages\":\"2401-2415\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2024-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Communications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10713295/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10713295/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Semantic-Topology Preserving Quantization of Word Embeddings for Human-to-Machine Communications
The vision of 6G mobile networks aims to connect intelligent machines to humans to provide the latter with cooperation, care, and assistance. The mainstream approach for human-to-machine (H2M) semantic communication is to map words into (word) embedding vectors which are clustered according to their semantic similarity to facilitate machines’ interpretation of human languages. The computation-intensive tasks of text-to-embedding mapping are usually delegated to an edge server that senses human commands, maps them into embedding vectors, and then transmits the vectors to a machine over a wireless link. In this work, we propose a quantization framework customized for embedding vectors, called semantic-topology preserving VQ (SemTop-VQ), to overcome the communication bottleneck due to the vectors’ high dimensionality. While traditional VQ focuses on minimizing the distortion of individual vectors, SemTop-VQ aims to minimize the distortion of the topology of embedding matrix, referring to the vectors’ relative positions that represent semantics. To this end, we adopt a topology-distortion metric, termed pointwise-inner-product (PIP) loss, a hierarchical VQ architecture targeting high-dimensional VQ. In this architecture, an embedding vector is decomposed into blocks; the norm and shape (normalized vector) are quantized separately using a scalar and a Grassmannian quantizers, respectively. The main feature of SemTop-VQ lies in deriving from the PIP loss a set of so-called semantic-importance indicators, which reflect the level of influences of individual blocks’ quantization errors on the topology distortion. Then the indicators are applied to optimize quantization-bit allocation for decomposed vector blocks under the criterion of PIP-loss minimization. In practice, the usage probabilities of embedding vectors for a specific machine task are highly skewed and the task is time-varying. We exploit this fact to further develop SemTop-VQ to feature task adaptation that can attain a higher communication efficiency. The task-adaptive VQ is realized via the use of a frequently used (quantization) codebook that is much smaller in size than the original codebook and continuously updated via estimation of embedding-usage distribution. Our experiments using real embedding datasets, namely Word2Vec and Glove, demonstrate the effectiveness of SemTop-VQ as a goal-oriented technique for efficient H2M communications.
期刊介绍:
The IEEE Transactions on Communications is dedicated to publishing high-quality manuscripts that showcase advancements in the state-of-the-art of telecommunications. Our scope encompasses all aspects of telecommunications, including telephone, telegraphy, facsimile, and television, facilitated by electromagnetic propagation methods such as radio, wire, aerial, underground, coaxial, and submarine cables, as well as waveguides, communication satellites, and lasers. We cover telecommunications in various settings, including marine, aeronautical, space, and fixed station services, addressing topics such as repeaters, radio relaying, signal storage, regeneration, error detection and correction, multiplexing, carrier techniques, communication switching systems, data communications, and communication theory. Join us in advancing the field of telecommunications through groundbreaking research and innovation.