首页 > 最新文献

IEEE Journal of Selected Topics in Signal Processing最新文献

英文 中文
IEEE Signal Processing Society Information IEEE信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-22 DOI: 10.1109/JSTSP.2025.3527894
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2025.3527894","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3527894","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 7","pages":"C2-C2"},"PeriodicalIF":8.7,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10850494","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial Introduction for the Special Issue on Intelligent Signal Processing and Learning for Next Generation Multiple Access 《下一代多址智能信号处理与学习》特刊编辑导言
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-22 DOI: 10.1109/JSTSP.2024.3522636
Wei Chen;Yuanwei Liu;Hamid Jafarkhani;Yonina Eldar;Peiying Zhu;Khaled B. Letaief
{"title":"Editorial Introduction for the Special Issue on Intelligent Signal Processing and Learning for Next Generation Multiple Access","authors":"Wei Chen;Yuanwei Liu;Hamid Jafarkhani;Yonina Eldar;Peiying Zhu;Khaled B. Letaief","doi":"10.1109/JSTSP.2024.3522636","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3522636","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 7","pages":"1139-1145"},"PeriodicalIF":8.7,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10850495","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Complexity Joint Radar-Communication Beamforming: From Optimization to Deep Unfolding 低复杂度联合雷达通信波束形成:从优化到深度展开
IF 13.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-13 DOI: 10.1109/JSTSP.2024.3522787
Jianjun Zhang;Christos Masouros;Fan Liu;Yongming Huang;A. Lee Swindlehurst
By sharing the same hardware platform, spectral resource as well as transmit waveform, dual-functional radar-communication (DFRC) systems have been envisioned as a key technology for the future wireless networks. However, advanced signal processing algorithms for DFRC, which can achieve better performance, tradeoff or other design goals, often suffer from prohibitive computational complexity. This motivates us to design low-complexity joint radar sensing and communication beamforming algorithms in this paper, so as to achieve better energy efficiency, communication-sensing tradeoff, and so on. First, we formulate the problem of joint radar-communication beamforming based on symbol-level precoding (SLP) by incorporating constructive interference so as to improve the energy efficiency. To address the formulated problem, we tailor highly parallelizable iterative optimization algorithms that are shown to converge to stationary (or locally optimal) points. To achieve better performance, we propose efficient recursive optimizations that monotonically improve the performance metric of interest. Simulation results indicate that the proposed iterative algorithms outperform the previous approaches. Finally, to further reduce the complexity, we employ deep unfolding to design efficient learning-based algorithms. Besides parallelizability, the learning-based algorithms also enjoy appealing advantages of scalability in the number of served users, the number of transmit antennas and the length of the radar pulse.
通过共享相同的硬件平台、频谱资源和传输波形,双功能雷达通信(DFRC)系统已被设想为未来无线网络的关键技术。然而,用于DFRC的先进信号处理算法可以实现更好的性能、权衡或其他设计目标,但往往存在令人望而却步的计算复杂性。这促使我们在本文中设计低复杂度的联合雷达传感与通信波束形成算法,以达到更好的能效、通信与传感的权衡等目的。首先,我们提出了基于符号级预编码(SLP)的联合雷达通信波束形成问题,通过引入建设性干扰来提高能量效率。为了解决公式化问题,我们定制高度并行化的迭代优化算法,这些算法被证明收敛于平稳(或局部最优)点。为了获得更好的性能,我们提出了有效的递归优化,单调地提高感兴趣的性能指标。仿真结果表明,所提出的迭代算法优于以往的方法。最后,为了进一步降低复杂性,我们采用深度展开来设计高效的基于学习的算法。除了可并行性外,基于学习的算法在服务用户数量、发射天线数量和雷达脉冲长度等方面也具有显著的可扩展性。
{"title":"Low-Complexity Joint Radar-Communication Beamforming: From Optimization to Deep Unfolding","authors":"Jianjun Zhang;Christos Masouros;Fan Liu;Yongming Huang;A. Lee Swindlehurst","doi":"10.1109/JSTSP.2024.3522787","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3522787","url":null,"abstract":"By sharing the same hardware platform, spectral resource as well as transmit waveform, dual-functional radar-communication (DFRC) systems have been envisioned as a key technology for the future wireless networks. However, advanced signal processing algorithms for DFRC, which can achieve better performance, tradeoff or other design goals, often suffer from prohibitive computational complexity. This motivates us to design low-complexity joint radar sensing and communication beamforming algorithms in this paper, so as to achieve better energy efficiency, communication-sensing tradeoff, and so on. First, we formulate the problem of joint radar-communication beamforming based on symbol-level precoding (SLP) by incorporating constructive interference so as to improve the energy efficiency. To address the formulated problem, we tailor highly parallelizable iterative optimization algorithms that are shown to converge to stationary (or locally optimal) points. To achieve better performance, we propose efficient recursive optimizations that monotonically improve the performance metric of interest. Simulation results indicate that the proposed iterative algorithms outperform the previous approaches. Finally, to further reduce the complexity, we employ deep unfolding to design efficient learning-based algorithms. Besides parallelizability, the learning-based algorithms also enjoy appealing advantages of scalability in the number of served users, the number of transmit antennas and the length of the radar pulse.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 5","pages":"856-871"},"PeriodicalIF":13.7,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Massive MIMO With Low Resolution ADCs for Massive Random Access 基于低分辨率adc的大规模随机接入分布式大规模MIMO
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-10 DOI: 10.1109/JSTSP.2024.3516382
Yuhui Song;Zijun Gong;Yuanzhu Chen;Cheng Li
Massive machine-type communications (mMTC), an essential fifth-generation (5G) usage scenario, aims to provide services for a large number of users that intermittently transmit small data packets in smart cities, manufacturing, and agriculture. Massive random access (MRA) emerges as a promising candidate for multiple access in mMTC characterized by the sporadic data traffic. Despite the use of massive multiple-input multiple-output (mMIMO) in MRA to achieve spatial division multiple access and mitigate small-scale fading, existing research endeavors overlook the near-far effect of large-scale fading by assuming perfect power control. In this paper, we present a cost-efficient, effective, and fully distributed solution for MRA to combat large-scale fading, wherein distributed access points (APs) cooperatively detect and serve active users. Each AP is equipped with low resolution analog-to-digital converters (ADCs) for energy-efficient system implementation. Specifically, we derive a rigorous closed-form expression for the uplink achievable rate, considering the impact of non-orthogonal pilots and low resolution ADCs. We also propose a scalable distributed algorithm for user activity detection under flat fading channels, and further adapt it to handle frequency-selective fading in popular orthogonal frequency division multiplexing (OFDM) systems. The proposed solution is fully distributed, since most processing tasks, such as activity detection, channel estimation, and data detection, are localized at each AP. Simulation results demonstrate the significant advantage of distributed systems over co-located systems in accommodating more users while achieving higher activity detection accuracy, and quantify performance loss resulting from the use of low resolution ADCs.
mMTC (Massive machine-type communications)是5G必不可少的使用场景,旨在为智慧城市、制造业、农业等领域大量用户间歇性传输小数据包提供服务。海量随机接入(MRA)是具有零星数据流量特点的多址通信(mMTC)中一种很有前途的多址接入方式。尽管在MRA中使用了大规模多输入多输出(mMIMO)来实现空分多址和缓解小规模衰落,但现有的研究通过假设完美的功率控制而忽略了大规模衰落的近远效应。在本文中,我们提出了一种经济、有效和完全分布式的MRA解决方案来对抗大规模衰落,其中分布式接入点(ap)协同检测和服务活跃用户。每个AP都配备了低分辨率模数转换器(adc),以实现节能系统。具体来说,考虑到非正交导频和低分辨率adc的影响,我们推导了上行可达速率的严格封闭表达式。我们还提出了一种可扩展的分布式算法用于平坦衰落信道下的用户活动检测,并进一步将其应用于处理常见的正交频分复用(OFDM)系统中的频率选择性衰落。所提出的解决方案是完全分布式的,因为大多数处理任务,如活动检测、信道估计和数据检测,都定位在每个AP上。仿真结果表明,分布式系统在容纳更多用户的同时,实现了更高的活动检测精度,并量化了使用低分辨率adc造成的性能损失。
{"title":"Distributed Massive MIMO With Low Resolution ADCs for Massive Random Access","authors":"Yuhui Song;Zijun Gong;Yuanzhu Chen;Cheng Li","doi":"10.1109/JSTSP.2024.3516382","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3516382","url":null,"abstract":"Massive machine-type communications (mMTC), an essential fifth-generation (5G) usage scenario, aims to provide services for a large number of users that intermittently transmit small data packets in smart cities, manufacturing, and agriculture. Massive random access (MRA) emerges as a promising candidate for multiple access in mMTC characterized by the sporadic data traffic. Despite the use of massive multiple-input multiple-output (mMIMO) in MRA to achieve spatial division multiple access and mitigate small-scale fading, existing research endeavors overlook the near-far effect of large-scale fading by assuming perfect power control. In this paper, we present a cost-efficient, effective, and fully distributed solution for MRA to combat large-scale fading, wherein distributed access points (APs) cooperatively detect and serve active users. Each AP is equipped with low resolution analog-to-digital converters (ADCs) for energy-efficient system implementation. Specifically, we derive a rigorous closed-form expression for the uplink achievable rate, considering the impact of non-orthogonal pilots and low resolution ADCs. We also propose a scalable distributed algorithm for user activity detection under flat fading channels, and further adapt it to handle frequency-selective fading in popular orthogonal frequency division multiplexing (OFDM) systems. The proposed solution is fully distributed, since most processing tasks, such as activity detection, channel estimation, and data detection, are localized at each AP. Simulation results demonstrate the significant advantage of distributed systems over co-located systems in accommodating more users while achieving higher activity detection accuracy, and quantify performance loss resulting from the use of low resolution ADCs.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 7","pages":"1381-1395"},"PeriodicalIF":8.7,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information IEEE信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3511064
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3511064","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3511064","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"C2-C2"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832404","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information IEEE信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3511060
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3511060","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3511060","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"C3-C3"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832440","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent Mixup Knowledge Distillation for Single Channel Speech Enhancement 用于单通道语音增强的潜在混淆知识蒸馏
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3524022
Behnam Gholami;Mostafa El-Khamy;Kee-Bong Song
Traditional speech enhancement methods often rely on complex signal processing algorithms, which may not be efficient for real-time applications or may suffer from limitations in handling various types of noise. Deploying complex Deep Neural Network (DNN) models in resource-constrained environments can be challenging due to their high computational requirements. In this paper, we propose a Knowledge Distillation (KD) method for speech enhancement leveraging the information stored in the intermediate latent features of a very complex DNN (teacher) model to train a smaller, more efficient (student) model. Experimental results on a two benchmark speech enhancement datasets demonstrate the effectiveness of the proposed KD method for speech enhancement. The student model trained with knowledge distillation outperforms SOTA speech enhancement methods and achieves comparable performance to the teacher model. Furthermore, our method achieves significant reductions in computational complexity, making it suitable for deployment in resource-constrained environments such as embedded systems and mobile devices.
传统的语音增强方法通常依赖于复杂的信号处理算法,这些算法对于实时应用来说可能并不高效,或者在处理各种类型的噪声时可能会受到限制。由于计算要求较高,在资源有限的环境中部署复杂的深度神经网络(DNN)模型可能具有挑战性。在本文中,我们提出了一种用于语音增强的知识蒸馏(KD)方法,利用存储在非常复杂的 DNN(教师)模型的中间潜在特征中的信息来训练一个更小、更高效的(学生)模型。在两个基准语音增强数据集上的实验结果证明了所提出的 KD 方法在语音增强方面的有效性。采用知识提炼方法训练的学生模型优于 SOTA 语音增强方法,其性能与教师模型相当。此外,我们的方法大大降低了计算复杂度,使其适用于嵌入式系统和移动设备等资源有限的环境。
{"title":"Latent Mixup Knowledge Distillation for Single Channel Speech Enhancement","authors":"Behnam Gholami;Mostafa El-Khamy;Kee-Bong Song","doi":"10.1109/JSTSP.2024.3524022","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3524022","url":null,"abstract":"Traditional speech enhancement methods often rely on complex signal processing algorithms, which may not be efficient for real-time applications or may suffer from limitations in handling various types of noise. Deploying complex Deep Neural Network (DNN) models in resource-constrained environments can be challenging due to their high computational requirements. In this paper, we propose a Knowledge Distillation (KD) method for speech enhancement leveraging the information stored in the intermediate latent features of a very complex DNN (teacher) model to train a smaller, more efficient (student) model. Experimental results on a two benchmark speech enhancement datasets demonstrate the effectiveness of the proposed KD method for speech enhancement. The student model trained with knowledge distillation outperforms SOTA speech enhancement methods and achieves comparable performance to the teacher model. Furthermore, our method achieves significant reductions in computational complexity, making it suitable for deployment in resource-constrained environments such as embedded systems and mobile devices.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 8","pages":"1544-1556"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial Introduction to the Special Issue on Learning-Based Signal Processing for Integrated Sensing and Communications 基于学习的集成传感与通信信号处理特刊编辑导言
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3522437
Kumar Vijay Mishra;M. R. Bhavani Shankar;Nuria González-Prelcic;Mikko Valkama;Wei Yu;Björn Ottersten
Signal processing techniques have played a pivotal role in the early development of joint sensing and communication systems [1]. These efforts were driven by the need to address spectrum scarcity and to reduce hardware size and cost. Initially focused on dual-function radar-communication systems, this field has since evolved into the broader paradigm of Integrated Sensing and Communication (ISAC). ISAC encompasses a wide range of interactions between sensing and communication, incorporating not just radar but also other sensors, and leveraging their capabilities for applications such as autonomous driving, drone-based services, radio-frequency identification, and weather monitoring. With wireless networks now operating at higher frequencies, their dual role as communication networks and environmental sensors has become increasingly significant, providing critical information for both user needs and network operations [2].
信号处理技术在联合传感和通信系统的早期发展中起着举足轻重的作用。这些努力是由解决频谱稀缺和减少硬件尺寸和成本的需求驱动的。该领域最初侧重于双功能雷达通信系统,此后发展成为更广泛的综合传感和通信(ISAC)范式。ISAC涵盖了传感和通信之间的广泛互动,不仅包括雷达,还包括其他传感器,并利用其应用能力,如自动驾驶、无人机服务、射频识别和天气监测。随着无线网络以更高的频率运行,其作为通信网络和环境传感器的双重作用变得越来越重要,为用户需求和网络运营提供关键信息。
{"title":"Editorial Introduction to the Special Issue on Learning-Based Signal Processing for Integrated Sensing and Communications","authors":"Kumar Vijay Mishra;M. R. Bhavani Shankar;Nuria González-Prelcic;Mikko Valkama;Wei Yu;Björn Ottersten","doi":"10.1109/JSTSP.2024.3522437","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3522437","url":null,"abstract":"Signal processing techniques have played a pivotal role in the early development of joint sensing and communication systems [1]. These efforts were driven by the need to address spectrum scarcity and to reduce hardware size and cost. Initially focused on dual-function radar-communication systems, this field has since evolved into the broader paradigm of Integrated Sensing and Communication (ISAC). ISAC encompasses a wide range of interactions between sensing and communication, incorporating not just radar but also other sensors, and leveraging their capabilities for applications such as autonomous driving, drone-based services, radio-frequency identification, and weather monitoring. With wireless networks now operating at higher frequencies, their dual role as communication networks and environmental sensors has become increasingly significant, providing critical information for both user needs and network operations [2].","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"731-736"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832414","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Latency Deep Analog Speech Transmission Using Joint Source Channel Coding 使用联合源信道编码的低延迟深度模拟语音传输
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-06 DOI: 10.1109/JSTSP.2024.3521277
Mohammad Bokaei;Jesper Jensen;Simon Doclo;Jan Østergaard
Low-latency configurable speech transmission presents significant challenges in modern communication systems. Traditional methods rely on separate source and channel coding, which often degrades performance under low-latency constraints. Moreover, non-configurable systems require separate training for each condition, limiting their adaptability in resource-constrained scenarios. This paper proposes a configurable low-latency deep Joint Source-Channel Coding (JSCC) system for speech transmission. The system can be configured for varying signal-to-noise ratios (SNR), wireless channel conditions, or bandwidths. A joint source-channel encoder based on deep neural networks (DNN) is used to compress and transmit analog-coded information, while a configurable decoder reconstructs speech from noisy compressed signals. The system latency is adaptable based on the input speech length, achieving a minimum latency of 2 ms, with a lightweight architecture of 25 k parameters, significantly fewer than state-of-the-art systems. The simulation results demonstrate that the proposed system outperforms conventional separate source-channel coding systems in terms of speech quality and intelligibility, particularly in low-latency and noisy channel conditions. It also shows robustness in fixed configured scenarios, though higher latency conditions and better channel environments favor traditional coding systems.
低延迟可配置语音传输给现代通信系统带来了巨大挑战。传统方法依赖于单独的信源和信道编码,这往往会降低低延迟限制下的性能。此外,不可配置系统需要对每种条件进行单独训练,限制了其在资源受限情况下的适应性。本文提出了一种用于语音传输的可配置低延迟深度联合信源信道编码(JSCC)系统。该系统可根据不同的信噪比 (SNR)、无线信道条件或带宽进行配置。基于深度神经网络(DNN)的联合源信道编码器用于压缩和传输模拟编码信息,而可配置的解码器则从有噪声的压缩信号中重建语音。系统延迟可根据输入语音的长度进行调整,实现了 2 毫秒的最低延迟,其轻量级架构包含 25 k 个参数,大大少于最先进的系统。仿真结果表明,在语音质量和可懂度方面,特别是在低延迟和高噪声信道条件下,拟议系统优于传统的独立源信道编码系统。它还显示了在固定配置情况下的鲁棒性,尽管更高的延迟条件和更好的信道环境有利于传统编码系统。
{"title":"Low-Latency Deep Analog Speech Transmission Using Joint Source Channel Coding","authors":"Mohammad Bokaei;Jesper Jensen;Simon Doclo;Jan Østergaard","doi":"10.1109/JSTSP.2024.3521277","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3521277","url":null,"abstract":"Low-latency configurable speech transmission presents significant challenges in modern communication systems. Traditional methods rely on separate source and channel coding, which often degrades performance under low-latency constraints. Moreover, non-configurable systems require separate training for each condition, limiting their adaptability in resource-constrained scenarios. This paper proposes a configurable low-latency deep Joint Source-Channel Coding (JSCC) system for speech transmission. The system can be configured for varying signal-to-noise ratios (SNR), wireless channel conditions, or bandwidths. A joint source-channel encoder based on deep neural networks (DNN) is used to compress and transmit analog-coded information, while a configurable decoder reconstructs speech from noisy compressed signals. The system latency is adaptable based on the input speech length, achieving a minimum latency of 2 ms, with a lightweight architecture of 25 k parameters, significantly fewer than state-of-the-art systems. The simulation results demonstrate that the proposed system outperforms conventional separate source-channel coding systems in terms of speech quality and intelligibility, particularly in low-latency and noisy channel conditions. It also shows robustness in fixed configured scenarios, though higher latency conditions and better channel environments favor traditional coding systems.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 8","pages":"1401-1413"},"PeriodicalIF":8.7,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence and Privacy of Decentralized Nonconvex Optimization With Gradient Clipping and Communication Compression 梯度裁剪和通信压缩分散非凸优化的收敛性和保密性
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-03 DOI: 10.1109/JSTSP.2025.3526081
Boyue Li;Yuejie Chi
Achieving communication efficiency in decentralized machine learning has been attracting significant attention, with communication compression recognized as an effective technique in algorithm design. This paper takes a first step to understand the role of gradient clipping, a popular strategy in practice, in decentralized nonconvex optimization with communication compression. We propose PORTER, which considers two variants of gradient clipping added before or after taking a mini-batch of stochastic gradients, where the former variant PORTER-DP allows local differential privacy analysis with additional Gaussian perturbation, and the latter variant PORTER-GC helps to stabilize training. We develop a novel analysis framework that establishes their convergence guarantees without assuming the stringent bounded gradient assumption. To the best of our knowledge, our work provides the first convergence analysis for decentralized nonconvex optimization with gradient clipping and communication compression, highlighting the trade-offs between convergence rate, compression ratio, network connectivity, and privacy.
在去中心化机器学习中实现通信效率一直备受关注,通信压缩被认为是算法设计中的一种有效技术。本文首先阐述了梯度裁剪在通信压缩分散非凸优化中的作用。我们提出了PORTER,它考虑了在获取小批量随机梯度之前或之后添加的梯度裁剪的两种变体,其中前一种变体PORTER- dp允许在附加高斯扰动的情况下进行局部差分隐私分析,后一种变体PORTER- gc有助于稳定训练。我们开发了一种新的分析框架,在不假设严格有界梯度假设的情况下建立了它们的收敛性保证。据我们所知,我们的工作为梯度裁剪和通信压缩的分散非凸优化提供了第一个收敛分析,突出了收敛速度、压缩比、网络连接和隐私之间的权衡。
{"title":"Convergence and Privacy of Decentralized Nonconvex Optimization With Gradient Clipping and Communication Compression","authors":"Boyue Li;Yuejie Chi","doi":"10.1109/JSTSP.2025.3526081","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3526081","url":null,"abstract":"Achieving communication efficiency in decentralized machine learning has been attracting significant attention, with communication compression recognized as an effective technique in algorithm design. This paper takes a first step to understand the role of gradient clipping, a popular strategy in practice, in decentralized nonconvex optimization with communication compression. We propose <monospace><b>PORTER</b></monospace>, which considers two variants of gradient clipping added before or after taking a mini-batch of stochastic gradients, where the former variant <monospace><b>PORTER-DP</b></monospace> allows local differential privacy analysis with additional Gaussian perturbation, and the latter variant <monospace><b>PORTER-GC</b></monospace> helps to stabilize training. We develop a novel analysis framework that establishes their convergence guarantees without assuming the stringent bounded gradient assumption. To the best of our knowledge, our work provides the first convergence analysis for decentralized nonconvex optimization with gradient clipping and communication compression, highlighting the trade-offs between convergence rate, compression ratio, network connectivity, and privacy.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 1","pages":"273-282"},"PeriodicalIF":8.7,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143512922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1