首页 > 最新文献

IEEE Journal of Selected Topics in Signal Processing最新文献

英文 中文
Distributed Massive MIMO With Low Resolution ADCs for Massive Random Access 基于低分辨率adc的大规模随机接入分布式大规模MIMO
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-10 DOI: 10.1109/JSTSP.2024.3516382
Yuhui Song;Zijun Gong;Yuanzhu Chen;Cheng Li
Massive machine-type communications (mMTC), an essential fifth-generation (5G) usage scenario, aims to provide services for a large number of users that intermittently transmit small data packets in smart cities, manufacturing, and agriculture. Massive random access (MRA) emerges as a promising candidate for multiple access in mMTC characterized by the sporadic data traffic. Despite the use of massive multiple-input multiple-output (mMIMO) in MRA to achieve spatial division multiple access and mitigate small-scale fading, existing research endeavors overlook the near-far effect of large-scale fading by assuming perfect power control. In this paper, we present a cost-efficient, effective, and fully distributed solution for MRA to combat large-scale fading, wherein distributed access points (APs) cooperatively detect and serve active users. Each AP is equipped with low resolution analog-to-digital converters (ADCs) for energy-efficient system implementation. Specifically, we derive a rigorous closed-form expression for the uplink achievable rate, considering the impact of non-orthogonal pilots and low resolution ADCs. We also propose a scalable distributed algorithm for user activity detection under flat fading channels, and further adapt it to handle frequency-selective fading in popular orthogonal frequency division multiplexing (OFDM) systems. The proposed solution is fully distributed, since most processing tasks, such as activity detection, channel estimation, and data detection, are localized at each AP. Simulation results demonstrate the significant advantage of distributed systems over co-located systems in accommodating more users while achieving higher activity detection accuracy, and quantify performance loss resulting from the use of low resolution ADCs.
mMTC (Massive machine-type communications)是5G必不可少的使用场景,旨在为智慧城市、制造业、农业等领域大量用户间歇性传输小数据包提供服务。海量随机接入(MRA)是具有零星数据流量特点的多址通信(mMTC)中一种很有前途的多址接入方式。尽管在MRA中使用了大规模多输入多输出(mMIMO)来实现空分多址和缓解小规模衰落,但现有的研究通过假设完美的功率控制而忽略了大规模衰落的近远效应。在本文中,我们提出了一种经济、有效和完全分布式的MRA解决方案来对抗大规模衰落,其中分布式接入点(ap)协同检测和服务活跃用户。每个AP都配备了低分辨率模数转换器(adc),以实现节能系统。具体来说,考虑到非正交导频和低分辨率adc的影响,我们推导了上行可达速率的严格封闭表达式。我们还提出了一种可扩展的分布式算法用于平坦衰落信道下的用户活动检测,并进一步将其应用于处理常见的正交频分复用(OFDM)系统中的频率选择性衰落。所提出的解决方案是完全分布式的,因为大多数处理任务,如活动检测、信道估计和数据检测,都定位在每个AP上。仿真结果表明,分布式系统在容纳更多用户的同时,实现了更高的活动检测精度,并量化了使用低分辨率adc造成的性能损失。
{"title":"Distributed Massive MIMO With Low Resolution ADCs for Massive Random Access","authors":"Yuhui Song;Zijun Gong;Yuanzhu Chen;Cheng Li","doi":"10.1109/JSTSP.2024.3516382","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3516382","url":null,"abstract":"Massive machine-type communications (mMTC), an essential fifth-generation (5G) usage scenario, aims to provide services for a large number of users that intermittently transmit small data packets in smart cities, manufacturing, and agriculture. Massive random access (MRA) emerges as a promising candidate for multiple access in mMTC characterized by the sporadic data traffic. Despite the use of massive multiple-input multiple-output (mMIMO) in MRA to achieve spatial division multiple access and mitigate small-scale fading, existing research endeavors overlook the near-far effect of large-scale fading by assuming perfect power control. In this paper, we present a cost-efficient, effective, and fully distributed solution for MRA to combat large-scale fading, wherein distributed access points (APs) cooperatively detect and serve active users. Each AP is equipped with low resolution analog-to-digital converters (ADCs) for energy-efficient system implementation. Specifically, we derive a rigorous closed-form expression for the uplink achievable rate, considering the impact of non-orthogonal pilots and low resolution ADCs. We also propose a scalable distributed algorithm for user activity detection under flat fading channels, and further adapt it to handle frequency-selective fading in popular orthogonal frequency division multiplexing (OFDM) systems. The proposed solution is fully distributed, since most processing tasks, such as activity detection, channel estimation, and data detection, are localized at each AP. Simulation results demonstrate the significant advantage of distributed systems over co-located systems in accommodating more users while achieving higher activity detection accuracy, and quantify performance loss resulting from the use of low resolution ADCs.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 7","pages":"1381-1395"},"PeriodicalIF":8.7,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information IEEE信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3511064
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3511064","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3511064","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"C2-C2"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832404","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information IEEE信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3511060
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3511060","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3511060","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"C3-C3"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832440","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent Mixup Knowledge Distillation for Single Channel Speech Enhancement 用于单通道语音增强的潜在混淆知识蒸馏
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3524022
Behnam Gholami;Mostafa El-Khamy;Kee-Bong Song
Traditional speech enhancement methods often rely on complex signal processing algorithms, which may not be efficient for real-time applications or may suffer from limitations in handling various types of noise. Deploying complex Deep Neural Network (DNN) models in resource-constrained environments can be challenging due to their high computational requirements. In this paper, we propose a Knowledge Distillation (KD) method for speech enhancement leveraging the information stored in the intermediate latent features of a very complex DNN (teacher) model to train a smaller, more efficient (student) model. Experimental results on a two benchmark speech enhancement datasets demonstrate the effectiveness of the proposed KD method for speech enhancement. The student model trained with knowledge distillation outperforms SOTA speech enhancement methods and achieves comparable performance to the teacher model. Furthermore, our method achieves significant reductions in computational complexity, making it suitable for deployment in resource-constrained environments such as embedded systems and mobile devices.
传统的语音增强方法通常依赖于复杂的信号处理算法,这些算法对于实时应用来说可能并不高效,或者在处理各种类型的噪声时可能会受到限制。由于计算要求较高,在资源有限的环境中部署复杂的深度神经网络(DNN)模型可能具有挑战性。在本文中,我们提出了一种用于语音增强的知识蒸馏(KD)方法,利用存储在非常复杂的 DNN(教师)模型的中间潜在特征中的信息来训练一个更小、更高效的(学生)模型。在两个基准语音增强数据集上的实验结果证明了所提出的 KD 方法在语音增强方面的有效性。采用知识提炼方法训练的学生模型优于 SOTA 语音增强方法,其性能与教师模型相当。此外,我们的方法大大降低了计算复杂度,使其适用于嵌入式系统和移动设备等资源有限的环境。
{"title":"Latent Mixup Knowledge Distillation for Single Channel Speech Enhancement","authors":"Behnam Gholami;Mostafa El-Khamy;Kee-Bong Song","doi":"10.1109/JSTSP.2024.3524022","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3524022","url":null,"abstract":"Traditional speech enhancement methods often rely on complex signal processing algorithms, which may not be efficient for real-time applications or may suffer from limitations in handling various types of noise. Deploying complex Deep Neural Network (DNN) models in resource-constrained environments can be challenging due to their high computational requirements. In this paper, we propose a Knowledge Distillation (KD) method for speech enhancement leveraging the information stored in the intermediate latent features of a very complex DNN (teacher) model to train a smaller, more efficient (student) model. Experimental results on a two benchmark speech enhancement datasets demonstrate the effectiveness of the proposed KD method for speech enhancement. The student model trained with knowledge distillation outperforms SOTA speech enhancement methods and achieves comparable performance to the teacher model. Furthermore, our method achieves significant reductions in computational complexity, making it suitable for deployment in resource-constrained environments such as embedded systems and mobile devices.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 8","pages":"1544-1556"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial Introduction to the Special Issue on Learning-Based Signal Processing for Integrated Sensing and Communications 基于学习的集成传感与通信信号处理特刊编辑导言
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-07 DOI: 10.1109/JSTSP.2024.3522437
Kumar Vijay Mishra;M. R. Bhavani Shankar;Nuria González-Prelcic;Mikko Valkama;Wei Yu;Björn Ottersten
Signal processing techniques have played a pivotal role in the early development of joint sensing and communication systems [1]. These efforts were driven by the need to address spectrum scarcity and to reduce hardware size and cost. Initially focused on dual-function radar-communication systems, this field has since evolved into the broader paradigm of Integrated Sensing and Communication (ISAC). ISAC encompasses a wide range of interactions between sensing and communication, incorporating not just radar but also other sensors, and leveraging their capabilities for applications such as autonomous driving, drone-based services, radio-frequency identification, and weather monitoring. With wireless networks now operating at higher frequencies, their dual role as communication networks and environmental sensors has become increasingly significant, providing critical information for both user needs and network operations [2].
信号处理技术在联合传感和通信系统的早期发展中起着举足轻重的作用。这些努力是由解决频谱稀缺和减少硬件尺寸和成本的需求驱动的。该领域最初侧重于双功能雷达通信系统,此后发展成为更广泛的综合传感和通信(ISAC)范式。ISAC涵盖了传感和通信之间的广泛互动,不仅包括雷达,还包括其他传感器,并利用其应用能力,如自动驾驶、无人机服务、射频识别和天气监测。随着无线网络以更高的频率运行,其作为通信网络和环境传感器的双重作用变得越来越重要,为用户需求和网络运营提供关键信息。
{"title":"Editorial Introduction to the Special Issue on Learning-Based Signal Processing for Integrated Sensing and Communications","authors":"Kumar Vijay Mishra;M. R. Bhavani Shankar;Nuria González-Prelcic;Mikko Valkama;Wei Yu;Björn Ottersten","doi":"10.1109/JSTSP.2024.3522437","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3522437","url":null,"abstract":"Signal processing techniques have played a pivotal role in the early development of joint sensing and communication systems [1]. These efforts were driven by the need to address spectrum scarcity and to reduce hardware size and cost. Initially focused on dual-function radar-communication systems, this field has since evolved into the broader paradigm of Integrated Sensing and Communication (ISAC). ISAC encompasses a wide range of interactions between sensing and communication, incorporating not just radar but also other sensors, and leveraging their capabilities for applications such as autonomous driving, drone-based services, radio-frequency identification, and weather monitoring. With wireless networks now operating at higher frequencies, their dual role as communication networks and environmental sensors has become increasingly significant, providing critical information for both user needs and network operations [2].","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 5","pages":"731-736"},"PeriodicalIF":8.7,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10832414","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Latency Deep Analog Speech Transmission Using Joint Source Channel Coding 使用联合源信道编码的低延迟深度模拟语音传输
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-06 DOI: 10.1109/JSTSP.2024.3521277
Mohammad Bokaei;Jesper Jensen;Simon Doclo;Jan Østergaard
Low-latency configurable speech transmission presents significant challenges in modern communication systems. Traditional methods rely on separate source and channel coding, which often degrades performance under low-latency constraints. Moreover, non-configurable systems require separate training for each condition, limiting their adaptability in resource-constrained scenarios. This paper proposes a configurable low-latency deep Joint Source-Channel Coding (JSCC) system for speech transmission. The system can be configured for varying signal-to-noise ratios (SNR), wireless channel conditions, or bandwidths. A joint source-channel encoder based on deep neural networks (DNN) is used to compress and transmit analog-coded information, while a configurable decoder reconstructs speech from noisy compressed signals. The system latency is adaptable based on the input speech length, achieving a minimum latency of 2 ms, with a lightweight architecture of 25 k parameters, significantly fewer than state-of-the-art systems. The simulation results demonstrate that the proposed system outperforms conventional separate source-channel coding systems in terms of speech quality and intelligibility, particularly in low-latency and noisy channel conditions. It also shows robustness in fixed configured scenarios, though higher latency conditions and better channel environments favor traditional coding systems.
低延迟可配置语音传输给现代通信系统带来了巨大挑战。传统方法依赖于单独的信源和信道编码,这往往会降低低延迟限制下的性能。此外,不可配置系统需要对每种条件进行单独训练,限制了其在资源受限情况下的适应性。本文提出了一种用于语音传输的可配置低延迟深度联合信源信道编码(JSCC)系统。该系统可根据不同的信噪比 (SNR)、无线信道条件或带宽进行配置。基于深度神经网络(DNN)的联合源信道编码器用于压缩和传输模拟编码信息,而可配置的解码器则从有噪声的压缩信号中重建语音。系统延迟可根据输入语音的长度进行调整,实现了 2 毫秒的最低延迟,其轻量级架构包含 25 k 个参数,大大少于最先进的系统。仿真结果表明,在语音质量和可懂度方面,特别是在低延迟和高噪声信道条件下,拟议系统优于传统的独立源信道编码系统。它还显示了在固定配置情况下的鲁棒性,尽管更高的延迟条件和更好的信道环境有利于传统编码系统。
{"title":"Low-Latency Deep Analog Speech Transmission Using Joint Source Channel Coding","authors":"Mohammad Bokaei;Jesper Jensen;Simon Doclo;Jan Østergaard","doi":"10.1109/JSTSP.2024.3521277","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3521277","url":null,"abstract":"Low-latency configurable speech transmission presents significant challenges in modern communication systems. Traditional methods rely on separate source and channel coding, which often degrades performance under low-latency constraints. Moreover, non-configurable systems require separate training for each condition, limiting their adaptability in resource-constrained scenarios. This paper proposes a configurable low-latency deep Joint Source-Channel Coding (JSCC) system for speech transmission. The system can be configured for varying signal-to-noise ratios (SNR), wireless channel conditions, or bandwidths. A joint source-channel encoder based on deep neural networks (DNN) is used to compress and transmit analog-coded information, while a configurable decoder reconstructs speech from noisy compressed signals. The system latency is adaptable based on the input speech length, achieving a minimum latency of 2 ms, with a lightweight architecture of 25 k parameters, significantly fewer than state-of-the-art systems. The simulation results demonstrate that the proposed system outperforms conventional separate source-channel coding systems in terms of speech quality and intelligibility, particularly in low-latency and noisy channel conditions. It also shows robustness in fixed configured scenarios, though higher latency conditions and better channel environments favor traditional coding systems.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 8","pages":"1401-1413"},"PeriodicalIF":8.7,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence and Privacy of Decentralized Nonconvex Optimization With Gradient Clipping and Communication Compression 梯度裁剪和通信压缩分散非凸优化的收敛性和保密性
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-01-03 DOI: 10.1109/JSTSP.2025.3526081
Boyue Li;Yuejie Chi
Achieving communication efficiency in decentralized machine learning has been attracting significant attention, with communication compression recognized as an effective technique in algorithm design. This paper takes a first step to understand the role of gradient clipping, a popular strategy in practice, in decentralized nonconvex optimization with communication compression. We propose PORTER, which considers two variants of gradient clipping added before or after taking a mini-batch of stochastic gradients, where the former variant PORTER-DP allows local differential privacy analysis with additional Gaussian perturbation, and the latter variant PORTER-GC helps to stabilize training. We develop a novel analysis framework that establishes their convergence guarantees without assuming the stringent bounded gradient assumption. To the best of our knowledge, our work provides the first convergence analysis for decentralized nonconvex optimization with gradient clipping and communication compression, highlighting the trade-offs between convergence rate, compression ratio, network connectivity, and privacy.
在去中心化机器学习中实现通信效率一直备受关注,通信压缩被认为是算法设计中的一种有效技术。本文首先阐述了梯度裁剪在通信压缩分散非凸优化中的作用。我们提出了PORTER,它考虑了在获取小批量随机梯度之前或之后添加的梯度裁剪的两种变体,其中前一种变体PORTER- dp允许在附加高斯扰动的情况下进行局部差分隐私分析,后一种变体PORTER- gc有助于稳定训练。我们开发了一种新的分析框架,在不假设严格有界梯度假设的情况下建立了它们的收敛性保证。据我们所知,我们的工作为梯度裁剪和通信压缩的分散非凸优化提供了第一个收敛分析,突出了收敛速度、压缩比、网络连接和隐私之间的权衡。
{"title":"Convergence and Privacy of Decentralized Nonconvex Optimization With Gradient Clipping and Communication Compression","authors":"Boyue Li;Yuejie Chi","doi":"10.1109/JSTSP.2025.3526081","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3526081","url":null,"abstract":"Achieving communication efficiency in decentralized machine learning has been attracting significant attention, with communication compression recognized as an effective technique in algorithm design. This paper takes a first step to understand the role of gradient clipping, a popular strategy in practice, in decentralized nonconvex optimization with communication compression. We propose <monospace><b>PORTER</b></monospace>, which considers two variants of gradient clipping added before or after taking a mini-batch of stochastic gradients, where the former variant <monospace><b>PORTER-DP</b></monospace> allows local differential privacy analysis with additional Gaussian perturbation, and the latter variant <monospace><b>PORTER-GC</b></monospace> helps to stabilize training. We develop a novel analysis framework that establishes their convergence guarantees without assuming the stringent bounded gradient assumption. To the best of our knowledge, our work provides the first convergence analysis for decentralized nonconvex optimization with gradient clipping and communication compression, highlighting the trade-offs between convergence rate, compression ratio, network connectivity, and privacy.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 1","pages":"273-282"},"PeriodicalIF":8.7,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143512922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRAU-Net: Inception Residual Attention U-Net in Adversarial Network for Cardiac MRI Segmentation IRAU-Net:用于心脏MRI分割的初始剩余注意U-Net对抗网络
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-12-26 DOI: 10.1109/JSTSP.2024.3523233
Maryam Talebi Rostami;Seyed Ahmad Motamedi
Due to the significant advancements in medical imaging, the application of artificial intelligence for early disease diagnosis has greatly contributed to reducing mortality caused by heart diseases. Cardiac MRI segmentation into the left ventricle (LV), right ventricle (RV) and myocardium (MYO) is a key step towards the early treatment and diagnosis of heart diseases. The proposed method consists of two main components: a localization part to extract the heart organ from other parts of the image, and a GAN model for segmentation. In the generator part of the generative adversarial network (GAN) model, a customized U-Net network is employed to segment the cardiac image. A combination of inception and residual blocks and utilizing an attention mechanism yield an improved version of U-Net. On the other hand, the discriminator part of the GAN model is designed to accurately distinguish between the ground truth image and the segmented image generated by the generator part. Our method is evaluated on two cardiac MRI datasets. The evaluation results on the ACDC 2017 challenge dataset show mean Dice scores of 0.947 for LV, 0.919 for RV, and 0.907 for MYO in cardiac MRI segmentation. The experimental results highlight that our proposed IRAU-Net method outperforms other state-of-the-art methods in terms of accuracy while significantly reducing computational costs.
由于医学影像学的显著进步,人工智能在疾病早期诊断中的应用为降低心脏病死亡率做出了巨大贡献。心脏MRI对左心室(LV)、右心室(RV)和心肌(MYO)的分割是心脏病早期治疗和诊断的关键一步。该方法由两个主要部分组成:从图像的其他部分提取心脏器官的定位部分和用于分割的GAN模型。在生成对抗网络(GAN)模型的生成部分,采用定制的U-Net网络对心脏图像进行分割。将初始块和剩余块结合起来,并利用注意机制,产生了U-Net的改进版本。另一方面,GAN模型的鉴别器部分被设计用于准确区分地真图像和由生成器部分生成的分割图像。我们的方法在两个心脏MRI数据集上进行了评估。在ACDC 2017挑战数据集上的评估结果显示,心脏MRI分割中LV的平均Dice得分为0.947,RV为0.919,MYO为0.907。实验结果表明,我们提出的IRAU-Net方法在精度方面优于其他最先进的方法,同时显着降低了计算成本。
{"title":"IRAU-Net: Inception Residual Attention U-Net in Adversarial Network for Cardiac MRI Segmentation","authors":"Maryam Talebi Rostami;Seyed Ahmad Motamedi","doi":"10.1109/JSTSP.2024.3523233","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3523233","url":null,"abstract":"Due to the significant advancements in medical imaging, the application of artificial intelligence for early disease diagnosis has greatly contributed to reducing mortality caused by heart diseases. Cardiac MRI segmentation into the left ventricle (LV), right ventricle (RV) and myocardium (MYO) is a key step towards the early treatment and diagnosis of heart diseases. The proposed method consists of two main components: a localization part to extract the heart organ from other parts of the image, and a GAN model for segmentation. In the generator part of the generative adversarial network (GAN) model, a customized U-Net network is employed to segment the cardiac image. A combination of inception and residual blocks and utilizing an attention mechanism yield an improved version of U-Net. On the other hand, the discriminator part of the GAN model is designed to accurately distinguish between the ground truth image and the segmented image generated by the generator part. Our method is evaluated on two cardiac MRI datasets. The evaluation results on the ACDC 2017 challenge dataset show mean Dice scores of 0.947 for LV, 0.919 for RV, and 0.907 for MYO in cardiac MRI segmentation. The experimental results highlight that our proposed IRAU-Net method outperforms other state-of-the-art methods in terms of accuracy while significantly reducing computational costs.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 1","pages":"260-272"},"PeriodicalIF":8.7,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143512840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Reinforcement Learning for Resource Allocation in V2X Networks V2X网络中资源分配的联邦强化学习
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-12-17 DOI: 10.1109/JSTSP.2024.3513692
Kaidi Xu;Shenglong Zhou;Geoffrey Ye Li
Resource allocation significantly impacts the performance of vehicle-to-everything (V2X) networks in next generation multiple access (NGMA). Most existing algorithms for resource allocation are based on optimization or machine learning (e.g., reinforcement learning). In this paper, we explore resource allocation in a NGMA V2X network under the framework of federated reinforcement learning (FRL). On one hand, the usage of RL overcomes many challenges from the model-based optimization schemes. On the other hand, federated learning (FL) enables agents to deal with a number of practical issues, such as privacy, communication overhead, distributed learning, and exploration efficiency. The framework of FRL is then implemented by the inexact alternative direction method of multipliers (ADMM), where subproblems are solved approximately using policy gradients and accelerated by an adaptive step size calculated from their second moments. The developed algorithm, PASM, is proven to be convergent under mild conditions and has a nice numerical performance compared with some baseline methods for solving the resource allocation problems in a NGMA V2X network.
在下一代多址(NGMA)技术中,资源分配对V2X (vehicle-to-everything)网络的性能影响很大。大多数现有的资源分配算法都是基于优化或机器学习(如强化学习)。本文研究了联邦强化学习(FRL)框架下NGMA V2X网络中的资源分配问题。一方面,强化学习的使用克服了基于模型的优化方案的许多挑战。另一方面,联邦学习(FL)使代理能够处理许多实际问题,如隐私、通信开销、分布式学习和探索效率。FRL的框架然后通过乘法器的不精确替代方向方法(ADMM)实现,其中子问题使用策略梯度近似求解,并通过从其第二矩计算的自适应步长加速。该算法在温和条件下具有收敛性,与一些基准方法相比,在求解NGMA V2X网络中的资源分配问题上具有良好的数值性能。
{"title":"Federated Reinforcement Learning for Resource Allocation in V2X Networks","authors":"Kaidi Xu;Shenglong Zhou;Geoffrey Ye Li","doi":"10.1109/JSTSP.2024.3513692","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3513692","url":null,"abstract":"Resource allocation significantly impacts the performance of vehicle-to-everything (V2X) networks in next generation multiple access (NGMA). Most existing algorithms for resource allocation are based on optimization or machine learning (e.g., reinforcement learning). In this paper, we explore resource allocation in a NGMA V2X network under the framework of federated reinforcement learning (FRL). On one hand, the usage of RL overcomes many challenges from the model-based optimization schemes. On the other hand, federated learning (FL) enables agents to deal with a number of practical issues, such as privacy, communication overhead, distributed learning, and exploration efficiency. The framework of FRL is then implemented by the inexact alternative direction method of multipliers (ADMM), where subproblems are solved approximately using policy gradients and accelerated by an adaptive step size calculated from their second moments. The developed algorithm, PASM, is proven to be convergent under mild conditions and has a nice numerical performance compared with some baseline methods for solving the resource allocation problems in a NGMA V2X network.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 7","pages":"1210-1221"},"PeriodicalIF":8.7,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Channel Learning for Large-Scale Radio Speaker Verification 大规模无线电扬声器验证的鲁棒信道学习
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-12-16 DOI: 10.1109/JSTSP.2024.3518257
Wenhao Yang;Jianguo Wei;Wenhuan Lu;Lei Li;Xugang Lu
Recent research in speaker verification has increasingly focused on achieving robust and reliable recognition under challenging channel conditions and noisy environments. Identifying speakers in radio communications is particularly difficult due to inherent limitations such as constrained bandwidth and pervasive noise interference. To address this issue, we present a Channel Robust Speaker Learning (CRSL) framework that enhances the robustness of the current speaker verification pipeline, considering data source, data augmentation, and the efficiency of model transfer processes. Our framework introduces an augmentation module that mitigates bandwidth variations in radio speech datasets by manipulating the bandwidth of training inputs. It also addresses unknown noise by introducing noise within the manifold space. Additionally, we propose an efficient fine-tuning method that reduces the need for extensive additional training time and large amounts of data. Moreover, we develop a toolkit for assembling a large-scale radio speech corpus and establish a benchmark specifically tailored for radio scenario speaker verification studies. Experimental results demonstrate that our proposed methodology effectively enhances performance and mitigates degradation caused by radio transmission in speaker verification tasks.
近年来对说话人验证的研究越来越关注于如何在具有挑战性的信道条件和噪声环境下实现鲁棒和可靠的识别。由于固有的限制,例如受限的带宽和普遍的噪声干扰,在无线电通信中识别说话人特别困难。为了解决这个问题,我们提出了一个通道鲁棒说话人学习(CRSL)框架,该框架考虑了数据源、数据增强和模型迁移过程的效率,增强了当前说话人验证管道的鲁棒性。我们的框架引入了一个增强模块,通过操纵训练输入的带宽来减轻无线电语音数据集的带宽变化。它还通过在流形空间中引入噪声来处理未知噪声。此外,我们提出了一种有效的微调方法,减少了对大量额外训练时间和大量数据的需求。此外,我们开发了一个用于组装大规模无线电语音语料库的工具包,并建立了专门为无线电场景说话者验证研究量身定制的基准。实验结果表明,我们提出的方法有效地提高了性能,减轻了说话人验证任务中无线电传输引起的退化。
{"title":"Robust Channel Learning for Large-Scale Radio Speaker Verification","authors":"Wenhao Yang;Jianguo Wei;Wenhuan Lu;Lei Li;Xugang Lu","doi":"10.1109/JSTSP.2024.3518257","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3518257","url":null,"abstract":"Recent research in speaker verification has increasingly focused on achieving robust and reliable recognition under challenging channel conditions and noisy environments. Identifying speakers in radio communications is particularly difficult due to inherent limitations such as constrained bandwidth and pervasive noise interference. To address this issue, we present a Channel Robust Speaker Learning (CRSL) framework that enhances the robustness of the current speaker verification pipeline, considering data source, data augmentation, and the efficiency of model transfer processes. Our framework introduces an augmentation module that mitigates bandwidth variations in radio speech datasets by manipulating the bandwidth of training inputs. It also addresses unknown noise by introducing noise within the manifold space. Additionally, we propose an efficient fine-tuning method that reduces the need for extensive additional training time and large amounts of data. Moreover, we develop a toolkit for assembling a large-scale radio speech corpus and establish a benchmark specifically tailored for radio scenario speaker verification studies. Experimental results demonstrate that our proposed methodology effectively enhances performance and mitigates degradation caused by radio transmission in speaker verification tasks.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 1","pages":"248-259"},"PeriodicalIF":8.7,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143512843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1