IEEE Journal of Selected Topics in Signal Processing最新文献

英文中文

Editorial JSTSP NSAC Editorial 编辑JSTSP NSAC编辑

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-02-05 DOI: 10.1109/JSTSP.2024.3522737

Jan Skoglund;Minje Kim;Xiulian Peng;Lars Villemoes

引用次数: 0

Distributed Distortion-Aware Beamforming Designs for Cell-Free mMIMO Systems 无小区mimo系统的分布式畸变感知波束形成设计

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-02-03 DOI: 10.1109/JSTSP.2025.3537798

Mengzhen Liu;Ming Li;Rang Liu;Qian Liu

Cell-free massive multi-input multi-output (CF-mMIMO) systems have emerged as a promising paradigm for next-generation wireless communications, offering enhanced spectral efficiency and coverage through distributed antenna arrays. However, the non-linearity of power amplifiers (PAs) in these arrays introduce spatial distortion, which may significantly degrade system performance. This paper presents the first investigation of distortion-aware beamforming in a distributed framework tailored for CF-mMIMO systems, enabling pre-compensation for beam dispersion caused by nonlinear PA distortion. Using a third-order memoryless polynomial distortion model, the impact of the nonlinear PA on the performance of CF-mMIMO systems is firstly analyzed by evaluating the signal-to-interference-noise-and-distortion ratio (SINDR) at user equipment (UE). Then, we develop two distributed distortion-aware beamforming designs based on ring topology and star topology, respectively. In particular, the ring-topology-based fully-distributed approach reduces interconnection costs and computational complexity, while the star-topology-based partially-distributed scheme leverages the superior computation capability of the central processor to achieve improved sum-rate performance. Extensive simulations demonstrate the effectiveness of the proposed distortion-aware beamforming designs in mitigating the effect of nonlinear PA distortion, while also reducing computational complexity and backhaul information exchange in CF-mMIMO systems.

无蜂窝大规模多输入多输出（CF-mMIMO）系统已经成为下一代无线通信的一个有前途的范例，通过分布式天线阵列提供更高的频谱效率和覆盖范围。然而，在这些阵列中，功率放大器（PAs）的非线性引入了空间失真，这可能会严重降低系统性能。本文首次研究了为CF-mMIMO系统量身定制的分布式框架中的畸变感知波束形成，可以对非线性PA失真引起的波束色散进行预补偿。首先利用三阶无记忆多项式失真模型，通过评估用户设备（UE）的信噪比（SINDR），分析了非线性PA对CF-mMIMO系统性能的影响。然后，我们分别开发了基于环形拓扑和星形拓扑的两种分布式畸变感知波束形成设计。特别是，基于环拓扑的全分布式方案降低了互连成本和计算复杂度，而基于星型拓扑的部分分布式方案利用中央处理器优越的计算能力，实现了更高的和速率性能。大量的仿真证明了所提出的失真感知波束形成设计在减轻非线性PA失真影响方面的有效性，同时还降低了CF-mMIMO系统的计算复杂性和回程信息交换。

{"title":"Distributed Distortion-Aware Beamforming Designs for Cell-Free mMIMO Systems","authors":"Mengzhen Liu;Ming Li;Rang Liu;Qian Liu","doi":"10.1109/JSTSP.2025.3537798","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3537798","url":null,"abstract":"Cell-free massive multi-input multi-output (CF-mMIMO) systems have emerged as a promising paradigm for next-generation wireless communications, offering enhanced spectral efficiency and coverage through distributed antenna arrays. However, the non-linearity of power amplifiers (PAs) in these arrays introduce spatial distortion, which may significantly degrade system performance. This paper presents the first investigation of distortion-aware beamforming in a distributed framework tailored for CF-mMIMO systems, enabling pre-compensation for beam dispersion caused by nonlinear PA distortion. Using a third-order memoryless polynomial distortion model, the impact of the nonlinear PA on the performance of CF-mMIMO systems is firstly analyzed by evaluating the signal-to-interference-noise-and-distortion ratio (SINDR) at user equipment (UE). Then, we develop two distributed distortion-aware beamforming designs based on ring topology and star topology, respectively. In particular, the ring-topology-based fully-distributed approach reduces interconnection costs and computational complexity, while the star-topology-based partially-distributed scheme leverages the superior computation capability of the central processor to achieve improved sum-rate performance. Extensive simulations demonstrate the effectiveness of the proposed distortion-aware beamforming designs in mitigating the effect of nonlinear PA distortion, while also reducing computational complexity and backhaul information exchange in CF-mMIMO systems.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 2","pages":"381-397"},"PeriodicalIF":8.7,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Spatial Ultra-Sparse Array Formation on LEO Distributed Satellite Cluster: An Enhanced Hybrid Particle Swarm Method LEO分布式卫星簇空间超稀疏阵列形成：一种增强的混合粒子群方法

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-27 DOI: 10.1109/JSTSP.2025.3534428

Yuanzhi He;Peng Yang;Yunying Man;Changxu Wang;Chengwu Qi

The rapid development of Direct-to-device (D2D) services has put forward higher requirements for the performance of satellite antenna systems. The Spatial Ultra-Sparse Distributed Array (SUSDA) constructed by Distributed Satellite Cluster (DSC) has the characteristics of strong directivity, high flexibility and strong anti-jamming ability, which can better meet the communication requirements in future D2D scenarios. However, the non-uniform arrangement of SUSDA leads to the increase of the side lobe level (SLL) and the decrease of the overall antenna performance. To solve this problem, this paper proposes for the first time a configuration design method for a Low Earth Orbit (LEO) SUSDA capable of supporting D2D services in future 6G scenarios. It constructs a mathematical model related to the configuration design of the LEO SUSDA and provides a rapid prediction of the performance of the SUSDA radiation pattern function based on a probabilistic model. Then, an Enhanced Hybrid Particle Swarm Optimization (EHPSO) algorithm is proposed to solve the configuration design problem, which overcomes the slow convergence problem of traditional HPSO algorithm particularly when the array scale is large. The EHPSO algorithm adapts to the search requirements of different stages by adjusting parameters adaptively. It introduces a single suboptimal particle solution to enhance competition and cooperation among particles and employs a local search strategy to precisely narrow the search domain. Simulation results show that the algorithm can significantly reduce the number of iterations and running time of the algorithm while ensuring computational accuracy, which provides a new solution to the configuration design problem of large-scale LEO SUSDA in the future.

D2D业务的快速发展对卫星天线系统的性能提出了更高的要求。由分布式卫星集群（DSC）构建的空间超稀疏分布式阵列（SUSDA）具有指向性强、灵活性高、抗干扰能力强等特点，能够更好地满足未来D2D场景下的通信需求。然而，SUSDA的非均匀排列导致了旁瓣电平（SLL）的增加和天线整体性能的下降。针对这一问题，本文首次提出了未来6G场景下支持D2D业务的近地轨道SUSDA的配置设计方法。构建了与低轨道SUSDA结构设计相关的数学模型，并基于概率模型对SUSDA辐射方向图函数性能进行了快速预测。然后，提出了一种改进的混合粒子群优化算法（EHPSO）来解决构型设计问题，克服了传统的混合粒子群优化算法在阵列规模较大时收敛速度慢的问题。EHPSO算法通过自适应调整参数来适应不同阶段的搜索需求。该算法引入单个次优粒子解来增强粒子间的竞争与合作，并采用局部搜索策略来精确缩小搜索范围。仿真结果表明，该算法在保证计算精度的同时显著减少了算法的迭代次数和运行时间，为未来大规模LEO SUSDA的构型设计问题提供了一种新的解决方案。

{"title":"Spatial Ultra-Sparse Array Formation on LEO Distributed Satellite Cluster: An Enhanced Hybrid Particle Swarm Method","authors":"Yuanzhi He;Peng Yang;Yunying Man;Changxu Wang;Chengwu Qi","doi":"10.1109/JSTSP.2025.3534428","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3534428","url":null,"abstract":"The rapid development of Direct-to-device (D2D) services has put forward higher requirements for the performance of satellite antenna systems. The Spatial Ultra-Sparse Distributed Array (SUSDA) constructed by Distributed Satellite Cluster (DSC) has the characteristics of strong directivity, high flexibility and strong anti-jamming ability, which can better meet the communication requirements in future D2D scenarios. However, the non-uniform arrangement of SUSDA leads to the increase of the side lobe level (SLL) and the decrease of the overall antenna performance. To solve this problem, this paper proposes for the first time a configuration design method for a Low Earth Orbit (LEO) SUSDA capable of supporting D2D services in future 6G scenarios. It constructs a mathematical model related to the configuration design of the LEO SUSDA and provides a rapid prediction of the performance of the SUSDA radiation pattern function based on a probabilistic model. Then, an Enhanced Hybrid Particle Swarm Optimization (EHPSO) algorithm is proposed to solve the configuration design problem, which overcomes the slow convergence problem of traditional HPSO algorithm particularly when the array scale is large. The EHPSO algorithm adapts to the search requirements of different stages by adjusting parameters adaptively. It introduces a single suboptimal particle solution to enhance competition and cooperation among particles and employs a local search strategy to precisely narrow the search domain. Simulation results show that the algorithm can significantly reduce the number of iterations and running time of the algorithm while ensuring computational accuracy, which provides a new solution to the configuration design problem of large-scale LEO SUSDA in the future.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 2","pages":"447-460"},"PeriodicalIF":8.7,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stochastic Geometry Analysis of Scalable Cell-Free RAN With Dynamic Association and Deployment 具有动态关联和部署的可扩展无小区无线局域网的随机几何分析

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-27 DOI: 10.1109/JSTSP.2025.3533897

Yunxiang Guo;Dongming Wang;Xinjiang Xia;Ziyang Zhang;Jiamin Li;Pengcheng Zhu;Xiaohu You

Cell-free radio access network (CF-RAN) breaks away from the traditional cellular network, forming a scalable wireless access network structure. Based on the conventional cell-free massive multiple input multiple output (CF-mMIMO) system, CF-RAN strategically partitions physical layer functionalities into remote radio unit (RRU), edge distributed unit (EDU) and user-centric distributed unit (UCDU), which enable the CF-mMIMO system to achieve a trade-off between complexity and performance in cooperative transmission. We use scalable full-pilot zero-forcing (FZF) combining/precoding in uplink/downlink and consider the impact of channel estimation error and pilot contamination, the closed-form expressions of uplink/downlink achievable signal-to-interference-noise ratio (SINR) of CF-RAN are given. For both uplink and downlink transmissions, we derive the closed-form achievable rate expressions when channel distribution information (CDI) or channel state information (CSI) is known in signal detection, respectively. Addressing the scalability of CF-RAN, the initial access of user equipment (UE) and dynamic RRU association scheme based on the contention mechanism, multiple RRU-EDU deployment schemes, as well as fractional uplink power control and downlink power allocation is considered. The deployment between RRU and EDU determines the performance of CF-RAN, in which we adopt random deployment, clustering deployment based on k-means algorithm, interleaving deployment based on genetic algorithm (GA), interleaving deployment based on graph coloring algorithm (GCA), respectively. Considering the spatial location randomness of UE and RRU, we model the locations of UE and RRU as two independent binomial point processes (BPP) within a limited area, and derive the expression of user rate coverage probability. Finally, the accuracy of our theoretical results is verified through Monte Carlo simulation.

无蜂窝无线接入网（CF-RAN）脱离了传统的蜂窝网络，形成了可扩展的无线接入网结构。CF-RAN基于传统的无蜂窝大规模多输入多输出（CF-mMIMO）系统，将物理层功能战略性地划分为远程无线电单元（RRU）、边缘分布单元（EDU）和以用户为中心的分布单元（UCDU），使CF-mMIMO系统在协同传输中实现了复杂性和性能之间的权衡。在考虑信道估计误差和导频污染影响的基础上，采用可扩展的全导频强制零（FZF）组合/预编码方法，给出了CF-RAN上行/下行可实现信噪比的封闭表达式。对于上行链路和下行链路传输，我们分别推导了信号检测中信道分布信息（CDI）和信道状态信息（CSI）已知时的封闭可达速率表达式。考虑了CF-RAN的可扩展性、用户设备的初始接入和基于争用机制的动态RRU关联方案、多RRU- edu部署方案以及上行功率分式控制和下行功率分式分配。RRU和EDU之间的部署决定了CF-RAN的性能，其中我们分别采用随机部署、基于k-means算法的聚类部署、基于遗传算法（GA）的交错部署、基于图着色算法（GCA）的交错部署。考虑到UE和RRU的空间位置随机性，将UE和RRU在有限区域内的位置建模为两个独立的二项点过程（BPP），推导出用户速率覆盖概率的表达式。最后，通过蒙特卡罗仿真验证了理论结果的准确性。

{"title":"Stochastic Geometry Analysis of Scalable Cell-Free RAN With Dynamic Association and Deployment","authors":"Yunxiang Guo;Dongming Wang;Xinjiang Xia;Ziyang Zhang;Jiamin Li;Pengcheng Zhu;Xiaohu You","doi":"10.1109/JSTSP.2025.3533897","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3533897","url":null,"abstract":"Cell-free radio access network (CF-RAN) breaks away from the traditional cellular network, forming a scalable wireless access network structure. Based on the conventional cell-free massive multiple input multiple output (CF-mMIMO) system, CF-RAN strategically partitions physical layer functionalities into remote radio unit (RRU), edge distributed unit (EDU) and user-centric distributed unit (UCDU), which enable the CF-mMIMO system to achieve a trade-off between complexity and performance in cooperative transmission. We use scalable full-pilot zero-forcing (FZF) combining/precoding in uplink/downlink and consider the impact of channel estimation error and pilot contamination, the closed-form expressions of uplink/downlink achievable signal-to-interference-noise ratio (SINR) of CF-RAN are given. For both uplink and downlink transmissions, we derive the closed-form achievable rate expressions when channel distribution information (CDI) or channel state information (CSI) is known in signal detection, respectively. Addressing the scalability of CF-RAN, the initial access of user equipment (UE) and dynamic RRU association scheme based on the contention mechanism, multiple RRU-EDU deployment schemes, as well as fractional uplink power control and downlink power allocation is considered. The deployment between RRU and EDU determines the performance of CF-RAN, in which we adopt random deployment, clustering deployment based on k-means algorithm, interleaving deployment based on genetic algorithm (GA), interleaving deployment based on graph coloring algorithm (GCA), respectively. Considering the spatial location randomness of UE and RRU, we model the locations of UE and RRU as two independent binomial point processes (BPP) within a limited area, and derive the expression of user rate coverage probability. Finally, the accuracy of our theoretical results is verified through Monte Carlo simulation.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 2","pages":"398-411"},"PeriodicalIF":8.7,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated Learning-Assisted Predictive Beamforming for Extremely Large-Scale Antenna Array Systems With Rate-Splitting Multiple Access 具有速率分裂多址的超大型天线阵列系统的联邦学习辅助预测波束形成

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-24 DOI: 10.1109/JSTSP.2025.3532040

Shengyu Zhang;Yijie Mao;Zihan Chen;Bruno Clerckx;Tony Q.S. Quek

Achieving perfect Channel State Information at the Transmitter (CSIT) is often infeasible in Extremely Large-scale Antenna Array (ELAA) systems due to user mobility and feedback/processing delay. This results in severe multi-user interference. Therefore, how to effectively and efficiently manage interference with partial/historical CSIT is one of the most important challenges for implementing ELAA. In this paper, we propose a Federated Learning (FL)-assisted predictive beamforming framework for ELAA systems to address this challenge. Specifically, we introduce Rate-Splitting Multiple Access (RSMA) to relax the sensitivity to imperfect CSIT while still benefiting from the spatial resolution. Moreover, a predictive beamforming protocol is designed to optimize the precoder design under the imperfections in the channel estimate quality originating from user mobility and latency. To calculate the beamformers, we first propose a lightweight patch-mixing approach to split the historical CSIT data samples into smaller manageable segments. Then, we propose an FL-based training method that enables parallel processing of these CSI segments, thereby accelerating the training process. Simulation results show the effectiveness and efficacy of the proposed FL-assisted predictive beamforming framework, which paves the way for real-world implementation of ELAA.

由于用户移动性和反馈/处理延迟，在超大规模天线阵列（ELAA）系统中，在发射机（CSIT）上实现完美的信道状态信息通常是不可行的。这将导致严重的多用户干扰。因此，如何有效和高效地管理部分/历史CSIT的干扰是实施ELAA的最重要挑战之一。在本文中，我们为ELAA系统提出了一个联邦学习（FL）辅助的预测波束形成框架来解决这一挑战。具体地说，我们引入了速率分割多址（RSMA）来放松对不完美CSIT的敏感性，同时仍然受益于空间分辨率。此外，针对用户移动和时延导致的信道估计质量缺陷，设计了一种预测波束形成协议，对预编码器设计进行了优化。为了计算波束形成器，我们首先提出了一种轻量级的贴片混合方法，将历史CSIT数据样本分成更小的可管理的部分。然后，我们提出了一种基于fl的训练方法，可以并行处理这些CSI片段，从而加快了训练过程。仿真结果表明了所提出的fl辅助预测波束形成框架的有效性和有效性，为ELAA的实际实现奠定了基础。

{"title":"Federated Learning-Assisted Predictive Beamforming for Extremely Large-Scale Antenna Array Systems With Rate-Splitting Multiple Access","authors":"Shengyu Zhang;Yijie Mao;Zihan Chen;Bruno Clerckx;Tony Q.S. Quek","doi":"10.1109/JSTSP.2025.3532040","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3532040","url":null,"abstract":"Achieving perfect Channel State Information at the Transmitter (CSIT) is often infeasible in Extremely Large-scale Antenna Array (ELAA) systems due to user mobility and feedback/processing delay. This results in severe multi-user interference. Therefore, how to effectively and efficiently manage interference with partial/historical CSIT is one of the most important challenges for implementing ELAA. In this paper, we propose a Federated Learning (FL)-assisted predictive beamforming framework for ELAA systems to address this challenge. Specifically, we introduce Rate-Splitting Multiple Access (RSMA) to relax the sensitivity to imperfect CSIT while still benefiting from the spatial resolution. Moreover, a predictive beamforming protocol is designed to optimize the precoder design under the imperfections in the channel estimate quality originating from user mobility and latency. To calculate the beamformers, we first propose a lightweight patch-mixing approach to split the historical CSIT data samples into smaller manageable segments. Then, we propose an FL-based training method that enables parallel processing of these CSI segments, thereby accelerating the training process. Simulation results show the effectiveness and efficacy of the proposed FL-assisted predictive beamforming framework, which paves the way for real-world implementation of ELAA.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 2","pages":"461-476"},"PeriodicalIF":8.7,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE Signal Processing Society Information IEEE信号处理学会信息

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-24 DOI: 10.1109/JSTSP.2025.3526289

引用次数: 0

StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis StyleTTS：一个基于风格的生成模型，用于自然和多样化的文本到语音合成

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-24 DOI: 10.1109/JSTSP.2025.3530171

Yinghao Aaron Li;Cong Han;Nima Mesgarani

Text-to-Speech (TTS) has recently seen great progress in synthesizing high-quality speech owing to the rapid development of parallel TTS systems. Yet producing speech with naturalistic prosodic variations, speaking styles, and emotional tones remains challenging. In addition, many existing parallel TTS models often struggle with identifying optimal monotonic alignments since speech and duration generation typically occur independently. Here, we propose StyleTTS, a style-based generative model for parallel TTS that can synthesize diverse speech with natural prosody from a reference speech utterance. Using our novel Transferable Monotonic Aligner (TMA) and duration-invariant data augmentation, StyleTTS significantly outperforms other baseline models on both single and multi-speaker datasets in subjective tests of speech naturalness and synthesized speaker similarity. It also demonstrates higher robustness and emotional similarity to the reference speech as indicated by word error rate (WER) and acoustic feature correlations. Through self-supervised learning, StyleTTS can generate speech with the same emotional and prosodic tone as the reference speech without needing explicit labels for these categories. In addition, when trained with a large number of speakers, our model can perform zero-shot speaker adaption. The source code and audio samples can be found on our demo page at https://styletts.github.io/.

近年来，并行文本转语音系统（TTS）在合成高质量语音方面取得了很大进展。然而，用自然的韵律变化、说话风格和情感语调来演讲仍然具有挑战性。此外，由于语音和持续时间的生成通常是独立发生的，许多现有的并行TTS模型往往难以识别最佳单调对齐。在此，我们提出了一种基于风格的并行TTS生成模型StyleTTS，该模型可以从参考语音话语中合成具有自然韵律的多种语音。使用我们新颖的可转移单调对齐器（TMA）和持续不变数据增强，StyleTTS在语音自然度和合成说话人相似度的主观测试中，在单说话人和多说话人数据集上都明显优于其他基线模型。从单词错误率（WER）和声学特征相关性来看，它也比参考语音显示出更高的鲁棒性和情感相似性。通过自我监督学习，StyleTTS可以生成与参考语音具有相同情感语调和韵律语调的语音，而不需要对这些类别进行明确的标记。此外，当使用大量扬声器进行训练时，我们的模型可以进行零射击扬声器自适应。源代码和音频示例可以在我们的演示页面https://styletts.github.io/上找到。

{"title":"StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis","authors":"Yinghao Aaron Li;Cong Han;Nima Mesgarani","doi":"10.1109/JSTSP.2025.3530171","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3530171","url":null,"abstract":"Text-to-Speech (TTS) has recently seen great progress in synthesizing high-quality speech owing to the rapid development of parallel TTS systems. Yet producing speech with naturalistic prosodic variations, speaking styles, and emotional tones remains challenging. In addition, many existing parallel TTS models often struggle with identifying optimal monotonic alignments since speech and duration generation typically occur independently. Here, we propose StyleTTS, a style-based generative model for parallel TTS that can synthesize diverse speech with natural prosody from a reference speech utterance. Using our novel Transferable Monotonic Aligner (TMA) and duration-invariant data augmentation, StyleTTS significantly outperforms other baseline models on both single and multi-speaker datasets in subjective tests of speech naturalness and synthesized speaker similarity. It also demonstrates higher robustness and emotional similarity to the reference speech as indicated by word error rate (WER) and acoustic feature correlations. Through self-supervised learning, StyleTTS can generate speech with the same emotional and prosodic tone as the reference speech without needing explicit labels for these categories. In addition, when trained with a large number of speakers, our model can perform zero-shot speaker adaption. The source code and audio samples can be found on our demo page at <uri>https://styletts.github.io/</uri>.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 1","pages":"283-296"},"PeriodicalIF":8.7,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143512981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial Introduction to the Special Issue Seeking Low-Dimensionality in Deep Neural Networks (SLowDNN) 特刊《寻求深度神经网络中的低维数》（SLowDNN）社论导言

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-24 DOI: 10.1109/JSTSP.2024.3522438

Yi Ma;Yuejie Chi;Ivan Dokmanić;Bihan Wen;John N. Wright;Zhihui Zhu

引用次数: 0

IEEE Signal Processing Society Information IEEE信号处理学会信息

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-24 DOI: 10.1109/JSTSP.2025.3526293

引用次数: 0

Joint Localization, Synchronization and Mapping via Phase-Coherent Distributed Arrays 基于相参分布式阵列的联合定位、同步和映射

IF 8.7 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Signal Processing

Pub Date : 2025-01-23 DOI: 10.1109/JSTSP.2025.3533111

Alessio Fascista;Benjamin J. B. Deutschmann;Musa Furkan Keskin;Thomas Wilding;Angelo Coluccia;Klaus Witrisal;Erik Leitinger;Gonzalo Seco-Granados;Henk Wymeersch

Extremely large-scale antenna array (ELAA) systems emerge as a promising technology in beyond 5G and 6G wireless networks to support the deployment of distributed architectures. This paper explores the use of ELAAs to enable joint localization, synchronization and mapping in sub-6 GHz uplink channels, capitalizing on the near-field effects of phase-coherent distributed arrays. We focus on a scenario where a single-antenna user equipment (UE) communicates with a network of access points (APs) distributed in an indoor environment, considering both specular reflections from walls and scattering from objects. The UE is assumed to be unsynchronized to the network, while the APs can be time- and phase-synchronized to each other. We formulate the problem of joint estimation of location, clock offset and phase offset of the UE, and the locations of scattering points (SPs) (i.e., mapping). Through comprehensive Fisher information analysis, we assess the impact of bandwidth, AP array size, wall reflections, SPs and phase synchronization on localization accuracy. Furthermore, we derive the maximum likelihood (ML) estimator for the joint localization, synchronization, and mapping problem, which optimally combines the information collected by all the distributed arrays. To overcome its intractable high dimensionality, we propose a novel three-stage algorithm that first estimates phase offset leveraging carrier phase information of line-of-sight (LoS) paths, then determines the UE location and clock offset via LoS paths and wall reflections, and finally locates SPs using a null-space transformation technique. Simulation results demonstrate the effectiveness of our approach in distributed architectures supported by radio stripes (RSs)—an innovative alternative for implementing ELAAs—while revealing the benefits of carrier phase exploitation and showcasing the interplay between delay and angular information under different bandwidth regimes.

超大规模天线阵列（ELAA）系统在5G和6G以外的无线网络中成为一项有前途的技术，以支持分布式架构的部署。本文探讨了利用elaa在sub-6 GHz上行信道中实现联合定位、同步和映射，利用相参分布式阵列的近场效应。我们专注于单天线用户设备（UE）与分布在室内环境中的接入点（ap）网络通信的场景，同时考虑到墙壁的镜面反射和物体的散射。假设终端与网络不同步，ap之间可以时间同步和相位同步。我们提出了联合估计UE的位置、时钟偏移和相位偏移以及散射点（SPs）的位置（即映射）的问题。通过全面的Fisher信息分析，我们评估了带宽、AP阵列大小、壁反射、SPs和相位同步对定位精度的影响。在此基础上，我们推导了联合定位、同步和映射问题的最大似然估计量，该估计量将所有分布式阵列收集的信息进行了最优组合。为了克服其难以处理的高维性，我们提出了一种新的三阶段算法，首先利用视距（LoS）路径的载波相位信息估计相位偏移，然后通过视距路径和壁反射确定UE位置和时钟偏移，最后使用零空间变换技术定位SPs。仿真结果证明了我们的方法在由无线电条纹（RSs）支持的分布式架构中的有效性——一种实现elaa的创新替代方案——同时揭示了载波相位利用的好处，并展示了不同带宽制度下延迟和角度信息之间的相互作用。

{"title":"Joint Localization, Synchronization and Mapping via Phase-Coherent Distributed Arrays","authors":"Alessio Fascista;Benjamin J. B. Deutschmann;Musa Furkan Keskin;Thomas Wilding;Angelo Coluccia;Klaus Witrisal;Erik Leitinger;Gonzalo Seco-Granados;Henk Wymeersch","doi":"10.1109/JSTSP.2025.3533111","DOIUrl":"https://doi.org/10.1109/JSTSP.2025.3533111","url":null,"abstract":"Extremely large-scale antenna array (ELAA) systems emerge as a promising technology in beyond 5G and 6G wireless networks to support the deployment of distributed architectures. This paper explores the use of ELAAs to enable joint localization, synchronization and mapping in sub-6 GHz uplink channels, capitalizing on the near-field effects of phase-coherent distributed arrays. We focus on a scenario where a single-antenna user equipment (UE) communicates with a network of access points (APs) distributed in an indoor environment, considering both specular reflections from walls and scattering from objects. The UE is assumed to be unsynchronized to the network, while the APs can be time- and phase-synchronized to each other. We formulate the problem of joint estimation of location, clock offset and phase offset of the UE, and the locations of scattering points (SPs) (i.e., mapping). Through comprehensive Fisher information analysis, we assess the impact of bandwidth, AP array size, wall reflections, SPs and phase synchronization on localization accuracy. Furthermore, we derive the maximum likelihood (ML) estimator for the joint localization, synchronization, and mapping problem, which optimally combines the information collected by all the distributed arrays. To overcome its intractable high dimensionality, we propose a novel three-stage algorithm that first estimates phase offset leveraging carrier phase information of line-of-sight (LoS) paths, then determines the UE location and clock offset via LoS paths and wall reflections, and finally locates SPs using a null-space transformation technique. Simulation results demonstrate the effectiveness of our approach in distributed architectures supported by radio stripes (RSs)—an innovative alternative for implementing ELAAs—while revealing the benefits of carrier phase exploitation and showcasing the interplay between delay and angular information under different bandwidth regimes.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"19 2","pages":"412-429"},"PeriodicalIF":8.7,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Journal of Selected Topics in Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀