EURASIP Journal on Advances in Signal Processing最新文献_第8页

Variable step size VLF/ELF nonlinear channel adaptive filtering algorithm based on Sigmoid function 基于 Sigmoid 函数的可变步长 VLF/ELF 非线性信道自适应滤波算法

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-06 DOI: 10.1186/s13634-023-01102-2

Sumou Hu, Hui Xie, Danling Liu, Jie Hu

The signals received by very low-frequency/extremely low-frequency nonlinear receivers are frequently affected by intense atmospheric pulse noise stemming from thunderstorms and global lightning activity. Current noise processing algorithms designed for nonlinear channels within these frequency ranges, which are predicated on fractional p-order moment alpha stable distribution criteria (where 0 < p < α < 2, and p and α denote distinct characteristic indices of alpha stable distribution noise), are constrained by their reliance on limited p-order moment statistics. As a result, the performance of low-frequency nonlinear channel receivers experiences significant degradation when confronted with robust pulse noise interference (0 < p < α < 2). To tackle this challenge, the present study introduces a novel variable step robust mixed norm (RMN) adaptive filtering algorithm, designated as SVS-RMN, which is based on the Sigmoid function. Leveraging the nonlinearity of the Sigmoid function and building upon the power function Hammerstein nonlinear channel model, the algorithm aims to enhance the RMN algorithm by deriving new cost functions and adaptive iteration formulas. The performance of the proposed algorithm is evaluated in comparison to conventional RMN algorithms based on fractional low-order moment (FLOM) criteria (0 < p < 2), as well as other algorithms employing variable step sizes and either FLOM or radial basis function (RBF) criteria, across various intensities of pulse noise and mixed signal-to-noise ratios. The experimental results reveal the following: (1) The proposed algorithm effectively mitigates strong pulse noise interference and significantly enhances the tracking performance of the RMN algorithm compared to conventional RMN algorithms based on FLOM criteria. (2) In terms of computational efficiency, simplicity of structure, convergence speed, and stability, the proposed algorithm surpasses other algorithms based on FLOM or RBF criteria.

极低频/超低频非线性接收机接收的信号经常受到雷暴和全球闪电活动产生的强烈大气脉冲噪声的影响。目前为这些频率范围内的非线性信道设计的噪声处理算法以分数 p 阶矩阿尔法稳定分布标准为前提（其中 0 < p < α < 2，p 和 α 表示阿尔法稳定分布噪声的不同特征指数），这些算法由于依赖于有限的 p 阶矩统计而受到限制。因此，低频非线性信道接收器在面对强脉冲噪声干扰（0 < p < α <2）时，性能会明显下降。为应对这一挑战，本研究引入了一种基于 Sigmoid 函数的新型变步长鲁棒混合规范（RMN）自适应滤波算法，命名为 SVS-RMN。该算法利用 Sigmoid 函数的非线性特性，以功率函数 Hammerstein 非线性信道模型为基础，旨在通过推导新的代价函数和自适应迭代公式来增强 RMN 算法。通过与基于分数低阶矩（FLOM）准则（0 < p < 2）的传统 RMN 算法以及采用可变步长和 FLOM 或径向基函数（RBF）准则的其他算法进行比较，评估了所提算法在各种脉冲噪声强度和混合信噪比下的性能。实验结果表明(1) 与基于 FLOM 准则的传统 RMN 算法相比，所提出的算法能有效缓解强脉冲噪声干扰，并显著提高 RMN 算法的跟踪性能。(2) 在计算效率、结构简单性、收敛速度和稳定性方面，所提出的算法超过了其他基于 FLOM 或 RBF 准则的算法。

{"title":"Variable step size VLF/ELF nonlinear channel adaptive filtering algorithm based on Sigmoid function","authors":"Sumou Hu, Hui Xie, Danling Liu, Jie Hu","doi":"10.1186/s13634-023-01102-2","DOIUrl":"https://doi.org/10.1186/s13634-023-01102-2","url":null,"abstract":"The signals received by very low-frequency/extremely low-frequency nonlinear receivers are frequently affected by intense atmospheric pulse noise stemming from thunderstorms and global lightning activity. Current noise processing algorithms designed for nonlinear channels within these frequency ranges, which are predicated on fractional p-order moment alpha stable distribution criteria (where 0 < p < α < 2, and p and α denote distinct characteristic indices of alpha stable distribution noise), are constrained by their reliance on limited p-order moment statistics. As a result, the performance of low-frequency nonlinear channel receivers experiences significant degradation when confronted with robust pulse noise interference (0 < p < α < 2). To tackle this challenge, the present study introduces a novel variable step robust mixed norm (RMN) adaptive filtering algorithm, designated as SVS-RMN, which is based on the Sigmoid function. Leveraging the nonlinearity of the Sigmoid function and building upon the power function Hammerstein nonlinear channel model, the algorithm aims to enhance the RMN algorithm by deriving new cost functions and adaptive iteration formulas. The performance of the proposed algorithm is evaluated in comparison to conventional RMN algorithms based on fractional low-order moment (FLOM) criteria (0 < p < 2), as well as other algorithms employing variable step sizes and either FLOM or radial basis function (RBF) criteria, across various intensities of pulse noise and mixed signal-to-noise ratios. The experimental results reveal the following: (1) The proposed algorithm effectively mitigates strong pulse noise interference and significantly enhances the tracking performance of the RMN algorithm compared to conventional RMN algorithms based on FLOM criteria. (2) In terms of computational efficiency, simplicity of structure, convergence speed, and stability, the proposed algorithm surpasses other algorithms based on FLOM or RBF criteria.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"17 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139372750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-tier dynamic sampling weak RF signal estimation theory 多层动态采样弱射频信号估计理论

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-06 DOI: 10.1186/s13634-023-01093-0

Brett Smith, Mary Lanzerotti

This paper presents a theoretical analysis in discrete time for a multi-tier weak radiofrequency (RF) signal estimation process with N simultaneous signals. Discrete time dynamic sampling is introduced and is shown to provide the capability to extract signal parameter values with increased accuracy compared with accuracy of estimates obtained in prior work. This paper advances phase measurement approaches by proposing discrete time dynamic sampling which our paper shows offers the desirable capability for more accurate weak signal parameter estimates. For (N=2) simultaneous signals with a strong signal at 850 MHz and a weak signal at 855 MHz, the results show that dynamically sampling the instantaneous frequency at 24 times the Nyquist rate provides weak signal frequency estimates that are within (1.7 times 10^{-5}) of the actual weak signal frequency and weak signal amplitude estimates that are within 428 PPM of the actual weak signal amplitude. Results are also presented for situations with (N=2) simultaneous 5G signals. In one case, the strong signal is 3950 MHz, and the weak signal is 3955 MHz; in the other case the strong case is 5950 MHz, and the weak signal is 5955 MHz. The results for these cases show that estimates obtained with dynamic sampling are more accurate than estimates provided using a single sample rate of 65 MSPS. This work has promising applications for weak signal parameters estimation using instantaneous frequency measurements.

本文对 N 个同步信号的多层弱射频（RF）信号估计过程进行了离散时间理论分析。本文引入了离散时间动态采样，结果表明，与之前工作中获得的估计精度相比，离散时间动态采样能够更准确地提取信号参数值。本文提出了离散时间动态采样，从而推进了相位测量方法的发展。对于 850 MHz 的强信号和 855 MHz 的弱信号的同时信号（N=2），结果显示以 24 倍奈奎斯特速率对瞬时频率进行动态采样所得到的弱信号频率估计值与实际弱信号频率的误差在（1.7 倍 10^{-5}）以内，而弱信号振幅估计值与实际弱信号振幅的误差在 428 PPM 以内。此外，还给出了同时有（N=2）个 5G 信号的情况下的结果。在一种情况下，强信号为 3950 MHz，弱信号为 3955 MHz；在另一种情况下，强信号为 5950 MHz，弱信号为 5955 MHz。这些情况的结果表明，使用动态采样获得的估计值比使用 65 MSPS 的单一采样率获得的估计值更准确。这项工作有望应用于利用瞬时频率测量进行弱信号参数估计。

{"title":"Multi-tier dynamic sampling weak RF signal estimation theory","authors":"Brett Smith, Mary Lanzerotti","doi":"10.1186/s13634-023-01093-0","DOIUrl":"https://doi.org/10.1186/s13634-023-01093-0","url":null,"abstract":"This paper presents a theoretical analysis in discrete time for a multi-tier weak radiofrequency (RF) signal estimation process with N simultaneous signals. Discrete time dynamic sampling is introduced and is shown to provide the capability to extract signal parameter values with increased accuracy compared with accuracy of estimates obtained in prior work. This paper advances phase measurement approaches by proposing discrete time dynamic sampling which our paper shows offers the desirable capability for more accurate weak signal parameter estimates. For (N=2) simultaneous signals with a strong signal at 850 MHz and a weak signal at 855 MHz, the results show that dynamically sampling the instantaneous frequency at 24 times the Nyquist rate provides weak signal frequency estimates that are within (1.7 times 10^{-5}) of the actual weak signal frequency and weak signal amplitude estimates that are within 428 PPM of the actual weak signal amplitude. Results are also presented for situations with (N=2) simultaneous 5G signals. In one case, the strong signal is 3950 MHz, and the weak signal is 3955 MHz; in the other case the strong case is 5950 MHz, and the weak signal is 5955 MHz. The results for these cases show that estimates obtained with dynamic sampling are more accurate than estimates provided using a single sample rate of 65 MSPS. This work has promising applications for weak signal parameters estimation using instantaneous frequency measurements.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"36 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139372786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ABYOLOv4: improved YOLOv4 human object detection based on enhanced multi-scale feature fusion ABYOLOv4：基于增强型多尺度特征融合的改进型 YOLOv4 人类物体检测

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-04 DOI: 10.1186/s13634-023-01105-z

Rui Li, Xin Zeng, Shiqiang Yang, Qi Li, An Yan, Dexin Li

The purpose of human object detection is to obtain the number of people and their position in images, which is one of the core problems in the field of machine vision. However, the high missing detection rate from small- and medium-sized human bodies due to the large variety of human scale in human object detection tasks still influences the performance of human object detection. To solve the above problem, this paper proposed an improved ASPP_BiFPN_YOLOv4 (ABYOLOv4) method to detect human object detection. In detail, Atrous Spatial Pyramid Pooling (ASPP) module was used to replace the original Spatial Pyramid Pooling module to increase the receptive field level of the network and improve the perception ability of multi-scale targets. Then, the original Path Aggregation Network (PANet) multi-scale fusion module was replaced by the self-built bi-layer bidirectional feature pyramid network (Bi-FPN). Meanwhile, a new feature was imported into the proposed model to reuse the mid- and low-level features, which could enhance the ability of the network to express the characteristics of small- and medium-sized targets. Finally, the standard convolution in Bi-FPN was replaced by depth-separable convolution to make the network achieve the balance of accuracy and the number of parameters. To identify the performance of the proposed ABYOLOv4 model, the human object detection experiment is carried out by using the public data set of VOC2007 and VOC2012, the improved YOLOv4 algorithm is 0.5% higher than the original AP algorithm, and the weight file size of the model is reduced by 45.3 M. The experimental results demonstrated that the proposed ABYOLOv4 network has higher accuracy and lower computational cost for human target detection.

人体物体检测的目的是获取图像中的人数及其位置，这是机器视觉领域的核心问题之一。然而，在人体物体检测任务中，由于人体尺度种类繁多，中小型人体的检测遗漏率较高，仍然影响着人体物体检测的性能。为了解决上述问题，本文提出了一种改进的 ASPP_BiFPN_YOLOv4 （ABYOLOv4）方法来检测人体物体检测。具体地说，使用 Atrous Spatial Pyramid Pooling（ASPP）模块取代原有的 Spatial Pyramid Pooling 模块，以增加网络的感受野水平，提高对多尺度目标的感知能力。然后，用自建的双层双向特征金字塔网络（Bi-FPN）取代了原有的路径聚合网络（PANet）多尺度融合模块。同时，还在模型中导入了新的特征，以重用中低层特征，从而增强网络表达中小型目标特征的能力。最后，Bi-FPN 中的标准卷积被深度分离卷积所取代，使网络实现了精度和参数数量的平衡。为了鉴定所提出的 ABYOLOv4 模型的性能，利用 VOC2007 和 VOC2012 的公共数据集进行了人体目标检测实验，改进后的 YOLOv4 算法比原 AP 算法提高了 0.5%，模型的权重文件大小减少了 45.3 M。

{"title":"ABYOLOv4: improved YOLOv4 human object detection based on enhanced multi-scale feature fusion","authors":"Rui Li, Xin Zeng, Shiqiang Yang, Qi Li, An Yan, Dexin Li","doi":"10.1186/s13634-023-01105-z","DOIUrl":"https://doi.org/10.1186/s13634-023-01105-z","url":null,"abstract":"The purpose of human object detection is to obtain the number of people and their position in images, which is one of the core problems in the field of machine vision. However, the high missing detection rate from small- and medium-sized human bodies due to the large variety of human scale in human object detection tasks still influences the performance of human object detection. To solve the above problem, this paper proposed an improved ASPP_BiFPN_YOLOv4 (ABYOLOv4) method to detect human object detection. In detail, Atrous Spatial Pyramid Pooling (ASPP) module was used to replace the original Spatial Pyramid Pooling module to increase the receptive field level of the network and improve the perception ability of multi-scale targets. Then, the original Path Aggregation Network (PANet) multi-scale fusion module was replaced by the self-built bi-layer bidirectional feature pyramid network (Bi-FPN). Meanwhile, a new feature was imported into the proposed model to reuse the mid- and low-level features, which could enhance the ability of the network to express the characteristics of small- and medium-sized targets. Finally, the standard convolution in Bi-FPN was replaced by depth-separable convolution to make the network achieve the balance of accuracy and the number of parameters. To identify the performance of the proposed ABYOLOv4 model, the human object detection experiment is carried out by using the public data set of VOC2007 and VOC2012, the improved YOLOv4 algorithm is 0.5% higher than the original AP algorithm, and the weight file size of the model is reduced by 45.3 M. The experimental results demonstrated that the proposed ABYOLOv4 network has higher accuracy and lower computational cost for human target detection.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"11 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139092861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Blind CFO estimation based on weighted subspace fitting criterion with fuzzy adaptive gravitational search algorithm 基于加权子空间拟合准则与模糊自适应引力搜索算法的盲 CFO 估计

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-03 DOI: 10.1186/s13634-023-01091-2

Chih-Chang Shen, Ming-Hua Zhang

This paper deals with the blind carrier frequency offset (CFO) estimation based on weighted subspace fitting (WSF) criterion with fuzzy adaptive gravitational search algorithm (GSA) for the interleaved orthogonal frequency-division multiplexing access (OFDMA) uplink system. For the CFO estimation problem, it is well known that the WSF has superior statistical characteristics and better estimation performance. However, the type of CFO estimation must pass through the high-dimensional space problem. Optimizing complex nonlinear multimodal functions requires a large computational load, which is difficult and not easy to maximize or minimize nonlinear cost functions in large parameter spaces. This paper firstly presents swarm intelligence (SI) optimization algorithms such as GSA, particle swarm optimization (PSO), and hybrid PSO and GSA (PSOGSA) to improve estimation accuracy and reduce the computational load of search. At the same time, this paper also integrates a fuzzy inference system to WSF-GSA to dynamically adjust the gravitational constant, which can not only reduce the searching computational load, but also improve the performance of GSA in the global optimization and solution accuracy. Finally, several simulation results are provided for illustrating the effectiveness of the proposed estimator.

本文讨论了基于加权子空间拟合（WSF）准则和模糊自适应引力搜索算法（GSA）的盲载波频率偏移（CFO）估计，适用于交错正交频分复用接入（OFDMA）上行系统。众所周知，对于 CFO 估计问题，WSF 具有更优越的统计特性和更好的估计性能。然而，CFO 估计类型必须通过高维空间问题。优化复杂的非线性多模态函数需要很大的计算量，要在大参数空间中最大化或最小化非线性代价函数，难度很大，并非易事。本文首先介绍了群智能（SI）优化算法，如 GSA、粒子群优化（PSO）以及 PSO 和 GSA 混合算法（PSOGSA），以提高估计精度并减少搜索的计算负荷。同时，本文还在 WSF-GSA 中集成了模糊推理系统，以动态调整引力常数，这不仅可以减少搜索计算量，还能提高 GSA 的全局优化性能和求解精度。最后，本文提供了几个仿真结果，以说明所提出的估计器的有效性。

{"title":"Blind CFO estimation based on weighted subspace fitting criterion with fuzzy adaptive gravitational search algorithm","authors":"Chih-Chang Shen, Ming-Hua Zhang","doi":"10.1186/s13634-023-01091-2","DOIUrl":"https://doi.org/10.1186/s13634-023-01091-2","url":null,"abstract":"This paper deals with the blind carrier frequency offset (CFO) estimation based on weighted subspace fitting (WSF) criterion with fuzzy adaptive gravitational search algorithm (GSA) for the interleaved orthogonal frequency-division multiplexing access (OFDMA) uplink system. For the CFO estimation problem, it is well known that the WSF has superior statistical characteristics and better estimation performance. However, the type of CFO estimation must pass through the high-dimensional space problem. Optimizing complex nonlinear multimodal functions requires a large computational load, which is difficult and not easy to maximize or minimize nonlinear cost functions in large parameter spaces. This paper firstly presents swarm intelligence (SI) optimization algorithms such as GSA, particle swarm optimization (PSO), and hybrid PSO and GSA (PSOGSA) to improve estimation accuracy and reduce the computational load of search. At the same time, this paper also integrates a fuzzy inference system to WSF-GSA to dynamically adjust the gravitational constant, which can not only reduce the searching computational load, but also improve the performance of GSA in the global optimization and solution accuracy. Finally, several simulation results are provided for illustrating the effectiveness of the proposed estimator.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"16 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139092832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multi-task learning speech synthesis optimization method based on CWT: a case study of Tacotron2 基于 CWT 的多任务学习语音合成优化方法：Tacotron2 案例研究

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-02 DOI: 10.1186/s13634-023-01096-x

Guoqiang Hu, Zhuofan Ruan, Wenqiu Guo, Yujuan Quan

Text-to-speech synthesis plays an essential role in facilitating human-computer interaction. Currently, the predominant approach in Text-to-speech acoustic models selects only the Mel spectrum as an intermediate feature for converting text to speech. However, the Mel spectrograms obtained may exhibit ambiguity in some aspects owing to the limited capability of the Fourier transform to capture mutation signals during the acquisition of the Mel spectrograms. With the aim of improving the clarity of synthesized speech, this study proposes a multi-task learning optimization method and conducts experiments on the Tacotron2 speech synthesis system to demonstrate the effectiveness of the proposed method. The method in the study introduces an additional task: wavelet spectrograms. The continuous wavelet transform has gained significant popularity in various applications, including speech enhancement and speech recognition, which is primarily attributed to its capability to adaptively vary the time-frequency resolution and its excellent performance in capturing non-stationary signals. This study highlights that the clarity of Tacotron2 synthesized speech can be improved by introducing Wavelet-spectrogram as an auxiliary task through theoretical and experimental analysis: a feature extraction network is added, and Wavelet-spectrogram features are extracted from the Mel spectrum output generated by the decoder. Experimental findings indicate that the Mean Opinion Score achieved for the speech synthesized by the model using multi-task learning is 0.17 higher compared to the baseline model. Furthermore, by analyzing the factors contributing to the success of the continuous wavelet transform-based multi-task learning method in the Tacotron2 model, as well as the effectiveness of multi-task learning, the study conjectures that the proposed method has the potential to enhance the performance of other acoustic models.

文本到语音合成在促进人机交互方面发挥着重要作用。目前，文本到语音声学模型的主要方法是仅选择 Mel 频谱作为将文本转换为语音的中间特征。然而，由于傅立叶变换在获取梅尔频谱图时捕捉突变信号的能力有限，所获得的梅尔频谱图在某些方面可能表现出模糊性。为了提高合成语音的清晰度，本研究提出了一种多任务学习优化方法，并在 Tacotron2 语音合成系统上进行了实验，以证明所提方法的有效性。本研究的方法引入了一项额外任务：小波频谱图。连续小波变换在语音增强和语音识别等各种应用中广受欢迎，这主要归功于它能够自适应地改变时频分辨率，以及在捕捉非稳态信号方面的出色表现。本研究通过理论和实验分析，强调通过引入小波频谱图作为辅助任务，可以提高 Tacotron2 合成语音的清晰度：添加特征提取网络，并从解码器生成的梅尔频谱输出中提取小波频谱图特征。实验结果表明，使用多任务学习的模型合成的语音的平均意见得分比基线模型高 0.17。此外，通过分析基于连续小波变换的多任务学习方法在 Tacotron2 模型中取得成功的因素以及多任务学习的有效性，研究推测所提出的方法有可能提高其他声学模型的性能。

{"title":"A multi-task learning speech synthesis optimization method based on CWT: a case study of Tacotron2","authors":"Guoqiang Hu, Zhuofan Ruan, Wenqiu Guo, Yujuan Quan","doi":"10.1186/s13634-023-01096-x","DOIUrl":"https://doi.org/10.1186/s13634-023-01096-x","url":null,"abstract":"Text-to-speech synthesis plays an essential role in facilitating human-computer interaction. Currently, the predominant approach in Text-to-speech acoustic models selects only the Mel spectrum as an intermediate feature for converting text to speech. However, the Mel spectrograms obtained may exhibit ambiguity in some aspects owing to the limited capability of the Fourier transform to capture mutation signals during the acquisition of the Mel spectrograms. With the aim of improving the clarity of synthesized speech, this study proposes a multi-task learning optimization method and conducts experiments on the Tacotron2 speech synthesis system to demonstrate the effectiveness of the proposed method. The method in the study introduces an additional task: wavelet spectrograms. The continuous wavelet transform has gained significant popularity in various applications, including speech enhancement and speech recognition, which is primarily attributed to its capability to adaptively vary the time-frequency resolution and its excellent performance in capturing non-stationary signals. This study highlights that the clarity of Tacotron2 synthesized speech can be improved by introducing Wavelet-spectrogram as an auxiliary task through theoretical and experimental analysis: a feature extraction network is added, and Wavelet-spectrogram features are extracted from the Mel spectrum output generated by the decoder. Experimental findings indicate that the Mean Opinion Score achieved for the speech synthesized by the model using multi-task learning is 0.17 higher compared to the baseline model. Furthermore, by analyzing the factors contributing to the success of the continuous wavelet transform-based multi-task learning method in the Tacotron2 model, as well as the effectiveness of multi-task learning, the study conjectures that the proposed method has the potential to enhance the performance of other acoustic models.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"34 4 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139079399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An experimental study of neural estimators of the mutual information between random vectors modeling power spectrum features 功率谱特征建模随机向量间互信息神经估计器的实验研究

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-02 DOI: 10.1186/s13634-023-01092-1

Donghoon Shin, Hyung Soon Kim

Mutual information (MI) quantifies the statistical dependency between a pair of random variables and plays a central role in signal processing and data analysis. Recent advances in machine learning have enabled the estimation of MI from a dataset using the expressive power of neural networks. In this study, we conducted a comparative experimental analysis of several existing neural estimators of MI between random vectors that model power spectrum features. We explored alternative models of power spectrum features by leveraging information-theoretic data processing inequality and bijective transformations. Empirical results demonstrated that each neural estimator of MI covered in this study has its limitations. In practical applications, we recommend the collective use of existing neural estimators in a complementary manner for the problem of estimating MI between power spectrum features.

互信息（MI）量化了一对随机变量之间的统计依赖关系，在信号处理和数据分析中发挥着核心作用。机器学习领域的最新进展使得利用神经网络的表现力从数据集中估算 MI 成为可能。在本研究中，我们对现有的几种以功率谱特征为模型的随机向量间 MI 神经估计器进行了对比实验分析。我们利用信息论数据处理不等式和双射变换探索了功率谱特征的替代模型。实证结果表明，本研究中涉及的每个 MI 神经估计器都有其局限性。在实际应用中，我们建议集体使用现有的神经估计器，以互补的方式来解决功率谱特征之间的 MI 估计问题。

引用次数: 0

Classification of drought severity in contiguous USA during the past 21 years using fractal geometry 利用分形几何学对过去 21 年美国毗连地区的干旱严重程度进行分类

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-02 DOI: 10.1186/s13634-023-01094-z

Sepideh Azizi, Tahmineh Azizi

Drought is characterized by a moisture deficit that can adversely impact the environment, economy, and society. In North America, like many regions worldwide, predicting the timing of drought events is challenging. However, our novel study in climate research explores whether the Drought Monitor database exhibits fractal characteristics, represented by a single scaling exponent. This database categorizes drought areas by intensity, ranging from D0 (abnormally dry) to D4 (exceptional drought). Through vibration analysis using power spectral densities (PSD), we investigate the presence of power-law scaling in various statistical moments across different scales within the database. Our multi-fractal analysis estimates the multi-fractal spectrum for each category, and the Higuchi algorithm assesses the fractal complexity, revealing that D4 follows a multi-fractal pattern with a wide range of exponents, while D0 to D3 exhibit a mono-fractal nature with a narrower range of exponents.

干旱的特点是水分不足，会对环境、经济和社会造成不利影响。在北美洲，与全球许多地区一样，预测干旱事件的发生时间是一项挑战。然而，我们在气候研究方面的新研究探索了干旱监测数据库是否表现出分形特征，该特征由一个单一的缩放指数表示。该数据库按强度对干旱地区进行分类，从 D0（异常干旱）到 D4（特大干旱）不等。通过使用功率谱密度（PSD）进行振动分析，我们研究了数据库中不同尺度的各种统计时刻是否存在幂律缩放。我们的多分形分析估计了每个类别的多分形频谱，樋口算法评估了分形的复杂性，发现 D4 遵循的是指数范围较宽的多分形模式，而 D0 至 D3 则表现出指数范围较窄的单分形性质。

引用次数: 0

Neighbor-based joint spatial division and multiplexing in massive MIMO: user scheduling and dynamic beam allocation 大规模多输入多输出（MIMO）中基于邻域的联合空间分割和多路复用：用户调度和动态波束分配

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-02 DOI: 10.1186/s13634-023-01099-8

Huibin Liang, Chen Liu, Yunchao Song, Tianbao Gao, Yulong Zou

Two-stage precoding schemes have been developed to reduce the channel estimation overhead in FDD systems. By integrating user scheduling into these schemes, it becomes possible to meet the quality-of-service requirements of high-density wireless communication systems, despite the limitations on spatial resources and transmit power budget. In this paper, we propose a user scheduling and dynamic beam allocation method for neighbor-based joint spatial division multiplexing (N-JSDM) transmission. The user scheduling problem is formulated as a 0–1 quadratic programming problem to maximize effective spectral efficiency (ESE) using directional channel properties. To address the complexity issue, convex relaxation and linearization methods are employed to transform the problem into a 0–1 mixed integer linear programming, and a dimensionality reduction method is introduced. The proposed user scheduling-aided N-JSDM scheme reduces downlink training length and feedback of channel state information. Additionally, a dynamic configuration form is used for pre-beamforming matrix design based on user distribution, outperforming conventional approaches. Simulation results demonstrate higher ESE achieved by the proposed scheme compared to previous methods.

为了减少 FDD 系统中的信道估计开销，人们开发了两级预编码方案。通过将用户调度集成到这些方案中，尽管空间资源和发射功率预算有限，但仍有可能满足高密度无线通信系统对服务质量的要求。本文提出了一种用户调度和动态波束分配方法，用于基于邻区的联合空间分割复用（N-JSDM）传输。用户调度问题被表述为一个 0-1 二次编程问题，目的是利用定向信道特性最大化有效频谱效率（ESE）。为解决复杂性问题，采用了凸松弛和线性化方法将问题转化为 0-1 混合整数线性规划，并引入了降维方法。所提出的用户调度辅助 N-JSDM 方案减少了下行链路训练长度和信道状态信息反馈。此外，基于用户分布的预波束成形矩阵设计采用了动态配置形式，优于传统方法。仿真结果表明，与之前的方法相比，拟议方案实现了更高的 ESE。

{"title":"Neighbor-based joint spatial division and multiplexing in massive MIMO: user scheduling and dynamic beam allocation","authors":"Huibin Liang, Chen Liu, Yunchao Song, Tianbao Gao, Yulong Zou","doi":"10.1186/s13634-023-01099-8","DOIUrl":"https://doi.org/10.1186/s13634-023-01099-8","url":null,"abstract":"Two-stage precoding schemes have been developed to reduce the channel estimation overhead in FDD systems. By integrating user scheduling into these schemes, it becomes possible to meet the quality-of-service requirements of high-density wireless communication systems, despite the limitations on spatial resources and transmit power budget. In this paper, we propose a user scheduling and dynamic beam allocation method for neighbor-based joint spatial division multiplexing (N-JSDM) transmission. The user scheduling problem is formulated as a 0–1 quadratic programming problem to maximize effective spectral efficiency (ESE) using directional channel properties. To address the complexity issue, convex relaxation and linearization methods are employed to transform the problem into a 0–1 mixed integer linear programming, and a dimensionality reduction method is introduced. The proposed user scheduling-aided N-JSDM scheme reduces downlink training length and feedback of channel state information. Additionally, a dynamic configuration form is used for pre-beamforming matrix design based on user distribution, outperforming conventional approaches. Simulation results demonstrate higher ESE achieved by the proposed scheme compared to previous methods.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"15 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139079494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Suicide Risk, Self-Injury, and Sleep: An Exploration of the Associations in a Sample of Juvenile Justice Involved Adolescents. 自杀风险、自伤和睡眠：对涉及少年司法问题的青少年样本的关联性研究。

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2024-01-01 Epub Date: 2022-03-30 DOI: 10.1080/24732850.2022.2057268

Selby M Conrad, Margaret Webb, Katelyn Affleck, Erik Hood, Kathleen Kemp

Court-involved youth living in the community represent a vulnerable, yet understudied, group that is at risk for a variety of concerning outcomes including increased suicidal ideation, suicide attempts, and non-suicidal self-injury (NSSI). Additionally, sleep disruption, which has been associated with an increase in impulsive decision making, appears to be disproportionately high in this population. However, little is known about any connection between poor sleep and increased suicide risk and NSSI in a group of youth. This study explores the associations between sleep disruption, suicidal ideation, suicide attempts, and NSSI in a sample of court-involved youth in the community referred for mental health evaluation at a court based mental health clinic. Findings suggest that sleep disruption is related to NSSI in this population but not suicidal ideation and suicide attempts. Additional relationships were found between NSSI and being female, as well as having a lifetime history of trauma and marijuana use. Findings suggest that court clinics may wish to screen for sleep disruption as a risk factor for NSSI, and future studies may wish to explore improved sleep as a protective factor for CINI youth.

生活在社区中的涉法涉诉青少年是一个脆弱的群体，但对他们的研究却不足，他们面临着各种令人担忧的风险，包括自杀意念增强、自杀未遂和非自杀性自伤（NSSI）。此外，睡眠中断与冲动性决策的增加有关，在这一人群中的比例似乎过高。然而，人们对睡眠质量差与青少年自杀风险增加和 NSSI 之间的关系知之甚少。本研究以社区中涉及法庭的青少年为样本，探讨了他们在法庭心理健康诊所接受心理健康评估时，睡眠中断、自杀意念、自杀未遂和 NSSI 之间的关联。研究结果表明，在这一人群中，睡眠中断与NSSI有关，但与自杀意念和自杀未遂无关。研究还发现，NSSI 与女性、终生创伤史和吸食大麻之间存在其他关系。研究结果表明，法院诊所不妨将睡眠障碍作为NSSI的一个风险因素进行筛查，未来的研究不妨将改善睡眠作为CINI青少年的一个保护因素进行探讨。

{"title":"Suicide Risk, Self-Injury, and Sleep: An Exploration of the Associations in a Sample of Juvenile Justice Involved Adolescents.","authors":"Selby M Conrad, Margaret Webb, Katelyn Affleck, Erik Hood, Kathleen Kemp","doi":"10.1080/24732850.2022.2057268","DOIUrl":"10.1080/24732850.2022.2057268","url":null,"abstract":"Court-involved youth living in the community represent a vulnerable, yet understudied, group that is at risk for a variety of concerning outcomes including increased suicidal ideation, suicide attempts, and non-suicidal self-injury (NSSI). Additionally, sleep disruption, which has been associated with an increase in impulsive decision making, appears to be disproportionately high in this population. However, little is known about any connection between poor sleep and increased suicide risk and NSSI in a group of youth. This study explores the associations between sleep disruption, suicidal ideation, suicide attempts, and NSSI in a sample of court-involved youth in the community referred for mental health evaluation at a court based mental health clinic. Findings suggest that sleep disruption is related to NSSI in this population but not suicidal ideation and suicide attempts. Additional relationships were found between NSSI and being female, as well as having a lifetime history of trauma and marijuana use. Findings suggest that court clinics may wish to screen for sleep disruption as a risk factor for NSSI, and future studies may wish to explore improved sleep as a protective factor for CINI youth.","PeriodicalId":11816,"journal":{"name":"EURASIP Journal on Advances in Signal Processing","volume":"2014 1","pages":"48-65"},"PeriodicalIF":1.9,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10959508/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88199829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RF fingerprint extraction and device recognition algorithm based on multi-scale fractal features and APWOA-LSSVM 基于多尺度分形特征和 APWOA-LSSVM 的射频指纹提取和设备识别算法

IF 1.9 4区工程技术 Q2 Engineering

EURASIP Journal on Advances in Signal Processing

Pub Date : 2023-12-21 DOI: 10.1186/s13634-023-01098-9

Wenjiang Feng, Yuan Li, Chongchong Wu, Juntao Zhang

引用次数: 0