首页 > 最新文献

Digital Signal Processing最新文献

英文 中文
Enhancing Hopfield network performance for pattern retrieval using sparse recovery algorithm and Parzen estimator 利用稀疏恢复算法和 Parzen 估计器提高模式检索的 Hopfield 网络性能
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-10 DOI: 10.1016/j.dsp.2024.104814
Djordje Stanković , Andjela Draganić , Cornel Ioana , Irena Orović
An improved pattern recovery approach that integrates the Hopfield neural network (HNN) with the iterative signal reconstruction and Parzen window-based classification is proposed. The HNN is observed as a form of associative memory network, used for various pattern recognition and optimization tasks. However, when the input pattern is highly damaged with a very limited set of available samples, the Hopfield network fails to perform the retrieval. The convex optimization-based gradient descent algorithm is considered for pattern recovery of damaged inputs in order to provide an improved pattern approximation for further processing within the HNN, enabling successful network performance. Additionally, in the case of grayscale images, the Parzen window approach is used to classify the probability density functions (pdfs) of the training set and to choose those being comparable to the pdf of the input pattern, therefore refining the selection of patterns and providing better convergence to the exact retrieval. The theoretical considerations are verified experimentally, showing the high performance of the proposed approach when only 10 % of the pixels are available for binary patterns and 40 % of pixels for grayscale patterns.
本文提出了一种改进的模式恢复方法,它将 Hopfield 神经网络(HNN)与迭代信号重建和基于 Parzen 窗口的分类相结合。据观察,HNN 是一种联想记忆网络,用于各种模式识别和优化任务。然而,当输入模式受到严重破坏且可用样本非常有限时,Hopfield 网络就无法进行检索。我们考虑采用基于凸优化的梯度下降算法来恢复受损输入的模式,以便为 HNN 的进一步处理提供改进的模式近似值,从而成功实现网络性能。此外,在灰度图像的情况下,使用 Parzen 窗口方法对训练集的概率密度函数(pdf)进行分类,并选择那些与输入模式的 pdf 具有可比性的概率密度函数,从而改进模式的选择,更好地收敛到精确检索。实验验证了理论考虑因素,表明当二进制模式只有 10% 的像素可用,灰度模式只有 40% 的像素可用时,所建议的方法具有很高的性能。
{"title":"Enhancing Hopfield network performance for pattern retrieval using sparse recovery algorithm and Parzen estimator","authors":"Djordje Stanković ,&nbsp;Andjela Draganić ,&nbsp;Cornel Ioana ,&nbsp;Irena Orović","doi":"10.1016/j.dsp.2024.104814","DOIUrl":"10.1016/j.dsp.2024.104814","url":null,"abstract":"<div><div>An improved pattern recovery approach that integrates the Hopfield neural network (HNN) with the iterative signal reconstruction and Parzen window-based classification is proposed. The HNN is observed as a form of associative memory network, used for various pattern recognition and optimization tasks. However, when the input pattern is highly damaged with a very limited set of available samples, the Hopfield network fails to perform the retrieval. The convex optimization-based gradient descent algorithm is considered for pattern recovery of damaged inputs in order to provide an improved pattern approximation for further processing within the HNN, enabling successful network performance. Additionally, in the case of grayscale images, the Parzen window approach is used to classify the probability density functions (pdfs) of the training set and to choose those being comparable to the pdf of the input pattern, therefore refining the selection of patterns and providing better convergence to the exact retrieval. The theoretical considerations are verified experimentally, showing the high performance of the proposed approach when only 10 % of the pixels are available for binary patterns and 40 % of pixels for grayscale patterns.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104814"},"PeriodicalIF":2.9,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142446376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRS aided visible light positioning with a single LED transmitter 使用单个 LED 发射器进行 IRS 辅助可见光定位
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-10 DOI: 10.1016/j.dsp.2024.104799
Efe Tarhan , Furkan Kokdogan , Sinan Gezici
We propose a visible light positioning (VLP) system with a single light emitting diode (LED) transmitter and an intelligent reflecting surface (IRS) for estimating the position of a receiver equipped with a single photo-detector. By performing a number of transmissions from the LED transmitter and optimizing the orientation vectors of the IRS elements for each transmission, position information is extracted by the receiver based on power measurements of the signals reflecting from the IRS. The theoretical limit and the maximum likelihood (ML) estimator are presented for the proposed setting. In addition, an algorithm, named IRS focusing, is proposed for determining the orientations of the IRS elements during the localization process. The effectiveness of the proposed localization approach is demonstrated through simulations. Furthermore, extensions are provided to apply the proposed approach in the presence of partial prior information about the receiver position and when the IRS is located at the LED transmitter.
我们提出了一种可见光定位(VLP)系统,该系统由一个发光二极管(LED)发射器和一个智能反射面(IRS)组成,用于估计装有单个光电探测器的接收器的位置。通过 LED 发射器进行多次发射,并优化每次发射的 IRS 元件方向向量,接收器可根据 IRS 反射信号的功率测量值提取位置信息。针对提议的设置,提出了理论极限和最大似然 (ML) 估计器。此外,还提出了一种名为 IRS 聚焦的算法,用于在定位过程中确定 IRS 元件的方向。通过模拟演示了所提出的定位方法的有效性。此外,还提供了扩展功能,以便在接收器位置存在部分先验信息以及 IRS 位于 LED 发射器时应用所提出的方法。
{"title":"IRS aided visible light positioning with a single LED transmitter","authors":"Efe Tarhan ,&nbsp;Furkan Kokdogan ,&nbsp;Sinan Gezici","doi":"10.1016/j.dsp.2024.104799","DOIUrl":"10.1016/j.dsp.2024.104799","url":null,"abstract":"<div><div>We propose a visible light positioning (VLP) system with a single light emitting diode (LED) transmitter and an intelligent reflecting surface (IRS) for estimating the position of a receiver equipped with a single photo-detector. By performing a number of transmissions from the LED transmitter and optimizing the orientation vectors of the IRS elements for each transmission, position information is extracted by the receiver based on power measurements of the signals reflecting from the IRS. The theoretical limit and the maximum likelihood (ML) estimator are presented for the proposed setting. In addition, an algorithm, named IRS focusing, is proposed for determining the orientations of the IRS elements during the localization process. The effectiveness of the proposed localization approach is demonstrated through simulations. Furthermore, extensions are provided to apply the proposed approach in the presence of partial prior information about the receiver position and when the IRS is located at the LED transmitter.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104799"},"PeriodicalIF":2.9,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142433291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drift suppression method based on signal stability detection and adaptive Kalman filter for NMR sensor 基于信号稳定性检测和自适应卡尔曼滤波器的核磁共振传感器漂移抑制方法
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-09 DOI: 10.1016/j.dsp.2024.104812
Qipeng Wang , Zhanchao Liu , Zekun Wu , Jingsong Wang , Chunyu Qu , Jianli Li
The small volume, high precision, and low cost of Nuclear Magnetic Resonance (NMR) sensors make them one of the best choices for future miniaturized and chip-scale Inertial Navigation System (INS). Due to technical and process limitations, NMR sensors inevitably exhibit random drift. To suppress these errors, a drift suppression method based on Signal Stability Detection and Adaptive Kalman Filter (SSD-AKF) for NMR sensors is proposed. Firstly, a state space model for the Kalman filter is established based on an Auto Regressive Moving Average (ARMA) sequence model. Secondly, to address the issue of reduced filtering accuracy caused by unstable signal noise in innovation-based AKF, an adaptive filtering method aided by a signal stability detection is proposed. The proposed method utilizes the standard deviation of prior information to assess the stability of the signal. Based on this assessment, the adaptive filter adjusts the gain matrix, ultimately enhancing the stability of the filter. The dynamic experimental results show that the proposed method can effectively improve filter performance and reduce sensor drift.
核磁共振(NMR)传感器体积小、精度高、成本低,是未来微型化和芯片级惯性导航系统(INS)的最佳选择之一。由于技术和工艺的限制,核磁共振传感器不可避免地会出现随机漂移。为了抑制这些误差,本文提出了一种基于信号稳定性检测和自适应卡尔曼滤波器(SSD-AKF)的核磁共振传感器漂移抑制方法。首先,基于自回归移动平均(ARMA)序列模型建立了卡尔曼滤波器的状态空间模型。其次,针对基于创新的 AKF 中不稳定信号噪声导致滤波精度降低的问题,提出了一种以信号稳定性检测为辅助的自适应滤波方法。该方法利用先验信息的标准偏差来评估信号的稳定性。在此基础上,自适应滤波器调整增益矩阵,最终增强滤波器的稳定性。动态实验结果表明,所提出的方法能有效提高滤波器性能,减少传感器漂移。
{"title":"Drift suppression method based on signal stability detection and adaptive Kalman filter for NMR sensor","authors":"Qipeng Wang ,&nbsp;Zhanchao Liu ,&nbsp;Zekun Wu ,&nbsp;Jingsong Wang ,&nbsp;Chunyu Qu ,&nbsp;Jianli Li","doi":"10.1016/j.dsp.2024.104812","DOIUrl":"10.1016/j.dsp.2024.104812","url":null,"abstract":"<div><div>The small volume, high precision, and low cost of Nuclear Magnetic Resonance (NMR) sensors make them one of the best choices for future miniaturized and chip-scale Inertial Navigation System (INS). Due to technical and process limitations, NMR sensors inevitably exhibit random drift. To suppress these errors, a drift suppression method based on Signal Stability Detection and Adaptive Kalman Filter (SSD-AKF) for NMR sensors is proposed. Firstly, a state space model for the Kalman filter is established based on an Auto Regressive Moving Average (ARMA) sequence model. Secondly, to address the issue of reduced filtering accuracy caused by unstable signal noise in innovation-based AKF, an adaptive filtering method aided by a signal stability detection is proposed. The proposed method utilizes the standard deviation of prior information to assess the stability of the signal. Based on this assessment, the adaptive filter adjusts the gain matrix, ultimately enhancing the stability of the filter. The dynamic experimental results show that the proposed method can effectively improve filter performance and reduce sensor drift.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104812"},"PeriodicalIF":2.9,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142427691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Error Performance of DF Cooperative SMTs with I/Q Imbalance over Beckmann Fading Channels 贝克曼衰减信道上具有 I/Q 不平衡的 DF 合作 SMT 的误差性能
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-09 DOI: 10.1016/j.dsp.2024.104804
Ayse Elif Canbilen , Ibrahim Develi , Seyfettin Sinan Gültekin
The direct down-conversion principle, which has generally been used in the design of multiple-input multiple-output (MIMO) schemes, including space modulation techniques (SMTs), is attractive to researchers because of its low cost, low power consumption with fewer components, flexible and simple structure. However, hardware imperfections such as in-phase (I) and quadrature-phase (Q) imbalance (IQI) negatively affect the performance of the systems with direct down-conversion in practice. On the other hand, cooperative communication is a promising technology that can be utilized in the design of future wireless networks due to its significant advantages such as increasing system reliability, extending network coverage, reducing channel degradation, and providing high quality of service. In this study, SMT-based methods are integrated into cooperative systems, and a flexible and comprehensive model is presented that is applicable to many channel structures. Specifically, the error performance analysis of space shift keying (SSK), spatial modulation (SM), and quadrature SM (QSM) systems in the presence of IQI in decode-and-forward (DF) cooperative communication is carried out by analytical derivations and computer simulations over generalized Beckmann fading channels. The obtained results show that the performance of SMT-based DF cooperative systems is superior to the conventional schemes, and the effects of receiver IQI can be eliminated by optimal detector designs.
直接下变频原理通常用于多输入多输出(MIMO)方案(包括空间调制技术(SMT))的设计,因其成本低、功耗低、元件少、结构灵活简单而备受研究人员青睐。然而,在实际应用中,同相(I)和正交相(Q)不平衡(IQI)等硬件缺陷会对直接下变频系统的性能产生负面影响。另一方面,合作通信是一种前景广阔的技术,可用于未来无线网络的设计,因为它具有显著的优势,如提高系统可靠性、扩大网络覆盖范围、减少信道劣化和提供高质量服务。在本研究中,基于 SMT 的方法被集成到了合作系统中,并提出了一个适用于多种信道结构的灵活而全面的模型。具体而言,通过分析推导和计算机仿真,在广义贝克曼衰落信道上对存在 IQI 的合作通信中的空间移调(SSK)、空间调制(SM)和正交 SM(QSM)系统进行了误差性能分析。结果表明,基于 SMT 的 DF 合作系统的性能优于传统方案,而且接收器 IQI 的影响可以通过优化检测器设计来消除。
{"title":"Error Performance of DF Cooperative SMTs with I/Q Imbalance over Beckmann Fading Channels","authors":"Ayse Elif Canbilen ,&nbsp;Ibrahim Develi ,&nbsp;Seyfettin Sinan Gültekin","doi":"10.1016/j.dsp.2024.104804","DOIUrl":"10.1016/j.dsp.2024.104804","url":null,"abstract":"<div><div>The direct down-conversion principle, which has generally been used in the design of multiple-input multiple-output (MIMO) schemes, including space modulation techniques (SMTs), is attractive to researchers because of its low cost, low power consumption with fewer components, flexible and simple structure. However, hardware imperfections such as in-phase (I) and quadrature-phase (Q) imbalance (IQI) negatively affect the performance of the systems with direct down-conversion in practice. On the other hand, cooperative communication is a promising technology that can be utilized in the design of future wireless networks due to its significant advantages such as increasing system reliability, extending network coverage, reducing channel degradation, and providing high quality of service. In this study, SMT-based methods are integrated into cooperative systems, and a flexible and comprehensive model is presented that is applicable to many channel structures. Specifically, the error performance analysis of space shift keying (SSK), spatial modulation (SM), and quadrature SM (QSM) systems in the presence of IQI in decode-and-forward (DF) cooperative communication is carried out by analytical derivations and computer simulations over generalized Beckmann fading channels. The obtained results show that the performance of SMT-based DF cooperative systems is superior to the conventional schemes, and the effects of receiver IQI can be eliminated by optimal detector designs.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104804"},"PeriodicalIF":2.9,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142427695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robust double-channel affine projection sign algorithm for blind acoustic interferences cancellation 用于消除盲声干扰的稳健双通道仿射投影符号算法
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-06 DOI: 10.1016/j.dsp.2024.104795
Boumegouas Rahil, Djendi Mohamed
This brief proposes a double-channel direct affine projection sign (DC-DAPS) algorithm for blind speech enhancement applications, especially in the presence of strong acoustic noise, and intensive non-Gaussian impulsive interferences. We also derive a full analysis of the stability of the proposed DC-DAPS algorithm. The performance evaluation of the proposed algorithm confirms its accuracy even under different noisy scenarios, and demonstrates that our approach outperforms the double-channel direct affine projection (DC-DAP), and the double-channel direct normalized least mean square (DC-DNLMS) algorithms. Simulation results showed that the proposed DC-DAPS algorithm could effectively achieve improved performance in combating acoustic noise and impulsive interferences, and speeding up the convergence rate even with highly correlated input signals such as the speech signal.
本摘要提出了一种双通道直接仿射投影符号(DC-DAPS)算法,用于盲语音增强应用,尤其是在存在强声学噪声和密集非高斯脉冲干扰的情况下。我们还对所提出的 DC-DAPS 算法的稳定性进行了全面分析。对所提算法的性能评估证实了它在不同噪声场景下的准确性,并证明我们的方法优于双通道直接仿射投影(DC-DAP)和双通道直接归一化最小均方(DC-DNLMS)算法。仿真结果表明,所提出的 DC-DAPS 算法能有效提高对抗声学噪声和脉冲干扰的性能,即使在输入信号(如语音信号)高度相关的情况下也能加快收敛速度。
{"title":"A robust double-channel affine projection sign algorithm for blind acoustic interferences cancellation","authors":"Boumegouas Rahil,&nbsp;Djendi Mohamed","doi":"10.1016/j.dsp.2024.104795","DOIUrl":"10.1016/j.dsp.2024.104795","url":null,"abstract":"<div><div>This brief proposes a double-channel direct affine projection sign (DC-DAPS) algorithm for blind speech enhancement applications, especially in the presence of strong acoustic noise, and intensive non-Gaussian impulsive interferences. We also derive a full analysis of the stability of the proposed DC-DAPS algorithm. The performance evaluation of the proposed algorithm confirms its accuracy even under different noisy scenarios, and demonstrates that our approach outperforms the double-channel direct affine projection (DC-DAP), and the double-channel direct normalized least mean square (DC-DNLMS) algorithms. Simulation results showed that the proposed DC-DAPS algorithm could effectively achieve improved performance in combating acoustic noise and impulsive interferences, and speeding up the convergence rate even with highly correlated input signals such as the speech signal.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104795"},"PeriodicalIF":2.9,"publicationDate":"2024-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142531112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative analysis of speaker identification performance using deep learning, machine learning, and novel subspace classifiers with multiple feature extraction techniques 使用深度学习、机器学习和新型子空间分类器以及多种特征提取技术的扬声器识别性能对比分析
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-05 DOI: 10.1016/j.dsp.2024.104811
Serkan Keser , Esra Gezer
Speaker identification is vital in various application domains, such as automation, security, and enhancing user experience. In the literature, convolutional neural network (CNN) or recurrent neural network (RNN) classifiers are generally used due to the one-dimensional time series of speech signals. However, new approaches using subspace classifiers are also crucial in speaker identification. In this study, in addition to the newly developed subspace classifiers for speaker identification, traditional classification algorithms, and various hybrid algorithms are analyzed in terms of performance. Stacked Features-Common Vector Approach (SF-CVA) and Hybrid CVA-Fisher Linear Discriminant Analysis (HCF) subspace classifiers are used for speaker identification for the first time in the literature. In addition, CVA is evaluated for the first time for speaker identification using hybrid deep learning algorithms. The study includes Recurrent Neural Network-Long Short-Term Memory (RNN-LSTM), i-vector + Probabilistic Linear Discriminant Analysis (i-vector+PLDA), Time Delayed Neural Network (TDNN), AutoEncoder+Softmax (AE+Softmax), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Common Vector Approach (CVA), SF-CVA, HCF, and Alexnet classifiers for speaker identification. This study uses MNIST, TIMIT and Voxceleb1 databases for clean and noisy speech signals. Six different feature structures are tested in the study. The six different feature extraction approaches consist of Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Filter Bank Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+seven spectral features, spectrograms,i-vectors, and Alexnet feature vectors. High accuracy rates were obtained, especially in tests using SF-CVA. RNN-LSTM, i-vector+KNN, AE+Softmax, TDNN, and i-vector+HCF classifiers also gave high test accuracy rates.
在自动化、安全和提升用户体验等多个应用领域,说话人识别都至关重要。由于语音信号是一维时间序列,因此文献中通常使用卷积神经网络(CNN)或递归神经网络(RNN)分类器。然而,使用子空间分类器的新方法对于扬声器识别也至关重要。在本研究中,除了新开发的用于扬声器识别的子空间分类器外,还分析了传统分类算法和各种混合算法的性能。堆叠特征-通用向量法(SF-CVA)和混合 CVA-费舍线性判别分析(HCF)子空间分类器在文献中首次用于说话人识别。此外,还首次评估了使用混合深度学习算法进行说话人识别的 CVA。该研究包括用于识别说话人的递归神经网络-长短期记忆(RNN-LSTM)、i-vector + 概率线性判别分析(i-vector+PLDA)、延时神经网络(TDNN)、自动编码器+软最大值(AE+Softmax)、K-近邻(KNN)、支持向量机(SVM)、共向量方法(CVA)、SF-CVA、HCF 和 Alexnet 分类器。本研究使用 MNIST、TIMIT 和 Voxceleb1 数据库来处理干净和有噪声的语音信号。研究中测试了六种不同的特征结构。这六种不同的特征提取方法包括 Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Filter Bank Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+seven spectral features, spectrograms, i-vectors, 和 Alexnet feature vectors。特别是在使用 SF-CVA 的测试中,获得了很高的准确率。RNN-LSTM、i-vector+KNN、AE+Softmax、TDNN 和 i-vector+HCF分类器的测试准确率也很高。
{"title":"Comparative analysis of speaker identification performance using deep learning, machine learning, and novel subspace classifiers with multiple feature extraction techniques","authors":"Serkan Keser ,&nbsp;Esra Gezer","doi":"10.1016/j.dsp.2024.104811","DOIUrl":"10.1016/j.dsp.2024.104811","url":null,"abstract":"<div><div>Speaker identification is vital in various application domains, such as automation, security, and enhancing user experience. In the literature, convolutional neural network (CNN) or recurrent neural network (RNN) classifiers are generally used due to the one-dimensional time series of speech signals. However, new approaches using subspace classifiers are also crucial in speaker identification. In this study, in addition to the newly developed subspace classifiers for speaker identification, traditional classification algorithms, and various hybrid algorithms are analyzed in terms of performance. Stacked Features-Common Vector Approach (SF-CVA) and Hybrid CVA-Fisher Linear Discriminant Analysis (HCF) subspace classifiers are used for speaker identification for the first time in the literature. In addition, CVA is evaluated for the first time for speaker identification using hybrid deep learning algorithms. The study includes Recurrent Neural Network-Long Short-Term Memory (RNN-LSTM), i-vector + Probabilistic Linear Discriminant Analysis (i-vector+PLDA), Time Delayed Neural Network (TDNN), AutoEncoder+Softmax (AE+Softmax), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Common Vector Approach (CVA), SF-CVA, HCF, and Alexnet classifiers for speaker identification. This study uses MNIST, TIMIT and Voxceleb1 databases for clean and noisy speech signals. Six different feature structures are tested in the study. The six different feature extraction approaches consist of Mel Frequency Cepstral Coefficients (MFCC)+Pitch, Gammatone Filter Bank Cepstral Coefficients (GTCC)+Pitch, MFCC+GTCC+Pitch+seven spectral features, spectrograms,i-vectors, and Alexnet feature vectors. High accuracy rates were obtained, especially in tests using SF-CVA. RNN-LSTM, i-vector+KNN, AE+Softmax, TDNN, and i-vector+HCF classifiers also gave high test accuracy rates.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104811"},"PeriodicalIF":2.9,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142428065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel differential chaos shift keying system based on joint time slot and code index of carrier phase modulation 基于载波相位调制联合时隙和编码指数的新型差分混沌移调系统
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-05 DOI: 10.1016/j.dsp.2024.104803
Lifang He , Xinyu Xiong , Chaofan Li
This paper proposes a novel differential chaos shift keying (DCSK) system that utilizes carrier phase modulation in combination with time slot and code index modulation, referred to as CP-TCIM-DCSK system, to achieve high data rate transmission. In the proposed CP-TCIM-DCSK system, the carrier modulated by the carrier phase is used to transmit the information-bearing signal. In addition to physically transmitted information bits, extra information bits are conveyed through time slot and Walsh code modulation, as well as carrier phase modulation, making full use of the benefits of multidimensional modulation. To successfully estimate carrier phase bits and code index bits, an effective joint detection algorithm for carrier phase and code index tailored for the CP-TCIM-DCSK system is introduced. The theoretical Bit Error Rate (BER) expressions for the CP-TCIM-DCSK scheme are derived for both Additive White Gaussian Noise (AWGN) and multipath Rayleigh fading channels. Furthermore, a comparison of data rates, energy efficiency, and complexity between the CP-TCIM-DCSK system and the most advanced systems in the same category at present is conducted. The results validate the accuracy of theoretical analysis through simulation and demonstrate that the proposed scheme outperforms its competitive systems in terms of BER performance.
本文提出了一种新型差分混沌移动键控(DCSK)系统,该系统利用载波相位调制与时隙和编码索引调制相结合,实现了高数据传输速率,被称为 CP-TCIM-DCSK 系统。在拟议的 CP-TCIM-DCSK 系统中,使用载波相位调制的载波来传输信息信号。除了物理传输的信息比特外,还通过时隙和沃尔什编码调制以及载波相位调制来传输额外的信息比特,从而充分利用了多维调制的优势。为了成功估计载波相位比特和编码索引比特,介绍了一种为 CP-TCIM-DCSK 系统量身定制的有效载波相位和编码索引联合检测算法。针对加性白高斯噪声(AWGN)和多径瑞利衰落信道,得出了 CP-TCIM-DCSK 方案的理论误码率(BER)表达式。此外,还对 CP-TCIM-DCSK 系统与目前同类最先进系统的数据传输率、能效和复杂性进行了比较。结果通过仿真验证了理论分析的准确性,并证明所提出的方案在误码率性能方面优于同类系统。
{"title":"A novel differential chaos shift keying system based on joint time slot and code index of carrier phase modulation","authors":"Lifang He ,&nbsp;Xinyu Xiong ,&nbsp;Chaofan Li","doi":"10.1016/j.dsp.2024.104803","DOIUrl":"10.1016/j.dsp.2024.104803","url":null,"abstract":"<div><div>This paper proposes a novel differential chaos shift keying (DCSK) system that utilizes carrier phase modulation in combination with time slot and code index modulation, referred to as CP-TCIM-DCSK system, to achieve high data rate transmission. In the proposed CP-TCIM-DCSK system, the carrier modulated by the carrier phase is used to transmit the information-bearing signal. In addition to physically transmitted information bits, extra information bits are conveyed through time slot and Walsh code modulation, as well as carrier phase modulation, making full use of the benefits of multidimensional modulation. To successfully estimate carrier phase bits and code index bits, an effective joint detection algorithm for carrier phase and code index tailored for the CP-TCIM-DCSK system is introduced. The theoretical Bit Error Rate (BER) expressions for the CP-TCIM-DCSK scheme are derived for both Additive White Gaussian Noise (AWGN) and multipath Rayleigh fading channels. Furthermore, a comparison of data rates, energy efficiency, and complexity between the CP-TCIM-DCSK system and the most advanced systems in the same category at present is conducted. The results validate the accuracy of theoretical analysis through simulation and demonstrate that the proposed scheme outperforms its competitive systems in terms of BER performance.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104803"},"PeriodicalIF":2.9,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142427692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAEE-Net: SAR ship target detection network based on multi-input attention and edge feature enhancement MAEE-Net:基于多输入关注和边缘特征增强的 SAR 船舶目标检测网络
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-05 DOI: 10.1016/j.dsp.2024.104810
Zonghao Li, Hui Ma, Zishuo Guo
Synthetic Aperture Radar (SAR) imagery has a wide range of applications in search and rescue ships lost contact and military reconnaissance. When detecting multi-scale targets, better determination of the target edge is conducive to improving the detection accuracy of the model, but most of the existing methods lack research on this aspect. To fix the problems mentioned earlier, this paper suggests using a SAR Ship target detection network called MAEE-Net. In this paper, a multi-input attention-based feature fusion module (MAFM) and an edge feature enhancement module (EFEM) are proposed. MAFM uses attention mechanism with multi-input and multiple-output to improve attention to shallow feature map target and suppress invalid information, so as to improve the information utilization rate of each layer. To make the network better at detecting the edges of ships, EFEM uses double-branched structure to carry out fine-grained information retention and edge feature extraction. PIoU v2 is introduced to enhance multi-target processing capability. Experiments were carried out on SSDD dataset and SAR-Ship-Dataset, the overall detection accuracy was as high as 98.6% and 94.7%. The detection accuracy was 93.5% and 99.3% on inshore and offshore sub-datasets of SSDD dataset. Experimental results on two datasets show that our model is impactful.
合成孔径雷达(SAR)图像在搜救失去联系的船只和军事侦察方面有着广泛的应用。在探测多尺度目标时,更好地确定目标边缘有利于提高模型的探测精度,但现有方法大多缺乏这方面的研究。为了解决上述问题,本文建议使用一种名为 MAEE-Net 的合成孔径雷达舰船目标检测网络。本文提出了基于多输入注意的特征融合模块(MAFM)和边缘特征增强模块(EFEM)。MAFM 采用多输入多输出的注意力机制,提高对浅层特征图目标的注意力,抑制无效信息,从而提高各层的信息利用率。为了使网络更好地检测船舶边缘,EFEM 采用双分支结构来进行细粒度信息保留和边缘特征提取。引入 PIoU v2 增强多目标处理能力。在 SSDD 数据集和 SAR-Ship 数据集上进行了实验,总体检测精度分别高达 98.6% 和 94.7%。在 SSDD 数据集的近岸和离岸子数据集上,检测精度分别为 93.5%和 99.3%。在两个数据集上的实验结果表明,我们的模型是有影响力的。
{"title":"MAEE-Net: SAR ship target detection network based on multi-input attention and edge feature enhancement","authors":"Zonghao Li,&nbsp;Hui Ma,&nbsp;Zishuo Guo","doi":"10.1016/j.dsp.2024.104810","DOIUrl":"10.1016/j.dsp.2024.104810","url":null,"abstract":"<div><div>Synthetic Aperture Radar (SAR) imagery has a wide range of applications in search and rescue ships lost contact and military reconnaissance. When detecting multi-scale targets, better determination of the target edge is conducive to improving the detection accuracy of the model, but most of the existing methods lack research on this aspect. To fix the problems mentioned earlier, this paper suggests using a SAR Ship target detection network called MAEE-Net. In this paper, a multi-input attention-based feature fusion module (MAFM) and an edge feature enhancement module (EFEM) are proposed. MAFM uses attention mechanism with multi-input and multiple-output to improve attention to shallow feature map target and suppress invalid information, so as to improve the information utilization rate of each layer. To make the network better at detecting the edges of ships, EFEM uses double-branched structure to carry out fine-grained information retention and edge feature extraction. PIoU v2 is introduced to enhance multi-target processing capability. Experiments were carried out on SSDD dataset and SAR-Ship-Dataset, the overall detection accuracy was as high as 98.6% and 94.7%. The detection accuracy was 93.5% and 99.3% on inshore and offshore sub-datasets of SSDD dataset. Experimental results on two datasets show that our model is impactful.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104810"},"PeriodicalIF":2.9,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142427693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CDAN: Convolutional dense attention-guided network for low-light image enhancement CDAN:用于弱光图像增强的卷积密集注意力引导网络
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1016/j.dsp.2024.104802
Hossein Shakibania , Sina Raoufi , Hassan Khotanlou
Low-light images, characterized by inadequate illumination, pose challenges of diminished clarity, muted colors, and reduced details. Low-light image enhancement, an essential task in computer vision, aims to rectify these issues by improving brightness, contrast, and overall perceptual quality, thereby facilitating accurate analysis and interpretation. This paper introduces the Convolutional Dense Attention-guided Network (CDAN), a novel solution for enhancing low-light images. CDAN integrates an autoencoder-based architecture with convolutional and dense blocks, complemented by an attention mechanism and skip connections. This architecture ensures efficient information propagation and feature learning. During training, we utilize a composite loss function that merges L2 and VGG losses for improved numeric and perceptual results. Furthermore, a dedicated post-processing phase refines color balance and contrast. Our approach demonstrates notable progress compared to state-of-the-art results in low-light image enhancement, showcasing its robustness across a wide range of challenging scenarios. Our model performs remarkably on benchmark datasets, effectively mitigating under-exposure and proficiently restoring textures and colors in diverse low-light scenarios. This achievement underscores CDAN's potential for diverse computer vision tasks, notably enabling robust object detection and recognition in challenging low-light conditions.
低照度图像的特点是光照不足,会带来清晰度降低、色彩暗淡和细节减少等挑战。低照度图像增强是计算机视觉领域的一项重要任务,旨在通过改善亮度、对比度和整体感知质量来纠正这些问题,从而促进准确的分析和解读。本文介绍了卷积密集注意力引导网络(CDAN),这是一种用于增强弱光图像的新型解决方案。CDAN 将基于自动编码器的架构与卷积和密集块整合在一起,并辅以注意力机制和跳过连接。这种架构可确保高效的信息传播和特征学习。在训练过程中,我们采用了一种复合损失函数,将 L2 和 VGG 损失合并,以改善数值和感知结果。此外,专门的后处理阶段还能改善色彩平衡和对比度。与最先进的弱光图像增强结果相比,我们的方法取得了显著的进步,在各种具有挑战性的场景中展示了其鲁棒性。我们的模型在基准数据集上表现出色,有效地减轻了曝光不足的问题,并在各种弱光场景中熟练地恢复了纹理和色彩。这一成就凸显了 CDAN 在各种计算机视觉任务中的潜力,特别是在极具挑战性的弱光条件下实现稳健的物体检测和识别。
{"title":"CDAN: Convolutional dense attention-guided network for low-light image enhancement","authors":"Hossein Shakibania ,&nbsp;Sina Raoufi ,&nbsp;Hassan Khotanlou","doi":"10.1016/j.dsp.2024.104802","DOIUrl":"10.1016/j.dsp.2024.104802","url":null,"abstract":"<div><div>Low-light images, characterized by inadequate illumination, pose challenges of diminished clarity, muted colors, and reduced details. Low-light image enhancement, an essential task in computer vision, aims to rectify these issues by improving brightness, contrast, and overall perceptual quality, thereby facilitating accurate analysis and interpretation. This paper introduces the Convolutional Dense Attention-guided Network (CDAN), a novel solution for enhancing low-light images. CDAN integrates an autoencoder-based architecture with convolutional and dense blocks, complemented by an attention mechanism and skip connections. This architecture ensures efficient information propagation and feature learning. During training, we utilize a composite loss function that merges L2 and VGG losses for improved numeric and perceptual results. Furthermore, a dedicated post-processing phase refines color balance and contrast. Our approach demonstrates notable progress compared to state-of-the-art results in low-light image enhancement, showcasing its robustness across a wide range of challenging scenarios. Our model performs remarkably on benchmark datasets, effectively mitigating under-exposure and proficiently restoring textures and colors in diverse low-light scenarios. This achievement underscores CDAN's potential for diverse computer vision tasks, notably enabling robust object detection and recognition in challenging low-light conditions.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104802"},"PeriodicalIF":2.9,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142428119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensor placement method for water distribution networks based on sampling of non-bandlimited graph signals 基于非带限图信号采样的配水管网传感器安置方法
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-03 DOI: 10.1016/j.dsp.2024.104809
Juan Li, Baoyi Cai
Monitoring water distribution networks (WDNs) requires careful consideration of sensor placement, which is essential for obtaining comprehensive information about the network. A natural graphical structure underlies WDN, making graph sampling theory advantageous for selecting monitoring nodes. However, graph sampling theory is only applied only to restrictive band-limited signals, while the pressure data of WDN is a restrictive non-band-limited signal. To address this issue, this paper presents an approximate conversion method for transforming non-band-limited signals into band-limited signals, accompanied by an optimal spectrum threshold formula. This formula is used to perform spectral screening in the graph frequency domain, effectively converting non-band-limited signals into band-limited signals that preserve the major frequency components while ignoring smaller-value frequency components. By sampling band-limited signal, we identify sampling nodes that perfectly recover the signal. These sampling nodes act as monitoring nodes that can perform a comprehensive inspection of the WDN and accurately locate leaks. The accuracy of our method in recovering the signal and locating the leak is demonstrated by comparing it with two existing sensor placement optimization methods.
监测配水管网(WDN)需要仔细考虑传感器的布置,这对获取网络的全面信息至关重要。WDN 具有天然的图形结构,这使得图形采样理论在选择监测节点时具有优势。然而,图抽样理论只适用于限制性带限信号,而 WDN 的压力数据属于限制性非带限信号。针对这一问题,本文提出了一种将非带限信号转换为带限信号的近似转换方法,并附有一个最佳频谱阈值公式。该公式用于在图频域中进行频谱筛选,有效地将非带限信号转换为带限信号,保留主要的频率成分,同时忽略较小的频率成分。通过对带限信号进行采样,我们可以找出完美恢复信号的采样节点。这些采样节点可作为监测节点,对 WDN 进行全面检查,并准确定位泄漏点。通过与现有的两种传感器放置优化方法进行比较,我们证明了我们的方法在恢复信号和定位泄漏方面的准确性。
{"title":"Sensor placement method for water distribution networks based on sampling of non-bandlimited graph signals","authors":"Juan Li,&nbsp;Baoyi Cai","doi":"10.1016/j.dsp.2024.104809","DOIUrl":"10.1016/j.dsp.2024.104809","url":null,"abstract":"<div><div>Monitoring water distribution networks (WDNs) requires careful consideration of sensor placement, which is essential for obtaining comprehensive information about the network. A natural graphical structure underlies WDN, making graph sampling theory advantageous for selecting monitoring nodes. However, graph sampling theory is only applied only to restrictive band-limited signals, while the pressure data of WDN is a restrictive non-band-limited signal. To address this issue, this paper presents an approximate conversion method for transforming non-band-limited signals into band-limited signals, accompanied by an optimal spectrum threshold formula. This formula is used to perform spectral screening in the graph frequency domain, effectively converting non-band-limited signals into band-limited signals that preserve the major frequency components while ignoring smaller-value frequency components. By sampling band-limited signal, we identify sampling nodes that perfectly recover the signal. These sampling nodes act as monitoring nodes that can perform a comprehensive inspection of the WDN and accurately locate leaks. The accuracy of our method in recovering the signal and locating the leak is demonstrated by comparing it with two existing sensor placement optimization methods.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104809"},"PeriodicalIF":2.9,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142428117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1