首页 > 最新文献

Digital Signal Processing最新文献

英文 中文
Optimal microphone subset selection for beamforming 波束形成的最佳麦克风子集选择
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-06 DOI: 10.1016/j.dsp.2026.105881
Yuhan Zhang , Ka-Fai Cedric Yiu , Zhibao Li
Microphone arrays are widely utilized in various speech-related applications. However, using all available microphones enlarges the number of filter coefficients to be estimated, thereby increasing the computational burden without benefitting the overall performance. Consequently, selecting an optimal subset of microphones is crucial for enhancing beamformer performance. This problem is inherently combinatorial and conventionally solved through greedy-based methodologies. In this paper, we propose a novel microphone subset selection problem for beamforming and reformulate the combinatorial constraints into algebraic constraints, thereby transforming the problem into a novel mixed-integer linear programming (MILP) problem. The optimal subset is derived from a multi-objective optimization problem that maximizes beamforming performance while minimizing the number of selected microphones. The branch-and-bound method is employed to guarantee global optimality. Numerical experiments demonstrate the proposed method achieves similar beamforming performance to the greedy method and genetic algorithm (GA) while utilizing fewer microphones. This makes it particularly valuable in applications where hardware scale is strictly constrained.
麦克风阵列广泛应用于各种语音相关的应用中。然而,使用所有可用的麦克风会增加需要估计的滤波器系数的数量,从而增加计算负担,而不会对整体性能产生影响。因此,选择一个最佳的麦克风子集对于提高波束形成器的性能是至关重要的。这个问题本质上是组合的,通常通过基于贪婪的方法来解决。本文提出了一种新的波束形成麦克风子集选择问题,并将组合约束重新表述为代数约束,从而将该问题转化为一个新的混合整数线性规划(MILP)问题。最优子集是由一个多目标优化问题衍生而来,该问题在最小化所选麦克风数量的同时最大化波束成形性能。采用分支定界法保证全局最优性。数值实验表明,该方法在使用较少麦克风的情况下,获得了与贪心算法和遗传算法相似的波束形成性能。这使得它在硬件规模受到严格限制的应用程序中特别有价值。
{"title":"Optimal microphone subset selection for beamforming","authors":"Yuhan Zhang ,&nbsp;Ka-Fai Cedric Yiu ,&nbsp;Zhibao Li","doi":"10.1016/j.dsp.2026.105881","DOIUrl":"10.1016/j.dsp.2026.105881","url":null,"abstract":"<div><div>Microphone arrays are widely utilized in various speech-related applications. However, using all available microphones enlarges the number of filter coefficients to be estimated, thereby increasing the computational burden without benefitting the overall performance. Consequently, selecting an optimal subset of microphones is crucial for enhancing beamformer performance. This problem is inherently combinatorial and conventionally solved through greedy-based methodologies. In this paper, we propose a novel microphone subset selection problem for beamforming and reformulate the combinatorial constraints into algebraic constraints, thereby transforming the problem into a novel mixed-integer linear programming (MILP) problem. The optimal subset is derived from a multi-objective optimization problem that maximizes beamforming performance while minimizing the number of selected microphones. The branch-and-bound method is employed to guarantee global optimality. Numerical experiments demonstrate the proposed method achieves similar beamforming performance to the greedy method and genetic algorithm (GA) while utilizing fewer microphones. This makes it particularly valuable in applications where hardware scale is strictly constrained.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105881"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CPMNet: an enhanced residual network for continuous phase modulation signal detection CPMNet:用于连续相位调制信号检测的增强残差网络
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-13 DOI: 10.1016/j.dsp.2026.105914
Yang He , Ning Cao , Hao Lu , Can Hu , Yajuan Guo
Continuous Phase Modulation (CPM) signals offer excellent spectral efficiency and constant envelope properties for wireless communications, but traditional detection methods suffer from prohibitive computational complexity. This paper presents CPMNet, a novel deep learning-based detection framework that addresses these limitations through an enhanced residual network architecture incorporating spatial attention mechanisms, multi-scale feature fusion, and bidirectional LSTM networks. CPMNet performs sequence-to-sequence detection without requiring channel estimation or equalization. Experimental results on Advanced Range Telemetry (ARTM) Tier 2 signals show performance varies with modulation complexity: while exhibiting 2–4 dB gaps compared to Maximum Likelihood Sequence Detection (MLSD) in high signal-to-noise ratio (SNR) AWGN channels for lower-order modulations, CPMNet maintains robust performance for high-order modulations where MLSD becomes impractical. In multipath fading channels, CPMNet significantly outperforms MLSD by 3–6 dB across various conditions, demonstrating superior resilience to channel impairments. The framework exhibits excellent generalization with only 1–2 dB degradation in unseen environments. Most critically, CPMNet maintains constant computational complexity regardless of CPM parameters, contrasting sharply with MLSD’s exponential complexity growth, making it particularly advantageous for high-order CPM signals that are computationally prohibitive for traditional methods.
连续相位调制(CPM)信号为无线通信提供了优异的频谱效率和恒定的包络特性,但传统的检测方法存在计算复杂度过高的问题。本文提出了一种新的基于深度学习的检测框架CPMNet,该框架通过一种增强的残差网络架构,结合空间注意机制、多尺度特征融合和双向LSTM网络,解决了这些限制。CPMNet执行序列到序列检测,而不需要信道估计或均衡。先进距离遥测(ARTM)第2层信号的实验结果表明,调制复杂性不同,性能也不同:在高信噪比(SNR) AWGN信道中,与最大似然序列检测(MLSD)相比,CPMNet在低阶调制中表现出2 - 4 dB的差距,而在MLSD变得不切实际的高阶调制中,CPMNet保持了强大的性能。在多径衰落信道中,CPMNet在各种条件下都明显优于MLSD 3-6 dB,显示出对信道损伤的优越恢复能力。该框架具有出色的泛化性能,在不可见的环境中只有1-2 dB的退化。最关键的是,无论CPM参数如何,CPMNet都保持恒定的计算复杂度,这与MLSD的指数复杂度增长形成鲜明对比,这使得CPMNet对传统方法难以计算的高阶CPM信号特别有利。
{"title":"CPMNet: an enhanced residual network for continuous phase modulation signal detection","authors":"Yang He ,&nbsp;Ning Cao ,&nbsp;Hao Lu ,&nbsp;Can Hu ,&nbsp;Yajuan Guo","doi":"10.1016/j.dsp.2026.105914","DOIUrl":"10.1016/j.dsp.2026.105914","url":null,"abstract":"<div><div>Continuous Phase Modulation (CPM) signals offer excellent spectral efficiency and constant envelope properties for wireless communications, but traditional detection methods suffer from prohibitive computational complexity. This paper presents CPMNet, a novel deep learning-based detection framework that addresses these limitations through an enhanced residual network architecture incorporating spatial attention mechanisms, multi-scale feature fusion, and bidirectional LSTM networks. CPMNet performs sequence-to-sequence detection without requiring channel estimation or equalization. Experimental results on Advanced Range Telemetry (ARTM) Tier 2 signals show performance varies with modulation complexity: while exhibiting 2–4 dB gaps compared to Maximum Likelihood Sequence Detection (MLSD) in high signal-to-noise ratio (SNR) AWGN channels for lower-order modulations, CPMNet maintains robust performance for high-order modulations where MLSD becomes impractical. In multipath fading channels, CPMNet significantly outperforms MLSD by 3–6 dB across various conditions, demonstrating superior resilience to channel impairments. The framework exhibits excellent generalization with only 1–2 dB degradation in unseen environments. Most critically, CPMNet maintains constant computational complexity regardless of CPM parameters, contrasting sharply with MLSD’s exponential complexity growth, making it particularly advantageous for high-order CPM signals that are computationally prohibitive for traditional methods.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105914"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unique word orthogonal signal division multiplexing with complex unitary neural network for underwater acoustic communication 基于复杂酉神经网络的水声通信独字正交信分复用
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-17 DOI: 10.1016/j.dsp.2026.105926
Zeyad A.H. Qasem , Xingbin Tu , Chunyi Song , Hamada Esmaiel , Waheb A. Jabbar , Fengzhong Qu
Although orthogonal signal division multiplexing (OSDM) offers improved performance for underwater acoustic communication (UWAC), it still faces two major challenges related to the high peak-to-average power ratio (PAPR) and increased sensitivity to inter-vector interference (IVI). This paper proposes a novel OSDM system, termed precoded unique word OSDM based on unitary neural network (UW-OSDM-UNN), to address these issues effectively. The proposed scheme embeds the guard interval within the fast Fourier transform duration to mitigate inter-symbol interference and employs a UNN-based precoder at the transmitter to reduce PAPR and significantly overcome the IVI sensitivity. The UNN-based transmitter is completely independent of the UWAC channel, eliminating the need for receiver-side training or additional testing-stage training. Furthermore, zero vectors and frequency-shifted Chu sequences are incorporated to enable robust Doppler shift estimation and multipath compensation, respectively. The Chu sequences are inserted in the frequency domain to generate deterministic sequences within the guard interval without introducing additional inter-symbol interference. The system is validated through both simulations and real-world sea trials over a 300-meter underwater connection. Results show that the proposed scheme achieves up to a 4 dB PAPR reduction, a 5 dB improvement in bit error rate (BER), and superior robustness against challenging UWAC channel conditions compared to state-of-the-art OSDM-based systems.
尽管正交信号分复用(OSDM)为水声通信(UWAC)提供了更好的性能,但它仍然面临着两个主要挑战,即高峰值平均功率比(PAPR)和对矢量间干扰(IVI)的灵敏度增加。本文提出了一种新的基于统一神经网络的预编码唯一字OSDM系统(UW-OSDM-UNN)来有效地解决这些问题。该方案在快速傅里叶变换持续时间内嵌入保护间隔以减轻符号间干扰,并在发射机处采用基于unn的预编码器来降低PAPR并显著克服IVI灵敏度。基于unn的发射机完全独立于UWAC信道,消除了接收机侧训练或额外测试阶段训练的需要。此外,零矢量和频移Chu序列分别用于鲁棒多普勒频移估计和多径补偿。在不引入额外符号间干扰的情况下,将Chu序列插入频域以在保护区间内生成确定性序列。该系统通过模拟和现实世界中300米水下连接的海上试验进行了验证。结果表明,与最先进的基于osdm的系统相比,该方案实现了高达4 dB的PAPR降低,5 dB的误码率(BER)提高,以及对具有挑战性的UWAC信道条件的卓越鲁棒性。
{"title":"Unique word orthogonal signal division multiplexing with complex unitary neural network for underwater acoustic communication","authors":"Zeyad A.H. Qasem ,&nbsp;Xingbin Tu ,&nbsp;Chunyi Song ,&nbsp;Hamada Esmaiel ,&nbsp;Waheb A. Jabbar ,&nbsp;Fengzhong Qu","doi":"10.1016/j.dsp.2026.105926","DOIUrl":"10.1016/j.dsp.2026.105926","url":null,"abstract":"<div><div>Although orthogonal signal division multiplexing (OSDM) offers improved performance for underwater acoustic communication (UWAC), it still faces two major challenges related to the high peak-to-average power ratio (PAPR) and increased sensitivity to inter-vector interference (IVI). This paper proposes a novel OSDM system, termed precoded unique word OSDM based on unitary neural network (UW-OSDM-UNN), to address these issues effectively. The proposed scheme embeds the guard interval within the fast Fourier transform duration to mitigate inter-symbol interference and employs a UNN-based precoder at the transmitter to reduce PAPR and significantly overcome the IVI sensitivity. The UNN-based transmitter is completely independent of the UWAC channel, eliminating the need for receiver-side training or additional testing-stage training. Furthermore, zero vectors and frequency-shifted Chu sequences are incorporated to enable robust Doppler shift estimation and multipath compensation, respectively. The Chu sequences are inserted in the frequency domain to generate deterministic sequences within the guard interval without introducing additional inter-symbol interference. The system is validated through both simulations and real-world sea trials over a 300-meter underwater connection. Results show that the proposed scheme achieves up to a 4 dB PAPR reduction, a 5 dB improvement in bit error rate (BER), and superior robustness against challenging UWAC channel conditions compared to state-of-the-art OSDM-based systems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105926"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-series clustering algorithm based on common tightest neighbors and local embedding 基于公共最紧邻居和局部嵌入的时间序列聚类算法
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-12 DOI: 10.1016/j.dsp.2026.105895
Lei Gao, Taichang Tian, Luosheng Wen
Time-series clustering is an important method in data mining, which is widely used in various fields. However, the traditional clustering algorithms directly deal with the time-series data, which will lead to the serious issue of “dimensionality catastrophe”. It is an important method to capture the local features of time-series data by using the neighbor information. In this paper, we propose a hierarchical graph clustering algorithm (CTNG) based on common tightest neighbors(CTN), which is able to cluster various kinds of complex streaming data and noisy data by using the ratio of common tightest neighbors between data points to determine whether the edges are connected in the tightest neighbors graph(TNG) or not. In order to solve the issue of “dimension disaster”, combined with the local linear embedding algorithm (LLE), this paper proposes a time-series clustering algorithm based on LLE_CTNG, which can make full use of the local structure of the data to realize the dimensionality reduction and clustering. Through a large number of experiments, it is shown that the algorithm has superior and stable clustering performance, has certain advantages in running speed, and is robust to the number of the tightest neighbors parameter.
时间序列聚类是数据挖掘中的一种重要方法,广泛应用于各个领域。然而,传统的聚类算法直接处理时间序列数据,这将导致严重的“维数突变”问题。利用邻域信息捕捉时间序列数据的局部特征是一种重要的方法。本文提出了一种基于共同最紧邻居(CTN)的分层图聚类算法(CTNG),该算法能够利用数据点之间的共同最紧邻居比率来确定最紧邻居图(TNG)中的边缘是否连通,从而对各种复杂的流数据和噪声数据进行聚类。为了解决“维数灾难”问题,结合局部线性嵌入算法(LLE),本文提出了一种基于LLE_CTNG的时间序列聚类算法,该算法可以充分利用数据的局部结构实现降维聚类。通过大量实验表明,该算法具有优越而稳定的聚类性能,在运行速度上具有一定优势,并且对最紧密邻居参数的个数具有鲁棒性。
{"title":"Time-series clustering algorithm based on common tightest neighbors and local embedding","authors":"Lei Gao,&nbsp;Taichang Tian,&nbsp;Luosheng Wen","doi":"10.1016/j.dsp.2026.105895","DOIUrl":"10.1016/j.dsp.2026.105895","url":null,"abstract":"<div><div>Time-series clustering is an important method in data mining, which is widely used in various fields. However, the traditional clustering algorithms directly deal with the time-series data, which will lead to the serious issue of “dimensionality catastrophe”. It is an important method to capture the local features of time-series data by using the neighbor information. In this paper, we propose a hierarchical graph clustering algorithm (CTNG) based on common tightest neighbors(CTN), which is able to cluster various kinds of complex streaming data and noisy data by using the ratio of common tightest neighbors between data points to determine whether the edges are connected in the tightest neighbors graph(TNG) or not. In order to solve the issue of “dimension disaster”, combined with the local linear embedding algorithm (LLE), this paper proposes a time-series clustering algorithm based on LLE_CTNG, which can make full use of the local structure of the data to realize the dimensionality reduction and clustering. Through a large number of experiments, it is shown that the algorithm has superior and stable clustering performance, has certain advantages in running speed, and is robust to the number of the tightest neighbors parameter.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105895"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trainable joint time-vertex fractional Fourier transform 可训练联合时顶点分数傅里叶变换
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-13 DOI: 10.1016/j.dsp.2026.105909
Ziqi Yan , Zhichao Zhang
To address the limitations of the graph fractional Fourier transform (GFRFT) Wiener filtering and the traditional joint time-vertex fractional Fourier transform (JFRFT) Wiener filtering, this study proposes a filtering method based on the hyper-differential form of the JFRFT. The gradient backpropagation mechanism is employed to establish the adaptive selection of transform order pair and filter coefficients. First, leveraging the hyper-differential form of the GFRFT and the fractional Fourier transform, the hyper-differential form of the JFRFT is constructed and its properties are analyzed. Second, time-varying graph signals are divided into dynamic graph sequences of equal span along the temporal dimension. A spatiotemporal joint representation is then established through vectorized reorganization, followed by the joint time-vertex Wiener filtering. Furthermore, by rigorously proving the differentiability of the transform orders, both the transform orders and filter coefficients are embedded as learnable parameters within a neural network architecture. Through gradient backpropagation, their synchronized iterative optimization is achieved, constructing a parameters-adaptive learning filtering framework. This method leverages a model-driven approach to learn the optimal transform order pair and filter coefficients. Experimental results indicate that the proposed framework improves the time-varying graph signals denoising performance, while reducing the computational burden of the traditional grid search strategy.
针对图分数阶傅里叶变换(GFRFT)维纳滤波和传统联合时间顶点分数阶傅里叶变换(JFRFT)维纳滤波的局限性,提出了一种基于JFRFT超微分形式的滤波方法。利用梯度反向传播机制建立了变换阶对和滤波系数的自适应选择。首先,利用GFRFT的超微分形式和分数阶傅里叶变换,构造了JFRFT的超微分形式并分析了其性质。其次,将时变图信号沿时间维划分为等跨度的动态图序列;然后通过向量化重组建立时空联合表示,然后进行联合时间-顶点维纳滤波。此外,通过严格证明变换阶数的可微性,将变换阶数和滤波系数作为可学习参数嵌入到神经网络结构中。通过梯度反向传播,实现了它们的同步迭代优化,构造了一个参数自适应学习滤波框架。该方法利用模型驱动的方法来学习最优变换阶对和过滤系数。实验结果表明,该框架在提高时变图信号去噪性能的同时,减少了传统网格搜索策略的计算量。
{"title":"Trainable joint time-vertex fractional Fourier transform","authors":"Ziqi Yan ,&nbsp;Zhichao Zhang","doi":"10.1016/j.dsp.2026.105909","DOIUrl":"10.1016/j.dsp.2026.105909","url":null,"abstract":"<div><div>To address the limitations of the graph fractional Fourier transform (GFRFT) Wiener filtering and the traditional joint time-vertex fractional Fourier transform (JFRFT) Wiener filtering, this study proposes a filtering method based on the hyper-differential form of the JFRFT. The gradient backpropagation mechanism is employed to establish the adaptive selection of transform order pair and filter coefficients. First, leveraging the hyper-differential form of the GFRFT and the fractional Fourier transform, the hyper-differential form of the JFRFT is constructed and its properties are analyzed. Second, time-varying graph signals are divided into dynamic graph sequences of equal span along the temporal dimension. A spatiotemporal joint representation is then established through vectorized reorganization, followed by the joint time-vertex Wiener filtering. Furthermore, by rigorously proving the differentiability of the transform orders, both the transform orders and filter coefficients are embedded as learnable parameters within a neural network architecture. Through gradient backpropagation, their synchronized iterative optimization is achieved, constructing a parameters-adaptive learning filtering framework. This method leverages a model-driven approach to learn the optimal transform order pair and filter coefficients. Experimental results indicate that the proposed framework improves the time-varying graph signals denoising performance, while reducing the computational burden of the traditional grid search strategy.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105909"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EnFuseNet: A Dual-Module approach combining tail-Class enhancement and dynamic fusion for long-Tail skin lesion diagnosis EnFuseNet:一种结合尾级增强和动态融合的双模块方法用于长尾皮肤病变诊断
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-21 DOI: 10.1016/j.dsp.2026.105929
Yongcai Tao , Renwei Xiao , Yucheng Shi , Zhe Li , Qing Zhang , Xiaotian Yuan , Lei Shi
The low incidence of skin diseases leads to a highly imbalanced class distribution, which complicates computer-aided diagnosis. While supervised contrastive learning has been applied to address this long-tail distribution, two challenges remain: first, the significant variation between intra-class and inter-class feature distributions, which hampers effective sample discrimination; and second, the insufficient number of tail-class samples, which limits their representation and impedes improvements in diagnostic accuracy. To address these challenges, we propose EnFuseNet, a novel contrastive learning framework. EnFuseNet incorporates two key modules: the Dual-view Interactive Fusion (DIF) module and the Tail Representation Enhancement (TREM) module. The DIF module enhances intra-class compactness and inter-class separability by combining dual-view features through a channel- and spatially interactive attention mechanism. The TREM module mitigates the issue of limited tail-class samples by generating and dynamically updating prototypes for these classes using a sliding window mechanism. Additionally, the Stage-Adaptive Weighted Cross-Entropy (SAW-CE) loss function, based on curriculum learning and dynamic weighting, guides the model toward more balanced inter-class learning, thereby alleviating diagnosis difficulties during training. Experimental results on the ISIC2018 and ISIC2019 skin disease datasets demonstrate that EnFuseNet achieves accuracy and AUC values of 86%-88% and 97%, respectively, outperforming state-of-the-art methods. These results highlight the potential of EnFuseNet in diagnosing rare and long-tail skin diseases. The source code is available on GitHub.
皮肤疾病的低发病率导致分类分布高度不平衡,这给计算机辅助诊断带来了复杂性。虽然监督对比学习已被应用于解决这种长尾分布,但仍然存在两个挑战:首先,类内和类间特征分布之间存在显著差异,这阻碍了有效的样本区分;其次,尾类样本数量不足,这限制了它们的代表性,阻碍了诊断准确性的提高。为了应对这些挑战,我们提出了一种新的对比学习框架EnFuseNet。EnFuseNet包含两个关键模块:双视图交互融合(DIF)模块和尾部表示增强(TREM)模块。DIF模块通过通道和空间交互关注机制结合双视图特性,增强了类内的紧凑性和类间的可分离性。TREM模块通过使用滑动窗口机制为这些类生成和动态更新原型,缓解了尾类样本有限的问题。此外,基于课程学习和动态加权的阶段自适应加权交叉熵(SAW-CE)损失函数,引导模型更平衡地进行班级间学习,从而减轻训练过程中的诊断困难。在ISIC2018和ISIC2019皮肤病数据集上的实验结果表明,EnFuseNet的准确率和AUC值分别为86%-88%和97%,优于目前最先进的方法。这些结果突出了EnFuseNet在诊断罕见和长尾皮肤病方面的潜力。源代码可在GitHub上获得。
{"title":"EnFuseNet: A Dual-Module approach combining tail-Class enhancement and dynamic fusion for long-Tail skin lesion diagnosis","authors":"Yongcai Tao ,&nbsp;Renwei Xiao ,&nbsp;Yucheng Shi ,&nbsp;Zhe Li ,&nbsp;Qing Zhang ,&nbsp;Xiaotian Yuan ,&nbsp;Lei Shi","doi":"10.1016/j.dsp.2026.105929","DOIUrl":"10.1016/j.dsp.2026.105929","url":null,"abstract":"<div><div>The low incidence of skin diseases leads to a highly imbalanced class distribution, which complicates computer-aided diagnosis. While supervised contrastive learning has been applied to address this long-tail distribution, two challenges remain: first, the significant variation between intra-class and inter-class feature distributions, which hampers effective sample discrimination; and second, the insufficient number of tail-class samples, which limits their representation and impedes improvements in diagnostic accuracy. To address these challenges, we propose EnFuseNet, a novel contrastive learning framework. EnFuseNet incorporates two key modules: the Dual-view Interactive Fusion (DIF) module and the Tail Representation Enhancement (TREM) module. The DIF module enhances intra-class compactness and inter-class separability by combining dual-view features through a channel- and spatially interactive attention mechanism. The TREM module mitigates the issue of limited tail-class samples by generating and dynamically updating prototypes for these classes using a sliding window mechanism. Additionally, the Stage-Adaptive Weighted Cross-Entropy (SAW-CE) loss function, based on curriculum learning and dynamic weighting, guides the model toward more balanced inter-class learning, thereby alleviating diagnosis difficulties during training. Experimental results on the ISIC2018 and ISIC2019 skin disease datasets demonstrate that EnFuseNet achieves accuracy and AUC values of 86%-88% and 97%, respectively, outperforming state-of-the-art methods. These results highlight the potential of EnFuseNet in diagnosing rare and long-tail skin diseases. The source code is available on <span><span>GitHub</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105929"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance of HEVC video coding for delivery over IP networks HEVC视频编码在IP网络上传输的性能
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-04-01 Epub Date: 2026-01-15 DOI: 10.1016/j.dsp.2026.105911
Khalid Abdullah M. Salih , Ismail Amin Ali , Ramadhan J. Mstafa
Efficient video streaming over IP networks faces significant challenges due to packet loss and network congestion, particularly when using User Datagram Protocol (UDP), which lacks inherent error correction mechanisms. This study provides a comprehensive framework for selecting HEVC encoding configurations based on motion content and network condition. The paper evaluates the packet loss resilience of various HEVC encoding configurations across video content with high-motion, intermediate-motion, and low-motion activity. Utilizing UDP streaming in conjunction with the MPEG Transport Stream (MPEG-TS) container, video quality was quantified using the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) under packet loss rates of up to 1.0%. Three HEVC encoding configurations IPPP, periodic I, and periodic IDR were assessed. The results indicate that periodic IDR, with its closed GOP structure, achieves the highest resilience to packet loss, rendering it ideal for unreliable networks. Specifically, for high-motion video content, periodic IDR limited PSNR degradation to 6.97 dB (from 28.78 dB to 21.87 dB) under a 0.5% packet loss rate. For intermediate-motion content (Mobcal), PSNR decreased by 9.26 dB (from 34.85 dB to 25.23 dB), and for low-motion content (FourPeople), PSNR degraded by 6.96 dB (from 40.87 dB to 33.91 dB), consistently outperforming the other configurations. In contrast, periodic I demonstrated moderate resilience, with PSNR degradation of 9.6 dB for high-motion content, up to 14.36 dB for intermediate-motion content, and approximately 11.46 dB for low-motion content. The IPPP configuration exhibited the greatest vulnerability, with PSNR degradations of 12.66 dB, 18.7 dB, and 11.95 dB for Crowd_run, Mobcal, and FourPeople, respectively, due to extensive error propagation inherent in its open GOP structure. The findings advance the understanding of error resilience in video compression and offer practical guidelines for maximizing video quality in real-world streaming scenarios over lossy IP networks.
由于数据包丢失和网络拥塞,IP网络上的高效视频流面临着巨大的挑战,特别是当使用用户数据报协议(UDP)时,它缺乏固有的纠错机制。本研究提供了一个基于运动内容和网络条件选择HEVC编码配置的综合框架。本文评估了各种HEVC编码配置在高运动、中运动和低运动视频内容中的丢包弹性。将UDP流与MPEG传输流(MPEG- ts)容器结合使用,在丢包率高达1.0%的情况下,使用峰值信噪比(PSNR)和结构相似性指数度量(SSIM)对视频质量进行量化。评估了三种HEVC编码配置IPPP、周期性I和周期性IDR。结果表明,周期IDR具有封闭的GOP结构,具有最高的丢包弹性,是不可靠网络的理想选择。具体来说,对于高运动视频内容,在丢包率为0.5%的情况下,周期性IDR将PSNR降至6.97 dB(从28.78 dB降至21.87 dB)。对于中运动内容(Mobcal), PSNR下降了9.26 dB(从34.85 dB下降到25.23 dB),对于低运动内容(FourPeople), PSNR下降了6.96 dB(从40.87 dB下降到33.91 dB),始终优于其他配置。相比之下,周期I表现出中等的弹性,高运动内容的PSNR下降为9.6 dB,中运动内容的PSNR下降为14.36 dB,低运动内容的PSNR下降约为11.46 dB。IPPP配置的漏洞最大,Crowd_run、Mobcal和FourPeople的PSNR分别下降了12.66 dB、18.7 dB和11.95 dB,这是由于其开放GOP结构固有的广泛错误传播。这些发现促进了对视频压缩中的错误恢复能力的理解,并为在有损IP网络上的真实流场景中最大限度地提高视频质量提供了实用指南。
{"title":"Performance of HEVC video coding for delivery over IP networks","authors":"Khalid Abdullah M. Salih ,&nbsp;Ismail Amin Ali ,&nbsp;Ramadhan J. Mstafa","doi":"10.1016/j.dsp.2026.105911","DOIUrl":"10.1016/j.dsp.2026.105911","url":null,"abstract":"<div><div>Efficient video streaming over IP networks faces significant challenges due to packet loss and network congestion, particularly when using User Datagram Protocol (UDP), which lacks inherent error correction mechanisms. This study provides a comprehensive framework for selecting HEVC encoding configurations based on motion content and network condition. The paper evaluates the packet loss resilience of various HEVC encoding configurations across video content with high-motion, intermediate-motion, and low-motion activity. Utilizing UDP streaming in conjunction with the MPEG Transport Stream (MPEG-TS) container, video quality was quantified using the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) under packet loss rates of up to 1.0%. Three HEVC encoding configurations <em>IPPP, periodic I</em>, and <em>periodic IDR</em> were assessed. The results indicate that <em>periodic IDR</em>, with its closed GOP structure, achieves the highest resilience to packet loss, rendering it ideal for unreliable networks. Specifically, for high-motion video content, <em>periodic IDR</em> limited PSNR degradation to 6.97 dB (from 28.78 dB to 21.87 dB) under a 0.5% packet loss rate. For intermediate-motion content (Mobcal), PSNR decreased by 9.26 dB (from 34.85 dB to 25.23 dB), and for low-motion content (FourPeople), PSNR degraded by 6.96 dB (from 40.87 dB to 33.91 dB), consistently outperforming the other configurations. In contrast, <em>periodic I</em> demonstrated moderate resilience, with PSNR degradation of 9.6 dB for high-motion content, up to 14.36 dB for intermediate-motion content, and approximately 11.46 dB for low-motion content. The <em>IPPP</em> configuration exhibited the greatest vulnerability, with PSNR degradations of 12.66 dB, 18.7 dB, and 11.95 dB for Crowd_run, Mobcal, and FourPeople, respectively, due to extensive error propagation inherent in its open GOP structure. The findings advance the understanding of error resilience in video compression and offer practical guidelines for maximizing video quality in real-world streaming scenarios over lossy IP networks.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105911"},"PeriodicalIF":3.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance analysis and robust DOA estimation using acoustic vector sensor array under non-orthogonal deviation 非正交偏差下声矢量传感器阵列性能分析及鲁棒DOA估计
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-02 DOI: 10.1016/j.dsp.2025.105867
Weidong Wang , Tianyou Wang , Hui Li , Wentao Shi , Wasiq Ali
In this paper, the problem of direction of arrival (DOA) estimation under the non-orthogonal deviation (NOD) in an acoustic vector sensor array (AVSA) is systematically addressed. First, by incorporating NOD information into the ideal AVSA model, two AVSA models with NOD are established. Subsequently, closed-form expressions for DOA estimation bias, the Cramér-Rao lower bound (CRLB), and the root mean square error (RMSE) are analytically derived for scenarios where each AVS exhibits NOD to illustrate the degrading influence of NOD on DOA estimation accuracy. To mitigate the effect of NOD, an innovative optimal modification matrix construction (OMMC) method is proposed. The NOD range of each AVS is initially coarsely estimated using prior information from a known auxiliary source and the theoretical RMSE. Based on the estimated deviation range, an overcomplete redundant correction matrix is constructed, which is used to calibrate the measurement data of each AVS. The optimal correction matrix is selected by minimizing the deviation between the estimated and true DOAs, and a global correction matrix for the entire array is formed by extracting the optimal correction sub-matrix for each AVS, thereby enabling accurate array calibration. A comprehensive performance evaluation is conducted through extensive simulations, where the proposed OMMC method is demonstrated to significantly outperform existing techniques, especially in challenging environments with large NOD or limited snapshot.
本文系统地研究了声矢量传感器阵列(AVSA)在非正交偏差(NOD)条件下的到达方向估计问题。首先,将NOD信息引入理想AVSA模型,建立了两个带NOD的AVSA模型。随后,在每个AVS都显示NOD的情况下,解析导出了DOA估计偏差、cram - rao下限(CRLB)和均方根误差(RMSE)的封闭表达式,以说明NOD对DOA估计精度的退化影响。为了减轻NOD的影响,提出了一种创新的最优修正矩阵构造(OMMC)方法。每个AVS的NOD范围最初是使用已知辅助源的先验信息和理论RMSE粗略估计的。根据估计的偏差范围,构造过完备冗余校正矩阵,用于标定各AVS的测量数据。通过最小化估计doa与真实doa之间的偏差来选择最优校正矩阵,并通过提取每个AVS的最优校正子矩阵形成整个阵列的全局校正矩阵,从而实现精确的阵列校准。通过广泛的模拟进行了全面的性能评估,其中提出的OMMC方法被证明明显优于现有技术,特别是在具有大NOD或有限快照的挑战性环境中。
{"title":"Performance analysis and robust DOA estimation using acoustic vector sensor array under non-orthogonal deviation","authors":"Weidong Wang ,&nbsp;Tianyou Wang ,&nbsp;Hui Li ,&nbsp;Wentao Shi ,&nbsp;Wasiq Ali","doi":"10.1016/j.dsp.2025.105867","DOIUrl":"10.1016/j.dsp.2025.105867","url":null,"abstract":"<div><div>In this paper, the problem of direction of arrival (DOA) estimation under the non-orthogonal deviation (NOD) in an acoustic vector sensor array (AVSA) is systematically addressed. First, by incorporating NOD information into the ideal AVSA model, two AVSA models with NOD are established. Subsequently, closed-form expressions for DOA estimation bias, the Cramér-Rao lower bound (CRLB), and the root mean square error (RMSE) are analytically derived for scenarios where each AVS exhibits NOD to illustrate the degrading influence of NOD on DOA estimation accuracy. To mitigate the effect of NOD, an innovative optimal modification matrix construction (OMMC) method is proposed. The NOD range of each AVS is initially coarsely estimated using prior information from a known auxiliary source and the theoretical RMSE. Based on the estimated deviation range, an overcomplete redundant correction matrix is constructed, which is used to calibrate the measurement data of each AVS. The optimal correction matrix is selected by minimizing the deviation between the estimated and true DOAs, and a global correction matrix for the entire array is formed by extracting the optimal correction sub-matrix for each AVS, thereby enabling accurate array calibration. A comprehensive performance evaluation is conducted through extensive simulations, where the proposed OMMC method is demonstrated to significantly outperform existing techniques, especially in challenging environments with large NOD or limited snapshot.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105867"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting low-rate DDoS attacks using Rényi entropy and trans-KAN hybrid model in SDN 基于rsamnyi熵和跨kan混合模型的SDN低速率DDoS攻击检测
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-31 DOI: 10.1016/j.dsp.2025.105859
Shiyue Na, Lin Chen, Muyu Lin
Software-Defined Networking (SDN) centralizes network control, enhancing management efficiency but increasing vulnerability to Distributed Denial-of-Service (DDoS) attacks. Low-rate DDoS (LDDoS) attacks are particularly challenging to detect, as their sporadic traffic closely mimics legitimate flows. Existing hybrid detection approaches often employ static fusion strategies that fail to adapt to the diverse characteristics of different LDDoS variants. This paper proposes a novel two-stage detection framework that fundamentally advances hybrid detection through adaptive feature fusion. The first stage utilizes Rényi entropy to efficiently filter 98.77% of benign traffic while retaining potential attack signatures. The second stage employs Trans-KAN, an innovative hybrid model that integrates Kolmogorov-Arnold Networks with Transformer architecture via an adaptive gating mechanism that dynamically balances their contributions through learnable weight matrices based on traffic characteristics. On custom SDN datasets, the proposed framework achieves 98.56% detection accuracy, 97.05% precision, 99.12% recall, and a 98.08% F1-Score with only 1.23% false positives, demonstrating improvements of 3.24% in accuracy over standalone Transformer and 5.48% over KAN. The synergistic combination of entropy-based pre-filtering and adaptive deep learning fusion establishes a new paradigm for LDDoS detection, offering theoretical insights into dynamic feature fusion and Kolmogorov-Arnold representations for hybrid deep learning, with practical applicability for next-generation network security systems.
SDN (software defined Networking)是一种网络集中控制技术,提高了网络管理效率,但也增加了受到DDoS (Distributed Denial-of-Service)攻击的脆弱性。低速率DDoS (LDDoS)攻击尤其难以检测,因为它们的零星流量非常接近合法流量。现有的混合检测方法通常采用静态融合策略,无法适应不同LDDoS变体的不同特征。本文提出了一种新的两阶段检测框架,从根本上推进了自适应特征融合混合检测。第一阶段利用rsamnyi熵有效过滤了98.77%的良性流量,同时保留了潜在的攻击特征。第二阶段采用Trans-KAN,这是一种创新的混合模型,通过自适应门通机制将Kolmogorov-Arnold网络与Transformer架构集成在一起,该机制通过基于流量特征的可学习权重矩阵动态平衡它们的贡献。在自定义SDN数据集上,该框架的检测准确率为98.56%,精度为97.05%,召回率为99.12%,F1-Score为98.08%,假阳性率仅为1.23%,比独立Transformer提高了3.24%,比KAN提高了5.48%。基于熵的预滤波和自适应深度学习融合的协同结合为LDDoS检测建立了一个新的范式,为混合深度学习的动态特征融合和Kolmogorov-Arnold表示提供了理论见解,具有下一代网络安全系统的实际适用性。
{"title":"Detecting low-rate DDoS attacks using Rényi entropy and trans-KAN hybrid model in SDN","authors":"Shiyue Na,&nbsp;Lin Chen,&nbsp;Muyu Lin","doi":"10.1016/j.dsp.2025.105859","DOIUrl":"10.1016/j.dsp.2025.105859","url":null,"abstract":"<div><div>Software-Defined Networking (SDN) centralizes network control, enhancing management efficiency but increasing vulnerability to Distributed Denial-of-Service (DDoS) attacks. Low-rate DDoS (LDDoS) attacks are particularly challenging to detect, as their sporadic traffic closely mimics legitimate flows. Existing hybrid detection approaches often employ static fusion strategies that fail to adapt to the diverse characteristics of different LDDoS variants. This paper proposes a novel two-stage detection framework that fundamentally advances hybrid detection through adaptive feature fusion. The first stage utilizes Rényi entropy to efficiently filter 98.77% of benign traffic while retaining potential attack signatures. The second stage employs Trans-KAN, an innovative hybrid model that integrates Kolmogorov-Arnold Networks with Transformer architecture via an adaptive gating mechanism that dynamically balances their contributions through learnable weight matrices based on traffic characteristics. On custom SDN datasets, the proposed framework achieves 98.56% detection accuracy, 97.05% precision, 99.12% recall, and a 98.08% F1-Score with only 1.23% false positives, demonstrating improvements of 3.24% in accuracy over standalone Transformer and 5.48% over KAN. The synergistic combination of entropy-based pre-filtering and adaptive deep learning fusion establishes a new paradigm for LDDoS detection, offering theoretical insights into dynamic feature fusion and Kolmogorov-Arnold representations for hybrid deep learning, with practical applicability for next-generation network security systems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105859"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Outage probability and ergodic capacity of RIS-assisted RSMA communication system ris辅助RSMA通信系统的中断概率和遍历容量
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-08 DOI: 10.1016/j.dsp.2026.105900
Nguyen Hong Kiem , Bui Anh Duc , Nguyen Tuan Minh , Le T.T. Huyen , Tran Manh Hoang
This paper investigates outage probability (OP) and ergodic capacity (EC) of a reconfigurable intelligent surface (RIS) assisted two-user rate-splitting multiple access (RSMA) communication system. Closed-form expressions for OP and EC are derived over Rayleigh fading channels, and validated through extensive Monte Carlo simulations. A comprehensive performance comparison is conducted between the proposed RIS-assisted RSMA scheme and two benchmark systems: RIS-assisted non-orthogonal multiple access (NOMA) and relay-assisted RSMA. Simulation results demonstrate that the proposed scheme significantly outperforms both benchmarks in terms of OP and EC, regardless of fading conditions. The influence of the critical system parameters, including the number of RIS reflecting elements, transmit power, power allocation factors, and the required rate of the common stream, is thoroughly examined. The results reveal that optimal power allocation between streams is essential for minimizing OP. These findings confirm that integrating RSMA with RIS provides a robust and efficient solution for enhancing communication reliability and spectral efficiency in future 6G wireless networks, especially in challenging non-line-of-sight environments.
研究了可重构智能表面(RIS)辅助的双用户分速多址(RSMA)通信系统的中断概率(OP)和遍历容量(EC)。在瑞利衰落信道上推导了OP和EC的封闭表达式,并通过大量的蒙特卡罗模拟进行了验证。将本文提出的ris辅助RSMA方案与ris辅助非正交多址(NOMA)和中继辅助RSMA两种基准系统进行了性能比较。仿真结果表明,无论在何种衰落条件下,该方案在OP和EC方面都明显优于两个基准。对关键系统参数的影响,包括RIS反射元件的数量、发射功率、功率分配因素和公共流所需的速率,进行了全面的研究。结果表明,流之间的最佳功率分配对于最小化op至关重要。这些研究结果证实,将RSMA与RIS集成为提高未来6G无线网络的通信可靠性和频谱效率提供了一个强大而有效的解决方案,特别是在具有挑战性的非视距环境中。
{"title":"Outage probability and ergodic capacity of RIS-assisted RSMA communication system","authors":"Nguyen Hong Kiem ,&nbsp;Bui Anh Duc ,&nbsp;Nguyen Tuan Minh ,&nbsp;Le T.T. Huyen ,&nbsp;Tran Manh Hoang","doi":"10.1016/j.dsp.2026.105900","DOIUrl":"10.1016/j.dsp.2026.105900","url":null,"abstract":"<div><div>This paper investigates outage probability (OP) and ergodic capacity (EC) of a reconfigurable intelligent surface (RIS) assisted two-user rate-splitting multiple access (RSMA) communication system. Closed-form expressions for OP and EC are derived over Rayleigh fading channels, and validated through extensive Monte Carlo simulations. A comprehensive performance comparison is conducted between the proposed RIS-assisted RSMA scheme and two benchmark systems: RIS-assisted non-orthogonal multiple access (NOMA) and relay-assisted RSMA. Simulation results demonstrate that the proposed scheme significantly outperforms both benchmarks in terms of OP and EC, regardless of fading conditions. The influence of the critical system parameters, including the number of RIS reflecting elements, transmit power, power allocation factors, and the required rate of the common stream, is thoroughly examined. The results reveal that optimal power allocation between streams is essential for minimizing OP. These findings confirm that integrating RSMA with RIS provides a robust and efficient solution for enhancing communication reliability and spectral efficiency in future 6G wireless networks, especially in challenging non-line-of-sight environments.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105900"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1