首页 > 最新文献

Digital Signal Processing最新文献

英文 中文
RRDT-guided FPT point detection combined with DCDTN: a new strategy for reliable rolling bearing remaining useful life prediction rrdt引导的FPT点检测与DCDTN相结合:滚动轴承剩余使用寿命可靠预测的新策略
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-29 DOI: 10.1016/j.dsp.2025.105865
Zhigang Feng, Mengyuan Ren
Rolling bearing remaining useful life (RUL) prediction still faces major challenges, including the difficulty of reliably locating the initial degradation point under noisy and long stationary conditions, the lack of discriminative multi-scale features that capture both transient and steady-state degradation patterns, and the limited ability of existing models to fuse heterogeneous temporal information. To address these issues, this study proposes a framework integrating robust regression with a dynamic 3σ-based threshold (RRDT) for first prediction time (FPT) identification and a dual-branch coordinated deep temporal network (DCDTN) for RUL estimation. By accurately detecting the degradation onset amid noise, RRDT provides a more trustworthy starting point for RUL modeling. On the modeling side, DCDTN comprises two feature pathways: a wavelet scattering transform (WST) branch whose coefficients are adaptively refined by the scattering feature adaptive processor (SFAP) unit to suppress redundancy, and a raw-signal branch that employs multi-scale convolutions and a depthwise-separable feed-forward network based temporal convolutional network (FN-TCN) to mine multi-scale deep features. A coordinated temporal fusion module combining Peephole BiConvLSTM and efficient multi-scale attention (EMA) further enhances the representation of complex degradation dynamics, producing the final RUL estimate. Experiments on the XJTU-SY and PHM2012 datasets demonstrate that the proposed method achieves significantly higher prediction accuracy than conventional RUL models. Overall, the framework effectively addresses key obstacles in bearing RUL prediction and shows strong potential for industrial deployment in equipment health monitoring and early-warning scenarios.
滚动轴承剩余使用寿命(RUL)预测仍然面临重大挑战,包括难以在噪声和长平稳条件下可靠地定位初始退化点,缺乏捕获瞬态和稳态退化模式的判别多尺度特征,以及现有模型融合异构时间信息的能力有限。为了解决这些问题,本研究提出了一个框架,将鲁棒回归与基于动态3σ阈值(RRDT)的首次预测时间(FPT)识别和双分支协调深度时间网络(DCDTN)相结合,用于RUL估计。通过在噪声中准确检测退化的开始,RRDT为RUL建模提供了一个更可靠的起点。在建模方面,DCDTN包括两个特征路径:小波散射变换(WST)分支,其系数由散射特征自适应处理器(SFAP)单元自适应细化以抑制冗余;原始信号分支采用多尺度卷积和基于深度可分前馈网络的时间卷积网络(FN-TCN)挖掘多尺度深度特征。结合Peephole BiConvLSTM和高效多尺度注意(EMA)的协调时间融合模块进一步增强了复杂退化动力学的表征,从而产生最终的RUL估计。在XJTU-SY和PHM2012数据集上的实验表明,该方法的预测精度明显高于传统的RUL模型。总体而言,该框架有效地解决了轴承RUL预测中的关键障碍,并在设备健康监测和预警场景中显示出强大的工业部署潜力。
{"title":"RRDT-guided FPT point detection combined with DCDTN: a new strategy for reliable rolling bearing remaining useful life prediction","authors":"Zhigang Feng,&nbsp;Mengyuan Ren","doi":"10.1016/j.dsp.2025.105865","DOIUrl":"10.1016/j.dsp.2025.105865","url":null,"abstract":"<div><div>Rolling bearing remaining useful life (RUL) prediction still faces major challenges, including the difficulty of reliably locating the initial degradation point under noisy and long stationary conditions, the lack of discriminative multi-scale features that capture both transient and steady-state degradation patterns, and the limited ability of existing models to fuse heterogeneous temporal information. To address these issues, this study proposes a framework integrating robust regression with a dynamic 3σ-based threshold (RRDT) for first prediction time (FPT) identification and a dual-branch coordinated deep temporal network (DCDTN) for RUL estimation. By accurately detecting the degradation onset amid noise, RRDT provides a more trustworthy starting point for RUL modeling. On the modeling side, DCDTN comprises two feature pathways: a wavelet scattering transform (WST) branch whose coefficients are adaptively refined by the scattering feature adaptive processor (SFAP) unit to suppress redundancy, and a raw-signal branch that employs multi-scale convolutions and a depthwise-separable feed-forward network based temporal convolutional network (FN-TCN) to mine multi-scale deep features. A coordinated temporal fusion module combining Peephole BiConvLSTM and efficient multi-scale attention (EMA) further enhances the representation of complex degradation dynamics, producing the final RUL estimate. Experiments on the XJTU-SY and PHM2012 datasets demonstrate that the proposed method achieves significantly higher prediction accuracy than conventional RUL models. Overall, the framework effectively addresses key obstacles in bearing RUL prediction and shows strong potential for industrial deployment in equipment health monitoring and early-warning scenarios.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105865"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SBLR: A high-accuracy graph signal processing algorithm for nodal head reconstruction in water networks SBLR:一种用于水网节点头重建的高精度图形信号处理算法
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-05 DOI: 10.1016/j.dsp.2026.105888
Amin A. Maggang , David B. Tay , Jinzhe Gong
In water distribution networks, pressure monitoring in terms of the nodal head values, is essential to provide insights into the operational state of the network and to optimize performance. Installing pressure sensors at pipe junctions is usually practically challenging and costly. Nodal heads at most locations are usually estimated using measurements from the sensors at limited number of locations. This task can be formulated as a problem in signal reconstruction, where the nodal head is considered as the graph signal. In this work, we develop a graph signal processing based algorithm for nodal head reconstruction. The algorithm is developed in the graph spectral domain, but can be implemented in the vertex domain, without the need for performing eigendecomposition. The algorithm exploits the smoothness assumption that is usually observed with steady-state nodal heads, but is also able to deal with any low level high-frequency information that may be present in the signal. Extensive performance evaluation of the proposed algorithm using realistic water network model and comparison with other algorithms is presented in this work.
在配水网络中,根据节点水头值进行压力监测,对于了解网络的运行状态和优化性能至关重要。在管道连接处安装压力传感器通常具有挑战性且成本高昂。大多数位置的节点头通常是使用有限数量位置的传感器测量来估计的。这个任务可以表述为信号重构中的一个问题,其中节点头被认为是图信号。在这项工作中,我们开发了一种基于图信号处理的节点头重建算法。该算法是在图谱域开发的,但可以在顶点域实现,而不需要执行特征分解。该算法利用了通常在稳态节点头中观察到的平滑假设,但也能够处理信号中可能存在的任何低电平高频信息。本文使用现实水网模型对所提出的算法进行了广泛的性能评估,并与其他算法进行了比较。
{"title":"SBLR: A high-accuracy graph signal processing algorithm for nodal head reconstruction in water networks","authors":"Amin A. Maggang ,&nbsp;David B. Tay ,&nbsp;Jinzhe Gong","doi":"10.1016/j.dsp.2026.105888","DOIUrl":"10.1016/j.dsp.2026.105888","url":null,"abstract":"<div><div>In water distribution networks, pressure monitoring in terms of the nodal head values, is essential to provide insights into the operational state of the network and to optimize performance. Installing pressure sensors at pipe junctions is usually practically challenging and costly. Nodal heads at most locations are usually estimated using measurements from the sensors at limited number of locations. This task can be formulated as a problem in signal reconstruction, where the nodal head is considered as the graph signal. In this work, we develop a graph signal processing based algorithm for nodal head reconstruction. The algorithm is developed in the graph spectral domain, but can be implemented in the vertex domain, without the need for performing eigendecomposition. The algorithm exploits the smoothness assumption that is usually observed with steady-state nodal heads, but is also able to deal with any low level high-frequency information that may be present in the signal. Extensive performance evaluation of the proposed algorithm using realistic water network model and comparison with other algorithms is presented in this work.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105888"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UEM-Net: Unsupervised segmentation network for kidneys and renal tumors based on image enhancement and migration repair mechanisms UEM-Net:基于图像增强和迁移修复机制的肾脏和肾脏肿瘤无监督分割网络
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-31 DOI: 10.1016/j.dsp.2025.105862
Zhengyu Li , Yanjun Peng , Keqiong Wang , Nan Lv
Kidney cancer is one of the top ten cancers in the world. Early detection of renal tumors can significantly improve patient survival rates. Automatic segmentation of kidneys and renal lesions from CT images is crucial for the treatment of renal cancer. However, due to the irregular and uneven distribution of renal tumor growth, diverse physiological morphologies, and the extreme difficulty in detecting small tumors, achieving complete detection of tumors remains challenging.Therefore, this paper proposes an unsupervised kidney and renal tumor segmentation network that integrates image enhancement and transfer-based restoration mechanisms. First, a Markov image enhancement method is designed to generate synthetic tumor images with more realistic structures and textures through pixel-level dependency modeling. These synthetic images are used as pseudo-labels during training to enrich data diversity. Second, an unsupervised saliency segmentation network is constructed to adaptively extract salient regions by leveraging feature differences between foreground and background areas, enabling fine segmentation of tumors and cysts. Finally, a transfer restoration mechanism is introduced, which reconstructs overlapping regions of the kidney and lesions based on spatial consistency constraints between pixels and their neighborhoods, effectively completing incomplete labels and further improving the accuracy and integrity of kidney structure segmentation. Our method achieved high experimental results on the Kits2019, Kits2021, and Kits2023 datasets, with the results on Kits2021 and Kits2023 surpassing the current first-place entries in the competitions.
肾癌是世界十大癌症之一。早期发现肾肿瘤可显著提高患者生存率。从CT图像中自动分割肾脏和肾脏病变对于肾癌的治疗至关重要。然而,由于肾脏肿瘤生长分布不规则、不均匀,生理形态多样,小肿瘤的检测难度极大,实现肿瘤的完全检测仍然是一个挑战。因此,本文提出了一种结合图像增强和基于转移的恢复机制的无监督肾脏和肾脏肿瘤分割网络。首先,设计了一种马尔可夫图像增强方法,通过像素级依赖建模生成具有更逼真结构和纹理的合成肿瘤图像。这些合成图像在训练时用作伪标签,以丰富数据的多样性。其次,构建无监督显著性分割网络,利用前景和背景区域的特征差异自适应提取显著区域,实现肿瘤和囊肿的精细分割;最后,引入了一种传递恢复机制,基于像素及其邻域之间的空间一致性约束,重构肾脏和病变的重叠区域,有效地完成了不完整的标记,进一步提高了肾脏结构分割的准确性和完整性。我们的方法在Kits2019、Kits2021和Kits2023数据集上取得了很高的实验结果,其中Kits2021和Kits2023的结果超过了目前比赛中的第一名。
{"title":"UEM-Net: Unsupervised segmentation network for kidneys and renal tumors based on image enhancement and migration repair mechanisms","authors":"Zhengyu Li ,&nbsp;Yanjun Peng ,&nbsp;Keqiong Wang ,&nbsp;Nan Lv","doi":"10.1016/j.dsp.2025.105862","DOIUrl":"10.1016/j.dsp.2025.105862","url":null,"abstract":"<div><div>Kidney cancer is one of the top ten cancers in the world. Early detection of renal tumors can significantly improve patient survival rates. Automatic segmentation of kidneys and renal lesions from CT images is crucial for the treatment of renal cancer. However, due to the irregular and uneven distribution of renal tumor growth, diverse physiological morphologies, and the extreme difficulty in detecting small tumors, achieving complete detection of tumors remains challenging.Therefore, this paper proposes an unsupervised kidney and renal tumor segmentation network that integrates image enhancement and transfer-based restoration mechanisms. First, a Markov image enhancement method is designed to generate synthetic tumor images with more realistic structures and textures through pixel-level dependency modeling. These synthetic images are used as pseudo-labels during training to enrich data diversity. Second, an unsupervised saliency segmentation network is constructed to adaptively extract salient regions by leveraging feature differences between foreground and background areas, enabling fine segmentation of tumors and cysts. Finally, a transfer restoration mechanism is introduced, which reconstructs overlapping regions of the kidney and lesions based on spatial consistency constraints between pixels and their neighborhoods, effectively completing incomplete labels and further improving the accuracy and integrity of kidney structure segmentation. Our method achieved high experimental results on the Kits2019, Kits2021, and Kits2023 datasets, with the results on Kits2021 and Kits2023 surpassing the current first-place entries in the competitions.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105862"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint equalization and power allocation for UAV-assisted RSMA-OTFS transmission over doubly dispersive channels 双色散信道无人机辅助RSMA-OTFS传输的联合均衡与功率分配
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-03 DOI: 10.1016/j.dsp.2025.105870
Ghania Khraimech , Fatiha Merazka , Mustapha Benssalah
With the emergence of sixth-generation (6G) networks, unmanned aerial vehicles (UAVs) are expected to play a key role in enhancing coverage, connectivity, and capacity, particularly in highly dynamic environments. However, UAV-based communication systems face significant challenges such as high mobility and severe Doppler effects, interference management, and link reliability, due to the doubly selective nature of wireless channels, characterized by both time and frequency selective variations resulting from high mobility. To address these issues, this paper proposes a downlink transmission framework that integrates rate-splitting multiple access (RSMA) with orthogonal time frequency space (OTFS) modulation. This combination enhances the communication reliability and the spectral efficiency in UAV-assisted networks. The channel is modeled using integer-valued delay-Doppler parameters to reflect realistic high-mobility conditions. To reduce the inter-symbol interference (ISI) and enhance the detection performance, a low-complexity equalization algorithm based on QR decomposition with Givens rotations is introduced, tailored to the sparse structure of the OTFS channel matrix. Additionally, a bi-objective power allocation strategy for RSMA is formulated to simultaneously minimize bit error rate (BER) and maximize throughput. The non-dominated sorting genetic algorithm II (NSGA-II) is used to find Pareto-optimal solutions suited to varying system demands. Comprehensive simulations further compare the proposed OTFS-RSMA system with benchmark schemes, namely OTFS-NOMA and OFDMA-RSMA, under identical conditions, showing superior sum-rate and BER performance.
随着第六代(6G)网络的出现,无人驾驶飞行器(uav)预计将在增强覆盖、连通性和容量方面发挥关键作用,特别是在高动态环境中。然而,基于无人机的通信系统面临着重大挑战,如高移动性和严重的多普勒效应、干扰管理和链路可靠性,这是由于无线信道的双重选择性,其特点是高移动性导致的时间和频率选择性变化。为了解决这些问题,本文提出了一种集成了分频多址(RSMA)和正交时频空间(OTFS)调制的下行传输框架。这种组合提高了无人机辅助网络的通信可靠性和频谱效率。采用整数值延迟多普勒参数对信道进行建模,以反映实际的高迁移率条件。为了减少码间干扰,提高检测性能,针对OTFS信道矩阵的稀疏结构,提出了一种基于给定旋转QR分解的低复杂度均衡算法。此外,提出了RSMA的双目标功率分配策略,以实现误码率最小化和吞吐量最大化。非支配排序遗传算法II (NSGA-II)用于寻找适合不同系统需求的pareto最优解。综合仿真进一步将所提出的OTFS-RSMA系统与基准方案OTFS-NOMA和OFDMA-RSMA在相同条件下进行了比较,显示出优越的和速率和误码率性能。
{"title":"Joint equalization and power allocation for UAV-assisted RSMA-OTFS transmission over doubly dispersive channels","authors":"Ghania Khraimech ,&nbsp;Fatiha Merazka ,&nbsp;Mustapha Benssalah","doi":"10.1016/j.dsp.2025.105870","DOIUrl":"10.1016/j.dsp.2025.105870","url":null,"abstract":"<div><div>With the emergence of sixth-generation (6G) networks, unmanned aerial vehicles (UAVs) are expected to play a key role in enhancing coverage, connectivity, and capacity, particularly in highly dynamic environments. However, UAV-based communication systems face significant challenges such as high mobility and severe Doppler effects, interference management, and link reliability, due to the doubly selective nature of wireless channels, characterized by both time and frequency selective variations resulting from high mobility. To address these issues, this paper proposes a downlink transmission framework that integrates rate-splitting multiple access (RSMA) with orthogonal time frequency space (OTFS) modulation. This combination enhances the communication reliability and the spectral efficiency in UAV-assisted networks. The channel is modeled using integer-valued delay-Doppler parameters to reflect realistic high-mobility conditions. To reduce the inter-symbol interference (ISI) and enhance the detection performance, a low-complexity equalization algorithm based on QR decomposition with Givens rotations is introduced, tailored to the sparse structure of the OTFS channel matrix. Additionally, a bi-objective power allocation strategy for RSMA is formulated to simultaneously minimize bit error rate (BER) and maximize throughput. The non-dominated sorting genetic algorithm II (NSGA-II) is used to find Pareto-optimal solutions suited to varying system demands. Comprehensive simulations further compare the proposed OTFS-RSMA system with benchmark schemes, namely OTFS-NOMA and OFDMA-RSMA, under identical conditions, showing superior sum-rate and BER performance.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105870"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved reinforcement learning-based joint decision-making of detection modes and transmit power for LPI radar 基于改进强化学习的LPI雷达探测模式与发射功率联合决策
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-02 DOI: 10.1016/j.dsp.2025.105872
Huilong Tang , Wei Wang , Zhiwei Pu , Jianlin Wei , Wang Zhang
Modern airborne radar reconnaissance systems employ range-divided multi-stage operations (e.g., passive detection, active detection, and active identification). However, traditional low probability of intercept (LPI) radar designs focus on optimizing performance for individual reconnaissance stages, resulting in suboptimal overall detection capability. Meanwhile, multi-stage operations yield excessive invalid and suboptimal actions, creating action space redundancy that deteriorates learning efficiency. This paper proposes a reinforcement learning (RL)-based joint decision-making method for enhanced detection performance, incorporating improved RL exploration mechanisms to accelerate learning. Firstly, adversarial strategies from each stage are integrated to construct a joint decision-making framework for detection modes and transmit power (JD-DMTP). Based on this framework, the RL elements are designed to enhance detection performance under LPI constraints. Secondly, we propose the trainable suboptimal action mask (TSAM), equipped with suboptimal action elimination criteria, to filter out both invalid and suboptimal actions, thereby improving learning efficiency. Finally, the experimental results validate the effectiveness of the JD-DMTP, showing 6.46×/4.04× higher hit value ratio and 1.52×/1.32× better successful decision-making rate (ideal/non-ideal environment) compared to the minimum-transmit-power baseline. The TSAM achieves comparable performance to the trainable action mask (TAM) baseline with only 25% of the required training iterations.
现代机载雷达侦察系统采用距离分割多阶段操作(例如,被动探测、主动探测和主动识别)。然而,传统的低截获概率(LPI)雷达设计侧重于优化单个侦察阶段的性能,导致整体探测能力不理想。同时,多阶段操作会产生过多无效和次优动作,造成动作空间冗余,降低学习效率。本文提出了一种基于强化学习(RL)的联合决策方法来提高检测性能,并结合改进的RL探索机制来加速学习。首先,整合各阶段的对抗策略,构建探测模式和发射功率联合决策框架(JD-DMTP)。基于该框架,设计了RL元素,以提高LPI约束下的检测性能。其次,我们提出了可训练次优动作掩模(TSAM),该掩模具有次优动作消除准则,可以过滤掉无效动作和次优动作,从而提高学习效率。最后,实验结果验证了JD-DMTP的有效性,与最小发射功率基线相比,在理想/非理想环境下,JD-DMTP的命中率提高了6.46×/4.04×,成功决策率提高了1.52×/1.32×。TSAM只需要25%的训练迭代就可以达到与可训练动作掩码(TAM)基线相当的性能。
{"title":"Improved reinforcement learning-based joint decision-making of detection modes and transmit power for LPI radar","authors":"Huilong Tang ,&nbsp;Wei Wang ,&nbsp;Zhiwei Pu ,&nbsp;Jianlin Wei ,&nbsp;Wang Zhang","doi":"10.1016/j.dsp.2025.105872","DOIUrl":"10.1016/j.dsp.2025.105872","url":null,"abstract":"<div><div>Modern airborne radar reconnaissance systems employ range-divided multi-stage operations (e.g., passive detection, active detection, and active identification). However, traditional low probability of intercept (LPI) radar designs focus on optimizing performance for individual reconnaissance stages, resulting in suboptimal overall detection capability. Meanwhile, multi-stage operations yield excessive invalid and suboptimal actions, creating action space redundancy that deteriorates learning efficiency. This paper proposes a reinforcement learning (RL)-based joint decision-making method for enhanced detection performance, incorporating improved RL exploration mechanisms to accelerate learning. Firstly, adversarial strategies from each stage are integrated to construct a joint decision-making framework for detection modes and transmit power (JD-DMTP). Based on this framework, the RL elements are designed to enhance detection performance under LPI constraints. Secondly, we propose the trainable suboptimal action mask (TSAM), equipped with suboptimal action elimination criteria, to filter out both invalid and suboptimal actions, thereby improving learning efficiency. Finally, the experimental results validate the effectiveness of the JD-DMTP, showing 6.46×/4.04× higher hit value ratio and 1.52×/1.32× better successful decision-making rate (ideal/non-ideal environment) compared to the minimum-transmit-power baseline. The TSAM achieves comparable performance to the trainable action mask (TAM) baseline with only 25% of the required training iterations.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105872"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An enhanced traffic object detection network based on multi-point attention and sparse feature aggregation 基于多点关注和稀疏特征聚合的增强交通目标检测网络
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-07 DOI: 10.1016/j.dsp.2026.105897
Xun Li, Qidi Wang, Ruixue Shi, Ruibo Nui, Weizhong Chen
Urban traffic surveillance often faces challenges such as vehicle occlusion, scale variation, and complex background interference, leading to missed detections and false positives. To address these challenges, this paper presents YOLOv11-W, an enhanced object detection network built upon YOLOv11. The proposed model improves three key aspects: reducing detection errors, strengthening perception of small-angle objects, and enhancing feature extraction. A C3k2_SimAM module is introduced to apply secondary weighting during feature fusion, thereby improving target saliency. At the backbone endpoint, the GAM module forms a Multi-Point Attention Enhancement (MPAE) mechanism, enabling stronger perception of critical regions and more effective global modeling. For feature aggregation, the sparsity-constrained SPPCSPC module replaces the conventional SPPF, enhancing multi-scale contextual awareness while minimizing redundancy. In the detection head, WIoU v2 serves as the bounding-box regression loss, improving localization accuracy for occluded and small objects while accelerating convergence. To mitigate motion blur, the network employs an attention-based multi-scale reweighting mechanism that reinforces blurred edge and texture features across scales, effectively preserving structural details. By the cooperation of these modules, YOLOv11-W achieves greater representational power and adaptability in complex multi-object traffic scenarios. Experimental results confirm its effectiveness: recognition accuracies reach 97.5%, 96.2%, and 96.2% in free flow, synchronous flow, and blocking flow conditions, representing gains of 0.1%, 0.9%, and 2.4% over YOLOv11. Meanwhile, the optimized design preserves real-time performance, achieving 66.7, 64.4, and 67.3 FPS across different traffic states. These results demonstrate that YOLOv11-W provides a balanced solution for accuracy and efficiency in urban traffic detection.
城市交通监控经常面临车辆遮挡、尺度变化和复杂背景干扰等挑战,导致漏检和误报。为了解决这些挑战,本文提出了基于YOLOv11的增强目标检测网络YOLOv11- w。该模型在减少检测误差、增强小角度目标感知和增强特征提取三个方面进行了改进。引入C3k2_SimAM模块,在特征融合过程中应用二次加权,提高目标显著性。在骨干端点,GAM模块形成多点注意增强(Multi-Point Attention Enhancement, MPAE)机制,实现对关键区域的更强感知和更有效的全局建模。对于特征聚合,稀疏约束的SPPCSPC模块取代了传统的SPPF,增强了多尺度上下文感知,同时最小化了冗余。在检测头部,WIoU v2作为边界盒回归损失,提高了对遮挡和小目标的定位精度,同时加快了收敛速度。为了减轻运动模糊,该网络采用了一种基于注意力的多尺度重加权机制,增强了模糊的边缘和纹理特征,有效地保留了结构细节。通过这些模块的协同,YOLOv11-W在复杂的多目标交通场景中实现了更强的表现能力和适应性。实验结果证实了该算法的有效性:在自由流动、同步流动和阻塞流动条件下,识别准确率分别达到97.5%、96.2%和96.2%,比YOLOv11分别提高0.1%、0.9%和2.4%。同时,优化后的设计保持了实时性,在不同的流量状态下FPS分别达到66.7、64.4和67.3。这些结果表明,YOLOv11-W为城市交通检测提供了精度和效率的平衡解决方案。
{"title":"An enhanced traffic object detection network based on multi-point attention and sparse feature aggregation","authors":"Xun Li,&nbsp;Qidi Wang,&nbsp;Ruixue Shi,&nbsp;Ruibo Nui,&nbsp;Weizhong Chen","doi":"10.1016/j.dsp.2026.105897","DOIUrl":"10.1016/j.dsp.2026.105897","url":null,"abstract":"<div><div>Urban traffic surveillance often faces challenges such as vehicle occlusion, scale variation, and complex background interference, leading to missed detections and false positives. To address these challenges, this paper presents YOLOv11-W, an enhanced object detection network built upon YOLOv11. The proposed model improves three key aspects: reducing detection errors, strengthening perception of small-angle objects, and enhancing feature extraction. A C3k2_SimAM module is introduced to apply secondary weighting during feature fusion, thereby improving target saliency. At the backbone endpoint, the GAM module forms a Multi-Point Attention Enhancement (MPAE) mechanism, enabling stronger perception of critical regions and more effective global modeling. For feature aggregation, the sparsity-constrained SPPCSPC module replaces the conventional SPPF, enhancing multi-scale contextual awareness while minimizing redundancy. In the detection head, WIoU v2 serves as the bounding-box regression loss, improving localization accuracy for occluded and small objects while accelerating convergence. To mitigate motion blur, the network employs an attention-based multi-scale reweighting mechanism that reinforces blurred edge and texture features across scales, effectively preserving structural details. By the cooperation of these modules, YOLOv11-W achieves greater representational power and adaptability in complex multi-object traffic scenarios. Experimental results confirm its effectiveness: recognition accuracies reach 97.5%, 96.2%, and 96.2% in free flow, synchronous flow, and blocking flow conditions, representing gains of 0.1%, 0.9%, and 2.4% over YOLOv11. Meanwhile, the optimized design preserves real-time performance, achieving 66.7, 64.4, and 67.3 FPS across different traffic states. These results demonstrate that YOLOv11-W provides a balanced solution for accuracy and efficiency in urban traffic detection.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105897"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-supervised monocular depth estimation using a hierarchical decoder and relative cost volume 使用分层解码器和相对成本体积的自监督单目深度估计
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-31 DOI: 10.1016/j.dsp.2025.105871
Dan Xu, Jiaao Wang, Yang Zhou, Qiang Qian, Jinlong Shi
Self-supervised learning has emerged as a promising approach in monocular depth estimation due to its independence from ground-truth depth annotations. However, its reliance on photometric consistency between adjacent frames as the supervisory signal makes it particularly vulnerable to illumination changes, occlusions, and dynamic objects. These limitations often result in unstable supervision and blurred depth predictions, especially near object boundaries. Furthermore, the absence of geometric constraints-typically provided by stereo or multi-view systems-hinders the accurate modeling of scene structure, compromising spatial coherence and geometric fidelity. To address these challenges, we propose a novel monocular depth estimation framework that combines hierarchical decoder with a relative distance cost volume. The proposed hierarchical decoder employs Laplacian pyramid residuals to enhance high-frequency details, while a residual mean feature strengthens edge and texture representation during decoding, effectively reducing boundary blurring caused by photometric inconsistencies. Additionally, we introduce a global-local decoupling structure within a Transformer-based architecture to construct the relative distance cost volume. By integrating global depth representations with local query mechanisms, our method captures intricate geometric relationships and improves scene understanding. Extensive experiments on the KITTI and Make3D datasets demonstrate that our framework achieves state-of-the-art performance across all evaluation metrics, while preserving fine-grained details and exhibiting strong generalization capability. Our source codes are available at https://github.com/jiaaw1/LPD-DCV-depth.
自监督学习由于其独立于真值深度标注而成为单目深度估计的一种很有前途的方法。然而,它依赖于相邻帧之间的光度一致性作为监控信号,这使得它特别容易受到照明变化、遮挡和动态物体的影响。这些限制通常会导致不稳定的监督和模糊的深度预测,特别是在物体边界附近。此外,缺乏几何约束——通常由立体或多视图系统提供——阻碍了场景结构的准确建模,损害了空间一致性和几何保真度。为了解决这些挑战,我们提出了一种新的单目深度估计框架,该框架将分层解码器与相对距离成本体积相结合。本文提出的分层解码器采用拉普拉斯金字塔残差增强高频细节,残差均值特征增强解码过程中的边缘和纹理表征,有效降低因光度不一致导致的边界模糊。此外,我们在基于变压器的体系结构中引入了全局-局部解耦结构来构造相对距离成本体积。通过将全局深度表示与局部查询机制相结合,我们的方法捕获了复杂的几何关系,提高了对场景的理解。在KITTI和Make3D数据集上进行的大量实验表明,我们的框架在所有评估指标上都实现了最先进的性能,同时保留了细粒度的细节并表现出强大的泛化能力。我们的源代码可在https://github.com/jiaaw1/LPD-DCV-depth上获得。
{"title":"Self-supervised monocular depth estimation using a hierarchical decoder and relative cost volume","authors":"Dan Xu,&nbsp;Jiaao Wang,&nbsp;Yang Zhou,&nbsp;Qiang Qian,&nbsp;Jinlong Shi","doi":"10.1016/j.dsp.2025.105871","DOIUrl":"10.1016/j.dsp.2025.105871","url":null,"abstract":"<div><div>Self-supervised learning has emerged as a promising approach in monocular depth estimation due to its independence from ground-truth depth annotations. However, its reliance on photometric consistency between adjacent frames as the supervisory signal makes it particularly vulnerable to illumination changes, occlusions, and dynamic objects. These limitations often result in unstable supervision and blurred depth predictions, especially near object boundaries. Furthermore, the absence of geometric constraints-typically provided by stereo or multi-view systems-hinders the accurate modeling of scene structure, compromising spatial coherence and geometric fidelity. To address these challenges, we propose a novel monocular depth estimation framework that combines hierarchical decoder with a relative distance cost volume. The proposed hierarchical decoder employs Laplacian pyramid residuals to enhance high-frequency details, while a residual mean feature strengthens edge and texture representation during decoding, effectively reducing boundary blurring caused by photometric inconsistencies. Additionally, we introduce a global-local decoupling structure within a Transformer-based architecture to construct the relative distance cost volume. By integrating global depth representations with local query mechanisms, our method captures intricate geometric relationships and improves scene understanding. Extensive experiments on the KITTI and Make3D datasets demonstrate that our framework achieves state-of-the-art performance across all evaluation metrics, while preserving fine-grained details and exhibiting strong generalization capability. Our source codes are available at <span><span>https://github.com/jiaaw1/LPD-DCV-depth</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105871"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the convergence rate of regularized GANs training: A dissipative dynamical system perspective 正则化gan训练的收敛速度:耗散动力系统视角
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-29 DOI: 10.1016/j.dsp.2025.105860
Liwen Jiang , Li Chai
It is well-known that Generative Adversarial Networks (GANs) are difficult to train and great efforts have been devoted to analyze and stabilize the training dynamics of GANs. Recent works have analyzed the factors influencing regularized GAN’s convergence and revealed that the training dynamics tend to converge locally near equilibrium points. In this paper, we studied the convergence rate of regularized GANs by the theory of dissipative dynamical systems, which can be viewed as the generalization of passivity theory and the small-gain theory of nonlinear systems. We analyze the impact of learning rate on regularized GANs’ training process. We prove the convergence of the training process for regularized GANs without involving eigen-analysis of the Jacobian matrix. We are able to derive the system’s geometric convergence rate and identify the optimal learning rate that leads to the fastest convergence. We have conducted extensive experiments on several datasets to verify our theoretical results.
生成对抗网络(Generative Adversarial Networks, GANs)的训练难度很大,人们对其训练动态的分析和稳定进行了大量的研究。最近的研究分析了影响正则化GAN收敛性的因素,发现训练动态倾向于在平衡点附近局部收敛。本文利用耗散动力系统理论研究了正则化gan的收敛速度,耗散动力系统理论可以看作是非线性系统无源性理论和小增益理论的推广。我们分析了学习率对正则化gan训练过程的影响。在不涉及雅可比矩阵特征分析的情况下,证明了正则化gan训练过程的收敛性。我们能够推导出系统的几何收敛率,并确定导致最快收敛的最佳学习率。我们在几个数据集上进行了大量的实验来验证我们的理论结果。
{"title":"On the convergence rate of regularized GANs training: A dissipative dynamical system perspective","authors":"Liwen Jiang ,&nbsp;Li Chai","doi":"10.1016/j.dsp.2025.105860","DOIUrl":"10.1016/j.dsp.2025.105860","url":null,"abstract":"<div><div>It is well-known that Generative Adversarial Networks (GANs) are difficult to train and great efforts have been devoted to analyze and stabilize the training dynamics of GANs. Recent works have analyzed the factors influencing regularized GAN’s convergence and revealed that the training dynamics tend to converge locally near equilibrium points. In this paper, we studied the convergence rate of regularized GANs by the theory of dissipative dynamical systems, which can be viewed as the generalization of passivity theory and the small-gain theory of nonlinear systems. We analyze the impact of learning rate on regularized GANs’ training process. We prove the convergence of the training process for regularized GANs without involving eigen-analysis of the Jacobian matrix. We are able to derive the system’s geometric convergence rate and identify the optimal learning rate that leads to the fastest convergence. We have conducted extensive experiments on several datasets to verify our theoretical results.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105860"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MPDL-YOLO: A multidimensional attention and lightweight convolution framework for remote sensing object detection MPDL-YOLO:一个用于遥感目标检测的多维关注和轻量级卷积框架
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2025-12-31 DOI: 10.1016/j.dsp.2025.105873
Pengyu Chen, Lie Wang, Yuman Liang, Zunmin Hou, Guangbin He, Hongshuai Chen
The rapid advancement of satellite and sensor technologies has facilitated the acquisition of remote sensing images, yet the efficient and accurate extraction of valuable information from high-resolution data, particularly for object detection, remains challenging. While deep learning-based algorithms have shown promise in automatic feature learning, single-scale feature layers struggle with large-scale variations, and existing attention mechanisms and convolutional modules are insufficient for remote sensing tasks. To address these issues, this paper proposes a novel Multi-scale Perception and Detail Learning YOLO (MPDL-YOLO) model for remote sensing object detection. First, we propose a Median-pooling Space and Channel Attention Block (MPCS), integrating global average, max, and median pooling to create a multi-dimensional attention mechanism that reduces noise while preserving edge details. Second, we design a Depthwise Separable Lightweight Inception Convolution (DWInceptionLite) by combining depthwise separable convolutions and Inception structures, significantly reducing computational complexity. Finally, we employ an inverted Residual Mobile Block (iRMB) to construct a Hierarchical Feature Fusion Block (HFFB), enhancing feature extraction and detail precision. Experimental results demonstrate that compared to YOLOv8, MPDL-YOLO achieves improvements of 1.2% and 3.2% in [email protected], and 1.5% and 1.8% in [email protected]:0.95 on the DIOR and RSOD datasets, respectively, thus validating the effectiveness and superiority of the proposed algorithm.
卫星和传感器技术的迅速发展促进了遥感图像的获取,然而,从高分辨率数据中有效和准确地提取有价值的信息,特别是用于目标探测,仍然具有挑战性。虽然基于深度学习的算法在自动特征学习中显示出前景,但单尺度特征层难以应对大规模变化,现有的注意机制和卷积模块不足以满足遥感任务。为了解决这些问题,本文提出了一种新的多尺度感知和细节学习YOLO (MPDL-YOLO)遥感目标检测模型。首先,我们提出了一个中位数池化空间和通道注意块(MPCS),整合了全局平均池化、最大池化和中位数池化,以创建一个多维注意机制,在保留边缘细节的同时降低噪声。其次,我们设计了一个深度可分轻量级初始卷积(DWInceptionLite),将深度可分卷积与初始结构相结合,显著降低了计算复杂度。最后,利用倒转残差移动块(iRMB)构造层次化特征融合块(HFFB),提高特征提取和细节精度。实验结果表明,与YOLOv8相比,MPDL-YOLO在[email protected]数据集上分别提高了1.2%和3.2%,在[email protected]数据集上分别提高了1.5%和1.8%:0.95,验证了本文算法的有效性和优越性。
{"title":"MPDL-YOLO: A multidimensional attention and lightweight convolution framework for remote sensing object detection","authors":"Pengyu Chen,&nbsp;Lie Wang,&nbsp;Yuman Liang,&nbsp;Zunmin Hou,&nbsp;Guangbin He,&nbsp;Hongshuai Chen","doi":"10.1016/j.dsp.2025.105873","DOIUrl":"10.1016/j.dsp.2025.105873","url":null,"abstract":"<div><div>The rapid advancement of satellite and sensor technologies has facilitated the acquisition of remote sensing images, yet the efficient and accurate extraction of valuable information from high-resolution data, particularly for object detection, remains challenging. While deep learning-based algorithms have shown promise in automatic feature learning, single-scale feature layers struggle with large-scale variations, and existing attention mechanisms and convolutional modules are insufficient for remote sensing tasks. To address these issues, this paper proposes a novel Multi-scale Perception and Detail Learning YOLO (MPDL-YOLO) model for remote sensing object detection. First, we propose a Median-pooling Space and Channel Attention Block (MPCS), integrating global average, max, and median pooling to create a multi-dimensional attention mechanism that reduces noise while preserving edge details. Second, we design a Depthwise Separable Lightweight Inception Convolution (DWInceptionLite) by combining depthwise separable convolutions and Inception structures, significantly reducing computational complexity. Finally, we employ an inverted Residual Mobile Block (iRMB) to construct a Hierarchical Feature Fusion Block (HFFB), enhancing feature extraction and detail precision. Experimental results demonstrate that compared to YOLOv8, MPDL-YOLO achieves improvements of 1.2% and 3.2% in [email protected], and 1.5% and 1.8% in [email protected]:0.95 on the DIOR and RSOD datasets, respectively, thus validating the effectiveness and superiority of the proposed algorithm.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105873"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-enabled receiver for OFDM-Based time-domain generalized spatial modulation in optical MIMO systems 光学MIMO系统中基于ofdm的时域广义空间调制的深度学习接收机
IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-15 Epub Date: 2026-01-06 DOI: 10.1016/j.dsp.2026.105894
Yahya M. Al-Moliki , Ali H. Alqahtani , Yahya Al-Harthi , Mohammed T. Alresheedi
This paper presents a novel deep learning (DL)-enabled receiver architecture for orthogonal frequency-division multiplexing (OFDM)-based time-domain generalized spatial modulation (TD-GSM) in optical multiple-input multiple-output (MIMO) systems. Unlike prior deep learning studies that focused on frequency-domain GSM (FD-GSM) or pulse-amplitude-modulation-based GSM under perfect channel assumptions, this work is the first to integrate a conditional denoising autoencoder (DAE), a multilayer perceptron (MLP), and a convolutional neural network (CNN) into an OFDM-based TD-GSM framework under realistic intensity modulation/direct detection (IM/DD) and imperfect channel state information (CSI) conditions. The proposed architecture uniquely combines: (i) a conditional DAE for adaptive channel estimation across varying signal-to-noise ratios, (ii) an MLP classifier for accurate spatial index classification, and (iii) a CNN classifier for robust constellation symbol recovery under nonlinear optical distortions. This design not only improves robustness but also reduces hardware complexity compared to FD-GSM, since TD-GSM requires only a single OFDM chain while still embedding spatial information in the time domain. Simulation results confirm that the proposed DL-based receiver achieves bit-error-rate (BER) performance gains of up to 9 dB over maximum-likelihood detection at a BER of 3×103, demonstrating both the scalability and generalizability of the approach. By explicitly addressing IM/DD biasing constraints and channel estimation imperfections, this architecture advances the methodological capabilities of optical digital signal processing and provides a practical, high-efficiency solution for future optical wireless communication systems.
本文提出了一种基于正交频分复用(OFDM)的时域广义空间调制(TD-GSM)光多输入多输出(MIMO)系统的新型深度学习(DL)接收机结构。与之前专注于频域GSM (FD-GSM)或完美信道假设下基于脉冲幅度调制的GSM的深度学习研究不同,这项工作首次在现实强度调制/直接检测(IM/DD)和不完美信道状态信息(CSI)条件下将条件降噪自动编码器(DAE)、多层感知器(MLP)和卷积神经网络(CNN)集成到基于ofdm的TD-GSM框架中。所提出的架构独特地结合了:(i)用于不同信噪比的自适应信道估计的条件DAE, (ii)用于精确空间索引分类的MLP分类器,以及(iii)用于非线性光学畸变下鲁棒星座符号恢复的CNN分类器。与FD-GSM相比,这种设计不仅提高了鲁棒性,而且降低了硬件复杂性,因为TD-GSM只需要一个OFDM链,同时仍然在时域内嵌入空间信息。仿真结果证实,在误码率3×10−3的最大似然检测下,所提出的基于dl的接收器实现了高达9 dB的误码率(BER)性能增益,证明了该方法的可扩展性和通用性。通过明确地解决IM/DD偏压约束和信道估计缺陷,该体系结构提高了光学数字信号处理的方法能力,并为未来的光学无线通信系统提供了实用、高效的解决方案。
{"title":"Deep learning-enabled receiver for OFDM-Based time-domain generalized spatial modulation in optical MIMO systems","authors":"Yahya M. Al-Moliki ,&nbsp;Ali H. Alqahtani ,&nbsp;Yahya Al-Harthi ,&nbsp;Mohammed T. Alresheedi","doi":"10.1016/j.dsp.2026.105894","DOIUrl":"10.1016/j.dsp.2026.105894","url":null,"abstract":"<div><div>This paper presents a novel deep learning (DL)-enabled receiver architecture for orthogonal frequency-division multiplexing (OFDM)-based time-domain generalized spatial modulation (TD-GSM) in optical multiple-input multiple-output (MIMO) systems. Unlike prior deep learning studies that focused on frequency-domain GSM (FD-GSM) or pulse-amplitude-modulation-based GSM under perfect channel assumptions, this work is the first to integrate a conditional denoising autoencoder (DAE), a multilayer perceptron (MLP), and a convolutional neural network (CNN) into an OFDM-based TD-GSM framework under realistic intensity modulation/direct detection (IM/DD) and imperfect channel state information (CSI) conditions. The proposed architecture uniquely combines: (i) a conditional DAE for adaptive channel estimation across varying signal-to-noise ratios, (ii) an MLP classifier for accurate spatial index classification, and (iii) a CNN classifier for robust constellation symbol recovery under nonlinear optical distortions. This design not only improves robustness but also reduces hardware complexity compared to FD-GSM, since TD-GSM requires only a single OFDM chain while still embedding spatial information in the time domain. Simulation results confirm that the proposed DL-based receiver achieves bit-error-rate (BER) performance gains of up to 9 dB over maximum-likelihood detection at a BER of <span><math><mrow><mn>3</mn><mspace></mspace><mo>×</mo><msup><mrow><mn>10</mn></mrow><mrow><mo>−</mo><mn>3</mn></mrow></msup></mrow></math></span>, demonstrating both the scalability and generalizability of the approach. By explicitly addressing IM/DD biasing constraints and channel estimation imperfections, this architecture advances the methodological capabilities of optical digital signal processing and provides a practical, high-efficiency solution for future optical wireless communication systems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"172 ","pages":"Article 105894"},"PeriodicalIF":3.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1