Hang Zheng;Zhiguo Shi;Chengwei Zhou;Sergiy A. Vorobyov;Yujie Gu
{"title":"用于 URA 的深度张量 2-D DOA 估算","authors":"Hang Zheng;Zhiguo Shi;Chengwei Zhou;Sergiy A. Vorobyov;Yujie Gu","doi":"10.1109/TSP.2024.3449117","DOIUrl":null,"url":null,"abstract":"Direction-of-arrival (DOA) estimation using deep neural networks has shown great potential for applications in complicated environments. However, conventional matrix-based deep neural networks vectorize multi-dimensional signal statistics into an excessively long input, necessitating a large number of parameters in neural layers. These parameters require substantial computational resources for training. To address the problem, we propose a resource-efficient tensorized neural network for \n<italic>deep tensor two-dimensional DOA estimation</i>\n. In this network, the covariance tensor corresponding to the uniform rectangular array (URA) is propagated to hidden state tensors that encapsulate essential signal features. To reduce the number of trainable parameters, the feedforward propagation is formulated as inverse Tucker decomposition, compressing the parameters into inverse Tucker factors. An effective tensorized backpropagation procedure is then designed to train the compressed parameters, and the Tucker rank sequences are tuned through Bayesian optimization to ensure satisfactory network performance. Our simulation results demonstrate the superiority of the proposed tensorized deep neural network over its matrix-based counterpart. In a scenario with a \n<inline-formula><tex-math>$10\\times 10$</tex-math></inline-formula>\n URA and \n<inline-formula><tex-math>$2$</tex-math></inline-formula>\n sources, the proposed network reduces the number of trained parameters by more than \n<inline-formula><tex-math>$122,000$</tex-math></inline-formula>\n times. Consequently, it achieves faster training speed and utilizes less GPU memory, while maintains comparable estimation accuracy and angular resolution even under non-ideal conditions and in varying scenarios.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"4065-4080"},"PeriodicalIF":4.6000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Tensor 2-D DOA Estimation for URA\",\"authors\":\"Hang Zheng;Zhiguo Shi;Chengwei Zhou;Sergiy A. Vorobyov;Yujie Gu\",\"doi\":\"10.1109/TSP.2024.3449117\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Direction-of-arrival (DOA) estimation using deep neural networks has shown great potential for applications in complicated environments. However, conventional matrix-based deep neural networks vectorize multi-dimensional signal statistics into an excessively long input, necessitating a large number of parameters in neural layers. These parameters require substantial computational resources for training. To address the problem, we propose a resource-efficient tensorized neural network for \\n<italic>deep tensor two-dimensional DOA estimation</i>\\n. In this network, the covariance tensor corresponding to the uniform rectangular array (URA) is propagated to hidden state tensors that encapsulate essential signal features. To reduce the number of trainable parameters, the feedforward propagation is formulated as inverse Tucker decomposition, compressing the parameters into inverse Tucker factors. An effective tensorized backpropagation procedure is then designed to train the compressed parameters, and the Tucker rank sequences are tuned through Bayesian optimization to ensure satisfactory network performance. Our simulation results demonstrate the superiority of the proposed tensorized deep neural network over its matrix-based counterpart. In a scenario with a \\n<inline-formula><tex-math>$10\\\\times 10$</tex-math></inline-formula>\\n URA and \\n<inline-formula><tex-math>$2$</tex-math></inline-formula>\\n sources, the proposed network reduces the number of trained parameters by more than \\n<inline-formula><tex-math>$122,000$</tex-math></inline-formula>\\n times. Consequently, it achieves faster training speed and utilizes less GPU memory, while maintains comparable estimation accuracy and angular resolution even under non-ideal conditions and in varying scenarios.\",\"PeriodicalId\":13330,\"journal\":{\"name\":\"IEEE Transactions on Signal Processing\",\"volume\":\"72 \",\"pages\":\"4065-4080\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10644148/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10644148/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
利用深度神经网络进行到达方向(DOA)估算在复杂环境中的应用已显示出巨大潜力。然而,传统的基于矩阵的深度神经网络会将多维信号统计数据矢量化为过长的输入,这就需要在神经层中设置大量参数。这些参数的训练需要大量的计算资源。为解决这一问题,我们提出了一种用于深度张量二维 DOA 估计的资源节约型张量神经网络。在该网络中,与均匀矩形阵列(URA)相对应的协方差张量被传播到包含基本信号特征的隐藏状态张量中。为了减少可训练参数的数量,前馈传播被表述为反 Tucker 分解,将参数压缩为反 Tucker 因子。然后设计了一个有效的张量反向传播程序来训练压缩参数,并通过贝叶斯优化调整塔克秩序列,以确保令人满意的网络性能。我们的模拟结果表明,与基于矩阵的深度神经网络相比,所提出的张量深度神经网络更具优势。在一个 10 美元乘以 10 美元的 URA 和 2 美元来源的场景中,所提出的网络将训练参数的数量减少了超过 122,000 美元倍。因此,它的训练速度更快,使用的 GPU 内存更少,即使在非理想条件和不同场景下,也能保持相当的估计精度和角度分辨率。
Direction-of-arrival (DOA) estimation using deep neural networks has shown great potential for applications in complicated environments. However, conventional matrix-based deep neural networks vectorize multi-dimensional signal statistics into an excessively long input, necessitating a large number of parameters in neural layers. These parameters require substantial computational resources for training. To address the problem, we propose a resource-efficient tensorized neural network for
deep tensor two-dimensional DOA estimation
. In this network, the covariance tensor corresponding to the uniform rectangular array (URA) is propagated to hidden state tensors that encapsulate essential signal features. To reduce the number of trainable parameters, the feedforward propagation is formulated as inverse Tucker decomposition, compressing the parameters into inverse Tucker factors. An effective tensorized backpropagation procedure is then designed to train the compressed parameters, and the Tucker rank sequences are tuned through Bayesian optimization to ensure satisfactory network performance. Our simulation results demonstrate the superiority of the proposed tensorized deep neural network over its matrix-based counterpart. In a scenario with a
$10\times 10$
URA and
$2$
sources, the proposed network reduces the number of trained parameters by more than
$122,000$
times. Consequently, it achieves faster training speed and utilizes less GPU memory, while maintains comparable estimation accuracy and angular resolution even under non-ideal conditions and in varying scenarios.
期刊介绍:
The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.