首页 > 最新文献

IEEE Journal of Selected Topics in Signal Processing最新文献

英文 中文
Deep Unrolled Single Snapshot Phase Retrieval via Non-Convex Formulation and Phase Mask Design 通过非凸公式和相位掩码设计实现深度未卷积单快照相位检索
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-06 DOI: 10.1109/JSTSP.2024.3395979
Andrés Jerez;Juan Estupiñán;Jorge Bacca;Henry Arguello
Phase retrieval (PR) consists of recovering the phase information from captured intensity measurements, known as coded diffraction patterns (CDPs). Non-convex algorithms for addressing the PR problem require a proper initialization that is refined through a gradient descent approach. These PR algorithms have proven to be robust for different scenarios. Despite deep models showing surprising results in this area, these approaches lack interpretability in their neural architectures. This work proposes unrolling the initialization and iterative reconstruction algorithm for the PR problem using the near-field model based on a non-convex formulation; resulting in an interpretable deep neural network (DNN) that can be trained in an end-to-end (E2E) manner. Furthermore, the proposed method can jointly optimize the phase mask for the CDP acquisition and the DNN parameters. Simulation results demonstrate that the proposed E2E method provides high-quality reconstruction using a learned phase mask from a single projection. Also, the proposed method is tested over an experimental optical setup that incorporates the learned phase mask via an only-phase spatial light modulator.
相位检索(PR)包括从捕获的强度测量值(称为编码衍射图样(CDP))中恢复相位信息。解决相位检索问题的非凸算法需要适当的初始化,并通过梯度下降方法加以完善。事实证明,这些 PR 算法在不同情况下都很稳健。尽管深度模型在这一领域取得了令人惊喜的成果,但这些方法的神经架构缺乏可解释性。本研究提出了基于非凸表述的近场模型,针对 PR 问题展开初始化和迭代重建算法;从而产生了一种可解释的深度神经网络(DNN),可以端到端(E2E)的方式进行训练。此外,所提出的方法还能共同优化 CDP 采集的相位掩码和 DNN 参数。仿真结果表明,所提出的 E2E 方法能利用从单一投影中学习到的相位掩码提供高质量的重建。此外,还在实验光学装置上测试了所提出的方法,该装置通过单相空间光调制器整合了所学相位掩码。
{"title":"Deep Unrolled Single Snapshot Phase Retrieval via Non-Convex Formulation and Phase Mask Design","authors":"Andrés Jerez;Juan Estupiñán;Jorge Bacca;Henry Arguello","doi":"10.1109/JSTSP.2024.3395979","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3395979","url":null,"abstract":"Phase retrieval (PR) consists of recovering the phase information from captured intensity measurements, known as coded diffraction patterns (CDPs). Non-convex algorithms for addressing the PR problem require a proper initialization that is refined through a gradient descent approach. These PR algorithms have proven to be robust for different scenarios. Despite deep models showing surprising results in this area, these approaches lack interpretability in their neural architectures. This work proposes unrolling the initialization and iterative reconstruction algorithm for the PR problem using the near-field model based on a non-convex formulation; resulting in an interpretable deep neural network (DNN) that can be trained in an end-to-end (E2E) manner. Furthermore, the proposed method can jointly optimize the phase mask for the CDP acquisition and the DNN parameters. Simulation results demonstrate that the proposed E2E method provides high-quality reconstruction using a learned phase mask from a single projection. Also, the proposed method is tested over an experimental optical setup that incorporates the learned phase mask via an only-phase spatial light modulator.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 4","pages":"694-703"},"PeriodicalIF":8.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142587619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wideband Near-Field Integrated Sensing and Communication With Sparse Transceiver Design 采用稀疏收发器设计的宽带近场综合传感与通信技术
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-06 DOI: 10.1109/JSTSP.2024.3394970
Xiangrong Wang;Weitong Zhai;Xianghua Wang;Moeness G. Amin;Kaiquan Cai
With the deployment of extremely large-scale array (XL-array) operating at the high frequency bands in future wireless systems, integrated sensing and communication (ISAC) is expected to function in the electromagnetic near-field region with a potential distance of hundreds of meters. Also, a wide signal bandwidth is employed to benefit both communication capacity and sensing resolution. However, most existing works assume a far-field narrowband model, which has prohibited their practical applications in future ISAC systems. In this article, we propose a near-field wideband ISAC framework for concurrent multi-user downlink communications and multi-target localization. In particular, the expression of Cramer Rao Bound (CRB) of direction-of-arrival (DOA) and distance estimations for sensing multiple wideband sources is derived, which is minimized subject to the guaranteed communication quality of service (QoS) for each user. Based on the proposed ISAC framework, sparse transceiver array and the precoding matrix are jointly optimized to reduce mutual coupling and system overhead. The problem is relaxed to a convex optimization and solved iteratively. Simulation results demonstrate that the proposed wideband near-field ISAC framework can well support both modalities and that the sparse transceiver improves the sensing accuracy without sacrificing the communication performance.
随着超大规模阵列(XL-array)在未来无线系统的高频段部署,综合传感与通信(ISAC)有望在潜在距离达数百米的电磁近场区域发挥作用。此外,宽信号带宽也有利于提高通信能力和传感分辨率。然而,大多数现有研究都假设了远场窄带模型,这阻碍了它们在未来 ISAC 系统中的实际应用。在本文中,我们提出了一种近场宽带 ISAC 框架,用于同时进行多用户下行链路通信和多目标定位。特别是,我们推导出了感知多个宽带信号源的到达方向(DOA)和距离估计的 Cramer Rao 约束(CRB)表达式,并在保证每个用户的通信服务质量(QoS)的前提下将其最小化。基于所提出的 ISAC 框架,对稀疏收发器阵列和预编码矩阵进行了联合优化,以减少相互耦合和系统开销。该问题被放宽为凸优化,并进行迭代求解。仿真结果表明,所提出的宽带近场 ISAC 框架能很好地支持两种模式,而且稀疏收发器在不牺牲通信性能的情况下提高了传感精度。
{"title":"Wideband Near-Field Integrated Sensing and Communication With Sparse Transceiver Design","authors":"Xiangrong Wang;Weitong Zhai;Xianghua Wang;Moeness G. Amin;Kaiquan Cai","doi":"10.1109/JSTSP.2024.3394970","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3394970","url":null,"abstract":"With the deployment of extremely large-scale array (XL-array) operating at the high frequency bands in future wireless systems, integrated sensing and communication (ISAC) is expected to function in the electromagnetic near-field region with a potential distance of hundreds of meters. Also, a wide signal bandwidth is employed to benefit both communication capacity and sensing resolution. However, most existing works assume a far-field narrowband model, which has prohibited their practical applications in future ISAC systems. In this article, we propose a near-field wideband ISAC framework for concurrent multi-user downlink communications and multi-target localization. In particular, the expression of Cramer Rao Bound (CRB) of direction-of-arrival (DOA) and distance estimations for sensing multiple wideband sources is derived, which is minimized subject to the guaranteed communication quality of service (QoS) for each user. Based on the proposed ISAC framework, sparse transceiver array and the precoding matrix are jointly optimized to reduce mutual coupling and system overhead. The problem is relaxed to a convex optimization and solved iteratively. Simulation results demonstrate that the proposed wideband near-field ISAC framework can well support both modalities and that the sparse transceiver improves the sensing accuracy without sacrificing the communication performance.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 4","pages":"662-677"},"PeriodicalIF":8.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142587623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Universal Approximation of Linear Time-Invariant (LTI) Systems Through RNNs: Power of Randomness in Reservoir Computing 通过 RNNs 实现线性时不变 (LTI) 系统的通用近似:水库计算中的随机性力量
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3387274
Shashank Jere;Lizhong Zheng;Karim Said;Lingjia Liu
Recurrent neural networks (RNNs) are known to be universal approximators of dynamic systems under fairly mild and general assumptions. However, RNNs usually suffer from the issues of vanishing and exploding gradients in standard RNN training. Reservoir computing (RC), a special RNN where the recurrent weights are randomized and left untrained, has been introduced to overcome these issues and has demonstrated superior empirical performance especially in scenarios where training samples are extremely limited. On the other hand, the theoretical grounding to support this observed performance has yet been fully developed. In this article, we show that RC can universally approximate a general linear time-invariant (LTI) system. Specifically, we present a clear signal processing interpretation of RC and utilize this understanding in the problem of approximating a generic LTI system. Under this setup, we analytically characterize the optimum probability density function for configuring (instead of training and/or randomly generating) the recurrent weights of the underlying RNN of the RC. Extensive numerical evaluations are provided to validate the optimality of the derived distribution for configuring the recurrent weights of the RC to approximate a general LTI system. Our work results in clear signal processing-based model interpretability of RC and provides theoretical explanation/justification for the power of randomness in randomly generating instead of training RC's recurrent weights. Furthermore, it provides a complete optimum analytical characterization for configuring the untrained recurrent weights, marking an important step towards explainable machine learning (XML) to incorporate domain knowledge for efficient learning.
众所周知,递归神经网络(RNN)在相当温和和一般的假设条件下是动态系统的通用近似器。然而,在标准 RNN 训练中,RNN 通常会遇到梯度消失和爆炸的问题。水库计算(RC)是一种特殊的 RNN,其中的递归权重是随机的,且未经训练,这种 RNN 被引入以克服这些问题,尤其是在训练样本极其有限的情况下,它已显示出卓越的经验性能。另一方面,支持这种观察到的性能的理论基础尚未得到充分发展。在本文中,我们展示了 RC 可以普遍逼近一般线性时不变(LTI)系统。具体来说,我们对 RC 进行了清晰的信号处理解释,并将这一理解用于近似一般 LTI 系统的问题中。在这种设置下,我们分析了配置(而不是训练和/或随机生成)RC 的底层 RNN 循环权重的最佳概率密度函数。我们提供了广泛的数值评估,以验证用于配置 RC 循环权重以近似一般 LTI 系统的推导分布的最优性。我们的研究成果明确了基于信号处理的 RC 模型可解释性,并从理论上解释/说明了随机性在随机生成而非训练 RC 循环权重方面的作用。此外,它还为配置未经训练的递归权重提供了完整的最佳分析表征,标志着向可解释机器学习(XML)迈出了重要一步,以结合领域知识实现高效学习。
{"title":"Universal Approximation of Linear Time-Invariant (LTI) Systems Through RNNs: Power of Randomness in Reservoir Computing","authors":"Shashank Jere;Lizhong Zheng;Karim Said;Lingjia Liu","doi":"10.1109/JSTSP.2024.3387274","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3387274","url":null,"abstract":"Recurrent neural networks (RNNs) are known to be universal approximators of dynamic systems under fairly mild and general assumptions. However, RNNs usually suffer from the issues of vanishing and exploding gradients in standard RNN training. Reservoir computing (RC), a special RNN where the recurrent weights are randomized and left untrained, has been introduced to overcome these issues and has demonstrated superior empirical performance especially in scenarios where training samples are extremely limited. On the other hand, the theoretical grounding to support this observed performance has yet been fully developed. In this article, we show that RC can universally approximate a general linear time-invariant (LTI) system. Specifically, we present a clear signal processing interpretation of RC and utilize this understanding in the problem of approximating a generic LTI system. Under this setup, we analytically characterize the optimum probability density function for configuring (instead of training and/or randomly generating) the recurrent weights of the underlying RNN of the RC. Extensive numerical evaluations are provided to validate the optimality of the derived distribution for configuring the recurrent weights of the RC to approximate a general LTI system. Our work results in clear signal processing-based model interpretability of RC and provides theoretical explanation/justification for the power of randomness in randomly generating instead of training RC's recurrent weights. Furthermore, it provides a complete optimum analytical characterization for configuring the untrained recurrent weights, marking an important step towards explainable machine learning (XML) to incorporate domain knowledge for efficient learning.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"184-198"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information 电气和电子工程师学会信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3407390
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3407390","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3407390","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"C2-C2"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10584141","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrections to “Energy Efficiency of Mmwave Massive MIMO Precoding With Low-Resolution DACs” 对 "使用低分辨率 DAC 的毫米波大规模 MIMO 精确编码的能效 "的更正
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3398833
Lucas N. Ribeiro;Stefan Schwarz;Markus Rupp;André L. F. de Almeida
Presents corrections to the paper, Energy Efficiency of Mmwave Massive MIMO Precoding With Low-Resolution DACs.
介绍对论文《使用低分辨率 DAC 的 Mmwave Massive MIMO Precoding 的能效》的更正。
{"title":"Corrections to “Energy Efficiency of Mmwave Massive MIMO Precoding With Low-Resolution DACs”","authors":"Lucas N. Ribeiro;Stefan Schwarz;Markus Rupp;André L. F. de Almeida","doi":"10.1109/JSTSP.2024.3398833","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3398833","url":null,"abstract":"Presents corrections to the paper, Energy Efficiency of Mmwave Massive MIMO Precoding With Low-Resolution DACs.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"259-260"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10584340","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial Introduction for the Special Series (Part I) on AI in Signal & Data Science - Toward Explainable, Reliable, and Sustainable Machine Learning 信号与数据科学中的人工智能》特别丛书(第一部分)编辑导言--实现可解释、可靠和可持续的机器学习
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3417111
Xiao-Ping Zhang
Machine learning methods are the backbone of AI and data science. As we embrace the era of deep learning and foundation models, marked by improving performance due to enhanced computing hardware and data scaling, numerous challenges remain. In areas such as healthcare, government decision making and scientific fields, there is a pressing need to develop transparent models that generate interpretable results. Equally important is the reliability of these models, which ensures their robustness and their ability to generalize to new datasets. This is particularly crucial in sectors like healthcare and autonomous driving, where the stakes are high. The recent trend of increasing model sizes also raises concerns about the environmental and societal impacts of training and deploying large models. Therefore, it is essential to mitigate these impacts as much as possible.
机器学习方法是人工智能和数据科学的支柱。在我们迎接深度学习和基础模型时代的同时,由于计算硬件的增强和数据的扩展,性能也在不断提高,但仍然存在许多挑战。在医疗保健、政府决策和科学领域等领域,迫切需要开发能产生可解释结果的透明模型。同样重要的是这些模型的可靠性,它能确保模型的稳健性和对新数据集的泛化能力。这在医疗保健和自动驾驶等利害关系重大的领域尤为重要。最近,模型规模不断扩大的趋势也引发了人们对训练和部署大型模型的环境和社会影响的担忧。因此,尽可能减轻这些影响至关重要。
{"title":"Editorial Introduction for the Special Series (Part I) on AI in Signal & Data Science - Toward Explainable, Reliable, and Sustainable Machine Learning","authors":"Xiao-Ping Zhang","doi":"10.1109/JSTSP.2024.3417111","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3417111","url":null,"abstract":"Machine learning methods are the backbone of AI and data science. As we embrace the era of deep learning and foundation models, marked by improving performance due to enhanced computing hardware and data scaling, numerous challenges remain. In areas such as healthcare, government decision making and scientific fields, there is a pressing need to develop transparent models that generate interpretable results. Equally important is the reliability of these models, which ensures their robustness and their ability to generalize to new datasets. This is particularly crucial in sectors like healthcare and autonomous driving, where the stakes are high. The recent trend of increasing model sizes also raises concerns about the environmental and societal impacts of training and deploying large models. Therefore, it is essential to mitigate these impacts as much as possible.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"139-141"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10584285","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Signal Processing Society Information 电气和电子工程师学会信号处理学会信息
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3407394
{"title":"IEEE Signal Processing Society Information","authors":"","doi":"10.1109/JSTSP.2024.3407394","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3407394","url":null,"abstract":"","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"C3-C3"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10584341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpretable Deep Image Classification Using Rationally Inattentive Utility Maximization 利用理性无意识效用最大化实现可解释的深度图像分类
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-01 DOI: 10.1109/JSTSP.2024.3381335
Kunal Pattanayak;Vikram Krishnamurthy;Adit Jain
Can deep convolutional neural networks (CNNs) for image classification be interpreted as utility maximizers with information costs? By performing set-valued system identification for Bayesian decision systems, we demonstrate that deep CNNs behave equivalently (in terms of necessary and sufficient conditions) to rationally inattentive Bayesian utility maximizers, a generative model used extensively in economics for human decision-making. Our claim is based on approximately 500 numerical experiments on 5 widely used neural network architectures. The parameters of the resulting interpretable model are computed efficiently via convex feasibility algorithms. As a practical application, we also illustrate how the reconstructed interpretable model can predict the classification performance of deep CNNs with high accuracy. The theoretical foundation of our approach lies in Bayesian revealed preference studied in micro-economics. All our results are on GitHub and completely reproducible.
用于图像分类的深度卷积神经网络(CNN)能否被解释为具有信息成本的效用最大化器?通过对贝叶斯决策系统进行集值系统识别,我们证明了深度卷积神经网络的行为(在必要条件和充分条件方面)等同于理性不注意的贝叶斯效用最大化者,这是经济学中广泛用于人类决策的生成模型。我们的观点基于对 5 种广泛使用的神经网络架构进行的约 500 次数值实验。由此产生的可解释模型的参数是通过凸可行性算法高效计算出来的。在实际应用中,我们还说明了重建的可解释模型如何高精度地预测深度 CNN 的分类性能。我们的方法的理论基础是微观经济学中研究的贝叶斯显现偏好。我们的所有成果都在 GitHub 上,完全可重复。
{"title":"Interpretable Deep Image Classification Using Rationally Inattentive Utility Maximization","authors":"Kunal Pattanayak;Vikram Krishnamurthy;Adit Jain","doi":"10.1109/JSTSP.2024.3381335","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3381335","url":null,"abstract":"Can deep convolutional neural networks (CNNs) for image classification be interpreted as utility maximizers with information costs? By performing set-valued system identification for Bayesian decision systems, we demonstrate that deep CNNs behave equivalently (in terms of necessary and sufficient conditions) to rationally inattentive Bayesian utility maximizers, a generative model used extensively in economics for human decision-making. Our claim is based on approximately 500 numerical experiments on 5 widely used neural network architectures. The parameters of the resulting interpretable model are computed efficiently via convex feasibility algorithms. As a practical application, we also illustrate how the reconstructed interpretable model can predict the classification performance of deep CNNs with high accuracy. The theoretical foundation of our approach lies in Bayesian revealed preference studied in micro-economics. All our results are on GitHub and completely reproducible.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"168-183"},"PeriodicalIF":8.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cooperative Robotics Visible Light Positioning: An Intelligent Compressed Sensing and GAN-Enabled Framework 合作机器人可见光定位:智能压缩传感和广义泛函模型框架
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-02-22 DOI: 10.1109/JSTSP.2024.3368661
Sicong Liu;Xianyao Wang;Jian Song;Zhu Han
This article presents a compressed sensing (CS) based framework for visible light positioning (VLP), designed to achieve simultaneous and precise localization of multiple intelligent robots within an indoor factory. The framework leverages light-emitting diodes (LEDs) originally intended for illumination purposes as anchors, repurposing them for the localization of robots equipped with photodetectors. By predividing the plane encompassing the robot positions into a grid, with the number of robots being notably fewer than the grid points, the inherent sparsity of the arrangement is harnessed. To construct an effective sparse measurement model, a sequence of aggregation, autocorrelation, and cross-correlation operations are employed to the signals. Consequently, the complex task of localizing multiple targets is reformulated into a sparse recovery problem, amenable to resolution through CS-based algorithms. Notably, the localization precision is augmented by inter-target cooperation among the robots, and inter-anchor cooperation among distinct LEDs. Furthermore, to fortify the robustness of localization, a generative adversarial network (GAN) is introduced into the proposed localization framework. The simulation results affirm that the proposed framework can successfully achieve centimeter-level accuracy for simultaneous localization of multiple targets.
本文介绍了一种基于压缩传感(CS)的可见光定位(VLP)框架,旨在实现室内工厂内多个智能机器人的同时精确定位。该框架利用原本用于照明目的的发光二极管(LED)作为锚点,将其重新用于配备光电探测器的机器人的定位。通过将包含机器人位置的平面预先划分为网格,机器人的数量明显少于网格点,从而利用了布置的固有稀疏性。为了构建有效的稀疏测量模型,需要对信号进行一系列聚合、自相关和交叉相关操作。因此,定位多个目标的复杂任务被重新表述为一个稀疏恢复问题,可通过基于 CS 的算法加以解决。值得注意的是,机器人之间的目标间合作以及不同 LED 之间的锚点间合作提高了定位精度。此外,为了加强定位的鲁棒性,在拟议的定位框架中引入了生成对抗网络(GAN)。仿真结果表明,所提出的框架可成功实现厘米级精度的多目标同时定位。
{"title":"Cooperative Robotics Visible Light Positioning: An Intelligent Compressed Sensing and GAN-Enabled Framework","authors":"Sicong Liu;Xianyao Wang;Jian Song;Zhu Han","doi":"10.1109/JSTSP.2024.3368661","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3368661","url":null,"abstract":"This article presents a compressed sensing (CS) based framework for visible light positioning (VLP), designed to achieve simultaneous and precise localization of multiple intelligent robots within an indoor factory. The framework leverages light-emitting diodes (LEDs) originally intended for illumination purposes as anchors, repurposing them for the localization of robots equipped with photodetectors. By predividing the plane encompassing the robot positions into a grid, with the number of robots being notably fewer than the grid points, the inherent sparsity of the arrangement is harnessed. To construct an effective sparse measurement model, a sequence of aggregation, autocorrelation, and cross-correlation operations are employed to the signals. Consequently, the complex task of localizing multiple targets is reformulated into a sparse recovery problem, amenable to resolution through CS-based algorithms. Notably, the localization precision is augmented by inter-target cooperation among the robots, and inter-anchor cooperation among distinct LEDs. Furthermore, to fortify the robustness of localization, a generative adversarial network (GAN) is introduced into the proposed localization framework. The simulation results affirm that the proposed framework can successfully achieve centimeter-level accuracy for simultaneous localization of multiple targets.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 3","pages":"407-418"},"PeriodicalIF":8.7,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FairTL: A Transfer Learning Approach for Bias Mitigation in Deep Generative Models FairTL:减轻深度生成模型偏差的迁移学习方法
IF 8.7 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-02-14 DOI: 10.1109/JSTSP.2024.3363419
Christopher T. H. Teo;Milad Abdollahzadeh;Ngai-Man Cheung
This work studies fair generative models. We reveal and quantify the biases in state-of-the-art (SOTA) GANs w.r.t. different sensitive attributes. To address the biases, our main contribution is to propose novel methods to learn fair generative models via transfer learning. Specifically, first, we propose FairTL where we pre-train the generative model with a large biased dataset, then adapt the model using a small fair reference dataset. Second, to further improve sample diversity, we propose FairTL++, containing two additional innovations: 1) aligned feature adaptation, which preserves learned general knowledge while improving fairness by adapting only sensitive attribute-specific parameters, 2) multiple feedback discrimination, which introduces a frozen discriminator for quality feedback and another evolving discriminator for fairness feedback. Taking one step further, we consider an alternative challenging and practical setup. Here, only a pre-trained model is available but the dataset used to pre-train the model is inaccessible. We remark that previous work requires access to large, biased datasets and cannot handle this setup. Extensive experimental results show that FairTL and FairTL++ achieve state-of-the-art performance in quality, diversity and fairness in both setups.
这项工作研究的是公平生成模型。我们揭示并量化了最先进的(SOTA)GAN 在不同敏感属性方面的偏差。为了解决这些偏差,我们的主要贡献是提出了通过迁移学习来学习公平生成模型的新方法。具体来说,首先,我们提出了公平生成模型(FairTL),即用一个大型偏差数据集预先训练生成模型,然后用一个小型公平参考数据集调整模型。其次,为了进一步提高样本多样性,我们提出了 FairTL++,其中包含两项额外的创新:1)对齐特征适应,通过只适应敏感的特定属性参数,在保留已学常识的同时提高公平性;2)多重反馈判别,为质量反馈引入一个冻结判别器,为公平反馈引入另一个不断演化的判别器。在此基础上,我们考虑了另一种具有挑战性和实用性的设置。在这里,只有一个预训练模型可用,但用于预训练模型的数据集无法访问。我们注意到,以前的工作需要访问大型、有偏见的数据集,因此无法处理这种设置。广泛的实验结果表明,FairTL 和 FairTL++ 在质量、多样性和公平性方面都达到了这两种设置中最先进的性能。
{"title":"FairTL: A Transfer Learning Approach for Bias Mitigation in Deep Generative Models","authors":"Christopher T. H. Teo;Milad Abdollahzadeh;Ngai-Man Cheung","doi":"10.1109/JSTSP.2024.3363419","DOIUrl":"https://doi.org/10.1109/JSTSP.2024.3363419","url":null,"abstract":"This work studies fair generative models. We reveal and quantify the biases in state-of-the-art (SOTA) GANs \u0000<italic>w.r.t.</i>\u0000 different sensitive attributes. To address the biases, our main contribution is to propose novel methods to learn fair generative models via transfer learning. Specifically, first, we propose \u0000<sc>FairTL</small>\u0000 where we pre-train the generative model with a large biased dataset, then adapt the model using a small fair reference dataset. Second, to further improve sample diversity, we propose \u0000<sc>FairTL++</small>\u0000, containing two additional innovations: 1) \u0000<italic>aligned feature adaptation</i>\u0000, which preserves learned general knowledge while improving fairness by adapting only sensitive attribute-specific parameters, 2) \u0000<italic>multiple feedback discrimination</i>\u0000, which introduces a frozen discriminator for quality feedback and another evolving discriminator for fairness feedback. Taking one step further, we consider an alternative challenging and practical setup. Here, only a pre-trained model is available but the dataset used to pre-train the model is inaccessible. We remark that previous work requires access to large, biased datasets and cannot handle this setup. Extensive experimental results show that \u0000<sc>FairTL</small>\u0000 and \u0000<sc>FairTL++</small>\u0000 achieve state-of-the-art performance in quality, diversity and fairness in both setups.","PeriodicalId":13038,"journal":{"name":"IEEE Journal of Selected Topics in Signal Processing","volume":"18 2","pages":"155-167"},"PeriodicalIF":8.7,"publicationDate":"2024-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141500323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1