首页 > 最新文献

IEEE open journal of signal processing最新文献

英文 中文
Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-Trained Models 利用预训练模型为 ImageNet 提供基于密钥的高效对抗性防御
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-26 DOI: 10.1109/OJSP.2024.3419569
AprilPyone MaungMaung;Isao Echizen;Hitoshi Kiya
In this paper, we propose key-based defense model proliferation by leveraging pre-trained models and utilizing recent efficient fine-tuning techniques on ImageNet-1 k classification. First, we stress that deploying key-based models on edge devices is feasible with the latest model deployment advancements, such as Apple CoreML, although the mainstream enterprise edge artificial intelligence (Edge AI) has been focused on the Cloud. Then, we point out that the previous key-based defense on on-device image classification is impractical for two reasons: (1) training many classifiers from scratch is not feasible, and (2) key-based defenses still need to be thoroughly tested on large datasets like ImageNet. To this end, we propose to leverage pre-trained models and utilize efficient fine-tuning techniques to proliferate key-based models even on limited compute resources. Experiments were carried out on the ImageNet-1 k dataset using adaptive and non-adaptive attacks. The results show that our proposed fine-tuned key-based models achieve a superior classification accuracy (more than 10% increase) compared to the previous key-based models on classifying clean and adversarial examples.
在本文中,我们提出了基于密钥的防御模型扩散方案,即利用预训练模型和最近在 ImageNet-1 k 分类上采用的高效微调技术。首先,我们强调,虽然主流的企业边缘人工智能(Edge AI)都集中在云端,但随着苹果 CoreML 等最新模型部署技术的发展,在边缘设备上部署基于密钥的模型是可行的。然后,我们指出,之前基于密钥的设备上图像分类防御是不切实际的,原因有二:(1)从头开始训练许多分类器是不可行的;(2)基于密钥的防御仍需在大型数据集(如 ImageNet)上进行彻底测试。为此,我们建议利用预先训练好的模型,并利用高效的微调技术,即使在有限的计算资源上也能推广基于密钥的模型。我们使用自适应和非自适应攻击在 ImageNet-1 k 数据集上进行了实验。结果表明,与以前的基于密钥的模型相比,我们提出的基于密钥的微调模型在对干净和对抗性示例进行分类时实现了更高的分类准确率(提高 10%以上)。
{"title":"Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-Trained Models","authors":"AprilPyone MaungMaung;Isao Echizen;Hitoshi Kiya","doi":"10.1109/OJSP.2024.3419569","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3419569","url":null,"abstract":"In this paper, we propose key-based defense model proliferation by leveraging pre-trained models and utilizing recent efficient fine-tuning techniques on ImageNet-1 k classification. First, we stress that deploying key-based models on edge devices is feasible with the latest model deployment advancements, such as Apple CoreML, although the mainstream enterprise edge artificial intelligence (Edge AI) has been focused on the Cloud. Then, we point out that the previous key-based defense on on-device image classification is impractical for two reasons: (1) training many classifiers from scratch is not feasible, and (2) key-based defenses still need to be thoroughly tested on large datasets like ImageNet. To this end, we propose to leverage pre-trained models and utilize efficient fine-tuning techniques to proliferate key-based models even on limited compute resources. Experiments were carried out on the ImageNet-1 k dataset using adaptive and non-adaptive attacks. The results show that our proposed fine-tuned key-based models achieve a superior classification accuracy (more than 10% increase) compared to the previous key-based models on classifying clean and adversarial examples.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"902-913"},"PeriodicalIF":2.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10572223","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Overview of the First Pathloss Radio Map Prediction Challenge 首届无线电路径损耗地图预测挑战赛概述
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-26 DOI: 10.1109/OJSP.2024.3419563
Çağkan Yapar;Fabian Jaensch;Ron Levie;Gitta Kutyniok;Giuseppe Caire
Pathloss quantifies the reduction in power density of a signal radiated from a transmitter. The attenuation is due to large-scale effects such as free-space propagation loss and interactions (e.g., penetration, reflection, and diffraction) of the signal with objects such as buildings, vehicles, trees, and pedestrians in the propagation environment. Many current or planned wireless communications applications require the knowledge (or a reliable approximation) of the pathloss on a dense grid (radio map) of the environment of interest. Deterministic simulation methods such as ray tracing are known to provide very good estimates of pathloss values. However, their high computational complexity makes them unsuitable for most of the applications envisaged. To promote research and facilitate a fair comparison among the recently proposed fast and accurate deep learning-based pathloss radio map prediction methods, we have organized the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this overview paper, we describe the pathloss radio map prediction problem, provide a literature survey of the current state of the art, describe the challenge datasets, the challenge task, and the challenge evaluation methodology. Finally, we provide a brief overview of the submitted methods and present the results of the challenge.
路径损耗量化了发射机辐射信号功率密度的降低。衰减是由于自由空间传播损耗和信号与传播环境中的建筑物、车辆、树木和行人等物体的相互作用(如穿透、反射和衍射)等大规模效应造成的。许多当前或计划中的无线通信应用都需要了解相关环境密集网格(无线电地图)上的路径损耗(或可靠的近似值)。众所周知,光线跟踪等确定性模拟方法可以提供非常好的路径损耗值估计。然而,由于其计算复杂度高,不适合大多数设想的应用。为了推动研究,促进对最近提出的基于深度学习的快速、准确无线电路径损耗地图预测方法进行公平比较,我们组织了 ICASSP 2023 首届路径损耗无线电地图预测挑战赛。在这篇综述论文中,我们将介绍路径损耗无线电地图预测问题,提供当前技术水平的文献调查,描述挑战数据集、挑战任务和挑战评估方法。最后,我们对提交的方法进行了简要概述,并介绍了挑战赛的结果。
{"title":"Overview of the First Pathloss Radio Map Prediction Challenge","authors":"Çağkan Yapar;Fabian Jaensch;Ron Levie;Gitta Kutyniok;Giuseppe Caire","doi":"10.1109/OJSP.2024.3419563","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3419563","url":null,"abstract":"Pathloss quantifies the reduction in power density of a signal radiated from a transmitter. The attenuation is due to large-scale effects such as free-space propagation loss and interactions (e.g., penetration, reflection, and diffraction) of the signal with objects such as buildings, vehicles, trees, and pedestrians in the propagation environment. Many current or planned wireless communications applications require the knowledge (or a reliable approximation) of the pathloss on a dense grid (radio map) of the environment of interest. Deterministic simulation methods such as ray tracing are known to provide very good estimates of pathloss values. However, their high computational complexity makes them unsuitable for most of the applications envisaged. To promote research and facilitate a fair comparison among the recently proposed fast and accurate deep learning-based pathloss radio map prediction methods, we have organized the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this overview paper, we describe the pathloss radio map prediction problem, provide a literature survey of the current state of the art, describe the challenge datasets, the challenge task, and the challenge evaluation methodology. Finally, we provide a brief overview of the submitted methods and present the results of the challenge.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"948-963"},"PeriodicalIF":2.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10572278","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141993995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SINR Analysis of Windowed OFDM in Power Line Communication Systems 电力线通信系统中窗口 OFDM 的 SINR 分析
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-26 DOI: 10.1109/OJSP.2024.3419448
Fausto García-Gangoso;Fernando Cruz-Roldán
The use of an accurate cyclic prefix length is crucial in orthogonal frequency division multiplexing (OFDM) to avoid intercarrier and intersymbol interference. Although there have been many works that analyse the interference of windowed OFDM, this study remains open in the context of power line communications (PLCs) taking into account the physical-layer (PHY) specifications of the standards. This paper focuses on obtaining a closed-form expression of the input-output relationship in windowed OFDM power line communication (PLC) systems under the condition of insufficient cyclic prefix, while incorporating various blocks deployed in the PHY under IEEE 1901 standards. The derived analysis is important for quantifying the undesired signal component in each subcarrier at a specific time, which renders the detection of the corresponding symbol more difficult. Moreover, a novel procedure is proposed that allows the use of a smaller number of redundant samples to avoid interference. This novel procedure, performed in the receiver after the windowing stage, replaces the overlap-and-add operations with multiplications, offering the advantage of requiring fewer samples from the time-domain received signal to recover each transmitted data symbol. Numerical results demonstrate the feasibility of interference-free transmission on channels with a larger number of samples, thereby yielding better results across various PLC scenarios.
在正交频分复用(OFDM)中,使用精确的循环前缀长度对避免载波间和符号间干扰至关重要。尽管已有许多著作分析了带窗口 OFDM 的干扰问题,但在电力线通信(PLC)的背景下,考虑到标准的物理层(PHY)规范,这项研究仍未完成。本文的重点是在循环前缀不足的条件下,结合 IEEE 1901 标准物理层中部署的各种模块,获得带窗 OFDM 电力线通信(PLC)系统中输入输出关系的闭式表达式。得出的分析结果对于量化特定时间内每个子载波中的非期望信号分量非常重要,这将增加检测相应符号的难度。此外,还提出了一种新程序,允许使用较少数量的冗余样本来避免干扰。这种新程序在窗口阶段之后的接收器中执行,用乘法取代了重叠加法运算,其优点是从时域接收信号中提取较少的样本来恢复每个传输的数据符号。数值结果表明,在采样数量较多的信道上进行无干扰传输是可行的,从而在各种 PLC 方案中取得更好的结果。
{"title":"SINR Analysis of Windowed OFDM in Power Line Communication Systems","authors":"Fausto García-Gangoso;Fernando Cruz-Roldán","doi":"10.1109/OJSP.2024.3419448","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3419448","url":null,"abstract":"The use of an accurate cyclic prefix length is crucial in orthogonal frequency division multiplexing (OFDM) to avoid intercarrier and intersymbol interference. Although there have been many works that analyse the interference of windowed OFDM, this study remains open in the context of power line communications (PLCs) taking into account the physical-layer (PHY) specifications of the standards. This paper focuses on obtaining a closed-form expression of the input-output relationship in windowed OFDM power line communication (PLC) systems under the condition of insufficient cyclic prefix, while incorporating various blocks deployed in the PHY under IEEE 1901 standards. The derived analysis is important for quantifying the undesired signal component in each subcarrier at a specific time, which renders the detection of the corresponding symbol more difficult. Moreover, a novel procedure is proposed that allows the use of a smaller number of redundant samples to avoid interference. This novel procedure, performed in the receiver after the windowing stage, replaces the overlap-and-add operations with multiplications, offering the advantage of requiring fewer samples from the time-domain received signal to recover each transmitted data symbol. Numerical results demonstrate the feasibility of interference-free transmission on channels with a larger number of samples, thereby yielding better results across various PLC scenarios.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"1052-1060"},"PeriodicalIF":2.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10572224","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142518177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Special Issue on the ICASSP 2023 Signal Processing Grand Challenges 编辑:ICASSP 2023 信号处理大挑战特刊
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-24 DOI: 10.1109/OJSP.2024.3397168
Alexander Bertrand;Ozlem Kalinli
The 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023) took place in Rhodos, Greece, running from June 4th to June 10th, with a record number of paper submissions and attendees. Since 2021, ICASSP has featured the “Signal Processing Grand Challenges” (SPGC) program, which has become an annual highlight at the conference. ICASSP 2023 featured a record number of 15 SPGCs, carefully selected from a large number of submissions, and covering a wide variety of application domains, including audio, acoustics, speech, biomedical signals, communications, and image processing. A list of accepted SPGCs can be found at https://2023.ieeeicassp.org/signal-processing-grand-challenges/, which also includes links to detailed information for each challenge.
2023 年电气和电子工程师学会声学、语音和信号处理国际会议(ICASSP 2023)于 6 月 4 日至 6 月 10 日在希腊罗多斯举行,提交论文和参会人数均创历史新高。自2021年以来,ICASSP一直设有 "信号处理大挑战"(SPGC)项目,该项目已成为该会议的年度亮点。2023 年的 ICASSP 大会从大量提交的论文中精挑细选出 15 个 SPGC,涵盖音频、声学、语音、生物医学信号、通信和图像处理等多个应用领域,数量创下历史新高。已接受的 SPGCs 列表可在 https://2023.ieeeicassp.org/signal-processing-grand-challenges/ 上找到,其中还包括每个挑战的详细信息链接。
{"title":"Editorial: Special Issue on the ICASSP 2023 Signal Processing Grand Challenges","authors":"Alexander Bertrand;Ozlem Kalinli","doi":"10.1109/OJSP.2024.3397168","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3397168","url":null,"abstract":"The 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023) took place in Rhodos, Greece, running from June 4th to June 10th, with a record number of paper submissions and attendees. Since 2021, ICASSP has featured the “Signal Processing Grand Challenges” (SPGC) program, which has become an annual highlight at the conference. ICASSP 2023 featured a record number of 15 SPGCs, carefully selected from a large number of submissions, and covering a wide variety of application domains, including audio, acoustics, speech, biomedical signals, communications, and image processing. A list of accepted SPGCs can be found at \u0000<uri>https://2023.ieeeicassp.org/signal-processing-grand-challenges/</uri>\u0000, which also includes links to detailed information for each challenge.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"630-631"},"PeriodicalIF":2.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10570160","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141447970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Universal Fourier Attack for Time Series 时间序列的通用傅里叶攻击
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-14 DOI: 10.1109/OJSP.2024.3402154
Elizabeth Coda;Brad Clymer;Chance DeSmet;Yijing Watkins;Michael Girard
A wide variety of adversarial attacks have been proposed and explored using image and audio data. These attacks are notoriously easy to generate digitally when the attacker can directly manipulate the input to a model, but are much more difficult to implement in the real world. In this paper we present a universal, time invariant attack for general time series data such that the attack has a frequency spectrum primarily composed of the frequencies present in the original data. The universality of the attack makes it fast and easy to implement as no computation is required to add it to an input, while time invariance is useful for real world deployment. Additionally, the frequency constraint ensures the attack can withstand filtering defenses. We demonstrate the effectiveness of the attack on two different classification tasks through both digital and real world experiments, and show that the attack is robust against common transform-and-compare defense pipelines.
人们利用图像和音频数据提出并探索了各种各样的对抗性攻击。众所周知,当攻击者可以直接操纵模型的输入时,这些攻击很容易以数字方式生成,但在现实世界中却很难实现。在本文中,我们针对一般时间序列数据提出了一种通用的时间不变攻击,这种攻击的频谱主要由原始数据中的频率组成。这种攻击的普遍性使其能够快速、轻松地实现,因为将其添加到输入中不需要计算,而时间不变性则有助于现实世界的部署。此外,频率限制确保攻击能抵御过滤防御。我们通过数字和真实世界的实验证明了该攻击在两种不同分类任务中的有效性,并表明该攻击对常见的变换和比较防御管道具有很强的抵御能力。
{"title":"Universal Fourier Attack for Time Series","authors":"Elizabeth Coda;Brad Clymer;Chance DeSmet;Yijing Watkins;Michael Girard","doi":"10.1109/OJSP.2024.3402154","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3402154","url":null,"abstract":"A wide variety of adversarial attacks have been proposed and explored using image and audio data. These attacks are notoriously easy to generate digitally when the attacker can directly manipulate the input to a model, but are much more difficult to implement in the real world. In this paper we present a universal, time invariant attack for general time series data such that the attack has a frequency spectrum primarily composed of the frequencies present in the original data. The universality of the attack makes it fast and easy to implement as no computation is required to add it to an input, while time invariance is useful for real world deployment. Additionally, the frequency constraint ensures the attack can withstand filtering defenses. We demonstrate the effectiveness of the attack on two different classification tasks through both digital and real world experiments, and show that the attack is robust against common transform-and-compare defense pipelines.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"858-866"},"PeriodicalIF":2.9,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10557789","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse DOA Estimation With Polarimetric Arrays 利用极坐标阵列进行稀疏 DOA 估算
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-07 DOI: 10.1109/OJSP.2024.3411468
Augusto Aubry;Marco Boddi;Antonio De Maio;Massimo Rosamilia
This paper addresses the Direction-of-Arrival (DOA) estimation problem using a narrowband polarimetric array sensing system. The considered receiving equipment is composed of two sub-arrays of sensors with orthogonal polarizations. By suitably modeling the received signal via a sparse representation (accounting for the multiple snapshots and the polarimetric array manifold structure), two iterative algorithms, namely Polarimetric Sparse Learning via Iterative Minimization (POL-SLIM) and Polarimetric Sparse Iterative Covariance-based Estimation (POL-SPICE), are devised to accomplish the estimation task. The proposed algorithms provide accurate DOA estimates while enjoying nice (rigorously proven) convergence properties. Numerical analysis shows the effectiveness of POL-SLIM and POL-SPICE to successfully locate signal sources in both passive sensing applications (with large numbers of collected snapshots) and radar spatial processing, also in comparison with single-polarization counterparts as well as theoretical benchmarks.
本文利用窄带偏振阵列传感系统解决到达方向(DOA)估计问题。所考虑的接收设备由两个具有正交极化的传感器子阵列组成。通过稀疏表示对接收信号进行适当建模(考虑到多个快照和偏振阵列流形结构),设计了两种迭代算法,即通过迭代最小化进行偏振稀疏学习(POL-SLIM)和基于协方差的偏振稀疏迭代估计(POL-SPICE),以完成估计任务。所提出的算法可提供精确的 DOA 估计值,同时具有良好的(经严格证明的)收敛特性。数值分析表明,POL-SLIM 和 POL-SPICE 在被动传感应用(收集大量快照)和雷达空间处理中成功定位信号源的有效性,与单极化对应算法和理论基准相比也是如此。
{"title":"Sparse DOA Estimation With Polarimetric Arrays","authors":"Augusto Aubry;Marco Boddi;Antonio De Maio;Massimo Rosamilia","doi":"10.1109/OJSP.2024.3411468","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3411468","url":null,"abstract":"This paper addresses the Direction-of-Arrival (DOA) estimation problem using a narrowband polarimetric array sensing system. The considered receiving equipment is composed of two sub-arrays of sensors with orthogonal polarizations. By suitably modeling the received signal via a sparse representation (accounting for the multiple snapshots and the polarimetric array manifold structure), two iterative algorithms, namely Polarimetric Sparse Learning via Iterative Minimization (POL-SLIM) and Polarimetric Sparse Iterative Covariance-based Estimation (POL-SPICE), are devised to accomplish the estimation task. The proposed algorithms provide accurate DOA estimates while enjoying nice (rigorously proven) convergence properties. Numerical analysis shows the effectiveness of POL-SLIM and POL-SPICE to successfully locate signal sources in both passive sensing applications (with large numbers of collected snapshots) and radar spatial processing, also in comparison with single-polarization counterparts as well as theoretical benchmarks.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"886-901"},"PeriodicalIF":2.9,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10552043","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Channel Low-Rank Convolution of Jointly Compressed Room Impulse Responses 联合压缩室内脉冲响应的多通道低库卷积
IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-06-05 DOI: 10.1109/OJSP.2024.3410089
Martin Jälmby;Filip Elvander;Toon van Waterschoot
The room impulse response (RIR) describes the response of a room to an acoustic excitation signal and models the acoustic channel between a point source and receiver. RIRs are used in a wide range of applications, e.g., virtual reality. In such an application, the availability of closely spaced RIRs and the capability to achieve low latency are imperative to provide an immersive experience. However, representing a complete acoustic environment using a fine grid of RIRs is prohibitive from a storage point of view and without exploiting spatial proximity, acoustic rendering becomes computationally expensive. We therefore propose two methods for the joint compression of multiple RIRs, based on the generalized low-rank approximation of matrices (GLRAM), for the purpose of efficiently storing RIRs and allowing for low-latency convolution. We show how one of the components of the GLRAM decomposition is virtually invariant to the change of position of the source throughout the room and how this can be exploited in the modeling and convolution. In simulations we show how this offers high compression, with less quality degradation than comparable benchmark methods.
房间脉冲响应(RIR)描述了房间对声学激励信号的响应,并模拟了点声源和接收器之间的声学通道。RIR 的应用范围很广,例如虚拟现实。在这种应用中,要提供身临其境的体验,就必须要有紧密间隔的 RIR 和实现低延迟的能力。然而,从存储的角度来看,使用精细的 RIR 网格来表示完整的声学环境是令人望而却步的,而且如果不利用空间上的接近性,声学渲染的计算成本就会变得很高。因此,我们提出了两种基于广义低阶矩阵近似(GLRAM)的多 RIR 联合压缩方法,以高效存储 RIR 并实现低延迟卷积。我们展示了 GLRAM 分解的一个分量如何对整个房间内声源位置的变化保持不变,以及如何在建模和卷积中利用这一点。在模拟中,我们展示了这种方法如何在提供高压缩的同时,比同类基准方法的质量下降更少。
{"title":"Multi-Channel Low-Rank Convolution of Jointly Compressed Room Impulse Responses","authors":"Martin Jälmby;Filip Elvander;Toon van Waterschoot","doi":"10.1109/OJSP.2024.3410089","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3410089","url":null,"abstract":"The room impulse response (RIR) describes the response of a room to an acoustic excitation signal and models the acoustic channel between a point source and receiver. RIRs are used in a wide range of applications, e.g., virtual reality. In such an application, the availability of closely spaced RIRs and the capability to achieve low latency are imperative to provide an immersive experience. However, representing a complete acoustic environment using a fine grid of RIRs is prohibitive from a storage point of view and without exploiting spatial proximity, acoustic rendering becomes computationally expensive. We therefore propose two methods for the joint compression of multiple RIRs, based on the generalized low-rank approximation of matrices (GLRAM), for the purpose of efficiently storing RIRs and allowing for low-latency convolution. We show how one of the components of the GLRAM decomposition is virtually invariant to the change of position of the source throughout the room and how this can be exploited in the modeling and convolution. In simulations we show how this offers high compression, with less quality degradation than comparable benchmark methods.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"850-857"},"PeriodicalIF":2.9,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10549780","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141543936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SDAT: Sub-Dataset Alternation Training for Improved Image Demosaicing SDAT: 用于改进图像去马赛克的子数据集交替训练
Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-30 DOI: 10.1109/OJSP.2024.3395179
Yuval Becker;Raz Z. Nossek;Tomer Peleg
Image demosaicing is an important step in the image processing pipeline for digital cameras. In data centric approaches, such as deep learning, the distribution of the dataset used for training can impose a bias on the networks' outcome. For example, in natural images most patches are smooth, and high-content patches are much rarer. This can lead to a bias in the performance of demosaicing algorithms. Most deep learning approaches address this challenge by utilizing specific losses or designing special network architectures. We propose a novel approach SDAT, Sub-Dataset Alternation Training, that tackles the problem from a training protocol perspective. SDAT is comprised of two essential phases. In the initial phase, we employ a method to create sub-datasets from the entire dataset, each inducing a distinct bias. The subsequent phase involves an alternating training process, which uses the derived sub-datasets in addition to training also on the entire dataset. SDAT can be applied regardless of the chosen architecture as demonstrated by various experiments we conducted for the demosaicing task. The experiments are performed across a range of architecture sizes and types, namely CNNs and transformers. We show improved performance in all cases. We are also able to achieve state-of-the-art results on three highly popular image demosaicing benchmarks.
图像去马赛克是数码相机图像处理流程中的一个重要步骤。在深度学习等以数据为中心的方法中,用于训练的数据集的分布会对网络的结果造成偏差。例如,在自然图像中,大多数斑块都是平滑的,而高内容斑块则更为罕见。这会导致去马赛克算法的性能出现偏差。大多数深度学习方法通过利用特定损失或设计特殊网络架构来应对这一挑战。我们提出了一种新方法 SDAT,即子数据集交替训练(Sub-Dataset Alternation Training),从训练协议的角度来解决这个问题。SDAT 包括两个基本阶段。在初始阶段,我们采用一种方法从整个数据集中创建子数据集,每个子数据集都会产生不同的偏差。随后的阶段涉及交替训练过程,除了在整个数据集上进行训练外,还使用衍生的子数据集。正如我们在去马赛克任务中进行的各种实验所证明的那样,无论选择何种架构,都可以应用 SDAT。实验跨越了一系列架构规模和类型,即 CNN 和变换器。我们在所有情况下都证明了性能的提高。我们还能在三个非常流行的图像去马赛克基准测试中取得最先进的结果。
{"title":"SDAT: Sub-Dataset Alternation Training for Improved Image Demosaicing","authors":"Yuval Becker;Raz Z. Nossek;Tomer Peleg","doi":"10.1109/OJSP.2024.3395179","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3395179","url":null,"abstract":"Image demosaicing is an important step in the image processing pipeline for digital cameras. In data centric approaches, such as deep learning, the distribution of the dataset used for training can impose a bias on the networks' outcome. For example, in natural images most patches are smooth, and high-content patches are much rarer. This can lead to a bias in the performance of demosaicing algorithms. Most deep learning approaches address this challenge by utilizing specific losses or designing special network architectures. We propose a novel approach \u0000<bold>SDAT</b>\u0000, Sub-Dataset Alternation Training, that tackles the problem from a training protocol perspective. SDAT is comprised of two essential phases. In the initial phase, we employ a method to create sub-datasets from the entire dataset, each inducing a distinct bias. The subsequent phase involves an alternating training process, which uses the derived sub-datasets in addition to training also on the entire dataset. SDAT can be applied regardless of the chosen architecture as demonstrated by various experiments we conducted for the demosaicing task. The experiments are performed across a range of architecture sizes and types, namely CNNs and transformers. We show improved performance in all cases. We are also able to achieve state-of-the-art results on three highly popular image demosaicing benchmarks.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"611-620"},"PeriodicalIF":0.0,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10510581","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140919094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Denoiser-Based Projections for 2D Super-Resolution MRA 基于去噪器的二维超分辨率 MRA 投影
Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-26 DOI: 10.1109/OJSP.2024.3394369
Jonathan Shani;Tom Tirer;Raja Giryes;Tamir Bendory
We study the 2D super-resolution multi-reference alignment (SR-MRA) problem: estimating an image from its down-sampled, circularly translated, and noisy copies. The SR-MRA problem serves as a mathematical abstraction of the structure determination problem for biological molecules. Since the SR-MRA problem is ill-posed without prior knowledge, accurate image estimation relies on designing priors that describe the statistics of the images of interest. In this work, we build on recent advances in image processing and harness the power of denoisers as priors for images. To estimate an image, we propose utilizing denoisers as projections and using them within two computational frameworks that we propose: projected expectation-maximization and projected method of moments. We provide an efficient GPU implementation and demonstrate the effectiveness of these algorithms through extensive numerical experiments on a wide range of parameters and images.
我们研究的是二维超分辨率多参考配准(SR-MRA)问题:从图像的下采样、圆周平移和噪声副本中估计图像。SR-MRA 问题是生物分子结构确定问题的数学抽象。由于 SR-MRA 问题在没有先验知识的情况下存在问题,因此准确的图像估计依赖于设计描述相关图像统计信息的先验知识。在这项工作中,我们以图像处理领域的最新进展为基础,利用去噪器的强大功能作为图像的先验知识。为了估算图像,我们建议利用去噪器作为投影,并在我们提出的两个计算框架内使用它们:投影期望最大化和投影矩方法。我们提供了一种高效的 GPU 实现方法,并通过对各种参数和图像的大量数值实验证明了这些算法的有效性。
{"title":"Denoiser-Based Projections for 2D Super-Resolution MRA","authors":"Jonathan Shani;Tom Tirer;Raja Giryes;Tamir Bendory","doi":"10.1109/OJSP.2024.3394369","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3394369","url":null,"abstract":"We study the 2D super-resolution multi-reference alignment (SR-MRA) problem: estimating an image from its down-sampled, circularly translated, and noisy copies. The SR-MRA problem serves as a mathematical abstraction of the structure determination problem for biological molecules. Since the SR-MRA problem is ill-posed without prior knowledge, accurate image estimation relies on designing priors that describe the statistics of the images of interest. In this work, we build on recent advances in image processing and harness the power of denoisers as priors for images. To estimate an image, we propose utilizing denoisers as projections and using them within two computational frameworks that we propose: projected expectation-maximization and projected method of moments. We provide an efficient GPU implementation and demonstrate the effectiveness of these algorithms through extensive numerical experiments on a wide range of parameters and images.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"621-629"},"PeriodicalIF":0.0,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10508939","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141078833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of Spectral Estimation Parameters for Direct Sampling FFT-Based Measuring Receivers 评估基于直接采样 FFT 测量接收机的频谱估计参数
Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-04-16 DOI: 10.1109/OJSP.2024.3389825
Marc García-Bermúdez;Jordi Solé-Lloveras;Martin Hudlička;Marco A. Azpúrua
The standard CISPR 16-1-1 defines the measuring receiver using a black-box approach and sets requirements for its accuracy and spectral properties. Traditionally, such test receivers were developed using a superheterodyne architecture. Recently, time-domain electromagnetic emission measurement systems have been built employing direct sampling instruments, mainly oscilloscopes, and relying on specific signal processing to emulate the performance of compliant instruments. In these cases, the short-time Fourier transform is used for spectral estimation, but the corresponding electromagnetic compatibility standards lack details for its correct use with respect to parameters such as windowing function, overlapping factor, and frequency interpolation. Moreover, it is unclear which combination of spectral estimation parameters is best fit for this purpose. Obtaining reliable, consistent and low uncertainty spectral estimates of electromagnetic emissions measured in time-domain needs appropriate configuration and tuning of the signal processing algorithms. This paper investigates the error in the calculated spectrum for various reference signals: multitone, chirp pulses and rectangular pulses. The analysis is carried out for each CISPR band from A to D, that is, between 9 kHz and 1 GHz. After $489.6times 10^{3}$ iterations, distributed in 1700 different digital implementations of the CISPR 16-1-1 measuring receiver, the simulations outcomes point to certain sets of parameters that showed satisfactory performance overall, being the Nutall, Kaiser, and Parzen windows with more than 75% of overlapping and using interpolation factor higher than 5, generally suitable. Calibration results are used to experimentally verify that a valid set of parameters is adequate to fulfil CISPR 16-1-1 requirements.
CISPR 16-1-1 标准采用黑盒方法定义了测量接收机,并对其精度和频谱特性提出了要求。传统上,此类测试接收器采用超外差结构。最近,时域电磁辐射测量系统的建立采用了直接采样仪器,主要是示波器,并依靠特定的信号处理来模拟兼容仪器的性能。在这些情况下,短时傅里叶变换被用于频谱估计,但相应的电磁兼容性标准缺乏关于正确使用短时傅里叶变换的详细信息,如窗口函数、重叠因子和频率插值等参数。此外,还不清楚哪种频谱估计参数组合最适合这一目的。要获得可靠、一致和低不确定性的时域电磁辐射频谱估计值,需要对信号处理算法进行适当的配置和调整。本文研究了多音、啁啾脉冲和矩形脉冲等各种参考信号的频谱计算误差。分析针对从 A 到 D 的每个 CISPR 频段,即 9 kHz 到 1 GHz。经过 489.6 次 10^{3}$ 的迭代(分布在 1700 个不同的 CISPR 16-1-1 测量接收器的数字实现中),模拟结果表明某些参数集显示出令人满意的整体性能,其中 Nutall、Kaiser 和 Parzen 窗口的重叠率超过 75%,且使用的插值因子大于 5,通常是合适的。校准结果用于实验验证有效参数集是否足以满足 CISPR 16-1-1 的要求。
{"title":"Evaluation of Spectral Estimation Parameters for Direct Sampling FFT-Based Measuring Receivers","authors":"Marc García-Bermúdez;Jordi Solé-Lloveras;Martin Hudlička;Marco A. Azpúrua","doi":"10.1109/OJSP.2024.3389825","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3389825","url":null,"abstract":"The standard CISPR 16-1-1 defines the measuring receiver using a black-box approach and sets requirements for its accuracy and spectral properties. Traditionally, such test receivers were developed using a superheterodyne architecture. Recently, time-domain electromagnetic emission measurement systems have been built employing direct sampling instruments, mainly oscilloscopes, and relying on specific signal processing to emulate the performance of compliant instruments. In these cases, the short-time Fourier transform is used for spectral estimation, but the corresponding electromagnetic compatibility standards lack details for its correct use with respect to parameters such as windowing function, overlapping factor, and frequency interpolation. Moreover, it is unclear which combination of spectral estimation parameters is best fit for this purpose. Obtaining reliable, consistent and low uncertainty spectral estimates of electromagnetic emissions measured in time-domain needs appropriate configuration and tuning of the signal processing algorithms. This paper investigates the error in the calculated spectrum for various reference signals: multitone, chirp pulses and rectangular pulses. The analysis is carried out for each CISPR band from A to D, that is, between 9 kHz and 1 GHz. After \u0000<inline-formula><tex-math>$489.6times 10^{3}$</tex-math></inline-formula>\u0000 iterations, distributed in 1700 different digital implementations of the CISPR 16-1-1 measuring receiver, the simulations outcomes point to certain sets of parameters that showed satisfactory performance overall, being the Nutall, Kaiser, and Parzen windows with more than 75% of overlapping and using interpolation factor higher than 5, generally suitable. Calibration results are used to experimentally verify that a valid set of parameters is adequate to fulfil CISPR 16-1-1 requirements.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"588-598"},"PeriodicalIF":0.0,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10502171","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140818796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of signal processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1