首页 > 最新文献

arXiv - EE - Signal Processing最新文献

英文 中文
Graph-Based Signal Sampling with Adaptive Subspace Reconstruction for Spatially-Irregular Sensor Data 基于图的信号采样与自适应子空间重构,用于空间不规则传感器数据
Pub Date : 2024-09-14 DOI: arxiv-2409.09526
Darukeesan Pakiyarajah, Eduardo Pavez, Antonio Ortega
Choosing an appropriate frequency definition and norm is critical in graphsignal sampling and reconstruction. Most previous works define frequenciesbased on the spectral properties of the graph and use the same frequencydefinition and $ell_2$-norm for optimization for all sampling sets. Ourprevious work demonstrated that using a sampling set-adaptive norm andfrequency definition can address challenges in classical bandlimitedapproximation, particularly with model mismatches and irregularly distributeddata. In this work, we propose a method for selecting sampling sets tailored tothe sampling set adaptive GFT-based interpolation. When the graph models theinverse covariance of the data, we show that this adaptive GFT enableslocalizing the bandlimited model mismatch error to high frequencies, and thespectral folding property allows us to track this error in reconstruction.Based on this, we propose a sampling set selection algorithm to minimize theworst-case bandlimited model mismatch error. We consider partitioning thesensors in a sensor network sampling a continuous spatial process as anapplication. Our experiments show that sampling and reconstruction usingsampling set adaptive GFT significantly outperform methods that used fixed GFTsand bandwidth-based criterion.
在图形信号采样和重建中,选择合适的频率定义和规范至关重要。之前的大多数工作都是根据图的频谱特性定义频率,并使用相同的频率定义和 $ell_2$ 准则对所有采样集进行优化。我们之前的工作表明,使用采样集自适应规范和频率定义可以解决经典带限逼近中的难题,尤其是在模型不匹配和数据不规则分布的情况下。在这项工作中,我们提出了一种为基于 GFT 的采样集自适应插值量身定制的采样集选择方法。当图形对数据的逆协方差进行建模时,我们发现这种自适应 GFT 能够将带限模型失配误差定位到高频率,而光谱折叠特性允许我们在重建中跟踪这种误差。我们将传感器网络中对连续空间过程进行采样的传感器分区视为一种应用。实验表明,使用采样集自适应 GFT 进行采样和重建的效果明显优于使用固定 GFT 和基于带宽准则的方法。
{"title":"Graph-Based Signal Sampling with Adaptive Subspace Reconstruction for Spatially-Irregular Sensor Data","authors":"Darukeesan Pakiyarajah, Eduardo Pavez, Antonio Ortega","doi":"arxiv-2409.09526","DOIUrl":"https://doi.org/arxiv-2409.09526","url":null,"abstract":"Choosing an appropriate frequency definition and norm is critical in graph\u0000signal sampling and reconstruction. Most previous works define frequencies\u0000based on the spectral properties of the graph and use the same frequency\u0000definition and $ell_2$-norm for optimization for all sampling sets. Our\u0000previous work demonstrated that using a sampling set-adaptive norm and\u0000frequency definition can address challenges in classical bandlimited\u0000approximation, particularly with model mismatches and irregularly distributed\u0000data. In this work, we propose a method for selecting sampling sets tailored to\u0000the sampling set adaptive GFT-based interpolation. When the graph models the\u0000inverse covariance of the data, we show that this adaptive GFT enables\u0000localizing the bandlimited model mismatch error to high frequencies, and the\u0000spectral folding property allows us to track this error in reconstruction.\u0000Based on this, we propose a sampling set selection algorithm to minimize the\u0000worst-case bandlimited model mismatch error. We consider partitioning the\u0000sensors in a sensor network sampling a continuous spatial process as an\u0000application. Our experiments show that sampling and reconstruction using\u0000sampling set adaptive GFT significantly outperform methods that used fixed GFTs\u0000and bandwidth-based criterion.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cislunar Communication Performance and System Analysis with Uncharted Phenomena 星际通信性能与未知现象系统分析
Pub Date : 2024-09-14 DOI: arxiv-2409.09426
Selen Gecgel Cetin, Angeles Vazquez-Castro, Gunes Karabulut Kurt
The Moon and its surrounding cislunar space have numerous unknowns,uncertainties, or partially charted phenomena that need to be investigated todetermine the extent to which they affect cislunar communication. These includetemperature fluctuations, spacecraft distance and velocity dynamics, surfaceroughness, and the diversity of propagation mechanisms. To develop robust anddynamically operative Cislunar space networks (CSNs), we need to analyze thecommunication system by incorporating inclusive models that account for thewide range of possible propagation environments and noise characteristics. Inthis paper, we consider that the communication signal can be subjected to bothGaussian and non-Gaussian noise, but also to different fading conditions.First, we analyze the communication link by showing the relationship betweenthe brightness temperatures of the Moon and the equivalent noise temperature atthe receiver of the Lunar Gateway. We propose to analyze the ergodic capacityand the outage probability, as they are essential metrics for the developmentof reliable communication. In particular, we model the noise with the additivesymmetric alpha-stable distribution, which allows a generic analysis forGaussian and non-Gaussian signal characteristics. Then, we present theclosed-form bounds for the ergodic capacity and the outage probability.Finally, the results show the theoretically and operationally achievableperformance bounds for the cislunar communication. To give insight into furtherdesigns, we also provide our results with comprehensive system settings thatinclude mission objectives as well as orbital and system dynamics.
月球及其周围的半月空间有许多未知数、不确定性或部分绘制的现象,需要对其进行研究,以确定它们对半月通信的影响程度。这些现象包括温度波动、航天器距离和速度动态、表面粗糙度以及传播机制的多样性。为了开发稳健且具有动力操作性的半月空间网络(CSN),我们需要结合包容性模型来分析通信系统,这些模型应考虑到各种可能的传播环境和噪声特性。首先,我们分析了月球亮度温度与月球网关接收器等效噪声温度之间的关系,从而分析了通信链路。我们建议分析遍历容量和中断概率,因为它们是开发可靠通信的重要指标。特别是,我们用添加剂不对称阿尔法稳定分布来模拟噪声,从而可以对高斯和非高斯信号特征进行通用分析。然后,我们给出了遍历容量和中断概率的闭式边界。最后,结果显示了同轴星通信理论上和操作上可实现的性能边界。为了深入了解进一步的设计,我们还提供了综合系统设置的结果,其中包括任务目标以及轨道和系统动态。
{"title":"Cislunar Communication Performance and System Analysis with Uncharted Phenomena","authors":"Selen Gecgel Cetin, Angeles Vazquez-Castro, Gunes Karabulut Kurt","doi":"arxiv-2409.09426","DOIUrl":"https://doi.org/arxiv-2409.09426","url":null,"abstract":"The Moon and its surrounding cislunar space have numerous unknowns,\u0000uncertainties, or partially charted phenomena that need to be investigated to\u0000determine the extent to which they affect cislunar communication. These include\u0000temperature fluctuations, spacecraft distance and velocity dynamics, surface\u0000roughness, and the diversity of propagation mechanisms. To develop robust and\u0000dynamically operative Cislunar space networks (CSNs), we need to analyze the\u0000communication system by incorporating inclusive models that account for the\u0000wide range of possible propagation environments and noise characteristics. In\u0000this paper, we consider that the communication signal can be subjected to both\u0000Gaussian and non-Gaussian noise, but also to different fading conditions.\u0000First, we analyze the communication link by showing the relationship between\u0000the brightness temperatures of the Moon and the equivalent noise temperature at\u0000the receiver of the Lunar Gateway. We propose to analyze the ergodic capacity\u0000and the outage probability, as they are essential metrics for the development\u0000of reliable communication. In particular, we model the noise with the additive\u0000symmetric alpha-stable distribution, which allows a generic analysis for\u0000Gaussian and non-Gaussian signal characteristics. Then, we present the\u0000closed-form bounds for the ergodic capacity and the outage probability.\u0000Finally, the results show the theoretically and operationally achievable\u0000performance bounds for the cislunar communication. To give insight into further\u0000designs, we also provide our results with comprehensive system settings that\u0000include mission objectives as well as orbital and system dynamics.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyperedge Representations with Hypergraph Wavelets: Applications to Spatial Transcriptomics 使用超图小波的 Hyperedge 表示法:空间转录组学应用
Pub Date : 2024-09-14 DOI: arxiv-2409.09469
Xingzhi Sun, Charles Xu, João F. Rocha, Chen Liu, Benjamin Hollander-Bodie, Laney Goldman, Marcello DiStasio, Michael Perlmutter, Smita Krishnaswamy
In many data-driven applications, higher-order relationships among multipleobjects are essential in capturing complex interactions. Hypergraphs, whichgeneralize graphs by allowing edges to connect any number of nodes, provide aflexible and powerful framework for modeling such higher-order relationships.In this work, we introduce hypergraph diffusion wavelets and describe theirfavorable spectral and spatial properties. We demonstrate their utility forbiomedical discovery in spatially resolved transcriptomics by applying themethod to represent disease-relevant cellular niches for Alzheimer's disease.
在许多数据驱动型应用中,多个对象之间的高阶关系对于捕捉复杂的相互作用至关重要。超图允许边连接任意数量的节点,从而对图进行了泛化,为此类高阶关系的建模提供了一个灵活而强大的框架。在这项工作中,我们介绍了超图扩散小波,并描述了其有利的频谱和空间特性。通过应用这种方法来表示阿尔茨海默病的疾病相关细胞龛,我们展示了它们在空间解析转录组学的生物医学发现中的实用性。
{"title":"Hyperedge Representations with Hypergraph Wavelets: Applications to Spatial Transcriptomics","authors":"Xingzhi Sun, Charles Xu, João F. Rocha, Chen Liu, Benjamin Hollander-Bodie, Laney Goldman, Marcello DiStasio, Michael Perlmutter, Smita Krishnaswamy","doi":"arxiv-2409.09469","DOIUrl":"https://doi.org/arxiv-2409.09469","url":null,"abstract":"In many data-driven applications, higher-order relationships among multiple\u0000objects are essential in capturing complex interactions. Hypergraphs, which\u0000generalize graphs by allowing edges to connect any number of nodes, provide a\u0000flexible and powerful framework for modeling such higher-order relationships.\u0000In this work, we introduce hypergraph diffusion wavelets and describe their\u0000favorable spectral and spatial properties. We demonstrate their utility for\u0000biomedical discovery in spatially resolved transcriptomics by applying the\u0000method to represent disease-relevant cellular niches for Alzheimer's disease.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"116 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility 联合语义知识提炼和掩蔽声学建模,实现具有更高可懂度的全频段语音修复
Pub Date : 2024-09-14 DOI: arxiv-2409.09357
Xiaoyu Liu, Xu Li, Joan Serrà, Santiago Pascual
Speech restoration aims at restoring full-band speech with high quality andintelligibility, considering a diverse set of distortions. MaskSR is a recentlyproposed generative model for this task. As other models of its kind, MaskSRattains high quality but, as we show, intelligibility can be substantiallyimproved. We do so by boosting the speech encoder component of MaskSR withpredictions of semantic representations of the target speech, using apre-trained self-supervised teacher model. Then, a masked language model isconditioned on the learned semantic features to predict acoustic tokens thatencode low level spectral details of the target speech. We show that, with thesame MaskSR model capacity and inference time, the proposed model, MaskSR2,significantly reduces the word error rate, a typical metric forintelligibility. MaskSR2 also achieves competitive word error rate among othermodels, while providing superior quality. An ablation study shows theeffectiveness of various semantic representations.
语音修复的目的是在考虑各种失真的情况下,恢复高质量和可理解的全频段语音。MaskSR 是最近针对这一任务提出的生成模型。与其他同类模型一样,MaskSR 可获得高质量,但正如我们所展示的,其可懂度也可大幅提高。为此,我们使用预先训练好的自监督教师模型,通过预测目标语音的语义表征来增强 MaskSR 的语音编码器部分。然后,以学习到的语义特征为条件建立掩码语言模型,预测编码目标语音低级频谱细节的声学标记。我们的研究表明,在 MaskSR 模型容量和推理时间相同的情况下,所提出的模型 MaskSR2 显著降低了单词错误率,而单词错误率是衡量语音可理解性的典型指标。MaskSR2 在提供卓越质量的同时,还在其他模型中实现了具有竞争力的词错误率。一项消融研究显示了各种语义表征的有效性。
{"title":"Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility","authors":"Xiaoyu Liu, Xu Li, Joan Serrà, Santiago Pascual","doi":"arxiv-2409.09357","DOIUrl":"https://doi.org/arxiv-2409.09357","url":null,"abstract":"Speech restoration aims at restoring full-band speech with high quality and\u0000intelligibility, considering a diverse set of distortions. MaskSR is a recently\u0000proposed generative model for this task. As other models of its kind, MaskSR\u0000attains high quality but, as we show, intelligibility can be substantially\u0000improved. We do so by boosting the speech encoder component of MaskSR with\u0000predictions of semantic representations of the target speech, using a\u0000pre-trained self-supervised teacher model. Then, a masked language model is\u0000conditioned on the learned semantic features to predict acoustic tokens that\u0000encode low level spectral details of the target speech. We show that, with the\u0000same MaskSR model capacity and inference time, the proposed model, MaskSR2,\u0000significantly reduces the word error rate, a typical metric for\u0000intelligibility. MaskSR2 also achieves competitive word error rate among other\u0000models, while providing superior quality. An ablation study shows the\u0000effectiveness of various semantic representations.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"65 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Foundations of Vision-Based Localization: A New Approach to Localizability Analysis Using Stochastic Geometry 基于视觉的定位基础:利用随机几何进行可定位性分析的新方法
Pub Date : 2024-09-14 DOI: arxiv-2409.09525
Haozhou Hu, Harpreet S. Dhillon, R. Michael Buehrer
Despite significant algorithmic advances in vision-based positioning, acomprehensive probabilistic framework to study its performance has remainedunexplored. The main objective of this paper is to develop such a frameworkusing ideas from stochastic geometry. Due to limitations in sensor resolution,the level of detail in prior information, and computational resources, we maynot be able to differentiate between landmarks with similar appearances in thevision data, such as trees, lampposts, and bus stops. While one cannotaccurately determine the absolute target position using a singleindistinguishable landmark, obtaining an approximate position fix is possibleif the target can see multiple landmarks whose geometric placement on the mapis unique. Modeling the locations of these indistinguishable landmarks as aPoisson point process (PPP) $Phi$ on $mathbb{R}^2$, we develop a new approachto analyze the localizability in this setting. From the target location$mathbb{x}$, the measurements are obtained from landmarks within thevisibility region. These measurements, including ranges and angles to thelandmarks, denoted as $f(mathbb{x})$, can be treated as mappings from thetarget location. We are interested in understanding the probability that themeasurements $f(mathbb{x})$ are sufficiently distinct from the measurement$f(mathbb{x}_0)$ at the given location, which we term localizability.Expressions of localizability probability are derived for specificvision-inspired measurements, such as ranges to landmarks and snapshots oftheir locations. Our analysis reveals that the localizability probabilityapproaches one when the landmark intensity tends to infinity, which means thaterror-free localization is achievable in this limiting regime.
尽管基于视觉的定位在算法上取得了重大进展,但研究其性能的综合概率框架仍有待探索。本文的主要目的就是利用随机几何的思想来开发这样一个框架。由于传感器分辨率、先验信息的详细程度和计算资源的限制,我们可能无法区分视觉数据中外观相似的地标,如树木、灯柱和公交车站。虽然我们无法通过单个难以区分的地标准确确定目标的绝对位置,但如果目标能看到多个地标,而这些地标在地图上的几何位置又是独一无二的,那么我们就有可能获得大致的位置固定。我们将这些不可分辨地标的位置建模为$mathbb{R}^2$上的泊松点过程(PPP)$Phi$,并开发了一种新方法来分析这种情况下的可定位性。从目标位置$mathbb{x}$出发,从可视区域内的地标获取测量值。这些测量值,包括与地标的距离和角度(表示为 $f(mathbb{x})$),可被视为来自目标位置的映射。我们感兴趣的是了解主题测量值$f(mathbb{x})$与给定位置的测量值$f(mathbb{x}_0)$有足够区别的概率,我们称之为本地化概率。我们的分析表明,当地标强度趋于无穷大时,定位概率接近于 1,这意味着在这种极限状态下可以实现无误差定位。
{"title":"Foundations of Vision-Based Localization: A New Approach to Localizability Analysis Using Stochastic Geometry","authors":"Haozhou Hu, Harpreet S. Dhillon, R. Michael Buehrer","doi":"arxiv-2409.09525","DOIUrl":"https://doi.org/arxiv-2409.09525","url":null,"abstract":"Despite significant algorithmic advances in vision-based positioning, a\u0000comprehensive probabilistic framework to study its performance has remained\u0000unexplored. The main objective of this paper is to develop such a framework\u0000using ideas from stochastic geometry. Due to limitations in sensor resolution,\u0000the level of detail in prior information, and computational resources, we may\u0000not be able to differentiate between landmarks with similar appearances in the\u0000vision data, such as trees, lampposts, and bus stops. While one cannot\u0000accurately determine the absolute target position using a single\u0000indistinguishable landmark, obtaining an approximate position fix is possible\u0000if the target can see multiple landmarks whose geometric placement on the map\u0000is unique. Modeling the locations of these indistinguishable landmarks as a\u0000Poisson point process (PPP) $Phi$ on $mathbb{R}^2$, we develop a new approach\u0000to analyze the localizability in this setting. From the target location\u0000$mathbb{x}$, the measurements are obtained from landmarks within the\u0000visibility region. These measurements, including ranges and angles to the\u0000landmarks, denoted as $f(mathbb{x})$, can be treated as mappings from the\u0000target location. We are interested in understanding the probability that the\u0000measurements $f(mathbb{x})$ are sufficiently distinct from the measurement\u0000$f(mathbb{x}_0)$ at the given location, which we term localizability.\u0000Expressions of localizability probability are derived for specific\u0000vision-inspired measurements, such as ranges to landmarks and snapshots of\u0000their locations. Our analysis reveals that the localizability probability\u0000approaches one when the landmark intensity tends to infinity, which means that\u0000error-free localization is achievable in this limiting regime.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph 快速 DCT+:基于路径图一级更新的快速变换系列
Pub Date : 2024-09-13 DOI: arxiv-2409.08970
Samuel Fernández-Menduiña, Eduardo Pavez, Antonio Ortega
This paper develops fast graph Fourier transform (GFT) algorithms with O(nlog n) runtime complexity for rank-one updates of the path graph. We first showthat several commonly-used audio and video coding transforms belong to thisclass of GFTs, which we denote by DCT+. Next, starting from an arbitrarygeneralized graph Laplacian and using rank-one perturbation theory, we providea factorization for the GFT after perturbation. This factorization is ourcentral result and reveals a progressive structure: we first apply theunperturbed Laplacian's GFT and then multiply the result by a Cauchy matrix. Byspecializing this decomposition to path graphs and exploiting the properties ofCauchy matrices, we show that Fast DCT+ algorithms exist. We also demonstratethat progressivity can speed up computations in applications involving multipletransforms related by rank-one perturbations (e.g., video coding) when combinedwith pruning strategies. Our results can be extended to other graphs and rank-kperturbations. Runtime analyses show that Fast DCT+ provides computationalgains over the naive method for graph sizes larger than 64, with runtimeapproximately equal to that of 8 DCTs.
本文针对路径图的秩一更新,开发了运行复杂度为 O(nlog n) 的快速图傅立叶变换(GFT)算法。我们首先证明,几种常用的音频和视频编码变换属于这一类 GFT,我们将其称为 DCT+。接下来,我们从任意广义图拉普拉斯开始,利用秩一扰动理论,提供了扰动后 GFT 的因式分解。这种因式分解是我们的核心成果,揭示了一种渐进结构:我们首先应用未扰动拉普拉斯的 GFT,然后将结果乘以考奇矩阵。通过将这种分解特殊化为路径图并利用考奇矩阵的特性,我们证明了快速 DCT+ 算法的存在。我们还证明,当结合剪枝策略时,渐进性可以加快涉及秩一扰动相关多变换(如视频编码)的应用中的计算速度。我们的结果可以扩展到其他图和阶一扰动。运行时间分析表明,当图的大小大于 64 时,快速 DCT+ 比传统方法带来了计算上的优势,其运行时间约等于 8 个 DCT 的运行时间。
{"title":"Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph","authors":"Samuel Fernández-Menduiña, Eduardo Pavez, Antonio Ortega","doi":"arxiv-2409.08970","DOIUrl":"https://doi.org/arxiv-2409.08970","url":null,"abstract":"This paper develops fast graph Fourier transform (GFT) algorithms with O(n\u0000log n) runtime complexity for rank-one updates of the path graph. We first show\u0000that several commonly-used audio and video coding transforms belong to this\u0000class of GFTs, which we denote by DCT+. Next, starting from an arbitrary\u0000generalized graph Laplacian and using rank-one perturbation theory, we provide\u0000a factorization for the GFT after perturbation. This factorization is our\u0000central result and reveals a progressive structure: we first apply the\u0000unperturbed Laplacian's GFT and then multiply the result by a Cauchy matrix. By\u0000specializing this decomposition to path graphs and exploiting the properties of\u0000Cauchy matrices, we show that Fast DCT+ algorithms exist. We also demonstrate\u0000that progressivity can speed up computations in applications involving multiple\u0000transforms related by rank-one perturbations (e.g., video coding) when combined\u0000with pruning strategies. Our results can be extended to other graphs and rank-k\u0000perturbations. Runtime analyses show that Fast DCT+ provides computational\u0000gains over the naive method for graph sizes larger than 64, with runtime\u0000approximately equal to that of 8 DCTs.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"184 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Low-Complexity RSMA Scheme for Multi-User Decode-and-Forward Relay Systems 多用户解码前向中继系统的高效低复杂度 RSMA 方案
Pub Date : 2024-09-13 DOI: arxiv-2409.08880
Ahmet Sacid Sümer, Mehmet Mert Şahin, Hüseyin Arslan
Rate-Splitting Multiple Access (RSMA) is a promising strategy for ensuringrobust transmission in multi-antenna wireless systems. In this paper, weinvestigate the performance of RSMA in a downlink Decode-and-Forward (DF) relayscenario under two phases with imperfect Channel State Information (CSI) at thetransmitter and the relay. In particular, in the first phase, the Base Station(BS) initially transmits to both BS Users (BUs) and the relay. In the secondphase, the relay decodes and forwards the received signals to Relay Users (RUs)outside the BS coverage area. Furthermore, we investigate a scenario where therelay broadcasts a common stream intended for the RUs in the second phase. Dueto the broadcast nature of the transmission, this stream is inadvertentlyreceived by both the RUs and the BUs. Concurrently, the BS utilizes SpatialDivision Multiple Access (SDMA) to transmit private streams to the BUs,resulting in BUs experiencing residual interference from the common streamtransmitted from relay. Incorporating this residual common stream interferenceinto our model results in a significant enhancement of the overall sum-rateachieved at the BUs. We derive a tractable lower bound on the ergodicsum-rates, enables us to develop closed-form solutions for power allocationthat maximize the overall sum-rate in both phases. Extensive simulationsvalidate that our proposed power allocation algorithm, in conjunction with alow-complexity precoder, significantly improves the sum-rate performance of DFrelay RSMA networks compared to the SDMA-based benchmark designs underimperfect CSI at the transmitter and relay.
速率分割多路访问(RSMA)是确保多天线无线系统中稳健传输的一种有前途的策略。在本文中,我们研究了 RSMA 在下行链路解码前向(DF)中继场景下的性能,该场景分为两个阶段,发射机和中继站的信道状态信息(CSI)均不完善。具体来说,在第一阶段,基站(BS)首先向 BS 用户(BU)和中继发送信息。在第二阶段,中继解码并将接收到的信号转发给基站覆盖区域外的中继用户(RU)。此外,我们还研究了一种情况,即中继在第二阶段为 RU 广播一个公共流。由于传输的广播性质,RU 和 BU 都会无意中接收到该数据流。与此同时,BS 利用空间分割多路访问 (SDMA) 向 BU 传输专用流,导致 BU 受到中继传输的公共流的残余干扰。将这种残余公共流干扰纳入我们的模型,可显著提高 BU 达到的总和速率。我们推导出了一个可控的遍历总和率下限,使我们能够开发出功率分配的闭式解决方案,最大限度地提高两个阶段的总和率。大量仿真验证了我们提出的功率分配算法与低复杂度前置编码器相结合,与基于 SDMA 的基准设计相比,在发射端和中继端 CSI 不完美的情况下,能显著提高 DFrelay RSMA 网络的总和速率性能。
{"title":"An Efficient Low-Complexity RSMA Scheme for Multi-User Decode-and-Forward Relay Systems","authors":"Ahmet Sacid Sümer, Mehmet Mert Şahin, Hüseyin Arslan","doi":"arxiv-2409.08880","DOIUrl":"https://doi.org/arxiv-2409.08880","url":null,"abstract":"Rate-Splitting Multiple Access (RSMA) is a promising strategy for ensuring\u0000robust transmission in multi-antenna wireless systems. In this paper, we\u0000investigate the performance of RSMA in a downlink Decode-and-Forward (DF) relay\u0000scenario under two phases with imperfect Channel State Information (CSI) at the\u0000transmitter and the relay. In particular, in the first phase, the Base Station\u0000(BS) initially transmits to both BS Users (BUs) and the relay. In the second\u0000phase, the relay decodes and forwards the received signals to Relay Users (RUs)\u0000outside the BS coverage area. Furthermore, we investigate a scenario where the\u0000relay broadcasts a common stream intended for the RUs in the second phase. Due\u0000to the broadcast nature of the transmission, this stream is inadvertently\u0000received by both the RUs and the BUs. Concurrently, the BS utilizes Spatial\u0000Division Multiple Access (SDMA) to transmit private streams to the BUs,\u0000resulting in BUs experiencing residual interference from the common stream\u0000transmitted from relay. Incorporating this residual common stream interference\u0000into our model results in a significant enhancement of the overall sum-rate\u0000achieved at the BUs. We derive a tractable lower bound on the ergodic\u0000sum-rates, enables us to develop closed-form solutions for power allocation\u0000that maximize the overall sum-rate in both phases. Extensive simulations\u0000validate that our proposed power allocation algorithm, in conjunction with a\u0000low-complexity precoder, significantly improves the sum-rate performance of DF\u0000relay RSMA networks compared to the SDMA-based benchmark designs under\u0000imperfect CSI at the transmitter and relay.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SIMRP: Self-Interference Mitigation Using RIS and Phase Shifter Network SIMRP:利用 RIS 和移相器网络缓解自干扰
Pub Date : 2024-09-13 DOI: arxiv-2409.08600
Zhang Wei, Chen Ding, Bin Zhou, Yi Jiang, Zhiyong Bu
Strong self-interference due to the co-located transmitter is the bottleneckfor implementing an in-band full-duplex (IBFD) system. If not adequatelymitigated, the strong interference can saturate the receiver's analog-digitalconverters (ADCs) and hence void the digital processing. This paper considersutilizing a reconfigurable intelligent surface (RIS), together with a receiving(Rx) phase shifter network (PSN), to mitigate the strong self-interferencethrough jointly optimizing their phases. This method, named self-interferencemitigation using RIS and PSN (SIMRP), can suppress self-interference to avoidADC saturation effectively and therefore improve the sum rate performance ofcommunication systems, as verified by the simulation studies.
同地发射机造成的强烈自干扰是实现带内全双工(IBFD)系统的瓶颈。如果不加以充分缓解,强干扰会使接收器的模数转换器(ADC)饱和,从而使数字处理失效。本文考虑利用可重构智能表面(RIS)和接收(Rx)移相器网络(PSN),通过联合优化它们的相位来缓解强自干扰。这种方法被命名为利用 RIS 和 PSN 缓解自干扰(SIMRP),它能有效抑制自干扰,避免 ADC 饱和,从而提高通信系统的总和速率性能。
{"title":"SIMRP: Self-Interference Mitigation Using RIS and Phase Shifter Network","authors":"Zhang Wei, Chen Ding, Bin Zhou, Yi Jiang, Zhiyong Bu","doi":"arxiv-2409.08600","DOIUrl":"https://doi.org/arxiv-2409.08600","url":null,"abstract":"Strong self-interference due to the co-located transmitter is the bottleneck\u0000for implementing an in-band full-duplex (IBFD) system. If not adequately\u0000mitigated, the strong interference can saturate the receiver's analog-digital\u0000converters (ADCs) and hence void the digital processing. This paper considers\u0000utilizing a reconfigurable intelligent surface (RIS), together with a receiving\u0000(Rx) phase shifter network (PSN), to mitigate the strong self-interference\u0000through jointly optimizing their phases. This method, named self-interference\u0000mitigation using RIS and PSN (SIMRP), can suppress self-interference to avoid\u0000ADC saturation effectively and therefore improve the sum rate performance of\u0000communication systems, as verified by the simulation studies.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge 射频挑战:数据驱动的射频信号分离挑战
Pub Date : 2024-09-13 DOI: arxiv-2409.08839
Alejandro Lancho, Amir Weiss, Gary C. F. Lee, Tejas Jayashankar, Binoy Kurien, Yury Polyanskiy, Gregory W. Wornell
This paper addresses the critical problem of interference rejection inradio-frequency (RF) signals using a novel, data-driven approach that leveragesstate-of-the-art AI models. Traditionally, interference rejection algorithmsare manually tailored to specific types of interference. This work introduces amore scalable data-driven solution and contains the following contributions.First, we present an insightful signal model that serves as a foundation fordeveloping and analyzing interference rejection algorithms. Second, weintroduce the RF Challenge, a publicly available dataset featuring diverse RFsignals along with code templates, which facilitates data-driven analysis of RFsignal problems. Third, we propose novel AI-based rejection algorithms,specifically architectures like UNet and WaveNet, and evaluate theirperformance across eight different signal mixture types. These modelsdemonstrate superior performance exceeding traditional methods like matchedfiltering and linear minimum mean square error estimation by up to two ordersof magnitude in bit-error rate. Fourth, we summarize the results from an opencompetition hosted at 2024 IEEE International Conference on Acoustics, Speech,and Signal Processing (ICASSP 2024) based on the RF Challenge, highlighting thesignificant potential for continued advancements in this area. Our findingsunderscore the promise of deep learning algorithms in mitigating interference,offering a strong foundation for future research.
本文采用一种新颖的数据驱动方法,利用最先进的人工智能模型来解决射频(RF)信号中的干扰抑制这一关键问题。传统上,干扰抑制算法是针对特定干扰类型手动定制的。首先,我们提出了一个具有洞察力的信号模型,作为开发和分析干扰抑制算法的基础。其次,我们引入了射频挑战赛(RF Challenge),这是一个公开可用的数据集,包含各种射频信号和代码模板,有助于对射频信号问题进行数据驱动分析。第三,我们提出了基于人工智能的新型剔除算法,特别是 UNet 和 WaveNet 等架构,并评估了它们在八种不同信号混合物类型中的性能。这些模型表现出优越的性能,比匹配过滤和线性最小均方误差估计等传统方法的误码率高出两个数量级。第四,我们总结了在 2024 年 IEEE 国际声学、语音和信号处理大会(ICASSP 2024)上举办的基于射频挑战赛的公开竞赛的结果,强调了该领域持续进步的巨大潜力。我们的研究结果进一步证实了深度学习算法在缓解干扰方面的前景,为未来的研究奠定了坚实的基础。
{"title":"RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge","authors":"Alejandro Lancho, Amir Weiss, Gary C. F. Lee, Tejas Jayashankar, Binoy Kurien, Yury Polyanskiy, Gregory W. Wornell","doi":"arxiv-2409.08839","DOIUrl":"https://doi.org/arxiv-2409.08839","url":null,"abstract":"This paper addresses the critical problem of interference rejection in\u0000radio-frequency (RF) signals using a novel, data-driven approach that leverages\u0000state-of-the-art AI models. Traditionally, interference rejection algorithms\u0000are manually tailored to specific types of interference. This work introduces a\u0000more scalable data-driven solution and contains the following contributions.\u0000First, we present an insightful signal model that serves as a foundation for\u0000developing and analyzing interference rejection algorithms. Second, we\u0000introduce the RF Challenge, a publicly available dataset featuring diverse RF\u0000signals along with code templates, which facilitates data-driven analysis of RF\u0000signal problems. Third, we propose novel AI-based rejection algorithms,\u0000specifically architectures like UNet and WaveNet, and evaluate their\u0000performance across eight different signal mixture types. These models\u0000demonstrate superior performance exceeding traditional methods like matched\u0000filtering and linear minimum mean square error estimation by up to two orders\u0000of magnitude in bit-error rate. Fourth, we summarize the results from an open\u0000competition hosted at 2024 IEEE International Conference on Acoustics, Speech,\u0000and Signal Processing (ICASSP 2024) based on the RF Challenge, highlighting the\u0000significant potential for continued advancements in this area. Our findings\u0000underscore the promise of deep learning algorithms in mitigating interference,\u0000offering a strong foundation for future research.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Why some audio signal short-time Fourier transform coefficients have nonuniform phase distributions 为什么某些音频信号的短时傅里叶变换系数具有不均匀的相位分布
Pub Date : 2024-09-13 DOI: arxiv-2409.08981
Stephen D. Voran
The short-time Fourier transform (STFT) represents a window of audio samplesas a set of complex coefficients. These are advantageously viewed as magnitudesand phases and the overall distribution of phases is very often assumed to beuniform. We show that when audio signal STFT phase distributions are analyzedper-frequency or per-magnitude range, they can be far from uniform. That is,the uniform phase distribution assumption obscures significant importantdetails. We explain the significance of the nonuniform phase distributions andhow they might be exploited, derive their source, and explain why the choice ofthe STFT window shape influences the nonuniformity of the resulting phasedistributions.
短时傅立叶变换(STFT)将音频样本窗口表示为一组复系数。这些系数被视为幅度和相位,而相位的整体分布通常被假定为均匀的。我们的研究表明,当按频率或幅度范围分析音频信号 STFT 相位分布时,它们可能远非均匀。也就是说,均匀相位分布假设掩盖了重要的细节。我们解释了不均匀相位分布的意义以及如何利用它们,得出了它们的来源,并解释了为什么 STFT 窗口形状的选择会影响所得到的相位分布的不均匀性。
{"title":"Why some audio signal short-time Fourier transform coefficients have nonuniform phase distributions","authors":"Stephen D. Voran","doi":"arxiv-2409.08981","DOIUrl":"https://doi.org/arxiv-2409.08981","url":null,"abstract":"The short-time Fourier transform (STFT) represents a window of audio samples\u0000as a set of complex coefficients. These are advantageously viewed as magnitudes\u0000and phases and the overall distribution of phases is very often assumed to be\u0000uniform. We show that when audio signal STFT phase distributions are analyzed\u0000per-frequency or per-magnitude range, they can be far from uniform. That is,\u0000the uniform phase distribution assumption obscures significant important\u0000details. We explain the significance of the nonuniform phase distributions and\u0000how they might be exploited, derive their source, and explain why the choice of\u0000the STFT window shape influences the nonuniformity of the resulting phase\u0000distributions.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"189 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - EE - Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1