首页 > 最新文献

arXiv - EE - Signal Processing最新文献

英文 中文
Atomic Norm Minimization-based DoA Estimation for IRS-assisted Sensing Systems 基于原子规范最小化的 IRS 辅助传感系统 DoA 估计
Pub Date : 2024-09-16 DOI: arxiv-2409.09982
Renwang Li, Shu Sun, Meixia Tao
Intelligent reflecting surface (IRS) is expected to play a pivotal role infuture wireless sensing networks owing to its potential for high-resolution andhigh-accuracy sensing. In this work, we investigate a multi-targetdirection-of-arrival (DoA) estimation problem in a semi-passive IRS-assistedsensing system, where IRS reflecting elements (REs) reflect signals from thebase station to targets, and IRS sensing elements (SEs) estimate DoA based onecho signals reflected by the targets. {First of all, instead of solely relyingon IRS SEs for DoA estimation as done in the existing literature, this workfully exploits the DoA information embedded in both IRS REs and SEs matricesvia the atomic norm minimization (ANM) scheme. Subsequently, the Cram'er-Raobound for DoA estimation is derived, revealing an inverse proportionality to$MN^3+NM^3$ under the case of identity covariance matrix of the IRS measurementmatrix and a single target, where $M$ and $N$ are the number of IRS SEs andREs, respectively. Finally, extensive numerical results substantiate thesuperior accuracy and resolution performance of the proposed ANM-based DoAestimation method over representative baselines.
智能反射面(IRS)具有高分辨率和高精度传感的潜力,因此有望在未来的无线传感网络中发挥关键作用。在这项工作中,我们研究了半被动 IRS 辅助传感系统中的多目标到达方向(DoA)估计问题,其中 IRS 反射元件(RE)将信号从基站反射到目标,IRS 传感元件(SE)根据目标反射的信号估计 DoA。{首先,这项工作并不像现有文献那样仅仅依靠 IRS SEs 来估计 DoA,而是通过原子规范最小化(ANM)方案,有效地利用了嵌入在 IRS REs 和 SEs 矩阵中的 DoA 信息。随后,推导出了 DoA 估计的 Cram'er-Raobound ,揭示了在 IRS 测量矩阵的同方差矩阵和单一目标的情况下,与 $MN^3+NM^3$ 的反比例关系,其中 $M$ 和 $N$ 分别是 IRS SE 和 RE 的数量。最后,大量的数值结果证明了基于 ANM 的 DoAestimation 方法比有代表性的基线方法具有更高的精度和分辨率。
{"title":"Atomic Norm Minimization-based DoA Estimation for IRS-assisted Sensing Systems","authors":"Renwang Li, Shu Sun, Meixia Tao","doi":"arxiv-2409.09982","DOIUrl":"https://doi.org/arxiv-2409.09982","url":null,"abstract":"Intelligent reflecting surface (IRS) is expected to play a pivotal role in\u0000future wireless sensing networks owing to its potential for high-resolution and\u0000high-accuracy sensing. In this work, we investigate a multi-target\u0000direction-of-arrival (DoA) estimation problem in a semi-passive IRS-assisted\u0000sensing system, where IRS reflecting elements (REs) reflect signals from the\u0000base station to targets, and IRS sensing elements (SEs) estimate DoA based on\u0000echo signals reflected by the targets. {First of all, instead of solely relying\u0000on IRS SEs for DoA estimation as done in the existing literature, this work\u0000fully exploits the DoA information embedded in both IRS REs and SEs matrices\u0000via the atomic norm minimization (ANM) scheme. Subsequently, the Cram'er-Rao\u0000bound for DoA estimation is derived, revealing an inverse proportionality to\u0000$MN^3+NM^3$ under the case of identity covariance matrix of the IRS measurement\u0000matrix and a single target, where $M$ and $N$ are the number of IRS SEs and\u0000REs, respectively. Finally, extensive numerical results substantiate the\u0000superior accuracy and resolution performance of the proposed ANM-based DoA\u0000estimation method over representative baselines.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wavenumber-Domain Near-Field Channel Estimation: Beyond the Fresnel Bound 文波域近场信道估计:超越菲涅尔边界
Pub Date : 2024-09-16 DOI: arxiv-2409.10123
Xufeng Guo, Yuanbin Chen, Ying Wang, Zhaocheng Wang, Chau Yuen
In the near-field context, the Fresnel approximation is typically employed tomathematically represent solvable functions of spherical waves. However, theseefforts may fail to take into account the significant increase in the lowerlimit of the Fresnel approximation, known as the Fresnel distance. The lowerbound of the Fresnel approximation imposes a constraint that becomes morepronounced as the array size grows. Beyond this constraint, the validity of theFresnel approximation is broken. As a potential solution, the wavenumber-domainparadigm characterizes the spherical wave using a spectrum composed of a seriesof linear orthogonal bases. However, this approach falls short of covering theeffects of the array geometry, especially when using Gaussian-mixed-model(GMM)-based von Mises-Fisher distributions to approximate all spectra. To fillthis gap, this paper introduces a novel wavenumber-domain ellipse fitting(WDEF) method to tackle these challenges. Particularly, the channel isaccurately estimated in the near-field region, by maximizing the closed-formlikelihood function of the wavenumber-domain spectrum conditioned on thescatterers' geometric parameters. Simulation results are provided todemonstrate the robustness of the proposed scheme against both the distance andangles of arrival.
在近场情况下,通常采用菲涅尔近似来表示球面波的可解函数。然而,这些努力可能没有考虑到菲涅尔近似下限(即菲涅尔距离)的显著增加。菲涅尔近似的下限施加了一个约束,随着阵列尺寸的增大,这个约束变得更加明显。超过这个限制,菲涅尔近似的有效性就会被打破。作为一种潜在的解决方案,波长域范式使用由一系列线性正交基组成的频谱来描述球面波。然而,这种方法无法涵盖阵列几何的影响,尤其是在使用基于高斯混合模型(GMM)的 von Mises-Fisher 分布来近似所有频谱时。为了填补这一空白,本文引入了一种新颖的波数域椭圆拟合(WDEF)方法来应对这些挑战。特别是,通过最大化以散射体几何参数为条件的闭式频谱似然函数,可以准确估计近场区域的信道。仿真结果证明了所提方案对到达距离和到达角的稳健性。
{"title":"Wavenumber-Domain Near-Field Channel Estimation: Beyond the Fresnel Bound","authors":"Xufeng Guo, Yuanbin Chen, Ying Wang, Zhaocheng Wang, Chau Yuen","doi":"arxiv-2409.10123","DOIUrl":"https://doi.org/arxiv-2409.10123","url":null,"abstract":"In the near-field context, the Fresnel approximation is typically employed to\u0000mathematically represent solvable functions of spherical waves. However, these\u0000efforts may fail to take into account the significant increase in the lower\u0000limit of the Fresnel approximation, known as the Fresnel distance. The lower\u0000bound of the Fresnel approximation imposes a constraint that becomes more\u0000pronounced as the array size grows. Beyond this constraint, the validity of the\u0000Fresnel approximation is broken. As a potential solution, the wavenumber-domain\u0000paradigm characterizes the spherical wave using a spectrum composed of a series\u0000of linear orthogonal bases. However, this approach falls short of covering the\u0000effects of the array geometry, especially when using Gaussian-mixed-model\u0000(GMM)-based von Mises-Fisher distributions to approximate all spectra. To fill\u0000this gap, this paper introduces a novel wavenumber-domain ellipse fitting\u0000(WDEF) method to tackle these challenges. Particularly, the channel is\u0000accurately estimated in the near-field region, by maximizing the closed-form\u0000likelihood function of the wavenumber-domain spectrum conditioned on the\u0000scatterers' geometric parameters. Simulation results are provided to\u0000demonstrate the robustness of the proposed scheme against both the distance and\u0000angles of arrival.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"75 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rate-Splitting Multiple Access for Coexistence of Semantic and Bit Communications 实现语义通信与比特通信共存的速率分割多路访问
Pub Date : 2024-09-16 DOI: arxiv-2409.10314
Yuanwen Liu, Bruno Clerckx
In the sixth generation (6G) of cellular networks, the demands for capacityand connectivity will increase dramatically to meet the requirements ofemerging services for both humans and machines. Semantic communication hasshown great potential because of its efficiency, and suitability for users whoonly care about the semantic meaning. But bit communication is still needed forusers requiring original messages. Therefore, there will be a coexistence ofsemantic and bit communications in future networks. This motivates us toexplore how to allocate resources in such a coexistence scenario. Weinvestigate different uplink multiple access (MA) schemes for the coexistenceof semantic users and a bit user, namely orthogonal multiple access (OMA),non-orthogonal multiple access (NOMA) and rate-splitting multiple access(RSMA). We characterize the rate regions achieved by those MA schemes. Thesimulation results show that RSMA always outperforms NOMA and has betterperformance in high semantic rate regimes compared to OMA. We find that RSMAscheme design, rate region, and power allocation are quite different in thecoexistence scenario compared to the bit-only communication, primarily due tothe need to consider the understandability in semantic communications.Interestingly, in contrast to bit-only communications where RSMA is capacityachieving without any need for time sharing, in the coexistence scenario, timesharing helps enlarging RSMA rate region.
在第六代(6G)蜂窝网络中,对容量和连接性的要求将急剧增加,以满足人类和机器对新兴服务的需求。语义通信因其效率高、适合只关心语义的用户而显示出巨大的潜力。但是,需要原始信息的用户仍然需要比特通信。因此,在未来的网络中,语义通信和比特通信将并存。这促使我们探索如何在这种共存情况下分配资源。我们研究了语义用户和比特用户共存时的不同上行链路多址接入(MA)方案,即正交多址接入(OMA)、非正交多址接入(NOMA)和速率分割多址接入(RSMA)。我们描述了这些多址接入方案实现的速率区域。仿真结果表明,RSMA 的性能始终优于 NOMA,而且与 OMA 相比,RSMA 在高语义速率区的性能更好。我们发现,在共存场景中,RSMA 方案的设计、速率区域和功率分配与纯比特通信相比有很大不同,这主要是由于需要考虑语义通信中的可理解性。有趣的是,在纯比特通信中,RSMA 无需分时即可实现容量,而在共存场景中,分时有助于扩大 RSMA 的速率区域。
{"title":"Rate-Splitting Multiple Access for Coexistence of Semantic and Bit Communications","authors":"Yuanwen Liu, Bruno Clerckx","doi":"arxiv-2409.10314","DOIUrl":"https://doi.org/arxiv-2409.10314","url":null,"abstract":"In the sixth generation (6G) of cellular networks, the demands for capacity\u0000and connectivity will increase dramatically to meet the requirements of\u0000emerging services for both humans and machines. Semantic communication has\u0000shown great potential because of its efficiency, and suitability for users who\u0000only care about the semantic meaning. But bit communication is still needed for\u0000users requiring original messages. Therefore, there will be a coexistence of\u0000semantic and bit communications in future networks. This motivates us to\u0000explore how to allocate resources in such a coexistence scenario. We\u0000investigate different uplink multiple access (MA) schemes for the coexistence\u0000of semantic users and a bit user, namely orthogonal multiple access (OMA),\u0000non-orthogonal multiple access (NOMA) and rate-splitting multiple access\u0000(RSMA). We characterize the rate regions achieved by those MA schemes. The\u0000simulation results show that RSMA always outperforms NOMA and has better\u0000performance in high semantic rate regimes compared to OMA. We find that RSMA\u0000scheme design, rate region, and power allocation are quite different in the\u0000coexistence scenario compared to the bit-only communication, primarily due to\u0000the need to consider the understandability in semantic communications.\u0000Interestingly, in contrast to bit-only communications where RSMA is capacity\u0000achieving without any need for time sharing, in the coexistence scenario, time\u0000sharing helps enlarging RSMA rate region.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Latent Wireless Dynamics from Channel State Information 从信道状态信息中学习潜在无线动态
Pub Date : 2024-09-16 DOI: arxiv-2409.10045
Charbel Bou Chaaya, Abanoub M. Girgis, Mehdi Bennis
In this work, we propose a novel data-driven machine learning (ML) techniqueto model and predict the dynamics of the wireless propagation environment inlatent space. Leveraging the idea of channel charting, which learns compressedrepresentations of high-dimensional channel state information (CSI), weincorporate a predictive component to capture the dynamics of the wirelesssystem. Hence, we jointly learn a channel encoder that maps the estimated CSIto an appropriate latent space, and a predictor that models the relationshipsbetween such representations. Accordingly, our problem boils down to training ajoint-embedding predictive architecture (JEPA) that simulates the latentdynamics of a wireless network from CSI. We present numerical evaluations onmeasured data and show that the proposed JEPA displays a two-fold increase inaccuracy over benchmarks, for longer look-ahead prediction tasks.
在这项工作中,我们提出了一种新颖的数据驱动机器学习(ML)技术,用于建模和预测静态空间中无线传播环境的动态。利用信道制图(可学习高维信道状态信息(CSI)的压缩表示)的思想,我们加入了一个预测组件来捕捉无线系统的动态。因此,我们共同学习一个将估计的 CSI 映射到适当的潜在空间的信道编码器,以及一个对这些表示之间的关系进行建模的预测器。因此,我们的问题可以归结为训练一个联合嵌入式预测架构(JEPA),该架构可以根据 CSI 模拟无线网络的潜在动态。我们在实测数据上进行了数值评估,结果表明,与基准相比,所提出的 JEPA 在较长时间的前瞻预测任务中显示出两倍的不准确性。
{"title":"Learning Latent Wireless Dynamics from Channel State Information","authors":"Charbel Bou Chaaya, Abanoub M. Girgis, Mehdi Bennis","doi":"arxiv-2409.10045","DOIUrl":"https://doi.org/arxiv-2409.10045","url":null,"abstract":"In this work, we propose a novel data-driven machine learning (ML) technique\u0000to model and predict the dynamics of the wireless propagation environment in\u0000latent space. Leveraging the idea of channel charting, which learns compressed\u0000representations of high-dimensional channel state information (CSI), we\u0000incorporate a predictive component to capture the dynamics of the wireless\u0000system. Hence, we jointly learn a channel encoder that maps the estimated CSI\u0000to an appropriate latent space, and a predictor that models the relationships\u0000between such representations. Accordingly, our problem boils down to training a\u0000joint-embedding predictive architecture (JEPA) that simulates the latent\u0000dynamics of a wireless network from CSI. We present numerical evaluations on\u0000measured data and show that the proposed JEPA displays a two-fold increase in\u0000accuracy over benchmarks, for longer look-ahead prediction tasks.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization Emo-DPO:通过直接偏好优化实现可控情感语音合成
Pub Date : 2024-09-16 DOI: arxiv-2409.10157
Xiaoxue Gao, Chen Zhang, Yiming Chen, Huayun Zhang, Nancy F. Chen
Current emotional text-to-speech (TTS) models predominantly conductsupervised training to learn the conversion from text and desired emotion toits emotional speech, focusing on a single emotion per text-speech pair. Thesemodels only learn the correct emotional outputs without fully comprehendingother emotion characteristics, which limits their capabilities of capturing thenuances between different emotions. We propose a controllable Emo-DPO approach,which employs direct preference optimization to differentiate subtle emotionalnuances between emotions through optimizing towards preferred emotions overless preferred emotional ones. Instead of relying on traditional neuralarchitectures used in existing emotional TTS models, we propose utilizing theemotion-aware LLM-TTS neural architecture to leverage LLMs' in-context learningand instruction-following capabilities. Comprehensive experiments confirm thatour proposed method outperforms the existing baselines.
目前的情感文本到语音(TTS)模型主要是通过监督训练来学习从文本和所需情感到情感语音的转换,重点是每个文本-语音对的单一情感。这些模型只能学习正确的情感输出,而不能完全理解其他情感特征,这限制了它们捕捉不同情感之间差异的能力。我们提出了一种可控的 Emo-DPO 方法,该方法采用直接偏好优化,通过优化偏好情感而非非偏好情感来区分不同情感之间微妙的情感差异。我们没有依赖现有情感 TTS 模型中使用的传统神经架构,而是提议利用情感感知 LLM-TTS 神经架构,以充分利用 LLM 的语境学习和指令跟随能力。综合实验证实,我们提出的方法优于现有的基线方法。
{"title":"Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization","authors":"Xiaoxue Gao, Chen Zhang, Yiming Chen, Huayun Zhang, Nancy F. Chen","doi":"arxiv-2409.10157","DOIUrl":"https://doi.org/arxiv-2409.10157","url":null,"abstract":"Current emotional text-to-speech (TTS) models predominantly conduct\u0000supervised training to learn the conversion from text and desired emotion to\u0000its emotional speech, focusing on a single emotion per text-speech pair. These\u0000models only learn the correct emotional outputs without fully comprehending\u0000other emotion characteristics, which limits their capabilities of capturing the\u0000nuances between different emotions. We propose a controllable Emo-DPO approach,\u0000which employs direct preference optimization to differentiate subtle emotional\u0000nuances between emotions through optimizing towards preferred emotions over\u0000less preferred emotional ones. Instead of relying on traditional neural\u0000architectures used in existing emotional TTS models, we propose utilizing the\u0000emotion-aware LLM-TTS neural architecture to leverage LLMs' in-context learning\u0000and instruction-following capabilities. Comprehensive experiments confirm that\u0000our proposed method outperforms the existing baselines.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"75 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings 采用分布式声学传感的自更新车辆监控框架,面向真实世界环境
Pub Date : 2024-09-16 DOI: arxiv-2409.10259
Xi Wang, Xin Liu, Songming Zhu, Zhanwen Li, Lina Gao
The recent emergence of Distributed Acoustic Sensing (DAS) technology hasfacilitated the effective capture of traffic-induced seismic data. Thetraffic-induced seismic wave is a prominent contributor to urban vibrations andcontain crucial information to advance urban exploration and governance.However, identifying vehicular movements within massive noisy data poses asignificant challenge. In this study, we introduce a real-time semi-supervisedvehicle monitoring framework tailored to urban settings. It requires only asmall fraction of manual labels for initial training and exploits unlabeleddata for model improvement. Additionally, the framework can autonomously adaptto newly collected unlabeled data. Before DAS data undergo object detection astwo-dimensional images to preserve spatial information, we leveragedcomprehensive one-dimensional signal preprocessing to mitigate noise.Furthermore, we propose a novel prior loss that incorporates the shapes ofvehicular traces to track a single vehicle with varying speeds. To evaluate ourmodel, we conducted experiments with seismic data from the Stanford 2 DASArray. The results showed that our model outperformed the baseline modelEfficient Teacher and its supervised counterpart, YOLO (You Only Look Once), inboth accuracy and robustness. With only 35 labeled images, our model surpassedYOLO's mAP 0.5:0.95 criterion by 18% and showed a 7% increase over EfficientTeacher. We conducted comparative experiments with multiple update strategiesfor self-updating and identified an optimal approach. This approach surpassesthe performance of non-overfitting training conducted with all data in a singlepass.
最近出现的分布式声学传感(DAS)技术有助于有效捕捉交通诱发的地震数据。交通诱发的地震波是城市振动的一个突出因素,包含着推进城市探索和治理的重要信息。然而,在海量噪声数据中识别车辆运动是一项重大挑战。在这项研究中,我们引入了一个专为城市环境定制的实时半监督车辆监测框架。它只需要少量人工标签进行初始训练,并利用无标签数据改进模型。此外,该框架还能自主适应新收集到的未标记数据。在将 DAS 数据作为二维图像进行物体检测以保留空间信息之前,我们利用全面的一维信号预处理来减少噪声。此外,我们还提出了一种新颖的先验损失,它结合了车辆轨迹的形状来跟踪不同速度的单个车辆。为了评估我们的模型,我们使用斯坦福 2 DAS 阵列的地震数据进行了实验。结果表明,我们的模型在准确性和鲁棒性方面都优于基线模型 "高效教师"(Efficient Teacher)及其监督模型 "YOLO"(You Only Look Once)。在只有 35 张标注图像的情况下,我们的模型比 YOLO 的 mAP 0.5:0.95 标准高出 18%,比 Efficient Teacher 高出 7%。我们使用多种自我更新策略进行了对比实验,并确定了一种最佳方法。这种方法的性能超过了单次使用所有数据进行非过拟合训练的效果。
{"title":"Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings","authors":"Xi Wang, Xin Liu, Songming Zhu, Zhanwen Li, Lina Gao","doi":"arxiv-2409.10259","DOIUrl":"https://doi.org/arxiv-2409.10259","url":null,"abstract":"The recent emergence of Distributed Acoustic Sensing (DAS) technology has\u0000facilitated the effective capture of traffic-induced seismic data. The\u0000traffic-induced seismic wave is a prominent contributor to urban vibrations and\u0000contain crucial information to advance urban exploration and governance.\u0000However, identifying vehicular movements within massive noisy data poses a\u0000significant challenge. In this study, we introduce a real-time semi-supervised\u0000vehicle monitoring framework tailored to urban settings. It requires only a\u0000small fraction of manual labels for initial training and exploits unlabeled\u0000data for model improvement. Additionally, the framework can autonomously adapt\u0000to newly collected unlabeled data. Before DAS data undergo object detection as\u0000two-dimensional images to preserve spatial information, we leveraged\u0000comprehensive one-dimensional signal preprocessing to mitigate noise.\u0000Furthermore, we propose a novel prior loss that incorporates the shapes of\u0000vehicular traces to track a single vehicle with varying speeds. To evaluate our\u0000model, we conducted experiments with seismic data from the Stanford 2 DAS\u0000Array. The results showed that our model outperformed the baseline model\u0000Efficient Teacher and its supervised counterpart, YOLO (You Only Look Once), in\u0000both accuracy and robustness. With only 35 labeled images, our model surpassed\u0000YOLO's mAP 0.5:0.95 criterion by 18% and showed a 7% increase over Efficient\u0000Teacher. We conducted comparative experiments with multiple update strategies\u0000for self-updating and identified an optimal approach. This approach surpasses\u0000the performance of non-overfitting training conducted with all data in a single\u0000pass.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Beamforming and Illumination Pattern Design for Beam-Hopping LEO Satellite Communications 跳波束低地轨道卫星通信的联合波束成形和照明模式设计
Pub Date : 2024-09-16 DOI: arxiv-2409.10127
Jing Wang, Chenhao Qi, Shui Yu, Shiwen Mao
Since hybrid beamforming (HBF) can approach the performance of fully-digitalbeamforming (FDBF) with much lower hardware complexity, we investigate the HBFdesign for beam-hopping (BH) low earth orbit (LEO) satellite communications(SatComs). Aiming at maximizing the sum-rate of totally illuminated beampositions during the whole BH period, we consider joint beamforming andillumination pattern design subject to the HBF constraints and sum-raterequirements. To address the non-convexity of the HBF constraints, wetemporarily replace the HBF constraints with the FDBF constraints. Then wepropose an FDBF and illumination pattern random search (FDBF-IPRS) scheme tooptimize illumination patterns and fully-digital beamformers using constrainedrandom search and fractional programming methods. To further reduce thecomputational complexity, we propose an FDBF and illumination patternalternating optimization (FDBF-IPAO) scheme, where we relax the integerillumination pattern to continuous variables and after finishing all theiterations we quantize the continuous variables into integer ones. Based on thefully-digital beamformers designed by the FDBF-IPRS or FDBF-IPAO scheme, wepropose an HBF alternating minimization algorithm to design the hybridbeamformers. Simulation results show that the proposed schemes can achievesatisfactory sum-rate performance for BH LEO SatComs.
由于混合波束成形(HBF)能以更低的硬件复杂度接近全数字波束成形(FDBF)的性能,我们研究了用于跳束(BH)低地球轨道(LEO)卫星通信(SatComs)的 HBF 设计。为了在整个跳束期间最大限度地提高完全照射波束的总和率,我们考虑了波束成形和照射模式的联合设计,但须遵守跳束约束和总和率要求。为了解决 HBF 约束的非凸性问题,我们暂时用 FDBF 约束代替 HBF 约束。然后,我们提出了一种 FDBF 和照明模式随机搜索(FDBF-IPRS)方案,利用约束随机搜索和分数编程方法优化照明模式和全数字波束成形器。为了进一步降低计算复杂度,我们提出了一种 FDBF 和照明模式交替优化(FDBF-IPAO)方案,将整数照明模式放宽为连续变量,完成所有迭代后,将连续变量量化为整数变量。基于 FDBF-IPRS 或 FDBF-IPAO 方案设计的全数字波束成形器,我们提出了一种 HBF 交替最小化算法来设计混合波束成形器。仿真结果表明,所提出的方案能为 BH LEO 卫星通信系统实现令人满意的和率性能。
{"title":"Joint Beamforming and Illumination Pattern Design for Beam-Hopping LEO Satellite Communications","authors":"Jing Wang, Chenhao Qi, Shui Yu, Shiwen Mao","doi":"arxiv-2409.10127","DOIUrl":"https://doi.org/arxiv-2409.10127","url":null,"abstract":"Since hybrid beamforming (HBF) can approach the performance of fully-digital\u0000beamforming (FDBF) with much lower hardware complexity, we investigate the HBF\u0000design for beam-hopping (BH) low earth orbit (LEO) satellite communications\u0000(SatComs). Aiming at maximizing the sum-rate of totally illuminated beam\u0000positions during the whole BH period, we consider joint beamforming and\u0000illumination pattern design subject to the HBF constraints and sum-rate\u0000requirements. To address the non-convexity of the HBF constraints, we\u0000temporarily replace the HBF constraints with the FDBF constraints. Then we\u0000propose an FDBF and illumination pattern random search (FDBF-IPRS) scheme to\u0000optimize illumination patterns and fully-digital beamformers using constrained\u0000random search and fractional programming methods. To further reduce the\u0000computational complexity, we propose an FDBF and illumination pattern\u0000alternating optimization (FDBF-IPAO) scheme, where we relax the integer\u0000illumination pattern to continuous variables and after finishing all the\u0000iterations we quantize the continuous variables into integer ones. Based on the\u0000fully-digital beamformers designed by the FDBF-IPRS or FDBF-IPAO scheme, we\u0000propose an HBF alternating minimization algorithm to design the hybrid\u0000beamformers. Simulation results show that the proposed schemes can achieve\u0000satisfactory sum-rate performance for BH LEO SatComs.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-supervised Multimodal Speech Representations for the Assessment of Schizophrenia Symptoms 用于评估精神分裂症症状的自我监督多模态语音表征
Pub Date : 2024-09-15 DOI: arxiv-2409.09733
Gowtham Premananth, Carol Espy-Wilson
Multimodal schizophrenia assessment systems have gained traction over thelast few years. This work introduces a schizophrenia assessment system todiscern between prominent symptom classes of schizophrenia and predict anoverall schizophrenia severity score. We develop a Vector Quantized VariationalAuto-Encoder (VQ-VAE) based Multimodal Representation Learning (MRL) model toproduce task-agnostic speech representations from vocal Tract Variables (TVs)and Facial Action Units (FAUs). These representations are then used in aMulti-Task Learning (MTL) based downstream prediction model to obtain classlabels and an overall severity score. The proposed framework outperforms theprevious works on the multi-class classification task across all evaluationmetrics (Weighted F1 score, AUC-ROC score, and Weighted Accuracy).Additionally, it estimates the schizophrenia severity score, a task notaddressed by earlier approaches.
多模态精神分裂症评估系统在过去几年中得到了广泛应用。这项研究介绍了一种精神分裂症评估系统,用于区分精神分裂症的主要症状类别,并预测精神分裂症的总体严重程度。我们开发了一种基于多模态表征学习(MRL)模型的矢量量化变异自动编码器(VQ-VAE),可从声道变量(TVs)和面部动作单元(FAUs)中生成与任务无关的语音表征。然后将这些表征用于基于多任务学习(MTL)的下游预测模型,以获得类别标签和总体严重程度评分。在多类分类任务的所有评价指标(加权 F1 分数、AUC-ROC 分数和加权准确率)上,所提出的框架都优于之前的研究成果。
{"title":"Self-supervised Multimodal Speech Representations for the Assessment of Schizophrenia Symptoms","authors":"Gowtham Premananth, Carol Espy-Wilson","doi":"arxiv-2409.09733","DOIUrl":"https://doi.org/arxiv-2409.09733","url":null,"abstract":"Multimodal schizophrenia assessment systems have gained traction over the\u0000last few years. This work introduces a schizophrenia assessment system to\u0000discern between prominent symptom classes of schizophrenia and predict an\u0000overall schizophrenia severity score. We develop a Vector Quantized Variational\u0000Auto-Encoder (VQ-VAE) based Multimodal Representation Learning (MRL) model to\u0000produce task-agnostic speech representations from vocal Tract Variables (TVs)\u0000and Facial Action Units (FAUs). These representations are then used in a\u0000Multi-Task Learning (MTL) based downstream prediction model to obtain class\u0000labels and an overall severity score. The proposed framework outperforms the\u0000previous works on the multi-class classification task across all evaluation\u0000metrics (Weighted F1 score, AUC-ROC score, and Weighted Accuracy).\u0000Additionally, it estimates the schizophrenia severity score, a task not\u0000addressed by earlier approaches.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multicarrier Spread Spectrum Communications with Noncontiguous Subcarrier Bands for HF Skywave Links 用于高频天波链路的非连续副载波频带多载波扩频通信
Pub Date : 2024-09-15 DOI: arxiv-2409.09723
Brandon T. HuntMontana Technological University, Hussein MoradiIdaho National Laboratory, Behrouz Farhang-BoroujenyThe University of Utah
Growing traffic over the high-frequency (HF) band poses significantchallenges to establishing robust communication links. While existingspread-spectrum HF transceivers are, to some degree, robust against harsh HFchannel conditions, their performance significantly degrades in the presence ofstrong co-channel interference. To improve performance in congested channelconditions, we propose a filter-bank based multicarrier spread-spectrumwaveform with noncontiguous subcarrier bands. The use of noncontiguoussubcarriers allows the system to at once leverage the robustness of a widebandsystem while retaining the frequency agility of a narrowband system. In thisstudy, we explore differences between contiguous and noncontiguous systems byconsidering their respective peak-to-average power ratios (PAPRs) andmatched-filter responses. Additionally, we develop a modified filter-bankreceiver structure to facilitate both efficient signal processing andnoncontiguous channel estimation. We conclude by presenting simulated andover-the-air results of the noncontiguous waveform, demonstrating both itsrobustness in harsh HF channels and its enhanced performance in congestedspectral conditions.
高频(HF)频段上日益增长的通信量给建立稳健的通信链路带来了巨大挑战。虽然现有的扩频高频收发器在一定程度上对恶劣的高频信道条件具有鲁棒性,但它们的性能在强烈的同信道干扰下会明显下降。为了提高在拥塞信道条件下的性能,我们提出了一种基于滤波器组的多载波扩频波形,该波形具有非连续的子载波频带。非连续子载波的使用使系统既能利用宽带系统的鲁棒性,又能保留窄带系统的频率灵活性。在这项研究中,我们通过考虑各自的峰均功率比 (PAPR) 和匹配滤波器响应,探讨了连续系统和非连续系统之间的差异。此外,我们还开发了一种改进的滤波器库接收器结构,以促进高效信号处理和非连续信道估计。最后,我们展示了非连续波形的模拟和空中结果,证明了它在恶劣高频信道中的稳健性以及在拥挤频谱条件下的增强性能。
{"title":"Multicarrier Spread Spectrum Communications with Noncontiguous Subcarrier Bands for HF Skywave Links","authors":"Brandon T. HuntMontana Technological University, Hussein MoradiIdaho National Laboratory, Behrouz Farhang-BoroujenyThe University of Utah","doi":"arxiv-2409.09723","DOIUrl":"https://doi.org/arxiv-2409.09723","url":null,"abstract":"Growing traffic over the high-frequency (HF) band poses significant\u0000challenges to establishing robust communication links. While existing\u0000spread-spectrum HF transceivers are, to some degree, robust against harsh HF\u0000channel conditions, their performance significantly degrades in the presence of\u0000strong co-channel interference. To improve performance in congested channel\u0000conditions, we propose a filter-bank based multicarrier spread-spectrum\u0000waveform with noncontiguous subcarrier bands. The use of noncontiguous\u0000subcarriers allows the system to at once leverage the robustness of a wideband\u0000system while retaining the frequency agility of a narrowband system. In this\u0000study, we explore differences between contiguous and noncontiguous systems by\u0000considering their respective peak-to-average power ratios (PAPRs) and\u0000matched-filter responses. Additionally, we develop a modified filter-bank\u0000receiver structure to facilitate both efficient signal processing and\u0000noncontiguous channel estimation. We conclude by presenting simulated and\u0000over-the-air results of the noncontiguous waveform, demonstrating both its\u0000robustness in harsh HF channels and its enhanced performance in congested\u0000spectral conditions.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ten Years of Research Advances in Full-Duplex Massive MIMO 全双工大规模多输入多输出(MIMO)十年研究进展
Pub Date : 2024-09-15 DOI: arxiv-2409.09732
Mohammadali Mohammadi, Zahra Mobini, Hien Quoc Ngo, Michail Matthaiou
We present an overview of ongoing research endeavors focused on in-bandfull-duplex (IBFD) massive multiple-input multiple-output (MIMO) systems andtheir applications. In response to the unprecedented demands for mobile trafficin concurrent and upcoming wireless networks, a paradigm shift fromconventional cellular networks to distributed communication systems becomesimperative. Cell-free massive MIMO (CF-mMIMO) emerges as a practical andscalable implementation of distributed/network MIMO systems, serving as acrucial physical layer technology for the advancement of next-generationwireless networks. This architecture inherits benefits from co-located massiveMIMO and distributed systems and provides the flexibility for integration withthe IBFD technology. We delineate the evolutionary trajectory of cellularnetworks, transitioning from conventional half-duplex multi-user MIMO networksto IBFD CF-mMIMO. The discussion extends further to the emerging paradigm ofnetwork-assisted IBFD CF-mMIMO (NAFD CF-mMIMO), serving as an energy-efficientprototype for asymmetric uplink and downlink communication services. This novelapproach finds applications in dual-functionality scenarios, includingsimultaneous wireless power and information transmission, wirelesssurveillance, and integrated sensing and communications. We highlight variouscurrent use case applications, discuss open challenges, and outline futureresearch directions aimed at fully realizing the potential of NAFD CF-mMIMOsystems to meet the evolving demands of future wireless networks.
我们概述了正在进行的有关带内全双工(IBFD)大规模多输入多输出(MIMO)系统及其应用的研究工作。为了应对并发和即将到来的无线网络中前所未有的移动通信需求,从传统蜂窝网络向分布式通信系统的模式转变变得非常重要。无蜂窝大规模多输入多输出(CF-mMIMO)是分布式/网络多输入多输出系统的一种实用且可扩展的实现方式,是推动下一代无线网络发展的重要物理层技术。这种架构继承了同地大规模 MIMO 和分布式系统的优点,并提供了与 IBFD 技术集成的灵活性。我们描绘了蜂窝网络从传统半双工多用户 MIMO 网络过渡到 IBFD CF-mMIMO 的演进轨迹。讨论进一步延伸到网络辅助 IBFD CF-mMIMO(NAFD CF-mMIMO)这一新兴范例,作为非对称上行和下行通信服务的能效原型。这种新方法可应用于双功能场景,包括同时进行无线电力和信息传输、无线监控以及综合传感和通信。我们重点介绍了当前的各种用例应用,讨论了面临的挑战,并概述了未来的研究方向,旨在充分发挥 NAFD CF-mMIMO 系统的潜力,满足未来无线网络不断发展的需求。
{"title":"Ten Years of Research Advances in Full-Duplex Massive MIMO","authors":"Mohammadali Mohammadi, Zahra Mobini, Hien Quoc Ngo, Michail Matthaiou","doi":"arxiv-2409.09732","DOIUrl":"https://doi.org/arxiv-2409.09732","url":null,"abstract":"We present an overview of ongoing research endeavors focused on in-band\u0000full-duplex (IBFD) massive multiple-input multiple-output (MIMO) systems and\u0000their applications. In response to the unprecedented demands for mobile traffic\u0000in concurrent and upcoming wireless networks, a paradigm shift from\u0000conventional cellular networks to distributed communication systems becomes\u0000imperative. Cell-free massive MIMO (CF-mMIMO) emerges as a practical and\u0000scalable implementation of distributed/network MIMO systems, serving as a\u0000crucial physical layer technology for the advancement of next-generation\u0000wireless networks. This architecture inherits benefits from co-located massive\u0000MIMO and distributed systems and provides the flexibility for integration with\u0000the IBFD technology. We delineate the evolutionary trajectory of cellular\u0000networks, transitioning from conventional half-duplex multi-user MIMO networks\u0000to IBFD CF-mMIMO. The discussion extends further to the emerging paradigm of\u0000network-assisted IBFD CF-mMIMO (NAFD CF-mMIMO), serving as an energy-efficient\u0000prototype for asymmetric uplink and downlink communication services. This novel\u0000approach finds applications in dual-functionality scenarios, including\u0000simultaneous wireless power and information transmission, wireless\u0000surveillance, and integrated sensing and communications. We highlight various\u0000current use case applications, discuss open challenges, and outline future\u0000research directions aimed at fully realizing the potential of NAFD CF-mMIMO\u0000systems to meet the evolving demands of future wireless networks.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - EE - Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1