首页 > 最新文献

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)最新文献

英文 中文
Millimeter Wave Hybrid MIMO System Channel Estimation Using Variable Step Size Zero Attracting LMS 基于变步长零吸引LMS的毫米波混合MIMO系统信道估计
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840854
Vidya Bhasker Shukla, R. Mitra, V. Bhatia
Millimeter-wave multiple-input multiple-output (mmWave MIMO) has emerged as a viable technique for 5G and beyond 5G(B5G) wireless networks, promising higher spectral efficiency and increased data speeds. However, achieving high spectral efficiency and data rates requires precise channel estimation, which is difficult for mmWave MIMO due to scattering and blockages in general. Because of scattering and blockages, mmWave MIMO channels have intrinsic sparsity, which needs sparse-aware channel estimation algorithms. As a result, this work propose a variable step-size zero-attracting least mean squares (VSSZALMS) based channel-estimator. In VSSZALMS the step-size increases (or decreases) as the mean-square error (MSE) increases (or decreases) that’s result adaptive estimator based on VSSZALMS achieves better tracking and faster convergence rate. Convergence and steady-state behavior of estimator is analyzed. Simulations for a typical mmWave MIMO channels demonstrate the benefits of the proposed sparse channel-estimation approach and its convergence.
毫米波多输入多输出(mmWave MIMO)已经成为5G及5G以上(B5G)无线网络的可行技术,有望提高频谱效率和数据速度。然而,实现高频谱效率和数据速率需要精确的信道估计,这对于毫米波MIMO来说是困难的,因为通常存在散射和阻塞。由于散射和阻塞,毫米波MIMO信道具有固有的稀疏性,需要稀疏感知信道估计算法。因此,本工作提出了一种基于可变步长零吸引最小均方(VSSZALMS)的信道估计器。在VSSZALMS中,步长随均方误差(MSE)的增大(或减小)而增大(或减小),使得基于VSSZALMS的自适应估计器实现了更好的跟踪和更快的收敛速度。分析了估计器的收敛性和稳态性。对典型毫米波MIMO信道的仿真表明了稀疏信道估计方法的优点及其收敛性。
{"title":"Millimeter Wave Hybrid MIMO System Channel Estimation Using Variable Step Size Zero Attracting LMS","authors":"Vidya Bhasker Shukla, R. Mitra, V. Bhatia","doi":"10.1109/SPCOM55316.2022.9840854","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840854","url":null,"abstract":"Millimeter-wave multiple-input multiple-output (mmWave MIMO) has emerged as a viable technique for 5G and beyond 5G(B5G) wireless networks, promising higher spectral efficiency and increased data speeds. However, achieving high spectral efficiency and data rates requires precise channel estimation, which is difficult for mmWave MIMO due to scattering and blockages in general. Because of scattering and blockages, mmWave MIMO channels have intrinsic sparsity, which needs sparse-aware channel estimation algorithms. As a result, this work propose a variable step-size zero-attracting least mean squares (VSSZALMS) based channel-estimator. In VSSZALMS the step-size increases (or decreases) as the mean-square error (MSE) increases (or decreases) that’s result adaptive estimator based on VSSZALMS achieves better tracking and faster convergence rate. Convergence and steady-state behavior of estimator is analyzed. Simulations for a typical mmWave MIMO channels demonstrate the benefits of the proposed sparse channel-estimation approach and its convergence.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132594962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Artificial Bandwidth Extension Using H∞ Optimization, Deep Neural Network, and Speech Production Model 利用H∞优化、深度神经网络和语音产生模型的人工带宽扩展
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840805
Deepika Gupta, H. S. Shekhawat
Artificial bandwidth extension is applied to speech signals to improve their quality in narrowband telephonic communication. For accomplishing this, the missing high-frequency components of speech signals are recovered by utilizing an extrapolation process. In this context, we propose another structure wherein we apply the gain adjustment as well as the discrete Fourier transform addition for adding the narrowband signal and corresponding estimated high-band signal. The high-band signal is evaluated by using a synthesis filter, which is acquired by utilizing the $H^{infty}$ optimization and speech production model. Non-stationary (time-varying) characteristics of speech signals produce assorted variety in the synthesis filters. So, we use a feed-forward deep neural network (DNN) to estimate the synthesis filter information and gain factor for a given narrowband feature of the signal. Objective analysis is done on the RSR15 and TIMIT datasets. Additionally, objective analysis is performed separately for the voiced speech as well as for the unvoiced speech. Subjective evaluation is conducted on the RSR15 dataset.
在窄带电话通信中,为了提高语音信号的质量,对语音信号进行了人工带宽扩展。为了实现这一点,通过利用外推过程恢复语音信号中缺失的高频成分。在这种情况下,我们提出了另一种结构,其中我们应用增益调整以及离散傅立叶变换加法来添加窄带信号和相应的估计高频带信号。利用$H^{infty}$优化和语音产生模型获得的合成滤波器对高频带信号进行评估。语音信号的非平稳(时变)特性导致了合成滤波器的各种变化。因此,我们使用前馈深度神经网络(DNN)来估计给定信号窄带特征的合成滤波器信息和增益因子。对RSR15和TIMIT数据集进行客观分析。此外,对浊音和不浊音分别进行客观分析。对RSR15数据集进行主观评价。
{"title":"Artificial Bandwidth Extension Using H∞ Optimization, Deep Neural Network, and Speech Production Model","authors":"Deepika Gupta, H. S. Shekhawat","doi":"10.1109/SPCOM55316.2022.9840805","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840805","url":null,"abstract":"Artificial bandwidth extension is applied to speech signals to improve their quality in narrowband telephonic communication. For accomplishing this, the missing high-frequency components of speech signals are recovered by utilizing an extrapolation process. In this context, we propose another structure wherein we apply the gain adjustment as well as the discrete Fourier transform addition for adding the narrowband signal and corresponding estimated high-band signal. The high-band signal is evaluated by using a synthesis filter, which is acquired by utilizing the $H^{infty}$ optimization and speech production model. Non-stationary (time-varying) characteristics of speech signals produce assorted variety in the synthesis filters. So, we use a feed-forward deep neural network (DNN) to estimate the synthesis filter information and gain factor for a given narrowband feature of the signal. Objective analysis is done on the RSR15 and TIMIT datasets. Additionally, objective analysis is performed separately for the voiced speech as well as for the unvoiced speech. Subjective evaluation is conducted on the RSR15 dataset.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124863883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Significance of Distance Measures for Speaker Anonymization 距离度量对说话人匿名化的意义
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840515
Gauri P. Prajapati, D. Singh, H. Patil
Privacy preservation methods for voice data are evolving day by day. A recent state-of-the-art voice privacy algorithm uses an x-vector and neural source-filter (NSF)- based anonymization approach that converts the original input voice into a pseudo speaker’s voice. The method uses an affinity propagation clustering (APC) algorithm to choose a pseudo speaker’s x-vector. Finding a set of distance measures for this clustering technique is important to get optimal anonymization. To that effect, in this paper, an attempt has been made to investigate the effect of six distance measures, namely, Euclidean, cosine, probabilistic linear discriminant analysis (PLDA), correlation, Manhattan, and Mahalanobis for voice privacy preservation using an x-vector-based anonymization system. This approach gave a 4.75% relative improvement in Equal Error Rate(EER) for original enrolls and anonymized trials. In addition, 11.49% relative improvement in EER is observed for anonymized enrolls and trials. Experimental results show that Mahalanobis and Pearson correlation coefficient-based distance are better choices for anonymization tasks. It provides better speaker de-identification and good speech intelligibility without increasing system complexity.
语音数据的隐私保护方法日益发展。最近的一种最先进的语音隐私算法使用基于x向量和神经源滤波器(NSF)的匿名化方法,将原始输入语音转换为伪说话者的语音。该方法采用亲和传播聚类(APC)算法选择伪说话人的x向量。为这种聚类技术找到一组距离度量对于获得最佳匿名化非常重要。为此,本文尝试使用基于x向量的匿名化系统来研究欧几里得、余弦、概率线性判别分析(PLDA)、相关性、曼哈顿和马氏等六种距离度量对语音隐私保护的影响。对于原始受试者和匿名试验,该方法在相等错误率(EER)方面的相对改进为4.75%。此外,在匿名入组和试验中,观察到11.49%的EER相对改善。实验结果表明,基于Mahalanobis和Pearson相关系数的距离是匿名化任务的较好选择。它在不增加系统复杂性的情况下提供了更好的说话人去识别和良好的语音清晰度。
{"title":"Significance of Distance Measures for Speaker Anonymization","authors":"Gauri P. Prajapati, D. Singh, H. Patil","doi":"10.1109/SPCOM55316.2022.9840515","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840515","url":null,"abstract":"Privacy preservation methods for voice data are evolving day by day. A recent state-of-the-art voice privacy algorithm uses an x-vector and neural source-filter (NSF)- based anonymization approach that converts the original input voice into a pseudo speaker’s voice. The method uses an affinity propagation clustering (APC) algorithm to choose a pseudo speaker’s x-vector. Finding a set of distance measures for this clustering technique is important to get optimal anonymization. To that effect, in this paper, an attempt has been made to investigate the effect of six distance measures, namely, Euclidean, cosine, probabilistic linear discriminant analysis (PLDA), correlation, Manhattan, and Mahalanobis for voice privacy preservation using an x-vector-based anonymization system. This approach gave a 4.75% relative improvement in Equal Error Rate(EER) for original enrolls and anonymized trials. In addition, 11.49% relative improvement in EER is observed for anonymized enrolls and trials. Experimental results show that Mahalanobis and Pearson correlation coefficient-based distance are better choices for anonymization tasks. It provides better speaker de-identification and good speech intelligibility without increasing system complexity.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125588949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Beamforming with Digital Predistortion using Xilinx RF SoC ZCU216 基于Xilinx射频SoC ZCU216的数字预失真数字波束形成
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840819
Nageswara Rao Dusari, Shipra, M. Rawat
This paper presents the implementation of digital beamforming (DBF) and linearization of power amplifiers (PAs) using Xilinx RF SoC ZCU216 software-defined radio (SDR). PAs in all the transmitting paths are linearized using the Memory polynomial (MP) model of the digital predistortion (DPD) technique to increase PA's efficiency and reduce the overall cost of the beamforming system. This paper also presents the methodology for phase calibration of each SDR transmitter path before applying DPD to provide an accurate phase to each antenna element. A 1x4 uniform microstrip antenna array is designed to operate at 3.5 GHz to perform DBF. The 5G NR signal with 20 MHz bandwidth is used for testing. The experimental results show that the beam is formed in the desired direction as per the applied phase shift between the antenna elements. An ACPR of -48 dB is obtained after DPD in each channel.
本文介绍了利用Xilinx RF SoC ZCU216软件定义无线电(SDR)实现数字波束形成(DBF)和功率放大器(PAs)的线性化。采用数字预失真(DPD)技术的记忆多项式(MP)模型对所有发射路径上的波束形成源进行线性化处理,提高了波束形成源的效率,降低了波束形成系统的总体成本。本文还介绍了在应用DPD为每个天线单元提供精确相位之前,对每个SDR发射机路径进行相位校准的方法。设计了一个1x4均匀微带天线阵列,工作在3.5 GHz以执行DBF。测试使用20mhz带宽的5G NR信号。实验结果表明,根据天线单元之间施加的相移,波束在期望的方向上形成。各通道DPD后的ACPR为- 48db。
{"title":"Digital Beamforming with Digital Predistortion using Xilinx RF SoC ZCU216","authors":"Nageswara Rao Dusari, Shipra, M. Rawat","doi":"10.1109/SPCOM55316.2022.9840819","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840819","url":null,"abstract":"This paper presents the implementation of digital beamforming (DBF) and linearization of power amplifiers (PAs) using Xilinx RF SoC ZCU216 software-defined radio (SDR). PAs in all the transmitting paths are linearized using the Memory polynomial (MP) model of the digital predistortion (DPD) technique to increase PA's efficiency and reduce the overall cost of the beamforming system. This paper also presents the methodology for phase calibration of each SDR transmitter path before applying DPD to provide an accurate phase to each antenna element. A 1x4 uniform microstrip antenna array is designed to operate at 3.5 GHz to perform DBF. The 5G NR signal with 20 MHz bandwidth is used for testing. The experimental results show that the beam is formed in the desired direction as per the applied phase shift between the antenna elements. An ACPR of -48 dB is obtained after DPD in each channel.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130296084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel Estimation Techniques for CP-Aided OTFS Systems Relying on Practical Pulse Shapes 基于实际脉冲形状的cp辅助OTFS系统信道估计技术
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840771
Anand Mehrotra, R. Singh, Suraj Srivastava, A. Jagannatham
This paper develops delay-Doppler domain CSI estimation techniques for cyclic prefix (CP)-aided OTFS systems that employ arbitrary pulse shapes at the transmitter/receiver. In this context, the vectorized model and an equivalent 2D-circular convolution relationship is derived between the input and output symbols for a CP-aided OTFS system. Initially, a pilot impulsebased CSI estimation technique is derived that exploits only the 2D-circular convolution relationship between the input and output symbols. Subsequently, an enhanced data-embedded pilot frame is proposed, where the data symbols are appropriately placed in the pilot frame, while being separated by delay-Doppler domain guard intervals from the pilot impulse, which eliminates the interference between the resulting data and pilot outputs. The proposed data-embedded channel estimation scheme exploits the pilot impulse for channel estimation. Subsequently, a vectorized input-output relationship is determined for data detection, followed by a linear MMSE (LMMSE) receiver that employs the estimated channel state information (CSI). Finally, simulation results are presented to demonstrate the performance of the proposed CSI estimation techniques in various settings and also to benchmark their performance with respect to an ideal system with perfect CSI.
本文开发了循环前缀(CP)辅助OTFS系统的延迟多普勒域CSI估计技术,该系统在发送/接收端采用任意脉冲形状。在此背景下,推导了cp辅助OTFS系统输入和输出符号之间的矢量化模型和等效2d -圆卷积关系。首先,导出了一种基于导频脉冲的CSI估计技术,该技术仅利用输入和输出符号之间的二维圆卷积关系。随后,提出了一种增强的数据嵌入式导频帧,其中数据符号适当地放置在导频帧中,同时通过延迟多普勒域保护间隔与导频脉冲分离,从而消除了结果数据与导频输出之间的干扰。提出的数据嵌入信道估计方案利用导频脉冲进行信道估计。随后,确定用于数据检测的矢量化输入-输出关系,然后使用估计的信道状态信息(CSI)的线性MMSE (LMMSE)接收器。最后,给出了仿真结果,以证明所提出的CSI估计技术在各种设置下的性能,并对具有完美CSI的理想系统的性能进行了基准测试。
{"title":"Channel Estimation Techniques for CP-Aided OTFS Systems Relying on Practical Pulse Shapes","authors":"Anand Mehrotra, R. Singh, Suraj Srivastava, A. Jagannatham","doi":"10.1109/SPCOM55316.2022.9840771","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840771","url":null,"abstract":"This paper develops delay-Doppler domain CSI estimation techniques for cyclic prefix (CP)-aided OTFS systems that employ arbitrary pulse shapes at the transmitter/receiver. In this context, the vectorized model and an equivalent 2D-circular convolution relationship is derived between the input and output symbols for a CP-aided OTFS system. Initially, a pilot impulsebased CSI estimation technique is derived that exploits only the 2D-circular convolution relationship between the input and output symbols. Subsequently, an enhanced data-embedded pilot frame is proposed, where the data symbols are appropriately placed in the pilot frame, while being separated by delay-Doppler domain guard intervals from the pilot impulse, which eliminates the interference between the resulting data and pilot outputs. The proposed data-embedded channel estimation scheme exploits the pilot impulse for channel estimation. Subsequently, a vectorized input-output relationship is determined for data detection, followed by a linear MMSE (LMMSE) receiver that employs the estimated channel state information (CSI). Finally, simulation results are presented to demonstrate the performance of the proposed CSI estimation techniques in various settings and also to benchmark their performance with respect to an ideal system with perfect CSI.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129426802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Multi-Stage Constant False-Alarm Rate Detector for Millimeter Wave Radars 一种用于毫米波雷达的多级恒虚警率检测器
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840827
G. Thiagarajan, Shilpa Hosur, Sanjeev Gurugopinath
We propose a novel multi-stage constant false alarm rate (CFAR) detector for millimeter wave radars. In particular, we consider the frequency modulated continuous wave (FMCW) radars. First, we employ order statistics-based detector (OSD) on the range and Doppler dimensions, to obtain potential target locations as a coarse detection procedure. Next, we propose a weighted centroid detector (WCD) for fine detection on the range-Doppler matrix obtained from OSD, which is agnostic to the knowledge of noise variance. We obtain analytical expressions for the probabilities of false-alarm and detection threshold for both OSD and WCD, which are validated using Monte Carlo simulations. Through synthetic data and real-world experimental data, we highlight the efficacy of the proposed detectors in terms of the receiver operating characteristics and detection probability.
提出了一种新型的毫米波雷达多级恒虚警率检测器。特别地,我们考虑调频连续波(FMCW)雷达。首先,我们在距离和多普勒维度上采用基于阶数统计量的检测器(OSD),作为粗检测过程获得潜在目标位置。其次,我们提出一种加权质心检测器(WCD),用于对OSD得到的距离-多普勒矩阵进行精细检测,该检测与噪声方差的知识无关。我们得到了OSD和WCD的虚警概率和检测阈值的解析表达式,并通过蒙特卡罗仿真对其进行了验证。通过合成数据和现实世界的实验数据,我们从接收机的工作特性和检测概率方面突出了所提出的探测器的有效性。
{"title":"A Multi-Stage Constant False-Alarm Rate Detector for Millimeter Wave Radars","authors":"G. Thiagarajan, Shilpa Hosur, Sanjeev Gurugopinath","doi":"10.1109/SPCOM55316.2022.9840827","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840827","url":null,"abstract":"We propose a novel multi-stage constant false alarm rate (CFAR) detector for millimeter wave radars. In particular, we consider the frequency modulated continuous wave (FMCW) radars. First, we employ order statistics-based detector (OSD) on the range and Doppler dimensions, to obtain potential target locations as a coarse detection procedure. Next, we propose a weighted centroid detector (WCD) for fine detection on the range-Doppler matrix obtained from OSD, which is agnostic to the knowledge of noise variance. We obtain analytical expressions for the probabilities of false-alarm and detection threshold for both OSD and WCD, which are validated using Monte Carlo simulations. Through synthetic data and real-world experimental data, we highlight the efficacy of the proposed detectors in terms of the receiver operating characteristics and detection probability.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127566141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Complexity Compression with Random Access 随机访问的低复杂度压缩
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840790
Srikanth Kamparaju, Shaik Mastan, Shashank Vatedka
We investigate the problem of variable-length compression with random access for stationary and ergodic sources, wherein short substrings of the raw file can be extracted from the compressed file without decompressing the entire file. It is possible to design compressors for sequences of length n that achieve compression rates close to the entropy rate of the source, and still be able to extract individual source symbols in time $theta(1)$ under the word-RAM model. In this article, we analyze a simple well-known approach used for compression with random access. We theoretically show that this is suboptimal, and design two simple compressors that simultaneously achieve entropy rate and constant-time random access. We then propose dictionary compression as a means to further improve performance, and experimentally validate this on various datasets.
我们研究了随机访问固定和遍历源的变长压缩问题,其中原始文件的短子字符串可以从压缩文件中提取,而无需解压缩整个文件。有可能为长度为n的序列设计压缩器,使其压缩率接近源的熵率,并且仍然能够在time $theta(1)$中提取单个源符号。在本文中,我们将分析一种用于随机访问压缩的简单方法。我们从理论上证明这是次优的,并设计了两个简单的压缩器,同时实现熵率和恒定时间随机访问。然后,我们提出字典压缩作为进一步提高性能的一种手段,并在各种数据集上进行实验验证。
{"title":"Low-Complexity Compression with Random Access","authors":"Srikanth Kamparaju, Shaik Mastan, Shashank Vatedka","doi":"10.1109/SPCOM55316.2022.9840790","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840790","url":null,"abstract":"We investigate the problem of variable-length compression with random access for stationary and ergodic sources, wherein short substrings of the raw file can be extracted from the compressed file without decompressing the entire file. It is possible to design compressors for sequences of length n that achieve compression rates close to the entropy rate of the source, and still be able to extract individual source symbols in time $theta(1)$ under the word-RAM model. In this article, we analyze a simple well-known approach used for compression with random access. We theoretically show that this is suboptimal, and design two simple compressors that simultaneously achieve entropy rate and constant-time random access. We then propose dictionary compression as a means to further improve performance, and experimentally validate this on various datasets.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124538655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modified U-Net Based Covid-19 Lesion Segmentation Using CT Scans 基于CT扫描改进U-Net的Covid-19病灶分割
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840780
K. G. Gopan, Pavan Sudeesh Peruru, N. Sinha
Computed Tomography (CT) based analysis will assist doctors in a prompt diagnosis of the Covid-19 infection. Automated segmentation of lesions in chest CT scans helps in determining the severity of the infection. The presented work addresses the task of automated segmentation of Covid-19 lesions. A U-Net framework incorporated with spatial-channel attention modules (contextual relationships), Atrous Spatial Pyramid Pooling module (a wider receptive field) and Deep Supervision (lesion focus, less error propagation) is proposed. Focal Tversky Loss is used to evaluate the outputs at coarser scales while Tversky loss evaluates the final segmentation output. This combination of losses is used to enhance segmentation of the small lesions. The framework is trained on CT scans of 20 subjects of COVID19 CT Lung and Infection Segmentation Dataset and tested on Mosmed dataset of 50 subjects, where infection has affected less than 25% of lung parenchyma. The experimental results show that the proposed method is effective in segmenting the hard ROIs in Mosmed data resulting in a mean Dice score of 0.57 (9% more than the state-of-the-art).
基于计算机断层扫描(CT)的分析将帮助医生及时诊断Covid-19感染。胸部CT扫描中病灶的自动分割有助于确定感染的严重程度。提出的工作解决了自动分割Covid-19病变的任务。提出了一个包含空间通道注意模块(上下文关系)、空间金字塔池模块(更宽的接受野)和深度监督(病灶聚焦,更少的错误传播)的U-Net框架。焦点Tversky Loss用于评估粗尺度下的输出,而Tversky Loss用于评估最终的分割输出。这种损失组合用于增强对小病变的分割。该框架在covid - 19 CT肺部和感染分割数据集的20名受试者的CT扫描上进行了训练,并在50名受试者的Mosmed数据集上进行了测试,其中感染影响的肺实质不到25%。实验结果表明,该方法在Mosmed数据中分割硬roi是有效的,平均Dice得分为0.57(比目前的方法高9%)。
{"title":"Modified U-Net Based Covid-19 Lesion Segmentation Using CT Scans","authors":"K. G. Gopan, Pavan Sudeesh Peruru, N. Sinha","doi":"10.1109/SPCOM55316.2022.9840780","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840780","url":null,"abstract":"Computed Tomography (CT) based analysis will assist doctors in a prompt diagnosis of the Covid-19 infection. Automated segmentation of lesions in chest CT scans helps in determining the severity of the infection. The presented work addresses the task of automated segmentation of Covid-19 lesions. A U-Net framework incorporated with spatial-channel attention modules (contextual relationships), Atrous Spatial Pyramid Pooling module (a wider receptive field) and Deep Supervision (lesion focus, less error propagation) is proposed. Focal Tversky Loss is used to evaluate the outputs at coarser scales while Tversky loss evaluates the final segmentation output. This combination of losses is used to enhance segmentation of the small lesions. The framework is trained on CT scans of 20 subjects of COVID19 CT Lung and Infection Segmentation Dataset and tested on Mosmed dataset of 50 subjects, where infection has affected less than 25% of lung parenchyma. The experimental results show that the proposed method is effective in segmenting the hard ROIs in Mosmed data resulting in a mean Dice score of 0.57 (9% more than the state-of-the-art).","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130904858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive Analysis on Apodized FBG for Quasi-Distributed Temperature-Strain Sensing 准分布温度应变传感的apozed FBG预测分析
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840764
Himadri Nirjhar Mandal, Soumya Sidhishwari
An apodized fiber Bragg grating (FBG) is designed for quasi-distributed sensing of temperature and strain due its various advantages particularly in hazardous environment. The main purpose of apodized FBG is to attain maximum reflectivity, narrow bandwidth and low level of side lobes, which are crucial for quasi-distributed sensing applications. Relationship between FBG properties and grating length have been explored to enhance and optimize the FBG. K Nearest Neighbors (KNN) algorithm is introduced for predictive analysis of FBG properties with different K values for the reliability of apodized FBG particularly for sensing applications. The optimal value of K has been identified for KNN by using various statistical techniques such as Mean Squared Error and Mean Absolute Error. Strong linearity has been obtained for both temperature and strain sensitivity of the designed apodized FBG. The optimized apodized FBG is utilized on wavelength division multiplexing (WDM) based quasi-distributed sensing system of four FBG signifying high reliability. High temperature and strain sensitivity ranges have been achieved in quasi-distributed sensing. The obtained ranges can be imposed in FBG-based sensing applications for monitoring of civil structure in hazardous environment.
apozed光纤布拉格光栅(FBG)由于其在危险环境下的各种优点,被设计用于温度和应变的准分布式传感。消光光纤光栅的主要目的是获得最大反射率,窄带宽和低电平的旁瓣,这是准分布式传感应用的关键。探讨了光纤光栅特性与光栅长度之间的关系,以增强和优化光纤光栅。引入K近邻(KNN)算法对不同K值的光纤光栅特性进行预测分析,以提高光纤光栅的可靠性,特别是在传感应用中。通过使用各种统计技术,如均方误差和平均绝对误差,已经确定了KNN的最佳K值。所设计的光电化光纤光栅的温度灵敏度和应变灵敏度均具有较强的线性关系。将优化后的光纤光栅应用于基于波分复用(WDM)的四光纤光栅准分布式传感系统,具有较高的可靠性。在准分布式传感中实现了较高的温度和应变灵敏度范围。所获得的范围可用于基于fbg的传感应用,用于监测危险环境中的土木结构。
{"title":"Predictive Analysis on Apodized FBG for Quasi-Distributed Temperature-Strain Sensing","authors":"Himadri Nirjhar Mandal, Soumya Sidhishwari","doi":"10.1109/SPCOM55316.2022.9840764","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840764","url":null,"abstract":"An apodized fiber Bragg grating (FBG) is designed for quasi-distributed sensing of temperature and strain due its various advantages particularly in hazardous environment. The main purpose of apodized FBG is to attain maximum reflectivity, narrow bandwidth and low level of side lobes, which are crucial for quasi-distributed sensing applications. Relationship between FBG properties and grating length have been explored to enhance and optimize the FBG. K Nearest Neighbors (KNN) algorithm is introduced for predictive analysis of FBG properties with different K values for the reliability of apodized FBG particularly for sensing applications. The optimal value of K has been identified for KNN by using various statistical techniques such as Mean Squared Error and Mean Absolute Error. Strong linearity has been obtained for both temperature and strain sensitivity of the designed apodized FBG. The optimized apodized FBG is utilized on wavelength division multiplexing (WDM) based quasi-distributed sensing system of four FBG signifying high reliability. High temperature and strain sensitivity ranges have been achieved in quasi-distributed sensing. The obtained ranges can be imposed in FBG-based sensing applications for monitoring of civil structure in hazardous environment.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128373001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Speaker Anonymization for Machines using Sinusoidal Model 基于正弦模型的机器说话人匿名化
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840792
Ayush Agarwal, Amitabh Swain, S. Prasanna
With the widespread use of speech technologies, speaker identity/voiceprint protection has become very important. Many methods have been proposed in the literature that protects the speaker’s identity either by modifying the voice or replacing it with another speaker’s identity. Both authentication systems and humans cannot recognize the speaker’s identity in those approaches. Changing the speaker identity of original speech cannot be used for the applications in which we want to conceal speaker identity from machine authentication and, at the same time, keep the speaker’s voice as it is. Noise addition methods have been proposed in the literature to address this issue. However, adding noise to the signal increases the irritation effect on speech perception. This paper proposes a sinusoidal model-based approach that solves this issue. The proposed method does not interfere with the originality of speech but, at the same time, protects the speaker’s identity for the automatic speaker verification (ASV) system by degrading its performance. The proposed approach’s anonymized speech is tested on the ASV system for TIMIT and IITG-MV datasets, and an equal error rate (EER) is reported. Intelligence tests like short-time objective intelligibility (STOI) and mean opinion score (MOS) is also done. By taking both EER and intelligibility tests together into consideration, it is shown that the proposed approach can solve the discussed issue.
随着语音技术的广泛应用,说话人身份/声纹保护变得非常重要。文献中提出了许多方法,通过修改声音或用另一个说话人的身份替换声音来保护说话人的身份。在这些方法中,认证系统和人类都无法识别说话者的身份。改变原始语音的说话人身份不能用于我们想要在机器认证中隐藏说话人身份的同时保持说话人的声音不变的应用。为了解决这个问题,文献中已经提出了添加噪声的方法。然而,在信号中加入噪声会增加对语音感知的刺激作用。本文提出了一种基于正弦模型的方法来解决这一问题。该方法在不影响语音原创性的同时,降低了ASV系统的性能,保护了说话人的身份。在针对TIMIT和IITG-MV数据集的ASV系统上对该方法的匿名语音进行了测试,得到了相等的错误率(EER)。智力测试如短时客观可理解性(STOI)和平均意见得分(MOS)也进行了。通过同时考虑EER和可理解性测试,表明该方法可以解决所讨论的问题。
{"title":"Speaker Anonymization for Machines using Sinusoidal Model","authors":"Ayush Agarwal, Amitabh Swain, S. Prasanna","doi":"10.1109/SPCOM55316.2022.9840792","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840792","url":null,"abstract":"With the widespread use of speech technologies, speaker identity/voiceprint protection has become very important. Many methods have been proposed in the literature that protects the speaker’s identity either by modifying the voice or replacing it with another speaker’s identity. Both authentication systems and humans cannot recognize the speaker’s identity in those approaches. Changing the speaker identity of original speech cannot be used for the applications in which we want to conceal speaker identity from machine authentication and, at the same time, keep the speaker’s voice as it is. Noise addition methods have been proposed in the literature to address this issue. However, adding noise to the signal increases the irritation effect on speech perception. This paper proposes a sinusoidal model-based approach that solves this issue. The proposed method does not interfere with the originality of speech but, at the same time, protects the speaker’s identity for the automatic speaker verification (ASV) system by degrading its performance. The proposed approach’s anonymized speech is tested on the ASV system for TIMIT and IITG-MV datasets, and an equal error rate (EER) is reported. Intelligence tests like short-time objective intelligibility (STOI) and mean opinion score (MOS) is also done. By taking both EER and intelligibility tests together into consideration, it is shown that the proposed approach can solve the discussed issue.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"194 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131167180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Signal Processing and Communications (SPCOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1