首页 > 最新文献

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)最新文献

英文 中文
Joint DOA Estimation in Spherical Harmonics Domain using Low Complexity CNN 基于低复杂度CNN的球谐波域联合DOA估计
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840853
Priyadarshini Dwivedi, Raj Prakash Gohil, Gyanajyoti Routray, Vishnuvardhan Varanasi, R. Hegde
Direction of arrival (DOA) estimation for multi-channel speech enhancement is a challenging problem. In this context, this paper proposes a new method for joint DOA estimation using a low complexity convolutional neural network (CNN) architecture. The spherical harmonic (SH) coefficients of the received speech signal are obtained from the spherical harmonics decomposition (SHD). The magnitude and phase features are extracted from these SH coefficients and combined as a single feature for training the CNN. A single CNN model is trained using these combined features in contrast to two CNN models used in earlier work. Both azimuth and elevation are then obtained for estimation of DOA from this single CNN. Extensive simulations are also conducted for the performance evaluation of the proposed low complexity CNN model. It is observed that the proposed CNN model provides robust DOA estimates at the various signal to noise ratios (SNR) and reverberation times with reduced computational complexity. Performance evaluated in terms of the gross error (GE) and run-time complexity also provides interesting results motivating the use of the proposed model in practical applications.
多通道语音增强的到达方向估计是一个具有挑战性的问题。在此背景下,本文提出了一种基于低复杂度卷积神经网络(CNN)架构的联合DOA估计新方法。对接收到的语音信号进行球谐波分解,得到其球谐波系数。从这些SH系数中提取幅值和相位特征,并将其合并为一个特征来训练CNN。与早期工作中使用的两个CNN模型相比,使用这些组合特征训练单个CNN模型。然后获得方位角和仰角,用于从该单个CNN估计DOA。本文还对所提出的低复杂度CNN模型进行了大量的仿真,以评估其性能。观察到,所提出的CNN模型在各种信噪比(SNR)和混响时间下提供了鲁棒的DOA估计,并且降低了计算复杂度。根据总误差(GE)和运行时复杂性评估的性能也提供了有趣的结果,激励在实际应用程序中使用所建议的模型。
{"title":"Joint DOA Estimation in Spherical Harmonics Domain using Low Complexity CNN","authors":"Priyadarshini Dwivedi, Raj Prakash Gohil, Gyanajyoti Routray, Vishnuvardhan Varanasi, R. Hegde","doi":"10.1109/SPCOM55316.2022.9840853","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840853","url":null,"abstract":"Direction of arrival (DOA) estimation for multi-channel speech enhancement is a challenging problem. In this context, this paper proposes a new method for joint DOA estimation using a low complexity convolutional neural network (CNN) architecture. The spherical harmonic (SH) coefficients of the received speech signal are obtained from the spherical harmonics decomposition (SHD). The magnitude and phase features are extracted from these SH coefficients and combined as a single feature for training the CNN. A single CNN model is trained using these combined features in contrast to two CNN models used in earlier work. Both azimuth and elevation are then obtained for estimation of DOA from this single CNN. Extensive simulations are also conducted for the performance evaluation of the proposed low complexity CNN model. It is observed that the proposed CNN model provides robust DOA estimates at the various signal to noise ratios (SNR) and reverberation times with reduced computational complexity. Performance evaluated in terms of the gross error (GE) and run-time complexity also provides interesting results motivating the use of the proposed model in practical applications.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116808919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Semi-Blind Technique for Frequency Selective Channel Estimation in Millimeter-Wave MIMO Coded FBMC System 毫米波MIMO编码FBMC系统中频率选择信道估计的半盲技术
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840787
Murali Krishna Pavuluri, Prem Singh, A. Jagannatham, V. Gadre
In this paper, a novel millimeter wave (mmWave) multiple-input multiple-output (MIMO) system based on coded filter bank multicarrier (FBMC) waveform is proposed. The coded FBMC modifies FBMC waveform such that it mitigates intrinsic interference by spreading the symbols in time. Next, a semi-blind channel estimation scheme is proposed for the mmWave MIMO coded FBMC system. The proposed semiblind estimation exploits the data symbols along with pilots to substantially enhance the mean square error (MSE) performance. Therefore, the proposed work is superior to the existing works in terms of accuracy of channel estimation. The simulation results indicate improved MSE performance of the proposed semi-blind scheme in comparison to the existing conventional training based channel estimation techniques.
本文提出了一种基于编码滤波器组多载波(FBMC)波形的毫米波多输入多输出(MIMO)系统。编码的FBMC修改了FBMC波形,从而通过在时间上扩展符号来减轻内在干扰。其次,针对毫米波MIMO编码FBMC系统,提出了一种半盲信道估计方案。所提出的半盲估计利用数据符号和导频,大大提高了均方误差(MSE)的性能。因此,本文提出的方法在信道估计精度方面优于现有的方法。仿真结果表明,与传统的基于训练的信道估计技术相比,所提半盲方案的MSE性能有所提高。
{"title":"Semi-Blind Technique for Frequency Selective Channel Estimation in Millimeter-Wave MIMO Coded FBMC System","authors":"Murali Krishna Pavuluri, Prem Singh, A. Jagannatham, V. Gadre","doi":"10.1109/SPCOM55316.2022.9840787","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840787","url":null,"abstract":"In this paper, a novel millimeter wave (mmWave) multiple-input multiple-output (MIMO) system based on coded filter bank multicarrier (FBMC) waveform is proposed. The coded FBMC modifies FBMC waveform such that it mitigates intrinsic interference by spreading the symbols in time. Next, a semi-blind channel estimation scheme is proposed for the mmWave MIMO coded FBMC system. The proposed semiblind estimation exploits the data symbols along with pilots to substantially enhance the mean square error (MSE) performance. Therefore, the proposed work is superior to the existing works in terms of accuracy of channel estimation. The simulation results indicate improved MSE performance of the proposed semi-blind scheme in comparison to the existing conventional training based channel estimation techniques.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"184 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114402216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secrecy Performance with Optimal Relay and Antenna Selection in Spectrum-Sharing Networks 频谱共享网络中中继和天线选择最优的保密性能
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840825
P. Das, Pradyumna Hegade
This paper investigates the physical layer secrecy performance for an underlay wiretap spectrum sharing network with optimal relay and antenna selection (ORAS). It jointly selects a transmit antenna at a secondary source, a receive antenna at its destination, and a relay between them to maximize the secrecy rate at the destination under a passive eavesdropping scenario. We derive novel, exact expressions for secrecy outage probability (SOP) and secrecy throughput (ST) in a single integral-form. We also provide their accurate approximations in closed-form. Considering three distinct scenarios depending on whether the main link is stronger than the wiretap link and the nature of the primary interference constraint, we obtain different useful system design insights. Under a proportional interference constraint, secrecy diversity gain is achieved only when the main link is stronger than the wiretap link, otherwise the diversity gain is lost and leads to an SOP floor. Under a fixed interference constraint, the SOP floor is further increased. Through numerical results, we show the impact of different signal combining strategies by the eavesdropper on the SOP and ST. We also illustrate the efficacy of the ORAS scheme as compared to other relay and antenna selection schemes.
研究了一种具有最优中继和天线选择(ORAS)的底层窃听频谱共享网络的物理层保密性能。在被动窃听情况下,联合选择二次源处的发射天线和目的处的接收天线,并在两者之间选择中继,以最大限度地提高目的处的保密率。我们在单一积分形式中导出了保密中断概率(SOP)和保密吞吐量(ST)的新颖精确表达式。我们还以封闭形式提供了它们的精确近似值。根据主链路是否比窃听链路强以及主干扰约束的性质,考虑三种不同的场景,我们获得了不同的有用的系统设计见解。在比例干扰约束下,只有当主链路比窃听链路强时才能获得保密分集增益,否则分集增益将丢失并导致SOP层。在一定干扰约束下,SOP下限进一步增大。通过数值结果,我们显示了窃听者不同的信号组合策略对SOP和st的影响,并说明了与其他中继和天线选择方案相比,ORAS方案的有效性。
{"title":"Secrecy Performance with Optimal Relay and Antenna Selection in Spectrum-Sharing Networks","authors":"P. Das, Pradyumna Hegade","doi":"10.1109/SPCOM55316.2022.9840825","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840825","url":null,"abstract":"This paper investigates the physical layer secrecy performance for an underlay wiretap spectrum sharing network with optimal relay and antenna selection (ORAS). It jointly selects a transmit antenna at a secondary source, a receive antenna at its destination, and a relay between them to maximize the secrecy rate at the destination under a passive eavesdropping scenario. We derive novel, exact expressions for secrecy outage probability (SOP) and secrecy throughput (ST) in a single integral-form. We also provide their accurate approximations in closed-form. Considering three distinct scenarios depending on whether the main link is stronger than the wiretap link and the nature of the primary interference constraint, we obtain different useful system design insights. Under a proportional interference constraint, secrecy diversity gain is achieved only when the main link is stronger than the wiretap link, otherwise the diversity gain is lost and leads to an SOP floor. Under a fixed interference constraint, the SOP floor is further increased. Through numerical results, we show the impact of different signal combining strategies by the eavesdropper on the SOP and ST. We also illustrate the efficacy of the ORAS scheme as compared to other relay and antenna selection schemes.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114709225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noisy Student Teacher Training with Self Supervised Learning for Children ASR 基于自监督学习的吵闹学生教师对儿童ASR的培训
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840763
Shreya S. Chaturvedi, Hardik B. Sailor, H. Patil
Automatic Speech Recognition (ASR) is a fast-growing field, where reliable systems are made for high resource languages and for adult’s speech. However, performance of such ASR system is inefficient for children speech, due to numerous acoustic variability in children speech and scarcity of resources. In this paper, we propose to use the unlabeled data extensively to develop ASR system for low resourced children speech. State-of-the-art wav2vec 2.0 is the baseline ASR technique used here. The baseline’s performance is further enhanced with the intuition of Noisy Student Teacher (NST) learning. The proposed technique is not only limited to introducing the use of soft labels (i.e., word-level transcription) of unlabeled data, but also adapts the learning of teacher model or preceding student model, which results in reduction of the redundant training significantly. To that effect, a detailed analysis is reported in this paper, as there is a difference in teacher and student learning. In ASR experiments, character-level tokenization was used and hence, Connectionist Temporal Classification (CTC) loss was used for fine-tuning. Due to computational limitations, experiments are performed with approximately 12 hours of training, and 5 hours of development and test data was used from standard My Science Tutor (MyST) corpus. The baseline wav2vec 2.0 achieves 34% WER, while relatively 10% of performance was improved using the proposed approach. Further, the analysis of performance loss and effect of language model is discussed in details.
自动语音识别(ASR)是一个快速发展的领域,它为高资源语言和成人语音建立了可靠的系统。然而,由于儿童语音存在大量的声学变异性和资源的稀缺性,这种ASR系统对儿童语音的表现效率较低。在本文中,我们建议广泛使用未标记的数据来开发低资源儿童语音的ASR系统。最先进的wav2vec 2.0是这里使用的基线ASR技术。噪声学生教师(NST)学习的直觉进一步增强了基线的性能。所提出的技术不仅局限于引入对未标记数据的软标签(即词级转录)的使用,而且还适应了教师模型或前学生模型的学习,从而显著减少了冗余训练。鉴于教师和学生的学习存在差异,本文对此进行了详细的分析。在ASR实验中,使用了字符级标记化,因此,使用连接时间分类(CTC)损失进行微调。由于计算的限制,实验进行了大约12小时的训练,5小时的开发和测试数据来自标准的My Science Tutor (MyST)语料库。基线wav2vec 2.0实现了34%的WER,而使用所提出的方法提高了相对10%的性能。在此基础上,详细分析了语言模型的性能损失和影响。
{"title":"Noisy Student Teacher Training with Self Supervised Learning for Children ASR","authors":"Shreya S. Chaturvedi, Hardik B. Sailor, H. Patil","doi":"10.1109/SPCOM55316.2022.9840763","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840763","url":null,"abstract":"Automatic Speech Recognition (ASR) is a fast-growing field, where reliable systems are made for high resource languages and for adult’s speech. However, performance of such ASR system is inefficient for children speech, due to numerous acoustic variability in children speech and scarcity of resources. In this paper, we propose to use the unlabeled data extensively to develop ASR system for low resourced children speech. State-of-the-art wav2vec 2.0 is the baseline ASR technique used here. The baseline’s performance is further enhanced with the intuition of Noisy Student Teacher (NST) learning. The proposed technique is not only limited to introducing the use of soft labels (i.e., word-level transcription) of unlabeled data, but also adapts the learning of teacher model or preceding student model, which results in reduction of the redundant training significantly. To that effect, a detailed analysis is reported in this paper, as there is a difference in teacher and student learning. In ASR experiments, character-level tokenization was used and hence, Connectionist Temporal Classification (CTC) loss was used for fine-tuning. Due to computational limitations, experiments are performed with approximately 12 hours of training, and 5 hours of development and test data was used from standard My Science Tutor (MyST) corpus. The baseline wav2vec 2.0 achieves 34% WER, while relatively 10% of performance was improved using the proposed approach. Further, the analysis of performance loss and effect of language model is discussed in details.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127832116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Effective Sample Complexity for Exact Sparse Recovery from Sequential Linear Measurements 序列线性测量精确稀疏恢复的有效样本复杂度
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840826
S. Mukhopadhyay
In this paper we consider the problem of exact recovery of a fixed sparse vector from sequentially arriving measurements. We assume that the measurements are generated by a linear model with time varying matrices and both the measurement vector as well as the matrix at each time are made available. However, we assume that the underlying unknown sparse vector is fixed during the time of interest. We prove that if the measurement matrices are i.i.d. subGaussian, the iterates produced by the popular iterative hard thresholding (IHT) algorithm can converge to the exact sparse vector with high probability if a certain function of the sample complexities of the time varying measurements, which we call effective sample complexity satisfies certain lower bound dependent on K,N, the sparsity and the length of the unknown vector, respectively. Interestingly, this bound reveals that the probability that the estimation error at the end of some instant is small enough, is hardly affected even if very small number measurements are used at sporadically chosen time instances. We also corroborate this theoretical result with numerical experiments which demonstrate that the conventional IHT can enjoy greater probability of recovery by occasionally using far lesser number of measurements than that required for successful recovery with offline IHT with fixed measurement matrix.
本文研究了从顺序到达的测量值中精确恢复固定稀疏向量的问题。我们假设测量是由具有时变矩阵的线性模型产生的,并且每次的测量向量和矩阵都是可用的。然而,我们假设底层未知稀疏向量在感兴趣的时间内是固定的。我们证明了当测量矩阵是iid次高斯矩阵时,如果时变测量的样本复杂度的某个函数(我们称之为有效样本复杂度)分别满足与未知向量的K、N、稀疏度和长度相关的某个下界,那么流行的迭代硬阈值(IHT)算法产生的迭代可以高概率收敛到精确稀疏向量。有趣的是,这个界限表明,即使在零星选择的时间实例中使用非常少量的测量,在某些瞬间结束时估计误差足够小的概率也几乎不受影响。我们还用数值实验证实了这一理论结果,结果表明,与使用固定测量矩阵的离线IHT成功恢复所需的测量次数相比,常规IHT偶尔使用的测量次数要少得多,可以获得更大的恢复概率。
{"title":"On the Effective Sample Complexity for Exact Sparse Recovery from Sequential Linear Measurements","authors":"S. Mukhopadhyay","doi":"10.1109/SPCOM55316.2022.9840826","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840826","url":null,"abstract":"In this paper we consider the problem of exact recovery of a fixed sparse vector from sequentially arriving measurements. We assume that the measurements are generated by a linear model with time varying matrices and both the measurement vector as well as the matrix at each time are made available. However, we assume that the underlying unknown sparse vector is fixed during the time of interest. We prove that if the measurement matrices are i.i.d. subGaussian, the iterates produced by the popular iterative hard thresholding (IHT) algorithm can converge to the exact sparse vector with high probability if a certain function of the sample complexities of the time varying measurements, which we call effective sample complexity satisfies certain lower bound dependent on K,N, the sparsity and the length of the unknown vector, respectively. Interestingly, this bound reveals that the probability that the estimation error at the end of some instant is small enough, is hardly affected even if very small number measurements are used at sporadically chosen time instances. We also corroborate this theoretical result with numerical experiments which demonstrate that the conventional IHT can enjoy greater probability of recovery by occasionally using far lesser number of measurements than that required for successful recovery with offline IHT with fixed measurement matrix.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125979931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring High Spectro-Temporal Resolution for Alzheimer’s Dementia Detection 探索阿尔茨海默氏痴呆症检测的高光谱时间分辨率
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840847
Nayan Anand Vats, Purva Barche, Mirishkar Sai Ganesh, A. Vuppala
Alzheimer’s Dementia is a progressive neurological disorder characterized by cognitive impairment. It affects memory, thinking skills, language, and the ability to perform simple tasks. Detection of Alzheimer’s Dementia from the speech is considered a primitive task, as most speech cues are preserved in it. Studies in the literature focused mainly on the lexical features and few acoustic features for detecting Alzheimer’s disease. The present work explores the single frequency filtering cepstral coefficients (SFCC) for the automatic detection of Alzheimer’s disease. In contrast to STFTs, the proposed feature has better temporal and spectral resolution and captures the transient part more appropriately. This offers a very compact and efficient way to derive the formant structure in the speech signal. The experiments were conducted on the ADReSSo dataset, using the support vector machine classifier. The classification performance was compared with several baseline features like Mel-frequency cepstral coefficients (MFCC), perceptual linear prediction (PLP), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficients of LP-residual (MFCC-WR), ZFF signal (MFCC-ZF) and eGeMAPS (openSMILE). The experiments conducted on Alzheimer’s Dementia classification task show that the proposed feature performs better than conventional MFCCs. Among all the features, SFCC offers the best classification accuracy of 65.1% and 60.6% for dementia detection on cross-validation and test data, respectively. The combination of baseline features with SFCC features further improved the performance.
老年痴呆症是一种以认知障碍为特征的进行性神经系统疾病。它会影响记忆力、思维能力、语言和执行简单任务的能力。从言语中检测阿尔茨海默氏症被认为是一项原始任务,因为大多数言语线索都保存在言语中。文献研究主要集中在阿尔茨海默病的词汇特征,很少有声学特征。本研究探讨了用于阿尔茨海默病自动检测的单频滤波倒谱系数(SFCC)。与stft相比,所提出的特征具有更好的时间和光谱分辨率,并且更合适地捕获瞬态部分。这提供了一种非常紧凑和有效的方法来推导语音信号的形成峰结构。实验在ADReSSo数据集上进行,使用支持向量机分类器。与Mel-频率倒谱系数(MFCC)、感知线性预测(PLP)、线性预测倒谱系数(LPCC)、Mel- lp -残差频率倒谱系数(MFCC- wr)、ZFF信号(MFCC- zf)和eGeMAPS (openSMILE)等基线特征进行分类性能比较。在阿尔茨海默氏痴呆症分类任务上进行的实验表明,所提出的特征比传统的MFCCs具有更好的性能。其中,在交叉验证和测试数据上,SFCC对痴呆的检测准确率最高,分别为65.1%和60.6%。基线特征与SFCC特征的结合进一步提高了性能。
{"title":"Exploring High Spectro-Temporal Resolution for Alzheimer’s Dementia Detection","authors":"Nayan Anand Vats, Purva Barche, Mirishkar Sai Ganesh, A. Vuppala","doi":"10.1109/SPCOM55316.2022.9840847","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840847","url":null,"abstract":"Alzheimer’s Dementia is a progressive neurological disorder characterized by cognitive impairment. It affects memory, thinking skills, language, and the ability to perform simple tasks. Detection of Alzheimer’s Dementia from the speech is considered a primitive task, as most speech cues are preserved in it. Studies in the literature focused mainly on the lexical features and few acoustic features for detecting Alzheimer’s disease. The present work explores the single frequency filtering cepstral coefficients (SFCC) for the automatic detection of Alzheimer’s disease. In contrast to STFTs, the proposed feature has better temporal and spectral resolution and captures the transient part more appropriately. This offers a very compact and efficient way to derive the formant structure in the speech signal. The experiments were conducted on the ADReSSo dataset, using the support vector machine classifier. The classification performance was compared with several baseline features like Mel-frequency cepstral coefficients (MFCC), perceptual linear prediction (PLP), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficients of LP-residual (MFCC-WR), ZFF signal (MFCC-ZF) and eGeMAPS (openSMILE). The experiments conducted on Alzheimer’s Dementia classification task show that the proposed feature performs better than conventional MFCCs. Among all the features, SFCC offers the best classification accuracy of 65.1% and 60.6% for dementia detection on cross-validation and test data, respectively. The combination of baseline features with SFCC features further improved the performance.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122720837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy-Based Maximum Likelihood Detector for GSSK in MIMO-ABC Systems 基于能量的MIMO-ABC系统GSSK极大似然检测器
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840815
Ashwini H. Raghavendra, Anagha K. Kowshik, Sanjeev Gurugopinath, S. Muhaidat, C. Tellambura
We propose a novel, low complexity energy-based maximum likelihood (EML) detector for a generalized space shift keying (GSSK)-enabled multiple-input multiple-output (MIMO) ambient backscatter communication (ABC) system. The proposed scheme exploits the multiple antenna structure of the system to achieve a lower error rate performance than the conventional single-antenna ABC systems. The proposed EML GSSK detector does not require the perfect knowledge of the ambient source signal. To gain insights into the performance of the proposed scheme, we derive the exact pairwise error probability (PEP) of the EML detector, and further obtain an upper bound on the probability of error. We also derive a simple asymptotic PEP expression, as the number of antennas of the reader becomes large. We validate our analysis through Monte Carlo simulations, and show that the performance loss due to the approximations employed in our analysis is small. The performance of EML detector is also compared with the conventional ML detector and the loss in performance is studied.
我们提出了一种新颖的、低复杂度的基于能量的最大似然(EML)探测器,用于支持广义空间移位键控(GSSK)的多输入多输出(MIMO)环境反向散射通信(ABC)系统。该方案利用系统的多天线结构,实现了比传统单天线ABC系统更低的误码率性能。提出的EML GSSK检测器不需要完全了解环境源信号。为了深入了解该方案的性能,我们推导了EML检测器的精确成对错误概率(PEP),并进一步得到了错误概率的上界。我们还推导了一个简单的渐近PEP表达式,当阅读器的天线数量变大时。我们通过蒙特卡罗模拟验证了我们的分析,并表明由于我们的分析中采用的近似导致的性能损失很小。将EML检测器的性能与传统的ML检测器进行了比较,并研究了其性能损失。
{"title":"Energy-Based Maximum Likelihood Detector for GSSK in MIMO-ABC Systems","authors":"Ashwini H. Raghavendra, Anagha K. Kowshik, Sanjeev Gurugopinath, S. Muhaidat, C. Tellambura","doi":"10.1109/SPCOM55316.2022.9840815","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840815","url":null,"abstract":"We propose a novel, low complexity energy-based maximum likelihood (EML) detector for a generalized space shift keying (GSSK)-enabled multiple-input multiple-output (MIMO) ambient backscatter communication (ABC) system. The proposed scheme exploits the multiple antenna structure of the system to achieve a lower error rate performance than the conventional single-antenna ABC systems. The proposed EML GSSK detector does not require the perfect knowledge of the ambient source signal. To gain insights into the performance of the proposed scheme, we derive the exact pairwise error probability (PEP) of the EML detector, and further obtain an upper bound on the probability of error. We also derive a simple asymptotic PEP expression, as the number of antennas of the reader becomes large. We validate our analysis through Monte Carlo simulations, and show that the performance loss due to the approximations employed in our analysis is small. The performance of EML detector is also compared with the conventional ML detector and the loss in performance is studied.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133577572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Wideband Bandpass Filter using U-shaped slots on SIW with two Notches at 8 GHz and 10 GHz 一种宽带带通滤波器,在SIW上使用u形槽,在8ghz和10ghz有两个陷波
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840774
Shameer Keelaillam, Saragadam Siva Teja, Tadikonda Jayanth, Ramapuram Ajay Reddy, Kanumetta Bhargava Ramanujam, L. Kumar
In this paper, a wideband band pass filter (BPF) with two notches is presented using a substrate integrated waveguide (SIW). The BPF consists of U-shaped slots at the top of SIW with an integrated digital capacitor (IDC). The Wideband BPF has a bandwidth of 9.25 GHz from 5.86 GHz to 15.1 GHz. In the pass band, small U-shape slots are inserted inside the bigger U-shape slots and with the help of lumped capacitors with values of 0.2 pF and 0.4 pF notches are created at 8 GHz and 10 GHz. The proposed BPF with two notch has the insertion loss of 0.5 dB, and a return loss of 10 dB in the pass band, bandwidth of 9.14 GHz from 6.46 GHz to 15.6 GHz.
本文提出了一种基于衬底集成波导的双陷波宽带通滤波器(BPF)。BPF由SIW顶部的u形槽组成,带有集成数字电容(IDC)。宽带BPF在5.86 GHz ~ 15.1 GHz范围内的带宽为9.25 GHz。在通频带中,在较大的u型槽内插入小的u型槽,并在8 GHz和10 GHz下创建值为0.2 pF和0.4 pF的集总电容器。该双陷波带在6.46 GHz至15.6 GHz范围内的通带带宽为9.14 GHz,插入损耗为0.5 dB,回波损耗为10 dB。
{"title":"A Wideband Bandpass Filter using U-shaped slots on SIW with two Notches at 8 GHz and 10 GHz","authors":"Shameer Keelaillam, Saragadam Siva Teja, Tadikonda Jayanth, Ramapuram Ajay Reddy, Kanumetta Bhargava Ramanujam, L. Kumar","doi":"10.1109/SPCOM55316.2022.9840774","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840774","url":null,"abstract":"In this paper, a wideband band pass filter (BPF) with two notches is presented using a substrate integrated waveguide (SIW). The BPF consists of U-shaped slots at the top of SIW with an integrated digital capacitor (IDC). The Wideband BPF has a bandwidth of 9.25 GHz from 5.86 GHz to 15.1 GHz. In the pass band, small U-shape slots are inserted inside the bigger U-shape slots and with the help of lumped capacitors with values of 0.2 pF and 0.4 pF notches are created at 8 GHz and 10 GHz. The proposed BPF with two notch has the insertion loss of 0.5 dB, and a return loss of 10 dB in the pass band, bandwidth of 9.14 GHz from 6.46 GHz to 15.6 GHz.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114077090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Cognitive Molecular Communication Inside a Cylindrical Diffusive Channel 圆柱形扩散通道内的认知分子通信
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840758
Shivani Dhok, P. Peshwe, Prabhat Kumar Sharma
In this paper, we consider an underlay cognitive molecular communication (MC) system inside a cylindrical diffusive-channel. We consider a primary and a secondary link operating simultaneously inside that channel, with the primary link having a higher priority over the secondary link. Based on the co-channel interference (CCI) at the primary receiver, the number of molecules transmitted by the secondary transmitter is controlled. The performance of the channel is analyzed based on the probability of error and the effects of the interference limits are analyzed. The Monte-Carlo simulations are presented for validating the derived expressions.
在本文中,我们考虑了一个圆柱形扩散通道内的底层认知分子通信(MC)系统。我们考虑在该通道内同时运行主链路和辅助链路,其中主链路比辅助链路具有更高的优先级。基于主接收机的共信道干扰(CCI),控制二次发射机发射的分子数。基于误差概率分析了信道的性能,并分析了干扰限制的影响。给出了蒙特卡罗模拟来验证推导出的表达式。
{"title":"Cognitive Molecular Communication Inside a Cylindrical Diffusive Channel","authors":"Shivani Dhok, P. Peshwe, Prabhat Kumar Sharma","doi":"10.1109/SPCOM55316.2022.9840758","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840758","url":null,"abstract":"In this paper, we consider an underlay cognitive molecular communication (MC) system inside a cylindrical diffusive-channel. We consider a primary and a secondary link operating simultaneously inside that channel, with the primary link having a higher priority over the secondary link. Based on the co-channel interference (CCI) at the primary receiver, the number of molecules transmitted by the secondary transmitter is controlled. The performance of the channel is analyzed based on the probability of error and the effects of the interference limits are analyzed. The Monte-Carlo simulations are presented for validating the derived expressions.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128249493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Instantaneous Fundamental Frequency Estimation from Speech using Fourier Decomposition Method 基于傅立叶分解方法的语音瞬时基频估计
Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840851
Pushpendra Singh, Amit Singhal, Binish Fatimah
Speech analysis and various speech processing applications use instantaneous fundamental frequency $(F_{0})$ of voiced speech signal as a prime acoustic parameter. The low frequency component of the voiced speech possesses most of the energy in F0 neighbourhood and its few harmonics. In this study, a novel approach is proposed to extract the instantaneous F0 component from a voiced speech signal using Fourier decomposition method (FDM), which decomposes the signal into its amplitude-frequency modulated (AM-FM) components. We also demonstrate that these derived AM-FM components, obtained due to desired frequency band decomposition property of FDM, provides the most suitable representation for voiced speech as compared to other AM-FM models available in the literature. Numerical results are presented to validate the adequacy of proposed method in estimating F0, when compared with existing algorithms based on empirical mode decomposition (EMD) and other speech-related algorithms.
语音分析和各种语音处理应用使用浊音信号的瞬时基频$(F_{0})$作为主要声学参数。浊音的低频分量占有F0邻域的大部分能量,其谐波较少。本研究提出了一种利用傅立叶分解方法(FDM)从语音语音信号中提取瞬时F0分量的新方法,该方法将语音信号分解为其幅频调制(AM-FM)分量。我们还证明,与文献中可用的其他AM-FM模型相比,由于FDM所需的频带分解特性而获得的这些衍生AM-FM分量为浊音语音提供了最合适的表示。通过与现有基于经验模态分解(EMD)和其他语音相关算法的比较,给出了数值结果,验证了该方法在估计F0方面的充分性。
{"title":"Instantaneous Fundamental Frequency Estimation from Speech using Fourier Decomposition Method","authors":"Pushpendra Singh, Amit Singhal, Binish Fatimah","doi":"10.1109/SPCOM55316.2022.9840851","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840851","url":null,"abstract":"Speech analysis and various speech processing applications use instantaneous fundamental frequency $(F_{0})$ of voiced speech signal as a prime acoustic parameter. The low frequency component of the voiced speech possesses most of the energy in F0 neighbourhood and its few harmonics. In this study, a novel approach is proposed to extract the instantaneous F0 component from a voiced speech signal using Fourier decomposition method (FDM), which decomposes the signal into its amplitude-frequency modulated (AM-FM) components. We also demonstrate that these derived AM-FM components, obtained due to desired frequency band decomposition property of FDM, provides the most suitable representation for voiced speech as compared to other AM-FM models available in the literature. Numerical results are presented to validate the adequacy of proposed method in estimating F0, when compared with existing algorithms based on empirical mode decomposition (EMD) and other speech-related algorithms.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132842079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Signal Processing and Communications (SPCOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1