首页 > 最新文献

2020 International Conference on Signal Processing and Communications (SPCOM)最新文献

英文 中文
SBL-Based Hybrid Precoder/ Combiner Design for Power and Spectrally Efficient Millimeter Wave MIMO Systems 基于sbl的功率和频谱高效毫米波MIMO系统混合预编码器/组合器设计
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179621
Suraj Srivastava, Amrita Mishra, A. Jagannatham, G. Ascheid
This work proposes a novel sparse Bayesian learning (SBL)-based hybrid precoder/ combiner design scheme for millimeter wave (mmWave) MIMO systems. Towards this end, a multiple measurement vector (MMV) based sparse signal recovery problem is developed that maximizes the mutual information by approximating the hybrid precoder to the ideal digital precoder. A unique aspect of the proposed SBL-based scheme is that the resulting hyperparameter estimates can be used to activate the minimum number of RF chains required to approximate the ideal digital precoder/ combiner, thus enabling one to leverage the time-varying multipath profile of the underlying mmWave MIMO channel. This feature coupled with the improved ability of SBL for sparse signal recovery leads to a significantly enhanced power and spectral efficiency of the proposed scheme in comparison to the conventional schemes that activate a fixed number of RF chains and data streams, irrespective of the multipath profile of the mmWave MIMO channel. Simulation results demonstrate the improved efficiency of the proposed scheme in comparison to the existing schemes and also the resulting reduction in the average number of RF chains employed.
为此,提出了一种基于多测量向量(MMV)的稀疏信号恢复问题,该问题通过将混合预编码器近似于理想的数字预编码器来最大化互信息。所提出的基于ssl的方案的一个独特之处在于,由此产生的超参数估计可用于激活近似理想数字预编码器/合并器所需的最小数量的RF链,从而使人们能够利用底层毫米波MIMO信道的时变多径配置文件。与激活固定数量的射频链和数据流的传统方案相比,该特性与改进的SBL稀疏信号恢复能力相结合,显著提高了所提出方案的功率和频谱效率,而不考虑毫米波MIMO信道的多径轮廓。仿真结果表明,与现有方案相比,该方案提高了效率,并且减少了平均使用的射频链数。
{"title":"SBL-Based Hybrid Precoder/ Combiner Design for Power and Spectrally Efficient Millimeter Wave MIMO Systems","authors":"Suraj Srivastava, Amrita Mishra, A. Jagannatham, G. Ascheid","doi":"10.1109/SPCOM50965.2020.9179621","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179621","url":null,"abstract":"This work proposes a novel sparse Bayesian learning (SBL)-based hybrid precoder/ combiner design scheme for millimeter wave (mmWave) MIMO systems. Towards this end, a multiple measurement vector (MMV) based sparse signal recovery problem is developed that maximizes the mutual information by approximating the hybrid precoder to the ideal digital precoder. A unique aspect of the proposed SBL-based scheme is that the resulting hyperparameter estimates can be used to activate the minimum number of RF chains required to approximate the ideal digital precoder/ combiner, thus enabling one to leverage the time-varying multipath profile of the underlying mmWave MIMO channel. This feature coupled with the improved ability of SBL for sparse signal recovery leads to a significantly enhanced power and spectral efficiency of the proposed scheme in comparison to the conventional schemes that activate a fixed number of RF chains and data streams, irrespective of the multipath profile of the mmWave MIMO channel. Simulation results demonstrate the improved efficiency of the proposed scheme in comparison to the existing schemes and also the resulting reduction in the average number of RF chains employed.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125808613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Coexistence of LTE-Unlicensed and WiFi: optimization and Game-Theoretic Frameworks lte - unlicensing与WiFi共存:优化与博弈论框架
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179532
Chaitanya Amballa, K. P. Naveen
In this work we propose detailed mathematical models to address the problem of achieving coexistence between the LTE and the WiFi technologies. Specifically, we first formulate an utility maxmization problem where our key contribution lies in identifying the necessary and sufficient set of constraints (including an interesting ctique constraint) that arise while timesharing the available WiFi spectrum among the LTE base-stations. We then use game theory to model scenarios where the LTEs are competitive; the solution is characterized in terms of constrained Nash equilibriums. We also obtain simplified formulations (both optimization and game-theoretic) by replacing the computationally intensive clique constraints with a simpler set of connectivity constraints. Finally, we conduct a numerical study to evaluate the performances of all the proposed schemes.
在这项工作中,我们提出了详细的数学模型来解决实现LTE和WiFi技术共存的问题。具体来说,我们首先制定了一个效用最大化问题,其中我们的关键贡献在于确定在LTE基站之间分时共享可用WiFi频谱时出现的必要和充分的约束集(包括一个有趣的关键约束)。然后,我们使用博弈论来模拟lte竞争的场景;该解用约束纳什均衡来描述。我们还通过用一组更简单的连通性约束替换计算密集型的团约束来获得简化的公式(优化和博弈论)。最后,我们进行了数值研究,以评估所有提出的方案的性能。
{"title":"Coexistence of LTE-Unlicensed and WiFi: optimization and Game-Theoretic Frameworks","authors":"Chaitanya Amballa, K. P. Naveen","doi":"10.1109/SPCOM50965.2020.9179532","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179532","url":null,"abstract":"In this work we propose detailed mathematical models to address the problem of achieving coexistence between the LTE and the WiFi technologies. Specifically, we first formulate an utility maxmization problem where our key contribution lies in identifying the necessary and sufficient set of constraints (including an interesting ctique constraint) that arise while timesharing the available WiFi spectrum among the LTE base-stations. We then use game theory to model scenarios where the LTEs are competitive; the solution is characterized in terms of constrained Nash equilibriums. We also obtain simplified formulations (both optimization and game-theoretic) by replacing the computationally intensive clique constraints with a simpler set of connectivity constraints. Finally, we conduct a numerical study to evaluate the performances of all the proposed schemes.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130000626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Gaussian Broadcast Channel with State Estimation 状态估计高斯广播信道
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179635
Viswanathan Ramachandran
A state amplification problem, where the common additive state to a Gaussian broadcast channel (BC) is to be estimated at both receivers, is considered. The state process, known non-causally at the encoder, is assumed to be independent and identically distributed (i.i.d.) Gaussian. Both receivers must estimate the state process to within acceptable squared error distortion limits. In addition to the state estimation, our setting also requires message transmission to the stronger receiver at a given rate. We are interested in the optimal trade-offs between the distortions incurred at the receivers when a message at a given rate is to be delivered from the encoder to the strong receiver. A complete characterization of the rate-distortion trade-off region is presented. Our result differs from a recent result where an additional common reconstruction constraint was imposed on the state estimates in the same setting, and it was observed that allowing the weak user to decode part of the private message to the stronger user helps the distortion trade-offs.
考虑了一个状态放大问题,其中高斯广播信道(BC)的公共加性状态在两个接收端都要估计。在编码器上已知的非因果状态过程假定为独立且同分布(i.i.d)。高斯函数。两个接收器都必须在可接受的平方误差失真范围内估计状态过程。除了状态估计之外,我们的设置还要求以给定的速率将消息传输到较强的接收器。我们感兴趣的是当以给定速率的消息从编码器传送到强接收器时,接收器产生的失真之间的最佳权衡。给出了速率失真权衡区域的完整表征。我们的结果与最近的结果不同,后者在相同的设置下对状态估计施加了额外的公共重构约束,并且观察到允许弱用户解码部分私有消息给较强用户有助于失真权衡。
{"title":"Gaussian Broadcast Channel with State Estimation","authors":"Viswanathan Ramachandran","doi":"10.1109/SPCOM50965.2020.9179635","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179635","url":null,"abstract":"A state amplification problem, where the common additive state to a Gaussian broadcast channel (BC) is to be estimated at both receivers, is considered. The state process, known non-causally at the encoder, is assumed to be independent and identically distributed (i.i.d.) Gaussian. Both receivers must estimate the state process to within acceptable squared error distortion limits. In addition to the state estimation, our setting also requires message transmission to the stronger receiver at a given rate. We are interested in the optimal trade-offs between the distortions incurred at the receivers when a message at a given rate is to be delivered from the encoder to the strong receiver. A complete characterization of the rate-distortion trade-off region is presented. Our result differs from a recent result where an additional common reconstruction constraint was imposed on the state estimates in the same setting, and it was observed that allowing the weak user to decode part of the private message to the stronger user helps the distortion trade-offs.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121532776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Equalizer-Free Clock Recovery for PAM-4 Optical Interconnects 用于PAM-4光互连的无均衡器时钟恢复
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179629
Kaushal Patel, Rakesh Ashok, Shalabh Gupta
One of the critical operations in high-speed serial link receiver design is the recovery of clock embedded in the received data signal. If the received signal eye is not open, clock recovery becomes challenging. We present full-rate and halfrate clock recovery architectures for unequalized input signals with PAM-4 modulation. The proposed architectures comprise independent frequency and phase recovery loops for locking the voltage controlled oscillator frequency and phase to those of the incoming signal. This architecture is validated using simulations in Verilog-A for 100 Gbps PAM-4 signals extracted for optical fiber links of different fiber lengths. This system can recover the clock for the data obtained from a lkm standard single-mode fiber link, which otherwise gives a completely closed eye at the receiver input.
高速串行链路接收机设计的关键操作之一是对接收数据信号中嵌入的时钟进行恢复。如果接收到的信号眼不打开,时钟恢复变得具有挑战性。我们提出了全速率和半速率时钟恢复架构的不均衡输入信号与PAM-4调制。所提出的结构包括独立的频率和相位恢复环,用于将电压控制的振荡器频率和相位锁定到输入信号的频率和相位。在Verilog-A中对不同光纤长度的光纤链路提取的100 Gbps PAM-4信号进行了仿真验证。该系统可以恢复从1公里标准单模光纤链路获得的数据的时钟,否则接收器输入将完全闭上眼睛。
{"title":"Equalizer-Free Clock Recovery for PAM-4 Optical Interconnects","authors":"Kaushal Patel, Rakesh Ashok, Shalabh Gupta","doi":"10.1109/SPCOM50965.2020.9179629","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179629","url":null,"abstract":"One of the critical operations in high-speed serial link receiver design is the recovery of clock embedded in the received data signal. If the received signal eye is not open, clock recovery becomes challenging. We present full-rate and halfrate clock recovery architectures for unequalized input signals with PAM-4 modulation. The proposed architectures comprise independent frequency and phase recovery loops for locking the voltage controlled oscillator frequency and phase to those of the incoming signal. This architecture is validated using simulations in Verilog-A for 100 Gbps PAM-4 signals extracted for optical fiber links of different fiber lengths. This system can recover the clock for the data obtained from a lkm standard single-mode fiber link, which otherwise gives a completely closed eye at the receiver input.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127593155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dendrogram based Clustering and Separation of Individual and Simultaneously Active Incipient Discharges in Transformer Insulation 基于树形图的变压器绝缘初始放电的聚类与分离
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179572
Niyas K. Haneefa, B. M. A. Desai, R. Sarathi, Manivasakan Rathinam
Partial discharges in transformer insulation are of major concern to utilities which cause the catastrophic failure of insulation. One of the major challenges is the identification of discharges from multiple sources when it occurs concurrently. Hence it is imperative to devise methods for identifying and separating those signals for corrective measures. In this study, an unsupervised learning approach is proposed for clustering of individual partial discharge signals and then using that information for separating the multi-source signals. Our clustering approach works by constructing a dendrogram by measuring the cosine similarity between the feature vectors and then computing a threshold, to group the individual source signals into different clusters. The feature vectors include the relative energies from the wavelet packet decomposed tree and the Higuchi fractal dimension of the wavelet coefficients at the terminal nodes. The generated clusters are trained using a classifier model to separate the individual and multi-source signals. The proposed approach is a simple and robust technique for individual cluster groupings and individual to multiclass separations and could be used for multiclass cluster groupings.
变压器绝缘局部放电是引起绝缘灾难性失效的重要问题。其中一个主要挑战是确定同时发生的多源排放。因此,必须设计出识别和分离这些信号的方法,以便采取纠正措施。在本研究中,提出了一种无监督学习方法,用于对单个局部放电信号进行聚类,然后利用该信息对多源信号进行分离。我们的聚类方法是通过测量特征向量之间的余弦相似度来构建一个树状图,然后计算一个阈值,将单个源信号分组到不同的聚类中。特征向量包括小波包分解树的相对能量和终端节点处小波系数的Higuchi分形维数。生成的聚类使用分类器模型进行训练,以分离单个和多源信号。该方法对于单个簇分组和单个到多类的分离是一种简单而稳健的技术,可用于多类簇分组。
{"title":"Dendrogram based Clustering and Separation of Individual and Simultaneously Active Incipient Discharges in Transformer Insulation","authors":"Niyas K. Haneefa, B. M. A. Desai, R. Sarathi, Manivasakan Rathinam","doi":"10.1109/SPCOM50965.2020.9179572","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179572","url":null,"abstract":"Partial discharges in transformer insulation are of major concern to utilities which cause the catastrophic failure of insulation. One of the major challenges is the identification of discharges from multiple sources when it occurs concurrently. Hence it is imperative to devise methods for identifying and separating those signals for corrective measures. In this study, an unsupervised learning approach is proposed for clustering of individual partial discharge signals and then using that information for separating the multi-source signals. Our clustering approach works by constructing a dendrogram by measuring the cosine similarity between the feature vectors and then computing a threshold, to group the individual source signals into different clusters. The feature vectors include the relative energies from the wavelet packet decomposed tree and the Higuchi fractal dimension of the wavelet coefficients at the terminal nodes. The generated clusters are trained using a classifier model to separate the individual and multi-source signals. The proposed approach is a simple and robust technique for individual cluster groupings and individual to multiclass separations and could be used for multiclass cluster groupings.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127454840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
VLSI architectures for Delay Multiply and Sum Beamforming in Ultrasound Medical Imaging 超声医学成像中延迟相乘和和波束形成的VLSI架构
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179510
Gayathri Malamal, Mahesh Raveendranatha Panicker
Ultrasound medical imaging systems typically follow standard delay and sum (DAS) beamforming at the reception for image reconstruction. In DAS, the echo signals that are returned to the transducer are aligned in time and summed to form the beamformed signal. To improve the image quality and the signal to noise ratio of DAS, a non-linear beamforming named delay multiply and sum (DMAS) has been proposed in the literature, where, the signals arriving at the transducer are aligned in time and are pairwise multiplied in all possible combinations before summation. This provides better coherence, a correlation-based data-driven apodization, and consequently result in better contrast and resolution. However, the computational complexity of DMAS is higher than DAS thus restricting its real-time implementation. This paper presents two novel VLSI architectures for the implementation of DMAS, whose complexity is independent of the number of transducer elements. The proposed architectures are implemented on xc7z010c1g400-1 FPGA and the results clearly show the channel independency of the proposed architectures.
超声医学成像系统通常在接收处遵循标准延迟和和(DAS)波束形成进行图像重建。在DAS中,返回到换能器的回波信号及时对齐并求和形成波束形成信号。为了提高DAS的图像质量和信噪比,文献中提出了一种非线性波束形成,称为延迟乘和(DMAS),其中,到达换能器的信号及时对齐,并在求和之前以所有可能的组合成对相乘。这提供了更好的一致性,一种基于相关性的数据驱动化,从而产生更好的对比度和分辨率。然而,DMAS的计算复杂度比DAS高,限制了其实时性的实现。本文提出了实现DMAS的两种新型VLSI架构,其复杂度与传感器元件的数量无关。在xc7z010c1g400-1 FPGA上实现了所提出的架构,结果清楚地显示了所提出架构的通道独立性。
{"title":"VLSI architectures for Delay Multiply and Sum Beamforming in Ultrasound Medical Imaging","authors":"Gayathri Malamal, Mahesh Raveendranatha Panicker","doi":"10.1109/SPCOM50965.2020.9179510","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179510","url":null,"abstract":"Ultrasound medical imaging systems typically follow standard delay and sum (DAS) beamforming at the reception for image reconstruction. In DAS, the echo signals that are returned to the transducer are aligned in time and summed to form the beamformed signal. To improve the image quality and the signal to noise ratio of DAS, a non-linear beamforming named delay multiply and sum (DMAS) has been proposed in the literature, where, the signals arriving at the transducer are aligned in time and are pairwise multiplied in all possible combinations before summation. This provides better coherence, a correlation-based data-driven apodization, and consequently result in better contrast and resolution. However, the computational complexity of DMAS is higher than DAS thus restricting its real-time implementation. This paper presents two novel VLSI architectures for the implementation of DMAS, whose complexity is independent of the number of transducer elements. The proposed architectures are implemented on xc7z010c1g400-1 FPGA and the results clearly show the channel independency of the proposed architectures.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127483367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sparse Plane-wave Decomposition for Upscaling Ambisonic Signals 放大双声信号的稀疏平面波分解
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179569
Gyanajyoti Routray, R. Hegde
Lower order ambisonics suffers from low spatial resolution, where hardware complexity is high for direct recording the higher order ambisonics (HOA). This problem can be solved by upscaling the order-l ambisonics (B-format signals). In this paper, a sparse plane-wave decomposition method using sequential matching pursuit is developed for upscaling the order of ambisonics. The proposed method maintains the same sparsity level across multiple measurements and is computationally efficient. The performance of the proposed method is evaluated based on the error in encoded signal and reconstructed sound field, and compared with the state-of-art upscaling techniques. Perceptual evaluations are also conducted, which indicates a significant improvement in spatial resolution.
低阶双声的空间分辨率低,直接记录高阶双声(HOA)的硬件复杂度高。这个问题可以通过放大i阶双声(b格式信号)来解决。本文提出了一种利用序列匹配追踪的稀疏平面波分解方法来提高双声的阶数。该方法在多个测量值之间保持相同的稀疏度水平,计算效率高。基于编码信号和重构声场的误差对该方法的性能进行了评价,并与现有的放大技术进行了比较。感知评价也进行了,这表明在空间分辨率显著提高。
{"title":"Sparse Plane-wave Decomposition for Upscaling Ambisonic Signals","authors":"Gyanajyoti Routray, R. Hegde","doi":"10.1109/SPCOM50965.2020.9179569","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179569","url":null,"abstract":"Lower order ambisonics suffers from low spatial resolution, where hardware complexity is high for direct recording the higher order ambisonics (HOA). This problem can be solved by upscaling the order-l ambisonics (B-format signals). In this paper, a sparse plane-wave decomposition method using sequential matching pursuit is developed for upscaling the order of ambisonics. The proposed method maintains the same sparsity level across multiple measurements and is computationally efficient. The performance of the proposed method is evaluated based on the error in encoded signal and reconstructed sound field, and compared with the state-of-art upscaling techniques. Perceptual evaluations are also conducted, which indicates a significant improvement in spatial resolution.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131194997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Speech task based automatic classification of ALS and Parkinson’s Disease and their severity using log Mel spectrograms 基于语音任务的ALS和帕金森病及其严重程度的对数Mel谱图自动分类
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179503
BN Suhas, Jhansi Mallela, Aravind Illa, B. Yamini, A. Nalini, R. Yadav, D. Gope, P. Ghosh
We consider the task of speech based classification of patients with amyotrophic lateral sclerosis (ALS), Parkinson’s disease (PD) and healthy controls (HC). Recent work in convolutional neural networks (CNN) to solve image classification problems raises the possibility of utilizing spectral representation of speech for detection of neurological diseases. In this paper, a spectrogram based approach is used. Feeding overlapping windows to the CNN makes sure that the temporal aspects are considered by using short signal segments or wide analysis filters. A three class (ALS, PD or HC) dysarthria classification is performed. In addition, we perform two severity classification experiments for ALS (5 class) and PD (3 class) respectively. Experiments are conducted on both baseline MFCC data [1] and log Mel spectrograms. Classification results show that for several audio lengths, models trained on log Mel spectrograms consistently outperform those of MFCC’s. The ability of the network to accurately classify different classes is evaluated via the area under receiver operating characteristic curve [2],[3]. The findings from this study could aid in better detection and monitoring of ALS and PD diseases.
我们考虑了肌萎缩侧索硬化症(ALS)、帕金森病(PD)和健康对照(HC)患者基于语言分类的任务。卷积神经网络(CNN)最近在解决图像分类问题方面的工作,提出了利用语音的频谱表示来检测神经系统疾病的可能性。本文采用了一种基于谱图的方法。向CNN输入重叠窗口确保使用短信号段或宽分析滤波器来考虑时间方面。进行三类(ALS, PD或HC)构音障碍分类。此外,我们还分别对ALS(5级)和PD(3级)进行了两项严重程度分类实验。在基线MFCC数据[1]和对数Mel谱图上都进行了实验。分类结果表明,在不同的音频长度下,对数梅尔谱图训练的模型始终优于MFCC的模型。通过接收者工作特征曲线下的面积来评估网络对不同类别进行准确分类的能力[2],[3]。这项研究的发现有助于更好地检测和监测ALS和PD疾病。
{"title":"Speech task based automatic classification of ALS and Parkinson’s Disease and their severity using log Mel spectrograms","authors":"BN Suhas, Jhansi Mallela, Aravind Illa, B. Yamini, A. Nalini, R. Yadav, D. Gope, P. Ghosh","doi":"10.1109/SPCOM50965.2020.9179503","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179503","url":null,"abstract":"We consider the task of speech based classification of patients with amyotrophic lateral sclerosis (ALS), Parkinson’s disease (PD) and healthy controls (HC). Recent work in convolutional neural networks (CNN) to solve image classification problems raises the possibility of utilizing spectral representation of speech for detection of neurological diseases. In this paper, a spectrogram based approach is used. Feeding overlapping windows to the CNN makes sure that the temporal aspects are considered by using short signal segments or wide analysis filters. A three class (ALS, PD or HC) dysarthria classification is performed. In addition, we perform two severity classification experiments for ALS (5 class) and PD (3 class) respectively. Experiments are conducted on both baseline MFCC data [1] and log Mel spectrograms. Classification results show that for several audio lengths, models trained on log Mel spectrograms consistently outperform those of MFCC’s. The ability of the network to accurately classify different classes is evaluated via the area under receiver operating characteristic curve [2],[3]. The findings from this study could aid in better detection and monitoring of ALS and PD diseases.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134460857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Improving Automatic Speech Recognition by Classifying Adult and Child Speakers into Separate Groups using Speech Rate Rhythmicity Parameter 基于语速韵律参数的成人和儿童语音自动识别方法研究
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179497
S. Shahnawazuddin, Tarun Sai Bandarupalli, R. Chakravarthy
When children’s speech is transcribed using acoustic models trained on adults’ data, a severely degraded recognition performance is obtained. Similar degradations are noted on recognizing adults’ speech using an automatic speech recognition (ASR) system trained on children’s speech. This problem can be overcome by using two separate ASR systems for the two groups of speakers. But this approach requires an effective technique to detect whether the given data is from adult or child speaker. In this paper, we present a very simple and novel technique to do the same. The proposed approach is based on speechrate rhythmicity parameter (SRRP). Since the speaking-rates for adults and children differ significantly, the SRRP values are also very different for the two groups of speakers. Hence, by computing the SRRP value for a given speech utterance, it can be easily determined whether it is from adult or child speaker. The corresponding ASR systems can then be used to achieve improved recognition performance. Alternatively, existing techniques for improving children’s speech recognition on adult data trained systems can be directly applied once it is known that the data is from a child speaker. Both these aspects have been experimentally validated in this work.
当使用成人数据训练的声学模型转录儿童语音时,识别性能会严重下降。在使用儿童语音训练的自动语音识别(ASR)系统识别成人语音时,也注意到类似的退化。这个问题可以通过为两组扬声器使用两个独立的ASR系统来解决。但是这种方法需要一种有效的技术来检测给定的数据是来自成人还是儿童说话者。在本文中,我们提出了一种非常简单和新颖的技术来做到这一点。该方法基于语音速率节律参数(SRRP)。由于成人和儿童的说话率差异很大,两组说话者的SRRP值也有很大差异。因此,通过计算给定语音的SRRP值,可以很容易地确定它是来自成人还是儿童说话者。相应的ASR系统可以用来提高识别性能。或者,一旦知道数据来自儿童说话者,就可以直接应用现有的技术来改善成人数据训练系统上儿童的语音识别。这两个方面在本工作中都得到了实验验证。
{"title":"Improving Automatic Speech Recognition by Classifying Adult and Child Speakers into Separate Groups using Speech Rate Rhythmicity Parameter","authors":"S. Shahnawazuddin, Tarun Sai Bandarupalli, R. Chakravarthy","doi":"10.1109/SPCOM50965.2020.9179497","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179497","url":null,"abstract":"When children’s speech is transcribed using acoustic models trained on adults’ data, a severely degraded recognition performance is obtained. Similar degradations are noted on recognizing adults’ speech using an automatic speech recognition (ASR) system trained on children’s speech. This problem can be overcome by using two separate ASR systems for the two groups of speakers. But this approach requires an effective technique to detect whether the given data is from adult or child speaker. In this paper, we present a very simple and novel technique to do the same. The proposed approach is based on speechrate rhythmicity parameter (SRRP). Since the speaking-rates for adults and children differ significantly, the SRRP values are also very different for the two groups of speakers. Hence, by computing the SRRP value for a given speech utterance, it can be easily determined whether it is from adult or child speaker. The corresponding ASR systems can then be used to achieve improved recognition performance. Alternatively, existing techniques for improving children’s speech recognition on adult data trained systems can be directly applied once it is known that the data is from a child speaker. Both these aspects have been experimentally validated in this work.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124050731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Performance Analysis of DF Relaying Assisted Underwater Visible Light Communication System DF中继辅助水下可见光通信系统性能分析
Pub Date : 2020-07-01 DOI: 10.1109/SPCOM50965.2020.9179631
Monika Jain, N. Sharma, Akash Gupta, D. Rawal, P. Garg
Underwater optical wireless communication plays an important role in the oceanography research, offshore exploration, climate monitoring, tactical surveillance and has attracted interest of military, industry, scientific and research community. In this paper, mixed satellite-underwater visible light communication (UWVLC) decode and forward (DF) cooperative system is proposed to help establish the connectivity between underwater scenario with the terrestrial regions via satellite for both oceanic monitoring as well as communication. The link between satellite and floating vessel (FV) is modeled by Shadowed-Rician fading channel and the UWVLC link is characterized by the mixture Exponential-Generalized Gamma (EGG) distribution. In particular, for the dual-hop satellite-UWVLC system, we derive the exact closed-form expressions for the average bit error rate (BER) in terms of a generalized Fox-H function. Furthermore, performance of the system is analyzed for various parameters such as air bubble levels, temperature gradient and saline level of water to validate the feasibility of the proposed system model.
水下无线光通信在海洋学研究、近海勘探、气候监测、战术监视等方面发挥着重要作用,引起了军事、工业和科研界的广泛关注。本文提出了一种混合卫星-水下可见光通信(UWVLC)译码转发(DF)协同系统,通过卫星建立水下场景与陆地区域之间的连接,用于海洋监测和通信。卫星与浮船(FV)之间的链路采用阴影衰落信道建模,UWVLC链路采用混合指数-广义伽玛(EGG)分布表征。特别地,对于双跳卫星- uwvlc系统,我们用广义Fox-H函数导出了平均误码率(BER)的精确封闭表达式。此外,分析了气泡水平、温度梯度和含盐量等参数对系统性能的影响,以验证所提出系统模型的可行性。
{"title":"Performance Analysis of DF Relaying Assisted Underwater Visible Light Communication System","authors":"Monika Jain, N. Sharma, Akash Gupta, D. Rawal, P. Garg","doi":"10.1109/SPCOM50965.2020.9179631","DOIUrl":"https://doi.org/10.1109/SPCOM50965.2020.9179631","url":null,"abstract":"Underwater optical wireless communication plays an important role in the oceanography research, offshore exploration, climate monitoring, tactical surveillance and has attracted interest of military, industry, scientific and research community. In this paper, mixed satellite-underwater visible light communication (UWVLC) decode and forward (DF) cooperative system is proposed to help establish the connectivity between underwater scenario with the terrestrial regions via satellite for both oceanic monitoring as well as communication. The link between satellite and floating vessel (FV) is modeled by Shadowed-Rician fading channel and the UWVLC link is characterized by the mixture Exponential-Generalized Gamma (EGG) distribution. In particular, for the dual-hop satellite-UWVLC system, we derive the exact closed-form expressions for the average bit error rate (BER) in terms of a generalized Fox-H function. Furthermore, performance of the system is analyzed for various parameters such as air bubble levels, temperature gradient and saline level of water to validate the feasibility of the proposed system model.","PeriodicalId":208527,"journal":{"name":"2020 International Conference on Signal Processing and Communications (SPCOM)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127738865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2020 International Conference on Signal Processing and Communications (SPCOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1