2018 Twenty Fourth National Conference on Communications (NCC)最新文献

英文中文

Determining the Generalized Hamming Weight Hierarchy of the Binary Projective Reed-Muller Code 确定二进制投影Reed-Muller码的广义Hamming权层次

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-06-06 DOI: 10.1109/NCC.2018.8600134

Vinayak Ramkumar, Myna Vajha, P. V. Kumar

Projective Reed-Muller codes correspond to subcodes of the Reed-Muller code in which the polynomials being evaluated to yield codewords, are restricted to be homogeneous. The Generalized Hamming Weights (GHW) of a code C, identify for each dimension v, the smallest size of the support of a sub code of $C$ of dimension u, The GHW of a code are of interest in assessing the vulnerability of a code in a wiretap channel setting. It is also of use in bounding the state complexity of the trellis representation of the code. In prior work [1] by the same authors, a code-shortening algorithm was employed to derive upper bounds on the GHW of binary projective, Reed-Muller (PRM) codes. In the present paper, we derive a matching lower bound by adapting the proof techniques used originally for Reed-Muller (RM) codes by Wei in [2]. This results in a characterization of the GHW hierarchy of binary PRM codes.

投影里德-穆勒码对应于里德-穆勒码的子码，其中多项式被评估以产生码字，被限制为齐次。代码C的广义汉明权重(GHW)，确定了每个维v，支持维u的子代码$C$的最小尺寸，代码的GHW对评估代码在窃听信道设置中的脆弱性感兴趣。它还用于限定代码的网格表示的状态复杂性。在相同作者的先前工作[1]中，使用代码缩短算法来推导二进制投影Reed-Muller (PRM)码的GHW上界。本文采用Wei在[2]中最初用于Reed-Muller (RM)码的证明技术，导出了一个匹配下界。这导致了二进制PRM码的GHW层次结构的表征。

引用次数: 3

Spectrum Sensing and Collision with Primary Users in MIMO Cognitive Radio MIMO认知无线电中的频谱感知与主用户碰撞

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600204

V. Viswanath, S. Alam, R. S. Kshetrimayum

Cognitive Radio Network is a form of communication where licensed frequency band of the Primary users (PUs) are made available to the Secondary user (SU) with constraint interference to the PUs. In this work, we have investigated a novel model considering interweave approach for spectrum access, with multiple primary users and single secondary user (SU). Multiple antennas have been considered at both the primary users as well as the secondary users. The activity of primary users are modeled as Poisson process. In addition, we also propose a new approach of sensing for a MIMO cognitive radio network using energy detection. The proposed method provides a closed form expression for probability of detection (Pd) and probability of false alarm (Pf) in a Multiple Input Multiple Output (MIMO) channel. Further, the throughput of secondary user as well as interference on primary users caused due to secondary user transmissions, has been computed for the proposed model.

认知无线网络是一种将主用户(pu)的许可频带提供给辅助用户(SU)的通信形式，对pu有约束干扰。在这项工作中，我们研究了一种考虑频谱接入交织方法的新模型，该模型具有多个主用户和单个辅助用户(SU)。在主用户和辅助用户都考虑了多天线。将主用户的活动建模为泊松过程。此外，我们还提出了一种基于能量检测的MIMO认知无线网络传感新方法。该方法给出了多输入多输出(MIMO)信道中检测概率(Pd)和虚警概率(Pf)的封闭表达式。此外，还计算了辅助用户的吞吐量以及辅助用户传输对主用户的干扰。

引用次数: 2

Automatic Segmentation of Intima Media Complex in Common Carotid Artery using Adaptive Wind Driven Optimization 应用自适应风驱动优化自动分割颈总动脉内膜中膜复合体

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600240

Pardhu Madipalli, S. Kotta, Harish Dadi, N. Y., A. C S, A. V. Narasimhadhan

Cardiovascular diseases have been one of the leading causes of death and have been increasing in much of the developing world. Atherosclerosis, the accumulation of plaque on artery walls is the major for cardiovascular diseases. This is diagnosed by measuring the thickness of IMC of common carotid artery (CCA) in ultrasound images. In this paper, we present a completely automatic technique for segmentation of IMC in ultrasound images of CCA. The image is segmented using adaptive wind driven optimization (AWDO) technique. The denoising filter based on Bayesian least square approach and a robust enhancement technique is used in the pre-processing stage. The proposed method is evaluated on 60 ultrasound images and is compared with the state-of-the-art methods. The experimental results show that the proposed method yields better results as compared to other methods.

心血管疾病一直是导致死亡的主要原因之一，并且在许多发展中国家一直在增加。动脉粥样硬化，即斑块在动脉壁上的积聚，是心血管疾病的主要诱因。这是通过测量颈总动脉(CCA)超声图像的IMC厚度来诊断的。在本文中，我们提出了一种完全自动分割CCA超声图像中IMC的技术。采用自适应风驱动优化(AWDO)技术对图像进行分割。预处理阶段采用了基于贝叶斯最小二乘法的去噪滤波器和鲁棒增强技术。所提出的方法是评估60超声图像，并与最先进的方法进行比较。实验结果表明，与其他方法相比，该方法取得了更好的效果。

引用次数: 5

Cell-Phone Identification from Recompressed Audio Recordings 从重新压缩的音频记录中识别手机

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600131

Vinay Verma, Preet Khaturia, N. Khanna

Many audio forensic applications would benefit from the ability to classify audio recordings, based on characteristics of the originating device, particularly in social media platforms where an enormous amount of data is posted every day. This paper utilizes passive signatures associated with the recording devices, as extracted from recorded audio itself, in the absence of any extrinsic security mechanism such as digital watermarking, to identify the source cell-phone of recorded audio. It uses device-specific information present in low as well as high-frequency regions of the recorded audio. On the only publicly available dataset in this field, MOBIPHONE, the proposed system gives a closed set accuracy of 97.2 % which matches the state of art accuracy reported for this dataset. On audio recordings which have undergone double compression, as typically happens for a recording posted on social media, the proposed system outperforms the existing methods (4% improvement in average accuracy).

许多音频取证应用将受益于根据原始设备的特征对录音进行分类的能力，特别是在每天发布大量数据的社交媒体平台上。本文利用与录音设备相关的被动签名，在没有任何外部安全机制(如数字水印)的情况下，从录制的音频中提取与录音设备相关的被动签名，来识别录制音频的源手机。它使用存在于录制音频的低频和高频区域的设备特定信息。在该领域唯一公开可用的数据集MOBIPHONE上，所提出的系统给出了97.2%的封闭集精度，与该数据集报告的最新精度相匹配。对于经过双重压缩的音频记录，如通常发生在社交媒体上的录音，所提出的系统优于现有的方法(平均准确率提高4%)。

引用次数: 5

Improved Epoch Extraction Using Variational Mode Decomposition Based Spectral Smoothing of Zero Frequency Filtered Emotive Speech Signals 基于变分模分解的零频率滤波情绪语音信号谱平滑改进历元提取

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600091

D. Govind, D. Pravena, S. Ajay

The objective of the present work is to improve the epoch extraction performance from emotive speech by proposing a post processing approach to the conventional zero frequency filtering (ZFF) method using variational mode decomposition (VMD) based spectral smoothing. Due to the fast uncontrolled variations of the pitch in emotive speech signals, the reliable estimation of epochs is always challenging. In the proposed method, the spectra of the short frames of zero frequency filtered signal (ZFFS) is subjected variational mode decomposition to get component spectra in five modes. A smoothed short time spectra is then obtained by excluding the spectra from the two higher VMD modes which essentially have the high spectral variations. The modified ZFFS is then reconstructed using the sinusoidal parameters corresponding to single dominant frequency present in the smoothed spectra using VMD by parameter interpolation based sinusoidal synthesis. The resulting re-synthesized ZFFS has reduced spurious zero crossings as compared to that obtained from the conventional ZFF method for emotive speech signals. The effectiveness of the proposed VMD based spectral post processing is confirmed from the improved epoch identification rate and epoch identification accuracy across all the emotive utterances (with 7 emotions) present in German emotion speech database having simultaneous speech and electroglottographic (EGG) signal recordings. The performance of the proposed method is found to be better or comparable with the other existing ZFF based post processing methods proposed for emotive speech signals in terms of the epoch identification accuracy with respect to the corresponding reference epochs estimated from EGG signals.

本文的目的是通过提出一种基于变分模态分解(VMD)的频谱平滑的后处理方法来改进从情绪语音中提取历元的性能。由于情绪语音信号中音高的快速不受控制的变化，可靠的时代估计一直是一个挑战。该方法对零频滤波信号的短帧谱进行变分模态分解，得到五种模态的分量谱。然后通过排除两个高VMD模式的光谱得到平滑的短时间光谱，这两个模式本质上具有高光谱变化。然后利用基于参数插值的正弦合成方法，利用VMD平滑谱中单个主频率对应的正弦参数重构改进后的ZFFS。与传统的ZFF方法获得的情感语音信号相比，由此产生的重新合成的ZFFS减少了虚假的过零。在德语情绪语音数据库中，同时记录语音和声门电信号的所有情绪话语(含7种情绪)的历元识别率和历元识别准确率均有所提高，从而证实了基于VMD的频谱后处理的有效性。与现有的基于ZFF的情感语音信号后处理方法相比，基于EGG信号估计的相应参考epoch的历元识别精度更好或相当。

{"title":"Improved Epoch Extraction Using Variational Mode Decomposition Based Spectral Smoothing of Zero Frequency Filtered Emotive Speech Signals","authors":"D. Govind, D. Pravena, S. Ajay","doi":"10.1109/NCC.2018.8600091","DOIUrl":"https://doi.org/10.1109/NCC.2018.8600091","url":null,"abstract":"The objective of the present work is to improve the epoch extraction performance from emotive speech by proposing a post processing approach to the conventional zero frequency filtering (ZFF) method using variational mode decomposition (VMD) based spectral smoothing. Due to the fast uncontrolled variations of the pitch in emotive speech signals, the reliable estimation of epochs is always challenging. In the proposed method, the spectra of the short frames of zero frequency filtered signal (ZFFS) is subjected variational mode decomposition to get component spectra in five modes. A smoothed short time spectra is then obtained by excluding the spectra from the two higher VMD modes which essentially have the high spectral variations. The modified ZFFS is then reconstructed using the sinusoidal parameters corresponding to single dominant frequency present in the smoothed spectra using VMD by parameter interpolation based sinusoidal synthesis. The resulting re-synthesized ZFFS has reduced spurious zero crossings as compared to that obtained from the conventional ZFF method for emotive speech signals. The effectiveness of the proposed VMD based spectral post processing is confirmed from the improved epoch identification rate and epoch identification accuracy across all the emotive utterances (with 7 emotions) present in German emotion speech database having simultaneous speech and electroglottographic (EGG) signal recordings. The performance of the proposed method is found to be better or comparable with the other existing ZFF based post processing methods proposed for emotive speech signals in terms of the epoch identification accuracy with respect to the corresponding reference epochs estimated from EGG signals.","PeriodicalId":121544,"journal":{"name":"2018 Twenty Fourth National Conference on Communications (NCC)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125576409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Disambiguation of Source and Trajectory Non-Stationarities of a Moving Acoustic Source 运动声源的声源和轨迹非平稳性消歧

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8599942

Sai Gunaranian Pelluri, T. Sreenivas

It is well known that when there is a relative motion between the transmitter (source) and receiver, a Doppler shift is observed in the spectral content of the received signal. In this paper, we investigate a scenario where the source signal itself has an innate spectral non-stationarity in addition to the non-stationarity introduced by the source motion relative to the receiver. Using only a single microphone recording, we show that these two kinds of non-stationarities are distinguishable and propose a method of separating them. Towards this, we propose a novel scheme of simulating the signal from a source traversing an arbitrary trajectory. The proposed simulation mechanism employs band-limited interpolation and nonuniform sampling to incorporate an acoustic source generating an arbitrary band-limited signal and moving along an arbitrary trajectory.

众所周知，当发射器(源)和接收器之间存在相对运动时，在接收信号的频谱内容中观察到多普勒频移。在本文中，我们研究了一种情况，即除了源相对于接收器的运动所引入的非平稳性外，源信号本身还具有固有的频谱非平稳性。仅使用一个麦克风记录，我们就证明了这两种非平稳性是可区分的，并提出了一种分离它们的方法。为此，我们提出了一种模拟任意轨迹源信号的新方案。所提出的仿真机制采用限带插值和非均匀采样来模拟产生任意限带信号并沿任意轨迹运动的声源。

引用次数: 0

Permutation Polynomial Representatives and their Matrices 置换多项式表示及其矩阵

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8599882

Megha M. Kolhekar, H. Pillai

Permutation polynomials are a topic of research due to their applications in various areas like coding theory, cryptography and combinatorial designs. The seminal paper [1] lists many open problems in this area. There are q^qpolynomials of degree < $q$ over $mathbb{F}_{q}$ and $q!$ among them are the permutation polynomials. Therefore as $q$ increases it becomes more difficult to find a permutation polynomial. In this paper, we define a notion of a “Permutation Polynomial Representative (PPR)” which can be used to reduce the search space for permutation polynomials. We give some properties of a PPR. Then we give matrix representation of a PPR; which can be used to construct the ‘compositional inverse’ of the PPR. In every application compositional inverses are required to invert the permutation established by the permutation polynomial, but finding the compositional inverse of a given permutation polynomial is not a straightforward problem. Further, we introduce a product of two vectors over $mathbb{F}_{q}$ which we call as the ‘Butterfly Product’, use it to define a $mathcal{H}$ matrix’ and provide a necessary and sufficient condition for any (q - 2) × (q - 2) matrix over $mathbb{F}_{q}$ to be the matrix representation of a permutation of non-zero elements of $mathbb{F}_{q}$. In the end we give a theorem about finding more permutation polynomials from the matrix of a PPR.

排列多项式由于其在编码理论、密码学和组合设计等各个领域的应用而成为一个研究课题。开创性的论文[1]列出了该领域的许多开放问题。有阶数< $q$ / $mathbb{F}_{q}$和$q!其中$是排列多项式。因此，随着$q$的增加，寻找置换多项式变得更加困难。在本文中，我们定义了一个“置换多项式代表(PPR)”的概念，它可以用来减少置换多项式的搜索空间。我们给出了PPR的一些性质。然后给出了PPR的矩阵表示;它可以用来构造PPR的“复合逆”。在每个应用中，组合逆都需要对由置换多项式建立的置换进行逆，但是找到给定置换多项式的组合逆并不是一个简单的问题。进一步，我们引入$mathbb{F}_{q}$上两个向量的乘积，我们称之为“蝴蝶积”，用它来定义$mathbb{H}$矩阵，并给出$mathbb{F}_{q}$上任意(q - 2) × (q - 2)矩阵是$mathbb{F}_{q}$的非零元素置换的矩阵表示的充要条件。最后给出了从PPR矩阵中求出更多置换多项式的定理。

{"title":"Permutation Polynomial Representatives and their Matrices","authors":"Megha M. Kolhekar, H. Pillai","doi":"10.1109/NCC.2018.8599882","DOIUrl":"https://doi.org/10.1109/NCC.2018.8599882","url":null,"abstract":"Permutation polynomials are a topic of research due to their applications in various areas like coding theory, cryptography and combinatorial designs. The seminal paper [1] lists many open problems in this area. There are q<sup>q</sup>polynomials of degree < <tex>$q$</tex> over <tex>$mathbb{F}_{q}$</tex> and <tex>$q!$</tex> among them are the permutation polynomials. Therefore as <tex>$q$</tex> increases it becomes more difficult to find a permutation polynomial. In this paper, we define a notion of a “Permutation Polynomial Representative (PPR)” which can be used to reduce the search space for permutation polynomials. We give some properties of a PPR. Then we give matrix representation of a PPR; which can be used to construct the ‘compositional inverse’ of the PPR. In every application compositional inverses are required to invert the permutation established by the permutation polynomial, but finding the compositional inverse of a given permutation polynomial is not a straightforward problem. Further, we introduce a product of two vectors over <tex>$mathbb{F}_{q}$</tex> which we call as the ‘Butterfly Product’, use it to define a <tex>$mathcal{H}$</tex> matrix’ and provide a necessary and sufficient condition for any (q - 2) × (q - 2) matrix over <tex>$mathbb{F}_{q}$</tex> to be the matrix representation of a permutation of non-zero elements of <tex>$mathbb{F}_{q}$</tex>. In the end we give a theorem about finding more permutation polynomials from the matrix of a PPR.","PeriodicalId":121544,"journal":{"name":"2018 Twenty Fourth National Conference on Communications (NCC)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126384546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cooperative Sensing of OFDM Signals using Heterogeneous Sensors 基于异构传感器的OFDM信号协同感知

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600266

Akhil Singh, P. B. Gohain, S. Chaudhari

In this paper, we investigate a distributed and heterogeneous cognitive radio network (CRN), comprising of secondary users (SUs) employing either energy detector (ED) or autocorrelation detector (AD) to detect the presence or absence of an orthogonal frequency-division multiplexing (OFDM) based primary user (PU). For the considered heterogeneous cooperative spectrum sensing (CSS), the optimal soft combining rule is derived. The performance of this optimal fusion rule and different hard combining schemes such as OR, AND, and MAJOR- ITY is presented for the case when the noise variance is exactly known. Later, the effect of noise uncertainty is also presented. The proposed heterogeneous CSS is shown to combine the excellent performance of the EDs (when the noise variance is exactly known) and robustness of the ADs to the noise uncertainty.

在本文中，我们研究了一个分布式和异构的认知无线电网络(CRN)，该网络由二级用户(su)组成，采用能量检测器(ED)或自相关检测器(AD)来检测基于正交频分复用(OFDM)的主用户(PU)的存在或不存在。针对考虑的异构协同频谱感知，导出了最优软组合规则。在噪声方差确切已知的情况下，给出了该最优融合规则与OR、and、MAJOR- ITY等不同硬组合方案的性能。最后分析了噪声不确定度的影响。所提出的异构CSS结合了EDs的优异性能(当噪声方差确切已知时)和ADs对噪声不确定性的鲁棒性。

引用次数: 3

Energy-Efficient Air Pollution Monitoring with Optimum Duty-Cycling on a Sensor Hub 基于传感器中心的最优占空比节能空气污染监测

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600133

M. R. Chowdhury, S. De, N. Shukla, Ranendra N. Biswas

Air pollution monitoring systems with energy-intensive sensors cannot afford to sample frequently in order to maximize time between successive recharges. In this paper, we propose an energy-efficient machine learning based sensor duty-cycling method for a sensor hub receiving data from the air-pollution sensors. In particular, we demonstrate that temporal correlation of pollutant concentration can be exploited to select optimum sampling period of an energy-intensive sensor to reduce sensing energy consumption without losing much information. Support Vector Regression is used to predict the missing samples during the period sensor is turned off.

具有能源密集型传感器的空气污染监测系统无法承受频繁采样以最大化连续充电之间的时间间隔。在本文中，我们提出了一种节能的基于机器学习的传感器占空比方法，用于从空气污染传感器接收数据的传感器集线器。特别是，我们证明污染物浓度的时间相关性可以用来选择能量密集型传感器的最佳采样周期，以减少传感能量消耗，而不会丢失太多信息。支持向量回归用于预测传感器关闭期间缺失的样本。

引用次数: 11

Manner of Articulation based Split Lattices for Phoneme Recognition 基于发音的分割格音位识别方法

2018 Twenty Fourth National Conference on Communications (NCC)

Pub Date : 2018-02-01 DOI: 10.1109/NCC.2018.8600190

P. R, K. S. Rao

Phoneme lattices have been shown to be a good choice to encode in a compact way alternative decoding hypotheses from a speech recognition system. However the optimal phoneme sequence is produced by tracing all the phoneme identities in the lattice. This not only makes the search space of the decoder huge but also the final phoneme sequence may be prone to have false substitutions or insertion errors. In this paper, we introduce the split lattice structures that is generated by splitting the speech frames based on the manner of articulation. Spectral flatness measure (SFM) is exploited to detect the two broad manner of articulation sonorants and non-sonorants. The manner of sonorants includes broadly the vowels, the semivowels and the nasals whereas the fricatives, stop consonants and closures belong to non-sonorants. The conventional way of speech decoder produces one lattice for one test utterance. In our work, we split the speech frames into sonorants and non-sonorants based on SFM knowledge and generate split lattices. The split lattice generated are modified according to the manner of articulation in each split so as to remove the irrelevant phoneme identities in the lattice. For instance, the sonorant lattice is forced to exclude the non-sonorant phoneme identities and hence minimizing false substitutions or insertion errors. The proposed split lattice structure based on sonority detection decreased the phone error rates by nearly 0.9 % when evaluated on core TIMIT test corpus as compared to the conventional decoding involved in the state-of-the-art Deep Neural Networks (DNN).

音素格已经被证明是一个很好的选择，以一种紧凑的方式编码来自语音识别系统的不同解码假设。然而，最优音素序列是通过跟踪格中的所有音素身份来产生的。这不仅使解码器的搜索空间巨大，而且最终的音素序列容易出现假替换或插入错误。本文介绍了基于发音方式对语音帧进行分割而产生的分割格结构。频谱平坦度测量(SFM)被用于检测两种广泛的发音方式的辅音和非辅音。辅音的方式大致包括元音、半元音和鼻音，而摩擦音、停辅音和闭音属于非辅音。传统的语音解码器对一个测试话语产生一个格。在我们的工作中，我们基于SFM知识将语音帧拆分为辅音和非辅音，并生成分割格。根据每个分裂中的发音方式修改生成的分裂格，以去除晶格中不相关的音素身份。例如，语音点阵被迫排除非语音音素身份，从而最大限度地减少虚假替换或插入错误。当在核心TIMIT测试语料库上评估时，与最先进的深度神经网络(DNN)中涉及的传统解码相比，所提出的基于声音检测的分裂晶格结构将电话错误率降低了近0.9%。

{"title":"Manner of Articulation based Split Lattices for Phoneme Recognition","authors":"P. R, K. S. Rao","doi":"10.1109/NCC.2018.8600190","DOIUrl":"https://doi.org/10.1109/NCC.2018.8600190","url":null,"abstract":"Phoneme lattices have been shown to be a good choice to encode in a compact way alternative decoding hypotheses from a speech recognition system. However the optimal phoneme sequence is produced by tracing all the phoneme identities in the lattice. This not only makes the search space of the decoder huge but also the final phoneme sequence may be prone to have false substitutions or insertion errors. In this paper, we introduce the split lattice structures that is generated by splitting the speech frames based on the manner of articulation. Spectral flatness measure (SFM) is exploited to detect the two broad manner of articulation sonorants and non-sonorants. The manner of sonorants includes broadly the vowels, the semivowels and the nasals whereas the fricatives, stop consonants and closures belong to non-sonorants. The conventional way of speech decoder produces one lattice for one test utterance. In our work, we split the speech frames into sonorants and non-sonorants based on SFM knowledge and generate split lattices. The split lattice generated are modified according to the manner of articulation in each split so as to remove the irrelevant phoneme identities in the lattice. For instance, the sonorant lattice is forced to exclude the non-sonorant phoneme identities and hence minimizing false substitutions or insertion errors. The proposed split lattice structure based on sonority detection decreased the phone error rates by nearly 0.9 % when evaluated on core TIMIT test corpus as compared to the conventional decoding involved in the state-of-the-art Deep Neural Networks (DNN).","PeriodicalId":121544,"journal":{"name":"2018 Twenty Fourth National Conference on Communications (NCC)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131229435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 Twenty Fourth National Conference on Communications (NCC)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀