首页 > 最新文献

2015 Twenty First National Conference on Communications (NCC)最新文献

英文 中文
A novel breathiness feature for analysis and classification of speech under stress 一种新的呼吸特征在语音分析和分类中的应用
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084826
S. Deb, S. Dandapat
This work explores the effect of breathiness component on speech under stress. The breathiness component in a speech signal can be estimated using different features such as period perturbation quotient (PPQ), amplitude perturbation quotient (APQ), harmonic to noise ratio (HNR), glottal to noise excitation ratio (GNER), harmonic energy (HE), harmonic energy of residue (HER) and harmonic to signal ratio (HSR). Statistical analysis of these features shows that they have different mean and variance values for speech under stress. The performance of breathiness features is evaluated using Hidden Markov Model (HMM) for classification of speech under stress. The results show that the breathiness features successfully characterize the speech under stress. The performance of breathiness features is compared with the MFCC feature. Finally, a speech under stress classification method is proposed with the combination of breathiness and MFCC features. In terms of classification rates, the proposed combined feature outperforms the MFCC feature.
本研究探讨了压力下呼吸成分对言语的影响。语音信号中的呼吸分量可以通过周期摄动商(PPQ)、幅度摄动商(APQ)、谐波噪声比(HNR)、声门噪声激励比(GNER)、谐波能量(HE)、残差谐波能量(HER)和谐波信号比(HSR)等不同特征来估计。对这些特征的统计分析表明,它们在压力下具有不同的均值和方差值。利用隐马尔可夫模型(HMM)评价呼吸特征在压力语音分类中的表现。结果表明,呼吸特征能很好地表征压力下的语音。并与MFCC特性进行了性能比较。最后,提出了一种结合呼吸特征和MFCC特征的压力下语音分类方法。在分类率方面,所提出的组合特征优于MFCC特征。
{"title":"A novel breathiness feature for analysis and classification of speech under stress","authors":"S. Deb, S. Dandapat","doi":"10.1109/NCC.2015.7084826","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084826","url":null,"abstract":"This work explores the effect of breathiness component on speech under stress. The breathiness component in a speech signal can be estimated using different features such as period perturbation quotient (PPQ), amplitude perturbation quotient (APQ), harmonic to noise ratio (HNR), glottal to noise excitation ratio (GNER), harmonic energy (HE), harmonic energy of residue (HER) and harmonic to signal ratio (HSR). Statistical analysis of these features shows that they have different mean and variance values for speech under stress. The performance of breathiness features is evaluated using Hidden Markov Model (HMM) for classification of speech under stress. The results show that the breathiness features successfully characterize the speech under stress. The performance of breathiness features is compared with the MFCC feature. Finally, a speech under stress classification method is proposed with the combination of breathiness and MFCC features. In terms of classification rates, the proposed combined feature outperforms the MFCC feature.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124566974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Equalization in amplify-forward full-duplex relay with direct link 直连放大前向全双工中继的均衡
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084888
Karra Chinmay Dheeraj, A. Thangaraj, R. Ganti
An ideal full-duplex relay doubles the achievable data rate compared to a half-duplex relay. However, in practice, the self-interference and processing delay induces an ISI channel between the source and the destination nodes. In this paper, we study the outage performance of a full-duplex relaying network with amplify-and-forward scheme. In contrast to prior work, we include the direct link from the source to destination, and analyze the distribution of the end-to-end signal-to-noise ratio (SNR) with the minimum mean squared error decision feedback equalizer. We observe that the direct link provides a significant SNR gain, and including it is particularly important for self-interference combating at the receiver.
与半双工中继相比,理想的全双工中继可以实现双倍的数据速率。然而,在实际中,自干扰和处理延迟在源节点和目标节点之间产生了一个ISI通道。本文研究了采用放大转发方案的全双工中继网络的中断性能。与之前的工作相比,我们包括了从源到目标的直接链接,并使用最小均方误差决策反馈均衡器分析了端到端信噪比(SNR)的分布。我们观察到,直接链路提供了显著的信噪比增益,并且包括它对于接收器的自干扰对抗特别重要。
{"title":"Equalization in amplify-forward full-duplex relay with direct link","authors":"Karra Chinmay Dheeraj, A. Thangaraj, R. Ganti","doi":"10.1109/NCC.2015.7084888","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084888","url":null,"abstract":"An ideal full-duplex relay doubles the achievable data rate compared to a half-duplex relay. However, in practice, the self-interference and processing delay induces an ISI channel between the source and the destination nodes. In this paper, we study the outage performance of a full-duplex relaying network with amplify-and-forward scheme. In contrast to prior work, we include the direct link from the source to destination, and analyze the distribution of the end-to-end signal-to-noise ratio (SNR) with the minimum mean squared error decision feedback equalizer. We observe that the direct link provides a significant SNR gain, and including it is particularly important for self-interference combating at the receiver.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124098941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Widespread near-field with robust H-field using NDTC antennas in multipurpose applications 广泛的近场与鲁棒的h场使用NDTC天线在多用途应用
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084872
Ashwani Sharma, I. Zuazola, J. Batchelor, A. Perallos
Non-uniformly Distributed-Turns Coil (NDTC) antennas are specifically arranged to widespread the overall reactive field in multipurpose near-field applications e.g., the reading/interrogating and wireless power transfer (charging mats), while permitting their inherent robust H-field using earlier reported NDTC antennas to form the array. The proposed array consists of five NDTC antennas cautiously arranged and will be shown to optimally widespread the effective area of the antenna by 4.3 times. Simulated results are provided and corroborate the widespreading of the reactive field with robust H-field at HF 13.56MHz. A further widening of the reactive field in principle can be obtained using more NDTC antenna elements.
非均匀分布匝线圈(NDTC)天线专门用于在多用途近场应用中扩展整体无源场,例如读取/询问和无线电力传输(充电垫),同时使用先前报道的NDTC天线来形成阵列,从而允许其固有的鲁棒h场。该阵列由5个精心布置的NDTC天线组成,可将天线的有效面积优化至4.3倍。给出了仿真结果,证实了在HF 13.56MHz波段具有鲁棒h场的无功场的广泛存在。采用更多的NDTC天线单元,原则上可以进一步扩大无功场。
{"title":"Widespread near-field with robust H-field using NDTC antennas in multipurpose applications","authors":"Ashwani Sharma, I. Zuazola, J. Batchelor, A. Perallos","doi":"10.1109/NCC.2015.7084872","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084872","url":null,"abstract":"Non-uniformly Distributed-Turns Coil (NDTC) antennas are specifically arranged to widespread the overall reactive field in multipurpose near-field applications e.g., the reading/interrogating and wireless power transfer (charging mats), while permitting their inherent robust H-field using earlier reported NDTC antennas to form the array. The proposed array consists of five NDTC antennas cautiously arranged and will be shown to optimally widespread the effective area of the antenna by 4.3 times. Simulated results are provided and corroborate the widespreading of the reactive field with robust H-field at HF 13.56MHz. A further widening of the reactive field in principle can be obtained using more NDTC antenna elements.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120919983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On analyzing Indian cellular traffic characteristics for energy efficient network operation 分析印度蜂窝通信特性,实现高效节能网络运行
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084922
Karuna Kumar, Archit Gupta, Rushabh Shah, A. Karandikar, P. Chaporkar
Recent proliferation of mobile devices and high market demand have pushed power consumption of cellular networks to high levels in India. At the same time, the marginal gains to telecom operators for providing services have dwindled. Thus, a gap is slowly building up in the demand and supply of telecom services. The effect is adverse in urban areas where the demand for throughput and other load handling capabilities are high. While the base stations are setup to meet the peak quality of service (QoS) demands, vast opportunities exist to save operational and energy cost when loads are low. For the traffic patterns of one of India's leading telecom service providers, we show that such cost saving opportunities do exist in a systematic fashion and can be tapped to lower the operational cost. We further show that these opportunities can be predicted reliably and discuss a possible scheme to cut down on both energy and operational cost.
最近移动设备的激增和高市场需求将印度蜂窝网络的功耗推到了很高的水平。与此同时,电信运营商提供服务的边际收益也在减少。因此,电信服务的供需差距正在慢慢扩大。在对吞吐量和其他负载处理能力的需求很高的城市地区,这种影响是不利的。虽然基站的设置是为了满足峰值服务质量(QoS)需求,但在低负载时,存在大量节省运营和能源成本的机会。对于印度领先的电信服务提供商之一的流量模式,我们表明这种节省成本的机会确实以系统的方式存在,并且可以用来降低运营成本。我们进一步表明,这些机会可以可靠地预测,并讨论了一种可能的方案,以降低能源和运营成本。
{"title":"On analyzing Indian cellular traffic characteristics for energy efficient network operation","authors":"Karuna Kumar, Archit Gupta, Rushabh Shah, A. Karandikar, P. Chaporkar","doi":"10.1109/NCC.2015.7084922","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084922","url":null,"abstract":"Recent proliferation of mobile devices and high market demand have pushed power consumption of cellular networks to high levels in India. At the same time, the marginal gains to telecom operators for providing services have dwindled. Thus, a gap is slowly building up in the demand and supply of telecom services. The effect is adverse in urban areas where the demand for throughput and other load handling capabilities are high. While the base stations are setup to meet the peak quality of service (QoS) demands, vast opportunities exist to save operational and energy cost when loads are low. For the traffic patterns of one of India's leading telecom service providers, we show that such cost saving opportunities do exist in a systematic fashion and can be tapped to lower the operational cost. We further show that these opportunities can be predicted reliably and discuss a possible scheme to cut down on both energy and operational cost.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131646852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
On linear subspace codes closed under intersection 交点下封闭的线性子空间码
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084870
Pranab Basu, N. Kashyap
Subspace codes are subsets of the projective space Pq(n), which is the set of all subspaces of the vector space Fqn. Koetter and Kschischang argued that subspace codes are useful for error and erasure correction in random network coding. Linearity in subspace codes was defined by Braun, Etzion and Vardy, and they conjectured that the largest cardinality of a linear subspace code in Pq(n) is 2n. In this paper, we show that the conjecture holds for linear subspace codes that are closed under intersection, i.e., codes having the property that the intersection of any pair of codewords is also a codeword. The proof is via a characterization of such codes in terms of partitions of linearly independent subsets of Fqn.
子空间码是射影空间Pq(n)的子集,Pq(n)是向量空间Fqn的所有子空间的集合。Koetter和Kschischang认为子空间码对随机网络编码中的错误和擦除校正是有用的。Braun、Etzion和Vardy定义了子空间码的线性性,并推测出Pq(n)中线性子空间码的最大基数为2n。在本文中,我们证明了对于闭于交下的线性子空间码,即具有任意码字对的交也是码字的性质的码,这个猜想成立。证明是通过用Fqn的线性独立子集的划分来描述这些码的特征。
{"title":"On linear subspace codes closed under intersection","authors":"Pranab Basu, N. Kashyap","doi":"10.1109/NCC.2015.7084870","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084870","url":null,"abstract":"Subspace codes are subsets of the projective space Pq(n), which is the set of all subspaces of the vector space Fqn. Koetter and Kschischang argued that subspace codes are useful for error and erasure correction in random network coding. Linearity in subspace codes was defined by Braun, Etzion and Vardy, and they conjectured that the largest cardinality of a linear subspace code in Pq(n) is 2n. In this paper, we show that the conjecture holds for linear subspace codes that are closed under intersection, i.e., codes having the property that the intersection of any pair of codewords is also a codeword. The proof is via a characterization of such codes in terms of partitions of linearly independent subsets of Fqn.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127006317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Building speech synthesis systems for Indian languages 为印度语言构建语音合成系统
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084931
Abhijit Pradhan, A. Prakash, S. Shanmugam, G. Kasthuri, R. Krishnan, H. Murthy
In this paper, new efforts to build text-to-speech synthesis systems (TTS) for Indian languages is presented. The synthesisers are built around both concatenative speech synthesis and statistical parametric speech synthesis frameworks. Text to speech synthesis systems require accurate segmentation. Obtaining accurate segmentation at the phone-level is a difficult task. Manual segmentation leads to human errors, while automatic segmentation using statistical approaches (hidden Markov model based approaches) leads to poor boundary information, when the amount of data used for training is small. A group delay based syllable segmentation semi-automatic tool is discussed. The tool is semi-automatic as some of the boundaries obtained are inaccurate and have to be manually corrected. Next, a segmentation algorithm that uses both HMM based segmentation and group delay based segmentation, is used to obtain accurate boundaries automatically. The boundaries obtained are used in the syllable-based synthesiser for unit selection. In the statistical phone-based synthesiser, embedded reestimation is performed at the phone level. Syllable-based and penta-phone based HMMs are used for building the synthesiser. TTS systems for 12 different Indian languages namely Tamil, Hindi, Marathi, Malayalam, Telugu, Rajasthani, Bengali, Odia, Assamese, Manipuri, Kannada and Gujarati are built using semi-automatic segmentation and synthesisers have been built for 7 Indian languages using automatic segmentation. Evaluation of the semi-automatic segmentation systems indicate that the MOS (mean opinion score) is above 3.0 for most of the languages. Pair comparison tests on semi-automatic vs. automatic segmentation show that automatic segmentation is preferred.
本文介绍了为印度语言构建文本到语音合成系统(TTS)的新努力。合成器是围绕连接语音合成和统计参数语音合成框架建立的。文本到语音合成系统需要精确的分割。在电话级获得准确的分割是一项艰巨的任务。人工分割会导致人为错误,而使用统计方法(基于隐马尔可夫模型的方法)的自动分割在用于训练的数据量很小的情况下,会导致边界信息不佳。讨论了一种基于组延迟的音节分词半自动工具。该工具是半自动的,因为获得的一些边界是不准确的,必须手动校正。其次,采用基于隐马尔可夫模型和基于群延迟的分割算法,自动获得精确的边界;获得的边界用于基于音节的合成器进行单元选择。在基于统计电话的合成器中,在电话级执行嵌入式重估计。基于音节和基于五音素的hmm用于构建合成器。12种不同的印度语言的TTS系统,即泰米尔语、印地语、马拉地语、马拉雅拉姆语、泰卢固语、拉贾斯坦语、孟加拉语、奥迪亚语、阿萨姆语、曼尼普尔语、卡纳达语和古吉拉特语,使用半自动分割,为7种印度语言建立了合成器,使用自动分割。对半自动分词系统的评价表明,大多数语言的平均意见评分都在3.0以上。对半自动和自动分割的配对比较测试表明,自动分割是首选。
{"title":"Building speech synthesis systems for Indian languages","authors":"Abhijit Pradhan, A. Prakash, S. Shanmugam, G. Kasthuri, R. Krishnan, H. Murthy","doi":"10.1109/NCC.2015.7084931","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084931","url":null,"abstract":"In this paper, new efforts to build text-to-speech synthesis systems (TTS) for Indian languages is presented. The synthesisers are built around both concatenative speech synthesis and statistical parametric speech synthesis frameworks. Text to speech synthesis systems require accurate segmentation. Obtaining accurate segmentation at the phone-level is a difficult task. Manual segmentation leads to human errors, while automatic segmentation using statistical approaches (hidden Markov model based approaches) leads to poor boundary information, when the amount of data used for training is small. A group delay based syllable segmentation semi-automatic tool is discussed. The tool is semi-automatic as some of the boundaries obtained are inaccurate and have to be manually corrected. Next, a segmentation algorithm that uses both HMM based segmentation and group delay based segmentation, is used to obtain accurate boundaries automatically. The boundaries obtained are used in the syllable-based synthesiser for unit selection. In the statistical phone-based synthesiser, embedded reestimation is performed at the phone level. Syllable-based and penta-phone based HMMs are used for building the synthesiser. TTS systems for 12 different Indian languages namely Tamil, Hindi, Marathi, Malayalam, Telugu, Rajasthani, Bengali, Odia, Assamese, Manipuri, Kannada and Gujarati are built using semi-automatic segmentation and synthesisers have been built for 7 Indian languages using automatic segmentation. Evaluation of the semi-automatic segmentation systems indicate that the MOS (mean opinion score) is above 3.0 for most of the languages. Pair comparison tests on semi-automatic vs. automatic segmentation show that automatic segmentation is preferred.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116964763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
An ultra-thin triple band polarization-insensitive metamaterial absorber for C-band applications 一种用于c波段应用的超薄三波段偏振不敏感超材料吸收器
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084816
Devkinandan Chaurasiya, Somak Bhattacharyya, Saptarshi Ghosh, Praneeth Munaga, K. V. Srivastava
An ultra-thin triple band polarization-insensitive metamaterial absorber has been presented in this paper. The unit cell of the proposed structure consists of three concentric rings in the top layer of metal-backed dielectric substrate. The simulated result shows that the proposed structure has triple band absorptivity response lying in C band. The structure exhibits polarization-insensitive behavior under normal incidence due to four-fold symmetry. It also shows high absorption under oblique incidence upto 60° for both TE polarization (above 75%) and TM polarization (above 90%), thus validating the wide angle characteristics. The absorption mechanism is explained through illustrating the electric and magnetic field along with the surface current distribution. The proposed structure has been fabricated and measured, which shows good agreement with simulated response, thus verifying the polarization-insensitivity and wide angle characteristics.
本文提出了一种超薄三波段极化不敏感超材料吸收体。所提出的结构的晶胞由位于金属背衬介质衬底顶层的三个同心圆组成。模拟结果表明,该结构在C波段具有三波段的吸光度响应。由于四重对称性,该结构在正入射下表现出极化不敏感的特性。TE偏振(75%以上)和TM偏振(90%以上)在60°斜入射下均显示出高吸收,从而验证了广角特性。通过对电场、磁场和表面电流分布的说明,解释了吸附机理。该结构的制作和测量结果与仿真结果吻合较好,从而验证了该结构的偏振无灵敏度和广角特性。
{"title":"An ultra-thin triple band polarization-insensitive metamaterial absorber for C-band applications","authors":"Devkinandan Chaurasiya, Somak Bhattacharyya, Saptarshi Ghosh, Praneeth Munaga, K. V. Srivastava","doi":"10.1109/NCC.2015.7084816","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084816","url":null,"abstract":"An ultra-thin triple band polarization-insensitive metamaterial absorber has been presented in this paper. The unit cell of the proposed structure consists of three concentric rings in the top layer of metal-backed dielectric substrate. The simulated result shows that the proposed structure has triple band absorptivity response lying in C band. The structure exhibits polarization-insensitive behavior under normal incidence due to four-fold symmetry. It also shows high absorption under oblique incidence upto 60° for both TE polarization (above 75%) and TM polarization (above 90%), thus validating the wide angle characteristics. The absorption mechanism is explained through illustrating the electric and magnetic field along with the surface current distribution. The proposed structure has been fabricated and measured, which shows good agreement with simulated response, thus verifying the polarization-insensitivity and wide angle characteristics.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122981696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Blind image quality evaluation using perception based features 基于感知特征的盲图像质量评价
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084843
Features Venkatanath, Praneeth, Maruthi Chandrasekhar Bh., Sumohana S. Channappayya, S. Medasani
This paper proposes a novel no-reference Perception-based Image Quality Evaluator (PIQUE) for real-world imagery. A majority of the existing methods for blind image quality assessment rely on opinion-based supervised learning for quality score prediction. Unlike these methods, we propose an opinion unaware methodology that attempts to quantify distortion without the need for any training data. Our method relies on extracting local features for predicting quality. Additionally, to mimic human behavior, we estimate quality only from perceptually significant spatial regions. Further, the choice of our features enables us to generate a fine-grained block level distortion map. Our algorithm is competitive with the state-of-the-art based on evaluation over several popular datasets including LIVE IQA, TID & CSIQ. Finally, our algorithm has low computational complexity despite working at the block-level.
提出了一种新的基于无参考感知的真实图像质量评估器(PIQUE)。现有的大多数盲图像质量评估方法依赖于基于意见的监督学习进行质量分数预测。与这些方法不同,我们提出了一种不考虑意见的方法,该方法试图在不需要任何训练数据的情况下量化失真。我们的方法依赖于提取局部特征来预测质量。此外,为了模仿人类行为,我们仅从感知上重要的空间区域估计质量。此外,特征的选择使我们能够生成细粒度的块级失真图。基于对LIVE IQA, TID和CSIQ等几个流行数据集的评估,我们的算法与最先进的算法具有竞争力。最后,尽管我们的算法工作在块级,但计算复杂度很低。
{"title":"Blind image quality evaluation using perception based features","authors":"Features Venkatanath, Praneeth, Maruthi Chandrasekhar Bh., Sumohana S. Channappayya, S. Medasani","doi":"10.1109/NCC.2015.7084843","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084843","url":null,"abstract":"This paper proposes a novel no-reference Perception-based Image Quality Evaluator (PIQUE) for real-world imagery. A majority of the existing methods for blind image quality assessment rely on opinion-based supervised learning for quality score prediction. Unlike these methods, we propose an opinion unaware methodology that attempts to quantify distortion without the need for any training data. Our method relies on extracting local features for predicting quality. Additionally, to mimic human behavior, we estimate quality only from perceptually significant spatial regions. Further, the choice of our features enables us to generate a fine-grained block level distortion map. Our algorithm is competitive with the state-of-the-art based on evaluation over several popular datasets including LIVE IQA, TID & CSIQ. Finally, our algorithm has low computational complexity despite working at the block-level.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128292921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 381
Different aspects of source information for limited data speaker verification 不同方面的源信息对有限数据说话人进行验证
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084846
Rohan Kumar Das, D. Pati, S. Prasanna
Limited data speaker verification has shown its significance in practical system oriented applications. The paper shows the importance of different aspects of voice source feature for limited test data scenario. A baseline speaker verification system using conventional mel frequency cepstral co-efficients (MFCC) feature is developed and performance under limited test data condition (≤10 s) is evaluated. A parallel system based on source feature mel power difference of spectrum in subband (M-PDSS) is developed in the i-vector based speaker verification framework. Both the systems were fused at the score level for the cases of short segments of test speech, which demonstrated the importance of source feature with reduction in test data duration. A comparative study of the M-PDSS feature is then made with our earlier work using discrete cosine transform of the integrated linear prediction residual (DCTILPR) feature and then fusion of two source features M-PDSS and DCTILPR along with MFCC features is carried out. An absolute improvement of 5.19% is obtained for 2 s of test data which conveys the significance of multiple source information under limited data speaker verification as it carries different aspects of source information.
有限数据说话人验证在面向系统的实际应用中具有重要意义。本文阐述了在有限的测试数据场景下,语音源特性各方面的重要性。开发了一种基于传统mel频率倒谱系数(MFCC)特征的基线扬声器验证系统,并对该系统在有限测试数据条件下(≤10 s)的性能进行了评估。在基于i向量的说话人验证框架中,提出了一种基于源特征和子带频谱功率差的并行系统(M-PDSS)。在测试语音的短片段情况下,两种系统在得分水平上融合,这表明了源特征在减少测试数据持续时间方面的重要性。然后,利用积分线性预测残差(DCTILPR)特征的离散余弦变换对M-PDSS特征进行对比研究,然后将M-PDSS和DCTILPR两个源特征与MFCC特征进行融合。2 s测试数据的绝对改进率为5.19%,在有限的数据说话人验证下,由于测试数据承载了源信息的不同方面,体现了多源信息的重要性。
{"title":"Different aspects of source information for limited data speaker verification","authors":"Rohan Kumar Das, D. Pati, S. Prasanna","doi":"10.1109/NCC.2015.7084846","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084846","url":null,"abstract":"Limited data speaker verification has shown its significance in practical system oriented applications. The paper shows the importance of different aspects of voice source feature for limited test data scenario. A baseline speaker verification system using conventional mel frequency cepstral co-efficients (MFCC) feature is developed and performance under limited test data condition (≤10 s) is evaluated. A parallel system based on source feature mel power difference of spectrum in subband (M-PDSS) is developed in the i-vector based speaker verification framework. Both the systems were fused at the score level for the cases of short segments of test speech, which demonstrated the importance of source feature with reduction in test data duration. A comparative study of the M-PDSS feature is then made with our earlier work using discrete cosine transform of the integrated linear prediction residual (DCTILPR) feature and then fusion of two source features M-PDSS and DCTILPR along with MFCC features is carried out. An absolute improvement of 5.19% is obtained for 2 s of test data which conveys the significance of multiple source information under limited data speaker verification as it carries different aspects of source information.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132582712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Enhancing speech intelligibility based on noise characteristics 基于噪声特性提高语音清晰度
Pub Date : 2015-04-16 DOI: 10.1109/NCC.2015.7084905
Mayur Jagtap, P. Rao
In degraded listening conditions, speakers are known to adapt their speech via the Lombard reflex to make it more comprehensible. This characteristic has been used in previous work to modify speech recorded in quiet before it is rendered in a noisy environment. The spectral modifications used have been found to be effective in low-pass noise such as babble noise. In this work, we investigate intelligibility enhancement of speech in completely different noise characteristics, namely aircraft noise, with its dominant high-frequency components. Natural Lombard speech elicited in aircraft noise was observed to be spectrally similar to Lombard speech in babble noise and showed no intelligibility benefit in a listening test in the presence of aircraft noise. Synthetic modifications using a data dependent optimization based on a perceptual measure are investigated to obtain intelligibility enhancement in aircraft noise.
在听力下降的情况下,说话者会通过伦巴第反射来调整他们的讲话,使其更容易被理解。这一特性在之前的工作中已经被用来修改在安静环境下录制的语音,然后再将其呈现在嘈杂的环境中。所采用的频谱修正方法在低通噪声(如杂音噪声)中是有效的。在这项工作中,我们研究了在完全不同的噪声特征下语音的可理解性增强,即飞机噪声,其主要高频成分。在飞机噪音中引出的自然伦巴第语与在牙牙学语噪音中引出的伦巴第语在频谱上相似,在飞机噪音存在的听力测试中没有显示出可理解性的好处。为了提高飞机噪声的可理解性,研究了基于感知度量的数据依赖优化的综合修改。
{"title":"Enhancing speech intelligibility based on noise characteristics","authors":"Mayur Jagtap, P. Rao","doi":"10.1109/NCC.2015.7084905","DOIUrl":"https://doi.org/10.1109/NCC.2015.7084905","url":null,"abstract":"In degraded listening conditions, speakers are known to adapt their speech via the Lombard reflex to make it more comprehensible. This characteristic has been used in previous work to modify speech recorded in quiet before it is rendered in a noisy environment. The spectral modifications used have been found to be effective in low-pass noise such as babble noise. In this work, we investigate intelligibility enhancement of speech in completely different noise characteristics, namely aircraft noise, with its dominant high-frequency components. Natural Lombard speech elicited in aircraft noise was observed to be spectrally similar to Lombard speech in babble noise and showed no intelligibility benefit in a listening test in the presence of aircraft noise. Synthetic modifications using a data dependent optimization based on a perceptual measure are investigated to obtain intelligibility enhancement in aircraft noise.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115175207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2015 Twenty First National Conference on Communications (NCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1