首页 > 最新文献

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)最新文献

英文 中文
A discrete linear chirp transform (DLCT) for data compression 用于数据压缩的离散线性啁啾变换(dct)
Osama A S Alkishriwo, L. Chaparro
Compressive sensing attempts to simplify the frequency transformation and thresholding steps, commonly done in data compression, into one. Sparseness of the signal, in either time or frequency, is required for the convex optimization in compressive sensing to perform well. Although sparseness of certain signals, in either time or frequency, is guaranteed by the uncertainty principle signals composed of chirps are not however sparse in either domain. In this paper we propose an orthogonal linear-chirp transform, the discrete linear chirp transform (DLCT), to represent any signal in terms of linear chirps, with modulation and dual properties. Using the DLCT the sparseness of the signal in either time or frequency can be assessed, and if not sparse in neither of these domains, the modulation and dual properties of the DLCT provide a way to transform the signal into a sparse signal. The application of the proposed DLCT is in data compression. The transformation is illustrated by using sparse and not sparse test signals as well as actual signals.
压缩感知试图将通常在数据压缩中完成的频率变换和阈值处理步骤简化为一个步骤。信号在时间和频率上的稀疏性是压缩感知中的凸优化的必要条件。虽然不确定性原理保证了某些信号在时间和频率上的稀疏性,但由啁啾组成的信号在时间和频率上都不是稀疏的。在本文中,我们提出了一个正交线性啁啾变换,离散线性啁啾变换(dct),以表示任何信号的线性啁啾,具有调制和对偶性质。使用dct可以评估信号在时间或频率上的稀疏性,如果在这两个域中都不是稀疏的,dct的调制和对偶特性提供了一种将信号转换为稀疏信号的方法。提出的dct在数据压缩方面的应用。通过使用稀疏和非稀疏测试信号以及实际信号来说明该变换。
{"title":"A discrete linear chirp transform (DLCT) for data compression","authors":"Osama A S Alkishriwo, L. Chaparro","doi":"10.1109/ISSPA.2012.6310490","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310490","url":null,"abstract":"Compressive sensing attempts to simplify the frequency transformation and thresholding steps, commonly done in data compression, into one. Sparseness of the signal, in either time or frequency, is required for the convex optimization in compressive sensing to perform well. Although sparseness of certain signals, in either time or frequency, is guaranteed by the uncertainty principle signals composed of chirps are not however sparse in either domain. In this paper we propose an orthogonal linear-chirp transform, the discrete linear chirp transform (DLCT), to represent any signal in terms of linear chirps, with modulation and dual properties. Using the DLCT the sparseness of the signal in either time or frequency can be assessed, and if not sparse in neither of these domains, the modulation and dual properties of the DLCT provide a way to transform the signal into a sparse signal. The application of the proposed DLCT is in data compression. The transformation is illustrated by using sparse and not sparse test signals as well as actual signals.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1038 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131513220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A Kalman filter approach to packet loss replacement in presence of additive noise 一种卡尔曼滤波方法用于加性噪声下的丢包替换
S. Miralavi, S. Ghorshi, Aidin Tahaei
A major problem in real-time packet-based communication systems, is misrouted or delayed packet which results in degraded perceived voice quality. If packets are not available on time, the packets are considered lost. The easiest solution in a network terminal receiver is to replace silence for the duration of lost speech segments. In a high quality communication system, to avoid degradation in speech quality due to packet loss, a suitable method or algorithm is needed to replace the missing segments of speech. In this paper, we introduce an adaptive filter for replacement of lost speech segment. In this method Kalman filter as a state-space based method will be used to predict the clean speech signal in presence of additive noise. The evaluation results show that Kalman filter has lower MSE compared to other methods in presence of White Gaussian Noise and background noise.
在基于分组的实时通信系统中,一个主要问题是分组路由错误或延迟导致感知语音质量下降。如果数据包没有及时可用,则认为数据包丢失。在网络终端接收器中,最简单的解决方案是在丢失的语音段期间替换沉默。在高质量的通信系统中,为了避免丢包导致的语音质量下降,需要一种合适的方法或算法来替换缺失的语音片段。本文介绍了一种自适应滤波器,用于替换丢失的语音片段。该方法将卡尔曼滤波作为一种基于状态空间的方法来预测存在加性噪声的干净语音信号。评价结果表明,在存在高斯白噪声和背景噪声的情况下,卡尔曼滤波具有较低的MSE。
{"title":"A Kalman filter approach to packet loss replacement in presence of additive noise","authors":"S. Miralavi, S. Ghorshi, Aidin Tahaei","doi":"10.1109/ISSPA.2012.6310566","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310566","url":null,"abstract":"A major problem in real-time packet-based communication systems, is misrouted or delayed packet which results in degraded perceived voice quality. If packets are not available on time, the packets are considered lost. The easiest solution in a network terminal receiver is to replace silence for the duration of lost speech segments. In a high quality communication system, to avoid degradation in speech quality due to packet loss, a suitable method or algorithm is needed to replace the missing segments of speech. In this paper, we introduce an adaptive filter for replacement of lost speech segment. In this method Kalman filter as a state-space based method will be used to predict the clean speech signal in presence of additive noise. The evaluation results show that Kalman filter has lower MSE compared to other methods in presence of White Gaussian Noise and background noise.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132909200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Developing a hybrid language model for open vocabulary automatic speech recognition in a lecture speech task 基于开放词汇自动语音识别的混合语言模型研究
Marc-Antoine Rondeau, R. Rose
This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.
本文研究了基于混合统计语言模型的开放词汇自动语音识别问题。混合LMs与封闭词汇LMs的不同之处在于,单词级别的词汇库增加了子词汇单元(slu)的清单。本文介绍了选择这些语言单元和根据语言单元扩展词汇外词的程序。在一个演讲任务域,给出了使用这些技术获得的开放词汇ASR性能。
{"title":"Developing a hybrid language model for open vocabulary automatic speech recognition in a lecture speech task","authors":"Marc-Antoine Rondeau, R. Rose","doi":"10.1109/ISSPA.2012.6310464","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310464","url":null,"abstract":"This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122364027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Micro-doppler radar signature classification by time-frequency and time-scale analysis 基于时频和时标分析的微多普勒雷达特征分类
A. Serir, Youcef Bouhafsi
This paper deals with the micro-Doppler (m-D) phenomenon which is a frequency modulation induced on the returned radar signal due to micro-motion. Indeed, the micro-Doppler provides an additional information for target recognition that is complementary to existing recognition methods. In this paper we present a comparative study between feature extraction based on 2 types of analysis: time frequency (Instantaneous Frequency IF) analysis and time scale analysis (Wavelet Transform WT and Multiplicative Multiresolution Decomposition MMD), associated to SVM classifier to perform target identification. These approaches are tested on RadEch Database, containing radar echoes from various targets. The data has been collected in controlled test environments at the premises of Military Academy - Republic of Serbia. Thus, the results show that good performances are provided by the association of the instantaneous frequency with the MMD descriptor.
微多普勒现象是雷达回波信号由于微运动而产生的一种调频现象。实际上,微多普勒为目标识别提供了额外的信息,是对现有识别方法的补充。本文对基于时频分析(瞬时频率IF)和时尺度分析(小波变换WT和乘法多分辨率分解MMD)两种分析方法的特征提取进行了对比研究,并结合SVM分类器进行目标识别。这些方法在RadEch数据库上进行了测试,该数据库包含来自不同目标的雷达回波。数据是在塞尔维亚共和国军事学院的受控测试环境中收集的。因此,结果表明,将瞬时频率与MMD描述符相关联可以提供良好的性能。
{"title":"Micro-doppler radar signature classification by time-frequency and time-scale analysis","authors":"A. Serir, Youcef Bouhafsi","doi":"10.1109/ISSPA.2012.6310701","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310701","url":null,"abstract":"This paper deals with the micro-Doppler (m-D) phenomenon which is a frequency modulation induced on the returned radar signal due to micro-motion. Indeed, the micro-Doppler provides an additional information for target recognition that is complementary to existing recognition methods. In this paper we present a comparative study between feature extraction based on 2 types of analysis: time frequency (Instantaneous Frequency IF) analysis and time scale analysis (Wavelet Transform WT and Multiplicative Multiresolution Decomposition MMD), associated to SVM classifier to perform target identification. These approaches are tested on RadEch Database, containing radar echoes from various targets. The data has been collected in controlled test environments at the premises of Military Academy - Republic of Serbia. Thus, the results show that good performances are provided by the association of the instantaneous frequency with the MMD descriptor.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122389154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Programmable and single-shot chirped microwave pulse compression using an optical fiber-based microwave dispersive line 利用基于光纤的微波色散线进行可编程和单次啁啾微波脉冲压缩
Ming Li, J. Azaña
An approach for programmable single-shot chirped microwave pulse compression is proposed and experimentally demonstrated based on an optical fiber-based microwave dispersive line. The microwave dispersive line is implemented based on a time-spectrum convolution system. This proposed technique has three key features. The first feature is that non-periodic input chirped microwave pulses can be single-shot compressed. The second one is that the compression system is programmable by shaping the spectrum of a broadband multi-wavelength laser or by tuning the first-order dispersion of a dispersive element. The last one is that microwave dispersive line provides an ultrahigh microwave dispersion approaching several ns2 over GHz bandwidths. The microwave dispersive line used to implement single-shot microwave pulse compression is theoretically analyzed and verified by a proof-of-concept experiment.
提出了一种基于光纤微波色散线的可编程单次啁啾微波脉冲压缩方法,并进行了实验验证。微波色散线是基于时间谱卷积系统实现的。该建议的技术有三个关键特征。第一个特点是非周期输入啁啾微波脉冲可以被单次压缩。第二个是压缩系统是可编程的,通过塑造宽带多波长激光器的频谱或通过调整色散元件的一阶色散。最后是微波色散线在GHz带宽上提供了接近几个ns2的超高微波色散。对用于实现单次微波脉冲压缩的微波色散线进行了理论分析,并通过概念验证实验进行了验证。
{"title":"Programmable and single-shot chirped microwave pulse compression using an optical fiber-based microwave dispersive line","authors":"Ming Li, J. Azaña","doi":"10.1109/ISSPA.2012.6310513","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310513","url":null,"abstract":"An approach for programmable single-shot chirped microwave pulse compression is proposed and experimentally demonstrated based on an optical fiber-based microwave dispersive line. The microwave dispersive line is implemented based on a time-spectrum convolution system. This proposed technique has three key features. The first feature is that non-periodic input chirped microwave pulses can be single-shot compressed. The second one is that the compression system is programmable by shaping the spectrum of a broadband multi-wavelength laser or by tuning the first-order dispersion of a dispersive element. The last one is that microwave dispersive line provides an ultrahigh microwave dispersion approaching several ns2 over GHz bandwidths. The microwave dispersive line used to implement single-shot microwave pulse compression is theoretically analyzed and verified by a proof-of-concept experiment.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127882137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Key point reduction in SIFT descriptor used by subtractive clustering 用减法聚类方法减少SIFT描述符中的关键点
Reza Javanmard Alitappeh, Kossar Jeddi Saravi, F. Mahmoudi
The SIFT descriptor is one of the most widely used descriptors and is very stable in regard to changes in rotation, scale, affine, illumination, etc. This method is based on key points extracted from the image. If there are many such points, a lot of time will be needed in the matching and recognition phases. For this reason, we have tried in this article to use the clustering technique in order to reduce the number of key points by omitting similar points. In other words, subtractive clustering is used to select key points which are more distinct from and less similar to other points. In the section on conclusions, a successful implementation of this method is presented. The efficiencies of the proposed algorithm and of the base SIFT algorithm on the data set ALOI were investigated and it was observed that by adding this method to the base SIFT descriptor the rate of recognition increases by two percent and the time complexity decreases by 1.035728 seconds.
SIFT描述子是应用最广泛的描述子之一,在旋转、尺度、仿射、光照等变化方面都非常稳定。该方法基于从图像中提取的关键点。如果有很多这样的点,在匹配和识别阶段将需要大量的时间。出于这个原因,我们在本文中尝试使用聚类技术,通过省略相似的点来减少关键点的数量。换句话说,使用减法聚类来选择与其他点区别更大,相似度更低的关键点。在结论部分,介绍了该方法的成功实施。研究了该算法和基本SIFT算法在ALOI数据集上的效率,发现将该方法加入到基本SIFT描述符中,识别率提高了2%,时间复杂度降低了1.035728秒。
{"title":"Key point reduction in SIFT descriptor used by subtractive clustering","authors":"Reza Javanmard Alitappeh, Kossar Jeddi Saravi, F. Mahmoudi","doi":"10.1109/ISSPA.2012.6310683","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310683","url":null,"abstract":"The SIFT descriptor is one of the most widely used descriptors and is very stable in regard to changes in rotation, scale, affine, illumination, etc. This method is based on key points extracted from the image. If there are many such points, a lot of time will be needed in the matching and recognition phases. For this reason, we have tried in this article to use the clustering technique in order to reduce the number of key points by omitting similar points. In other words, subtractive clustering is used to select key points which are more distinct from and less similar to other points. In the section on conclusions, a successful implementation of this method is presented. The efficiencies of the proposed algorithm and of the base SIFT algorithm on the data set ALOI were investigated and it was observed that by adding this method to the base SIFT descriptor the rate of recognition increases by two percent and the time complexity decreases by 1.035728 seconds.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129195448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Design of a high-resolution separable-kernel quadratic TFD for improving newborn health outcomes using fetal movement detection 设计高分辨率可分离核二次TFD,通过胎儿运动检测改善新生儿健康结果
B. Boashash, T. B. Jabeur
Prior to birth, fetus health can be monitored by the variety and scale of its movements. In addition, at birth, EEG signals are recorded from at-risk newborns. Studies have shown that both fetal movements and newborn EEGs are non-stationary signals. This paper aims to represent both newborn EEG and fetal movement signals in a time-frequency domain using a specifically designed time-frequency distribution (TFD) that is well adapted to these types of data for the purpose of analysis, detection and classification. The approach to design the quadratic TFDS is based on relating separable-kernel TFDS to DSP spectral window and digital filter design. To reach this goal, we compared recently proposed TFDs such as the Modified B distribution, a separable Gaussian distribution and the B distribution. Then, an extension of the modified B distribution (MBD) is proposed, referred to as the extended separable-kernel MBD. This new TFD uses a separable kernel based on an extension of the modified B kernel in both time and frequency domain with different windows for each domain. Simulation results are provided to compare the proposed Method with different TFDs and to assess its performance. The new TFD is then first applied to real fetal movement data recorded using accelerometers.
在出生前,胎儿的健康可以通过其运动的种类和规模来监测。此外,在出生时,记录高危新生儿的脑电图信号。研究表明,胎儿运动和新生儿脑电图都是非平稳信号。本文旨在使用专门设计的时频分布(TFD)在时频域表示新生儿脑电图和胎儿运动信号,该时频分布(TFD)很好地适应了这些类型的数据,用于分析,检测和分类。将可分核TFDS与DSP频谱窗和数字滤波器设计相结合,设计了二次型TFDS。为了达到这一目标,我们比较了最近提出的tfd,如修正B分布、可分离高斯分布和B分布。然后,提出了改进B分布(MBD)的一种扩展,称为扩展可分离内核MBD。这种新的TFD在时域和频域使用基于改进B核扩展的可分离核,每个域具有不同的窗口。仿真结果与不同的tfd进行了比较,并对其性能进行了评价。然后,新的TFD首先应用于使用加速度计记录的真实胎儿运动数据。
{"title":"Design of a high-resolution separable-kernel quadratic TFD for improving newborn health outcomes using fetal movement detection","authors":"B. Boashash, T. B. Jabeur","doi":"10.1109/ISSPA.2012.6310574","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310574","url":null,"abstract":"Prior to birth, fetus health can be monitored by the variety and scale of its movements. In addition, at birth, EEG signals are recorded from at-risk newborns. Studies have shown that both fetal movements and newborn EEGs are non-stationary signals. This paper aims to represent both newborn EEG and fetal movement signals in a time-frequency domain using a specifically designed time-frequency distribution (TFD) that is well adapted to these types of data for the purpose of analysis, detection and classification. The approach to design the quadratic TFDS is based on relating separable-kernel TFDS to DSP spectral window and digital filter design. To reach this goal, we compared recently proposed TFDs such as the Modified B distribution, a separable Gaussian distribution and the B distribution. Then, an extension of the modified B distribution (MBD) is proposed, referred to as the extended separable-kernel MBD. This new TFD uses a separable kernel based on an extension of the modified B kernel in both time and frequency domain with different windows for each domain. Simulation results are provided to compare the proposed Method with different TFDs and to assess its performance. The new TFD is then first applied to real fetal movement data recorded using accelerometers.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128480265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
A study of applying subspace based pronunciation modeling in verifying pronunciation accuracy 基于子空间的语音建模在语音准确性验证中的应用研究
Shou-Chun Yin, R. Rose, Yun Tang
This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers' utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.
本文研究了一种从神经肌肉障碍患儿的话语中检测音素水平发音错误的新方法。这种发音验证方法是基于子空间高斯混合模型(SGMM)的发音模型,使用一组状态级投影向量来表示语音变异。SGMM模型从残疾人说话者的话语中训练,PV分数直接从残疾人和参考说话者投影向量之间的距离计算。进行了一项实验研究,以评估基于SGMM的方法相对于基于晶格后验概率的方法的性能。当基于SGMM的分数与晶格后验概率相结合时,等效错误率(EER)降低了约15%。
{"title":"A study of applying subspace based pronunciation modeling in verifying pronunciation accuracy","authors":"Shou-Chun Yin, R. Rose, Yun Tang","doi":"10.1109/ISSPA.2012.6310622","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310622","url":null,"abstract":"This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers' utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128699517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian analysis of co-training algorithm with insufficient views 视图不足情况下协同训练算法的贝叶斯分析
Luca Didaci, F. Roli
The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.
如果一个数据集允许两个不同的特征集(两个视图)表示,则可以应用协同训练算法。然而,它的最优性仅在a)每个视图的充分性和b)给定类的条件独立性的条件下被证明。我们处理条件a)不成立的情况,这在具体应用中经常发生。在这种情况下,协同训练无法收敛到最优贝叶斯分类器,因为添加到训练集中的样本不按照类条件分布分布,即使它们分配的标签是正确的。这些结果有助于更好地理解当类仅在“统计”上可分离时,协同训练算法的行为。
{"title":"A Bayesian analysis of co-training algorithm with insufficient views","authors":"Luca Didaci, F. Roli","doi":"10.1109/ISSPA.2012.6310456","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310456","url":null,"abstract":"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115956866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Digital processing in the optical communications 光通信中的数字处理
L. Potí
Digital processing role in the optical communications is analyzed together with its historical evolution and perspectives. If optical systems at their early stage included weak digital processing in the electrical domain, a fast evolution towards completely all-optical technologies arises due to the bottleneck caused by the electronics bandwidth limitation. With the return of coherent communications, digital processing migrated again back to the electronic domain. Nowadays, algorithm complexity capable of mitigating for linear and nonlinear fibre impairments is becoming unsustainable due to huge level of parallelization and power demand. For such a reason one possible evolution will include processing both in the electronic and optical domain.
分析了数字处理技术在光通信中的作用及其发展历史和前景。如果早期的光学系统在电领域包含弱数字处理,那么由于电子带宽限制造成的瓶颈,将迅速向完全全光技术发展。随着相干通信的回归,数字处理再次迁移回电子领域。目前,由于并行化和功率需求巨大,能够减轻线性和非线性光纤损伤的算法复杂度变得不可持续。由于这样的原因,一个可能的发展将包括电子和光学领域的处理。
{"title":"Digital processing in the optical communications","authors":"L. Potí","doi":"10.1109/ISSPA.2012.6310512","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310512","url":null,"abstract":"Digital processing role in the optical communications is analyzed together with its historical evolution and perspectives. If optical systems at their early stage included weak digital processing in the electrical domain, a fast evolution towards completely all-optical technologies arises due to the bottleneck caused by the electronics bandwidth limitation. With the return of coherent communications, digital processing migrated again back to the electronic domain. Nowadays, algorithm complexity capable of mitigating for linear and nonlinear fibre impairments is becoming unsustainable due to huge level of parallelization and power demand. For such a reason one possible evolution will include processing both in the electronic and optical domain.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"532 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116707988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1