首页 > 最新文献

2013 6th International Congress on Image and Signal Processing (CISP)最新文献

英文 中文
Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining 非排他性音频分割和索引作为音频信息挖掘的预处理
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743930
Francis F. Li
Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.
许多与内容相关的信息可以从录制的音轨中提取出来,例如多媒体文件的音轨。原声可以分为三类,即语音、音乐和环境或事件声音。过去的研究主要集中在以排他性的方式对音频片段进行分类的算法上。然而,来自媒体内容的音轨通常呈现为这三种类型声音的重叠混合物。因此,非排他分割和索引是有效的音频信息挖掘和元数据生成必不可少的预处理。本文强调了非排他性索引和分词方法的重要性,指出了存在的问题,提出了一种通用的非排他性索引和分词体系结构,作为音频信息挖掘、元数据提取和场景分析的预处理。给出了相关的特征选择、模式识别和信号处理算法,并讨论了测试结果。
{"title":"Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining","authors":"Francis F. Li","doi":"10.1109/CISP.2013.6743930","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743930","url":null,"abstract":"Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124200316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Complexity pursuit for unifying time series 统一时间序列的复杂性追求
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743982
Yumin Yang
Complexity pursuit is a recently developed algorithm using the gradient descent for separating interesting components from time series. It is an extension of projection pursuit to time series data and the method is closely related to blind separation of time-dependent source signals and independent component analysis. The goal is to find projections of time series that have interesting structure, defined using criteria related to Kolmogoroff complexity or coding length. In this paper, we derived a simple approximation of coding length that takes into account the nongaussianity, the autocorrelations and the variance nonstationary of the time series. We give a simple algorithm for its approximative optimization.
复杂度追踪是最近发展起来的一种利用梯度下降从时间序列中分离出感兴趣分量的算法。投影寻踪是对时间序列数据的扩展,该方法与时间相关源信号的盲分离和独立分量分析密切相关。目标是找到具有有趣结构的时间序列的投影,这些结构使用与Kolmogoroff复杂度或编码长度相关的标准定义。在本文中,我们推导了一个简单的编码长度近似值,该近似值考虑了时间序列的非高斯性、自相关性和方差非平稳性。给出了一种简单的近似优化算法。
{"title":"Complexity pursuit for unifying time series","authors":"Yumin Yang","doi":"10.1109/CISP.2013.6743982","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743982","url":null,"abstract":"Complexity pursuit is a recently developed algorithm using the gradient descent for separating interesting components from time series. It is an extension of projection pursuit to time series data and the method is closely related to blind separation of time-dependent source signals and independent component analysis. The goal is to find projections of time series that have interesting structure, defined using criteria related to Kolmogoroff complexity or coding length. In this paper, we derived a simple approximation of coding length that takes into account the nongaussianity, the autocorrelations and the variance nonstationary of the time series. We give a simple algorithm for its approximative optimization.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124554557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Error feedback based lexical entity extraction for Chinese language modeling 基于错误反馈的汉语词汇实体抽取
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743873
Yi Liu, Jing Hua, Xiangang Li, Xihong Wu
Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding task, such as phoneme-to-grapheme conversion in this paper. The whole process consists of two iterative phases: selection of individual words from a large manual lexicon and further extraction of compound words based on Phase One. Experiments implemented on phoneme-to-grapheme conversion show that this method can achieve 1.09% and 0.38% absolute reduction in character error rate respectively for Phase One and Phase Two compared with baseline lexicons in the same size generated by the conventional method based on word frequency.
汉语与西方语言有很大的不同,它没有标准的词的定义。因此,选择合适的词汇在汉语语言建模中起着重要的作用。本文提出了一种自动构建词典的新方法。该方法不依赖于文本特征的统计度量,而是直接基于相应任务的错误反馈,例如本文中的音素-字素转换。整个过程包括两个迭代阶段:从大型人工词典中选择单个单词和在阶段一的基础上进一步提取复合词。音素-字素转换实验表明,与基于词频的常规方法生成的相同大小的基线词汇相比,该方法在第一阶段和第二阶段的字符错误率分别降低了1.09%和0.38%。
{"title":"Error feedback based lexical entity extraction for Chinese language modeling","authors":"Yi Liu, Jing Hua, Xiangang Li, Xihong Wu","doi":"10.1109/CISP.2013.6743873","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743873","url":null,"abstract":"Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding task, such as phoneme-to-grapheme conversion in this paper. The whole process consists of two iterative phases: selection of individual words from a large manual lexicon and further extraction of compound words based on Phase One. Experiments implemented on phoneme-to-grapheme conversion show that this method can achieve 1.09% and 0.38% absolute reduction in character error rate respectively for Phase One and Phase Two compared with baseline lexicons in the same size generated by the conventional method based on word frequency.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114813839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast head pose estimation using depth data 快速头部姿态估计使用深度数据
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6745249
Ti-zhou Qiao, S. Dai
In order to estimate head pose precisely in real time with computer vision technology, an enhanced framework using depth data and random regression forest is implemented for head pose estimation. This framework bases on head position and direction point recognition to accomplish head pose estimation. When training random forest, a decision function derived from Haar-like features is used as the binary test and this test uses some data features like Gaussian Curvature and Mean Curvature besides depth value and normal vector. We also generate a large training dataset of range images of heads by virtual structured light scanning. All votes of patches are filtered by clustering and mean shift, and then mean of them are used to estimate position of feature points. Performance evaluation shows accurate pose estimation (success rate above 90%) when running at real-time speed.
为了利用计算机视觉技术实时准确估计头部姿态,提出了一种基于深度数据和随机回归森林的增强头部姿态估计框架。该框架基于头部位置和方向点识别来完成头部姿态估计。在训练随机森林时,使用类似haar特征的决策函数作为二值测试,该测试除了使用深度值和法向量外,还使用高斯曲率和均值曲率等数据特征。我们还通过虚拟结构光扫描生成了一个大型的头部距离图像训练数据集。通过聚类和mean shift对所有patch的投票进行过滤,然后使用它们的平均值来估计特征点的位置。性能评估显示,当以实时速度运行时,准确的姿态估计(成功率在90%以上)。
{"title":"Fast head pose estimation using depth data","authors":"Ti-zhou Qiao, S. Dai","doi":"10.1109/CISP.2013.6745249","DOIUrl":"https://doi.org/10.1109/CISP.2013.6745249","url":null,"abstract":"In order to estimate head pose precisely in real time with computer vision technology, an enhanced framework using depth data and random regression forest is implemented for head pose estimation. This framework bases on head position and direction point recognition to accomplish head pose estimation. When training random forest, a decision function derived from Haar-like features is used as the binary test and this test uses some data features like Gaussian Curvature and Mean Curvature besides depth value and normal vector. We also generate a large training dataset of range images of heads by virtual structured light scanning. All votes of patches are filtered by clustering and mean shift, and then mean of them are used to estimate position of feature points. Performance evaluation shows accurate pose estimation (success rate above 90%) when running at real-time speed.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114712480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Biometric-Kerberos authentication scheme for secure mobile computing services 安全移动计算服务的Biometric-Kerberos认证方案
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743949
F. Han, M. Alkhathami, R. van Schyndel
Kerberos is an authentication protocol in which client and server can mutually authenticate each other across an insecure network connection. After the identity authentication, client and server can encrypt all of subsequent communications to ensure privacy and data integrity. In this paper, a biometric Kerberos-based user identity authentication scheme is presented. In the scheme, smart phones having computing capability and an internal mobile camera are the only device required at the user-end. The combination of owner biometrics and device information will be used for identity authentication. A watermark links the device to its user. The watermark is produced and embedded by using the internal functions of smart phones entirely and the watermark embedding key is the by-product in Kerberos authentication. Only the trusted key distribution center has enough knowledge to detect and remove the watermark. The ticket for the permission to access an application resource will only be issued upon successful biometric authentication. The watermark also offers forensic traceability in a resource constraint environment. As a result, cost effective strong security can be attained in mobile computing services.
Kerberos是一种身份验证协议,客户端和服务器可以通过不安全的网络连接相互进行身份验证。通过身份认证后,客户端和服务器可以对后续的所有通信进行加密,以确保隐私和数据的完整性。提出了一种基于生物识别kerberos的用户身份认证方案。在该方案中,具有计算能力和内置移动摄像头的智能手机是用户端唯一需要的设备。车主生物特征和设备信息的结合将用于身份认证。水印将设备与其用户连接起来。水印的产生和嵌入完全利用了智能手机的内部功能,水印嵌入密钥是Kerberos认证的副产品。只有受信任的密钥分发中心才有足够的知识来检测和删除水印。访问应用程序资源的权限票据只有在生物识别认证成功后才会发出。水印还提供了资源约束环境下的取证跟踪。因此,在移动计算服务中可以获得经济有效的强安全性。
{"title":"Biometric-Kerberos authentication scheme for secure mobile computing services","authors":"F. Han, M. Alkhathami, R. van Schyndel","doi":"10.1109/CISP.2013.6743949","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743949","url":null,"abstract":"Kerberos is an authentication protocol in which client and server can mutually authenticate each other across an insecure network connection. After the identity authentication, client and server can encrypt all of subsequent communications to ensure privacy and data integrity. In this paper, a biometric Kerberos-based user identity authentication scheme is presented. In the scheme, smart phones having computing capability and an internal mobile camera are the only device required at the user-end. The combination of owner biometrics and device information will be used for identity authentication. A watermark links the device to its user. The watermark is produced and embedded by using the internal functions of smart phones entirely and the watermark embedding key is the by-product in Kerberos authentication. Only the trusted key distribution center has enough knowledge to detect and remove the watermark. The ticket for the permission to access an application resource will only be issued upon successful biometric authentication. The watermark also offers forensic traceability in a resource constraint environment. As a result, cost effective strong security can be attained in mobile computing services.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"315 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123552722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A novel resource allocation scheme based on multi-satellite terminals in MF-TDMA satellite systems 一种基于多卫星终端的MF-TDMA卫星系统资源分配新方案
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743909
D. Qiu, Jiancheng Yu, X. Lu
The future satellite communication systems should be able to accommodate the integrated services with a variety of applications and fulfill Quality of Service requirements. Regarding to the limited and costly resources, a novel resource allocation in a multi-frequency time-division multiple access (MF-TDMA) is proposed. In order to optimize packing performance, the users with more time slots are packed first. The algorithm's performance of delay and channel utilization can be validated through the simulation for different types of traffic. We further compare the performances of classical schemes. The theoretical analysis and simulation results show that that new algorithm can efficiently improve channel utilization in satellite resource allocation, especially when the number of satellite terminals is large. The algorithm is simple and easy to implement in satellite communication system.
未来的卫星通信系统应能够适应多种应用的综合业务,并满足服务质量要求。针对资源有限和昂贵的问题,提出了一种新的多频时分多址(MF-TDMA)资源分配方法。为了优化打包性能,优先打包时隙较多的用户。通过对不同类型通信量的仿真,验证了该算法的时延性能和信道利用率。我们进一步比较了经典方案的性能。理论分析和仿真结果表明,该算法能有效提高卫星资源分配中的信道利用率,特别是在卫星终端数量较大的情况下。该算法简单,易于在卫星通信系统中实现。
{"title":"A novel resource allocation scheme based on multi-satellite terminals in MF-TDMA satellite systems","authors":"D. Qiu, Jiancheng Yu, X. Lu","doi":"10.1109/CISP.2013.6743909","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743909","url":null,"abstract":"The future satellite communication systems should be able to accommodate the integrated services with a variety of applications and fulfill Quality of Service requirements. Regarding to the limited and costly resources, a novel resource allocation in a multi-frequency time-division multiple access (MF-TDMA) is proposed. In order to optimize packing performance, the users with more time slots are packed first. The algorithm's performance of delay and channel utilization can be validated through the simulation for different types of traffic. We further compare the performances of classical schemes. The theoretical analysis and simulation results show that that new algorithm can efficiently improve channel utilization in satellite resource allocation, especially when the number of satellite terminals is large. The algorithm is simple and easy to implement in satellite communication system.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122026873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin 基于普通话时频特性的子带检测语音活动算法
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743871
Yinfeng Wang, S. Huang, Ying Wei
Voice activity detection algorithms are widely used in the areas of voice compression, speech synthesis, speech recognition, speech enhancement, and etc. In this paper, an efficient voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin is proposed. The proposed sub-band detection consists of two parts: crosswise detection and lengthwise detection. Energy detection and pitch detection are in the range of considerations. For a better performance, double-threshold criterion is used to reduce the misjudgment rate of the detection. Performance evaluation is based on six noise environments with different SNRs. Experiment results indicate that the proposed algorithm can detect the area of voice effectively in non-stationary environment and low SNR environment and has the potential to progress.
语音活动检测算法广泛应用于语音压缩、语音合成、语音识别、语音增强等领域。本文提出了一种基于普通话时频特性的子带检测语音活动的高效检测算法。提出的子带检测包括两部分:横向检测和纵向检测。能量检测和基音检测都在考虑范围内。为了获得更好的检测性能,采用双阈值准则来降低检测的误判率。性能评估基于6种不同信噪比的噪声环境。实验结果表明,该算法可以在非平稳环境和低信噪比环境下有效检测语音区域,具有发展潜力。
{"title":"A voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin","authors":"Yinfeng Wang, S. Huang, Ying Wei","doi":"10.1109/CISP.2013.6743871","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743871","url":null,"abstract":"Voice activity detection algorithms are widely used in the areas of voice compression, speech synthesis, speech recognition, speech enhancement, and etc. In this paper, an efficient voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin is proposed. The proposed sub-band detection consists of two parts: crosswise detection and lengthwise detection. Energy detection and pitch detection are in the range of considerations. For a better performance, double-threshold criterion is used to reduce the misjudgment rate of the detection. Performance evaluation is based on six noise environments with different SNRs. Experiment results indicate that the proposed algorithm can detect the area of voice effectively in non-stationary environment and low SNR environment and has the potential to progress.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116803998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design and verification of a non-invasive oil temperature measurement instrument 一种非侵入式油温测量仪的设计与验证
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743883
W. Cai, Hui Li, W. Xu, M. Dai
In this paper, a non-invasive oil temperature measurement instrument, together with the oil temperature calculation model based on the pipe surface and ambient temperatures, is presented. In order to validate the model, a verification device was developed which can measure the oil, pipe surface and ambient temperatures simultaneously. A series of model verification and optimization tests under different working conditions were carried out on the device. It can be concluded from the experiment results that the measurement instrument based on the optimized calculation model can perform non-invasive oil temperature measurement accurately.
本文提出了一种非侵入式油温测量仪,并建立了基于管道表面和环境温度的油温计算模型。为了对模型进行验证,研制了一种能同时测量油液、管道表面温度和环境温度的验证装置。对该装置进行了不同工况下的模型验证和优化试验。实验结果表明,基于优化计算模型的测量仪能够准确地进行无创油温测量。
{"title":"Design and verification of a non-invasive oil temperature measurement instrument","authors":"W. Cai, Hui Li, W. Xu, M. Dai","doi":"10.1109/CISP.2013.6743883","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743883","url":null,"abstract":"In this paper, a non-invasive oil temperature measurement instrument, together with the oil temperature calculation model based on the pipe surface and ambient temperatures, is presented. In order to validate the model, a verification device was developed which can measure the oil, pipe surface and ambient temperatures simultaneously. A series of model verification and optimization tests under different working conditions were carried out on the device. It can be concluded from the experiment results that the measurement instrument based on the optimized calculation model can perform non-invasive oil temperature measurement accurately.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123928978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Soft decision based Laplacian model factor estimation for noisy speech enhancement 基于拉普拉斯模型因子估计的软决策噪声语音增强
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6743878
S. Ou, Haidong Sun, Yanqin Zhang, Ying Gao
The Laplacian model factor estimation is a critical link for noisy speech enhancement technique employing Laplacian statistical model priori of clean speech. In this letter, we propose a novel estimation algorithm for this parameter based on soft decision in discrete cosine transform domain. As the speech signal is not always present in the noisy speech signal at all components, we first compute the speech presence probability which is decided in each discrete cosine transform component, and then based on the minimum mean square error estimation theory, the Laplacian model factor is estimated in the speech presence stage. Simulation experiment results demonstrate that the proposed algorithm possesses improved performance than that of the conventional method under different noisy conditions and levels.
拉普拉斯模型因子估计是利用干净语音的拉普拉斯先验统计模型进行噪声语音增强技术的关键环节。在本文中,我们提出了一种新的基于离散余弦变换域软判决的参数估计算法。由于语音信号并不总是存在于噪声语音信号的所有分量中,我们首先计算在每个离散余弦变换分量中确定的语音存在概率,然后基于最小均方误差估计理论,在语音存在阶段估计拉普拉斯模型因子。仿真实验结果表明,在不同的噪声条件和噪声水平下,该算法比传统方法具有更好的性能。
{"title":"Soft decision based Laplacian model factor estimation for noisy speech enhancement","authors":"S. Ou, Haidong Sun, Yanqin Zhang, Ying Gao","doi":"10.1109/CISP.2013.6743878","DOIUrl":"https://doi.org/10.1109/CISP.2013.6743878","url":null,"abstract":"The Laplacian model factor estimation is a critical link for noisy speech enhancement technique employing Laplacian statistical model priori of clean speech. In this letter, we propose a novel estimation algorithm for this parameter based on soft decision in discrete cosine transform domain. As the speech signal is not always present in the noisy speech signal at all components, we first compute the speech presence probability which is decided in each discrete cosine transform component, and then based on the minimum mean square error estimation theory, the Laplacian model factor is estimated in the speech presence stage. Simulation experiment results demonstrate that the proposed algorithm possesses improved performance than that of the conventional method under different noisy conditions and levels.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123941907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polarimetric detection for vector-sensor array in quaternion Gaussian proper noise 四元数高斯固有噪声中矢量传感器阵列的偏振检测
Pub Date : 2013-12-01 DOI: 10.1109/CISP.2013.6745232
Yikai Wang, W. Xia, Zishu He
The passive polarimetric detection problem for vector-sensor array is formulated based on quaternion. The quaternion-formed detectors are deduced and analyzed in the presence of different quaternion proper noise. The properness of quaternion Gaussian noise, especially for the C-proper and Q-proper cases in this paper, is discussed based on the second order statistics (SOS) of the complex components of quaternion noise. Numerical simulations and statistical analysis demonstrate that the C-proper, Q-proper and complex detectors are equivalent in the Q-proper case, and that the Q-proper detector behaves poorer than C-proper and complex detectors in the C-proper case.
提出了基于四元数的矢量传感器阵列被动极化检测问题。推导并分析了不同四元数固有噪声存在下的四元数探测器。基于四元数高斯噪声复分量的二阶统计量(SOS),讨论了四元数高斯噪声的性质,特别是c -固有和q -固有情况。数值模拟和统计分析表明,在q -固有情况下,c -固有、q -固有和复探测器是等效的,而在c -固有情况下,q -固有探测器的性能比c -固有和复探测器差。
{"title":"Polarimetric detection for vector-sensor array in quaternion Gaussian proper noise","authors":"Yikai Wang, W. Xia, Zishu He","doi":"10.1109/CISP.2013.6745232","DOIUrl":"https://doi.org/10.1109/CISP.2013.6745232","url":null,"abstract":"The passive polarimetric detection problem for vector-sensor array is formulated based on quaternion. The quaternion-formed detectors are deduced and analyzed in the presence of different quaternion proper noise. The properness of quaternion Gaussian noise, especially for the C-proper and Q-proper cases in this paper, is discussed based on the second order statistics (SOS) of the complex components of quaternion noise. Numerical simulations and statistical analysis demonstrate that the C-proper, Q-proper and complex detectors are equivalent in the Q-proper case, and that the Q-proper detector behaves poorer than C-proper and complex detectors in the C-proper case.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124605722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2013 6th International Congress on Image and Signal Processing (CISP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1