An Auditory-Based Monaural Feature for Noisy and Reverberant Speech Enhancement

Yi-jiao Jiang, Runsheng Liu, Ya Bai
{"title":"An Auditory-Based Monaural Feature for Noisy and Reverberant Speech Enhancement","authors":"Yi-jiao Jiang, Runsheng Liu, Ya Bai","doi":"10.1109/CIIS.2017.23","DOIUrl":null,"url":null,"abstract":"The deep neural networks (DNN) based speech enhancements is a hot topic in machine learning and speech enhancement application. Even with deep neural network, it is still hard to improve the speech quality on noisy and reverberant conditions. For machine learning based system, auditory feature extraction becomes the key point in speech enhancement and recognition. In this paper, we proposed a speech enhancement framework based on an auditory-based monaural feature, which model the function of human hearing auditory system. The auditory based feature is extracted from the data passing the gammatone filter banks, which has more detail on low frequency than normal filters. Systemic tests show the better performance of the proposed auditory based monaural feature than the mel-frequency cepstral coefficients (MFCC) in noise and reverberant environment.","PeriodicalId":254342,"journal":{"name":"2017 International Conference on Computing Intelligence and Information System (CIIS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing Intelligence and Information System (CIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIIS.2017.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The deep neural networks (DNN) based speech enhancements is a hot topic in machine learning and speech enhancement application. Even with deep neural network, it is still hard to improve the speech quality on noisy and reverberant conditions. For machine learning based system, auditory feature extraction becomes the key point in speech enhancement and recognition. In this paper, we proposed a speech enhancement framework based on an auditory-based monaural feature, which model the function of human hearing auditory system. The auditory based feature is extracted from the data passing the gammatone filter banks, which has more detail on low frequency than normal filters. Systemic tests show the better performance of the proposed auditory based monaural feature than the mel-frequency cepstral coefficients (MFCC) in noise and reverberant environment.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于听觉的噪声和混响语音增强单声特征
基于深度神经网络(DNN)的语音增强是机器学习和语音增强应用中的一个热点。即使使用深度神经网络,在噪声和混响条件下仍然难以提高语音质量。对于基于机器学习的系统,听觉特征提取成为语音增强和识别的关键。本文提出了一种基于听觉的单声特征的语音增强框架,该框架模拟了人类听觉系统的功能。基于听觉的特征是从通过伽马酮滤波器组的数据中提取出来的,它比普通滤波器在低频上有更多的细节。系统测试表明,在噪声和混响环境下,基于听觉的单声特征比基于梅尔频倒谱系数(MFCC)的单声特征表现更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Network Traffic Anomaly Detection Based on Dynamic Programming Study on the Robustness Based on PID Fuzzy Controller The Best Performance Evaluation of Encryption Algorithms to Reduce Power Consumption in WSN Non-redundant Distributed Database Allocation Technology Research Research and Implementation Based on Three-Dimensional Model Watermarking Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1