首页 > 最新文献

Journal of the Acoustical Society of Korea最新文献

英文 中文
Design of piezoelectric micro-machined ultrasonic transducer for wideband ultasonic radiation in air 用于空气中宽带超声辐射的压电微机械超声换能器设计
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.2.087
Hongmin Ahn, Jaehyeok Jin, W. Moon
In this paper, the design of piezoelectric Micro-machined Ultrasonic Transducer (pMUT) for wideband ultrasonic radiation in air was investigated. One of the methods to achieve wide frequency bandwidth in single device is modeling the transducer to multi-resonance system. The new pMUT was designed as a multi-resonance system with the addition of a suitable acoustic structure to the front and back of a thin film structure. A new pMUT consisting of thin film parts, radiation parts, and packaging parts is designed with a Lumped Parameter Model (L.P.M). Finally, it was validated as a Finite Element Method (FEM) simulation. The final designed pMUT achieved a frequency band of 102 kHz ~ 132 kHz (-3 dB).
本文研究了用于空气中宽带超声辐射的压电微机械超声换能器的设计。实现单器件宽频带的方法之一是将换能器建模为多谐振系统。新的pMUT被设计成一个多共振系统,在薄膜结构的前后增加了合适的声学结构。采用集总参数模型(l.p.m.)设计了一种由薄膜部件、辐射部件和封装部件组成的新型pMUT。最后,通过有限元仿真对其进行了验证。最终设计的pMUT实现了102 kHz ~ 132 kHz (-3 dB)的频段。
{"title":"Design of piezoelectric micro-machined ultrasonic transducer for wideband ultasonic radiation in air","authors":"Hongmin Ahn, Jaehyeok Jin, W. Moon","doi":"10.7776/ASK.2020.39.2.087","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.087","url":null,"abstract":"In this paper, the design of piezoelectric Micro-machined Ultrasonic Transducer (pMUT) for wideband ultrasonic radiation in air was investigated. One of the methods to achieve wide frequency bandwidth in single device is modeling the transducer to multi-resonance system. The new pMUT was designed as a multi-resonance system with the addition of a suitable acoustic structure to the front and back of a thin film structure. A new pMUT consisting of thin film parts, radiation parts, and packaging parts is designed with a Lumped Parameter Model (L.P.M). Finally, it was validated as a Finite Element Method (FEM) simulation. The final designed pMUT achieved a frequency band of 102 kHz ~ 132 kHz (-3 dB).","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"87-97"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
L 1 norm-recursive least squares algorithm for the robust sparse acoustic communication channel estimation 鲁棒稀疏声通信信道估计的l1范数递归最小二乘算法
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.1.032
Jun-Seok Lim, Yonggook Pyeon, Sungil Kim
{"title":"L 1 norm-recursive least squares algorithm for the robust sparse acoustic communication channel estimation","authors":"Jun-Seok Lim, Yonggook Pyeon, Sungil Kim","doi":"10.7776/ASK.2020.39.1.032","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.032","url":null,"abstract":"","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"32-37"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A study on the broadband beam pattern synthesis using spatial response variation 基于空间响应变化的宽带波束方向图合成研究
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.3.200
Jun-Seok Lim, Keunhwa Lee, J. Ahn
In this paper, we propose a broadband beamforming method using the Spatial Response Variation (SRV) which is defined to measure the fluctuation of the array spatial response within the desired frequency band. By applying the SRV to regularization term, we achieve a good quality main beam width variation less than 1 degree within the desired frequency band. In design experiments, we show that the proposed method is better than the existing method.
在本文中,我们提出了一种利用空间响应变化(SRV)的宽带波束形成方法,该方法被定义为测量阵列在期望频带内空间响应的波动。通过将SRV应用于正则化项,我们在期望的频带内获得了小于1度的高质量主波束宽度变化。在设计实验中,我们证明了该方法优于现有方法。
{"title":"A study on the broadband beam pattern synthesis using spatial response variation","authors":"Jun-Seok Lim, Keunhwa Lee, J. Ahn","doi":"10.7776/ASK.2020.39.3.200","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.3.200","url":null,"abstract":"In this paper, we propose a broadband beamforming method using the Spatial Response Variation (SRV) which is defined to measure the fluctuation of the array spatial response within the desired frequency band. By applying the SRV to regularization term, we achieve a good quality main beam width variation less than 1 degree within the desired frequency band. In design experiments, we show that the proposed method is better than the existing method.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"200-206"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acoustic parabolic equation model with a directional source 有方向性声源的声波抛物方程模型
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.1.001
Keunhwa Lee
: The acoustic parabolic equation method in the ocean is an efficient technique to calculate the acoustic field in the range-dependent environment, emanating from a point source. However, we often need to use the directional source with a main beam in the practical problem. In this paper, we present two methods to implement the directional source in the acoustic parabolic equation code easily. One is simply to filter the Delta function idealized as an omni-directional point source. Another method is based on the rational filtering of the self-starter solution. It has a limitation not to separate the up-going and the down-going wave for the depth, but would be useful in implementing the mode propagation. Numerical examples for validation are given in the Pekeris environment and the deep sea environment
海洋声抛物方程法是一种有效的计算距离相关环境中由点源发出的声场的方法。然而,在实际问题中,我们经常需要使用带主波束的定向源。本文提出了两种易于实现声抛物方程代码中定向源的方法。一种是简单地过滤作为全向点源的理想函数。另一种方法是基于自启动解的理性滤波。虽然不能对深度进行上行波和下行波的分离,但对实现模态传播是有帮助的。给出了Pekeris环境和深海环境下的数值算例进行验证。
{"title":"Acoustic parabolic equation model with a directional source","authors":"Keunhwa Lee","doi":"10.7776/ASK.2020.39.1.001","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.001","url":null,"abstract":": The acoustic parabolic equation method in the ocean is an efficient technique to calculate the acoustic field in the range-dependent environment, emanating from a point source. However, we often need to use the directional source with a main beam in the practical problem. In this paper, we present two methods to implement the directional source in the acoustic parabolic equation code easily. One is simply to filter the Delta function idealized as an omni-directional point source. Another method is based on the rational filtering of the self-starter solution. It has a limitation not to separate the up-going and the down-going wave for the depth, but would be useful in implementing the mode propagation. Numerical examples for validation are given in the Pekeris environment and the deep sea environment","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"1-7"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Signal synchronization method for depth information transmission of high-speed underwater vehicle 高速水下航行器深度信息传输的信号同步方法
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.2.069
Joohyeong Lee, Guen‑Hyeok Lee, An Jeongha, Ki-Man Kim, M. Han, S. Kim
This paper deals with a method of transmitting depth information of a high-speed underwater vehicle. The depth information signal transmitted from the high-speed mobile object is received with high frequency variability. In the proposed method, we apply not only frequency synchronization but also additional synchronization on the time axis like the existing method. In the case of a Doppler frequency bank with less resolution than the conventional method through simulations performed in the environment moving up to 50 kn, and the depth information is recovered using the proposed method, the error rate of 6 % ~ 9 % is reduced to 0.2 % ~ 1 %.
本文研究了一种高速水下航行器的深度信息传输方法。高速移动物体发射的深度信息信号具有高频变异性。在该方法中,我们不仅采用频率同步,而且像现有方法一样在时间轴上进行额外的同步。在多普勒频率组分辨率低于常规方法的情况下,通过在移动至50kn的环境中进行仿真,利用该方法恢复深度信息,将6% ~ 9%的错误率降低到0.2% ~ 1%。
{"title":"Signal synchronization method for depth information transmission of high-speed underwater vehicle","authors":"Joohyeong Lee, Guen‑Hyeok Lee, An Jeongha, Ki-Man Kim, M. Han, S. Kim","doi":"10.7776/ASK.2020.39.2.069","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.069","url":null,"abstract":"This paper deals with a method of transmitting depth information of a high-speed underwater vehicle. The depth information signal transmitted from the high-speed mobile object is received with high frequency variability. In the proposed method, we apply not only frequency synchronization but also additional synchronization on the time axis like the existing method. In the case of a Doppler frequency bank with less resolution than the conventional method through simulations performed in the environment moving up to 50 kn, and the depth information is recovered using the proposed method, the error rate of 6 % ~ 9 % is reduced to 0.2 % ~ 1 %.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"69-76"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization 利用归一化空间协方差矩阵和多通道非负矩阵分解进行欠定盲源分离
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.2.120
Son‑Mook Oh, Jung Han Kim
This paper solves the problem in underdetermined convolutive mixture by improving the disadvantages of the multichannel nonnegative matrix factorization technique widely used in blind source separation. In conventional researches based on Spatial Covariance Matrix (SCM), each element composed of values such as power gain of single channel and correlation tends to degrade the quality of the separated sources due to high variance. In this paper, level and frequency normalization is performed to effectively cluster the estimated sources. Therefore, we propose a novel SCM and an effective distance function for cluster pairs. In this paper, the proposed SCM is used for the initialization of the spatial model and used for hierarchical agglomerative clustering in the bottom-up approach. The proposed algorithm was experimented using the ‘Signal Separation Evaluation Campaign 2008 development dataset’. As a result, the improvement in most of the performance indicators was confirmed by utilizing the ‘Blind Source Separation Eval toolbox’, an objective source separation quality verification tool, and especially the performance superiority of the typical SDR of 1 dB to 3.5 dB was verified.
本文通过改进多通道非负矩阵分解技术在盲源分离中的缺点,解决了欠定卷积混合问题。在传统的基于空间协方差矩阵(SCM)的研究中,由单通道功率增益和相关系数等组成的各分量由于方差大,容易降低分离源的质量。本文采用水平归一化和频率归一化对估计的源进行有效聚类。因此,我们提出了一种新的SCM和有效的簇对距离函数。在本文中,本文提出的SCM用于空间模型的初始化,并在自下而上的方法中用于分层聚集聚类。使用“信号分离评估运动2008开发数据集”对所提出的算法进行了实验。结果,利用客观的信源分离质量验证工具“盲源分离评估工具箱”证实了大部分性能指标的改善,特别是验证了典型SDR在1 dB ~ 3.5 dB范围内的性能优势。
{"title":"Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization","authors":"Son‑Mook Oh, Jung Han Kim","doi":"10.7776/ASK.2020.39.2.120","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.120","url":null,"abstract":"This paper solves the problem in underdetermined convolutive mixture by improving the disadvantages of the multichannel nonnegative matrix factorization technique widely used in blind source separation. In conventional researches based on Spatial Covariance Matrix (SCM), each element composed of values such as power gain of single channel and correlation tends to degrade the quality of the separated sources due to high variance. In this paper, level and frequency normalization is performed to effectively cluster the estimated sources. Therefore, we propose a novel SCM and an effective distance function for cluster pairs. In this paper, the proposed SCM is used for the initialization of the spatial model and used for hierarchical agglomerative clustering in the bottom-up approach. The proposed algorithm was experimented using the ‘Signal Separation Evaluation Campaign 2008 development dataset’. As a result, the improvement in most of the performance indicators was confirmed by utilizing the ‘Blind Source Separation Eval toolbox’, an objective source separation quality verification tool, and especially the performance superiority of the typical SDR of 1 dB to 3.5 dB was verified.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"120-130"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrete-time approximation and modeling of a broadband underwater propagation channel based on eigenray analysis 基于本征射线分析的宽带水下传播信道离散时间逼近与建模
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.3.216
Donghoon Shin, Hyeon‑Deok Cho, Taek-ik Kwon, J. Ahn
In this paper, broadband underwater propagation channel modeling based on eigenray analysis is discussed. Underwater channels are often formulated in frequency domain time-harmonic signals, which are impractical for simulating broadband signals in time domain. In this regard, time domain modeling of the underwater propagation channel is required for the simulation of broadband signals, for which the eigenray analysis based on ray tracing, resulting in multipath propagation delays in time-domain, is used in this paper. For discrete time system application, the phase, frequency-dependent loss and non-integer sample delays for each eigenray, are approximated by the finite impulse response of the broadband propagation channel.
本文讨论了基于本征射线分析的宽带水下传播信道建模方法。水下信道通常用频域时谐信号来表示,这对于时域模拟宽带信号是不现实的。为此,宽带信号的仿真需要对水下传播信道进行时域建模,本文采用基于光线追踪的特征射线分析,在时域上产生多径传播延迟。对于离散时间系统的应用,每个特征射线的相位、频率相关损耗和非整数样本延迟都近似于宽带传播信道的有限脉冲响应。
{"title":"Discrete-time approximation and modeling of a broadband underwater propagation channel based on eigenray analysis","authors":"Donghoon Shin, Hyeon‑Deok Cho, Taek-ik Kwon, J. Ahn","doi":"10.7776/ASK.2020.39.3.216","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.3.216","url":null,"abstract":"In this paper, broadband underwater propagation channel modeling based on eigenray analysis is discussed. Underwater channels are often formulated in frequency domain time-harmonic signals, which are impractical for simulating broadband signals in time domain. In this regard, time domain modeling of the underwater propagation channel is required for the simulation of broadband signals, for which the eigenray analysis based on ray tracing, resulting in multipath propagation delays in time-domain, is used in this paper. For discrete time system application, the phase, frequency-dependent loss and non-integer sample delays for each eigenray, are approximated by the finite impulse response of the broadband propagation channel.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"216-225"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noise distribution analysis and noise barrier measures of thermal power plant 火电厂噪声分布分析及隔声措施
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.2.105
J. Yun, Won-Jin Kim
: An analysis model of noise map is proposed to evaluate and reduce the acoustical noise of power plant and its surroundings. The sound powers of many noise sources are estimated by measuring the sound levels of major equipments in the power plant. The analysis of noise has been made by using ENPro that is a commercial program for environmental noise prediction. The proposed model is verified by comparing the results from noise analysis and measurement at several points of the power plant units 1 through 4, and residential areas. It is shown that noise map simulation using the proposed model has a reliability, since the overall noise level approximates within the error of ±2 dB. Furthermore, through noise analysis, the increasing effect of noise due to newly established units 5 and 6 on residential areas is also analyzed. Consequently, the noise barrier is designed to meet an environmental noise standard and satisfy low cost and safety conditions.
{"title":"Noise distribution analysis and noise barrier measures of thermal power plant","authors":"J. Yun, Won-Jin Kim","doi":"10.7776/ASK.2020.39.2.105","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.105","url":null,"abstract":": An analysis model of noise map is proposed to evaluate and reduce the acoustical noise of power plant and its surroundings. The sound powers of many noise sources are estimated by measuring the sound levels of major equipments in the power plant. The analysis of noise has been made by using ENPro that is a commercial program for environmental noise prediction. The proposed model is verified by comparing the results from noise analysis and measurement at several points of the power plant units 1 through 4, and residential areas. It is shown that noise map simulation using the proposed model has a reliability, since the overall noise level approximates within the error of ±2 dB. Furthermore, through noise analysis, the increasing effect of noise due to newly established units 5 and 6 on residential areas is also analyzed. Consequently, the noise barrier is designed to meet an environmental noise standard and satisfy low cost and safety conditions.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"105-112"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of environmental sound classification performance of convolutional neural networks according to audio preprocessing methods 基于音频预处理方法的卷积神经网络环境声音分类性能比较
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.3.143
W. Oh
: This paper presents the effect of the feature extraction methods used in the audio preprocessing on the classification performance of the Convolutional Neural Networks (CNN). We extract mel spectrogram, log mel spectrogram, Mel Frequency Cepstral Coefficient (MFCC)
本文研究了音频预处理中使用的特征提取方法对卷积神经网络(CNN)分类性能的影响。我们从urbanansound8k数据集中提取了mel谱图、对数mel谱图、mel频倒系数(MFCC)和delta MFCC,这些数据被广泛用于环境声音分类研究。然后我们将数据缩放到3个分布。利用这些数据,我们测试了四种cnn、VGG16和MobileNetV2网络,根据音频特征和缩放进行性能评估。当使用未缩放的对数谱作为音频特征时,识别率最高。虽然这个结果并不适用于所有的音频识别问题,但对于Urbansound8K中包含的环境声音分类是有用的。
{"title":"Comparison of environmental sound classification performance of convolutional neural networks according to audio preprocessing methods","authors":"W. Oh","doi":"10.7776/ASK.2020.39.3.143","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.3.143","url":null,"abstract":": This paper presents the effect of the feature extraction methods used in the audio preprocessing on the classification performance of the Convolutional Neural Networks (CNN). We extract mel spectrogram, log mel spectrogram, Mel Frequency Cepstral Coefficient (MFCC)","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"143-149"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Applying feature normalization based on pole filtering to short-utterance speech recognition using deep neural network 基于极点滤波的特征归一化在深度神经网络短话语语音识别中的应用
IF 0.4 Q4 ACOUSTICS Pub Date : 2020-01-01 DOI: 10.7776/ASK.2020.39.1.064
J. Han, M. Kim, H. S. Kim
In a conventional speech recognition system using Gaussian Mixture Model-Hidden Markov Model (GMM-HMM), the cepstral feature normalization method based on pole filtering was effective in improving the performance of recognition of short utterances in noisy environments. In this paper, the usefulness of this method for the state-of-the-art speech recognition system using Deep Neural Network (DNN) is examined. Experimental results on AURORA 2 DB show that the cepstral mean and variance normalization based on pole filtering improves the recognition performance of very short utterances compared to that without pole filtering, especially when there is a large mismatch between the training and test conditions.
在基于高斯混合模型-隐马尔可夫模型(GMM-HMM)的传统语音识别系统中,基于极点滤波的倒谱特征归一化方法可以有效地提高噪声环境下短话语的识别性能。在本文中,研究了该方法在使用深度神经网络(DNN)的最新语音识别系统中的实用性。在AURORA 2db上的实验结果表明,基于极点滤波的倒谱均值和方差归一化方法在训练条件和测试条件不匹配较大的情况下,对极短语音的识别性能明显优于无极点滤波的归一化方法。
{"title":"Applying feature normalization based on pole filtering to short-utterance speech recognition using deep neural network","authors":"J. Han, M. Kim, H. S. Kim","doi":"10.7776/ASK.2020.39.1.064","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.064","url":null,"abstract":"In a conventional speech recognition system using Gaussian Mixture Model-Hidden Markov Model (GMM-HMM), the cepstral feature normalization method based on pole filtering was effective in improving the performance of recognition of short utterances in noisy environments. In this paper, the usefulness of this method for the state-of-the-art speech recognition system using Deep Neural Network (DNN) is examined. Experimental results on AURORA 2 DB show that the cepstral mean and variance normalization based on pole filtering improves the recognition performance of very short utterances compared to that without pole filtering, especially when there is a large mismatch between the training and test conditions.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"64-68"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Acoustical Society of Korea
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1