Pub Date : 2020-03-01DOI: 10.7776/ASK.2020.39.2.131
Ki-mu Yoon and Wooil Kim
{"title":"A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model","authors":"Ki-mu Yoon and Wooil Kim","doi":"10.7776/ASK.2020.39.2.131","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.131","url":null,"abstract":"","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"131-136"},"PeriodicalIF":0.4,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48582542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-03-01DOI: 10.7776/ASK.2020.39.2.137
Ara Bae, Wooil Kim
{"title":"Speaker verification system combining attention-long short term memory based speaker embedding and I-vector in far-field and noisy environments","authors":"Ara Bae, Wooil Kim","doi":"10.7776/ASK.2020.39.2.137","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.137","url":null,"abstract":"","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"137-142"},"PeriodicalIF":0.4,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47105107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.1.024
Seokjin Lee, Minhan Kim, Youngho Jeong
{"title":"A study on the waveform-based end-to-end deep convolutional neural network for weakly supervised sound event detection","authors":"Seokjin Lee, Minhan Kim, Youngho Jeong","doi":"10.7776/ASK.2020.39.1.024","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.024","url":null,"abstract":"","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"24-31"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.1.057
S. Kim
: The binaural beat is a subjective beating phenomenon due to the frequency difference of sounds heard in both ears, sensed by the human brain. The beating magnitude physically depends on the amplitudes of the two signals with slight different frequencies. Therefore, the binaural beat strength is expected to be affected by the difference in left and right hearing ability. In this study, 34 healthy subjects without hearing loss were chosen. They hear a pure tone sound (450 Hz) at 10 different Sound Pressure Levels (SPLs) (from 50 dB to 27.4 dB) through the one side of ears, while they hear the reference sound (440 Hz, 50 dB) through the other side of ears. Their subjective assessment using a semantic differential method reveals that the binaural beat strength decreases as SPL differences between the two sides of ears increases, if the difference is greater than 2.2 dB. The result suggests that the hearing loss difference between the two sides of ears should be less than 2.2 dB to maximize binaural
{"title":"Changes in binaural beat strength to the difference of right and left hearing ability","authors":"S. Kim","doi":"10.7776/ASK.2020.39.1.057","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.057","url":null,"abstract":": The binaural beat is a subjective beating phenomenon due to the frequency difference of sounds heard in both ears, sensed by the human brain. The beating magnitude physically depends on the amplitudes of the two signals with slight different frequencies. Therefore, the binaural beat strength is expected to be affected by the difference in left and right hearing ability. In this study, 34 healthy subjects without hearing loss were chosen. They hear a pure tone sound (450 Hz) at 10 different Sound Pressure Levels (SPLs) (from 50 dB to 27.4 dB) through the one side of ears, while they hear the reference sound (440 Hz, 50 dB) through the other side of ears. Their subjective assessment using a semantic differential method reveals that the binaural beat strength decreases as SPL differences between the two sides of ears increases, if the difference is greater than 2.2 dB. The result suggests that the hearing loss difference between the two sides of ears should be less than 2.2 dB to maximize binaural","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"57-63"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.1.016
J. Song, J. Chang, Y. Yoo
In High-Intensity Focused Ultrasound (HIFU) treatment, effective localization of HIFU focus is important for developing a safe treatment plan. While Magnetic Resonance Imaging guided HIFU (MRIgHIFU) can visualize the ultrasound path during the treatment for localizing HIFU focus, it is challenging in ultrasound imaging guided HIFU (USIgHIFU). In the present study, a real-time ultrasound beam visualization technique capable of localizing HIFU focus is presented for USIgHIFU. In the proposed method, a short pulse, with the same center frequency of an imaging ultrasound transducer below the regulated acoustic intensity (i.e., Ispta < 720 mW/cm(2)), was transmitted through a HIFU transducer whereupon backscattered signals were received by the imaging transducer. To visualize the HIFU beam path, the backscattered signals underwent dynamic receive focusing and subsequent echo processing. From in vitro experiments with bovine serum albumin gel phantoms, the HIFU beam path was clearly depicted with low acoustic intensity (i.e., Ispta of 94.8 mW/cm(2)) and the HIFU focus was successfully localized before any damages were produced. This result indicates that the proposed ultrasound beam path visualization method can be used for localizing the HIFU focus in real time while minimizing unwanted tissue damage in USIgHIFU treatment.
{"title":"High-intensity focused ultrasound beam path visualization using ultrasound imaging","authors":"J. Song, J. Chang, Y. Yoo","doi":"10.7776/ASK.2020.39.1.016","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.016","url":null,"abstract":"In High-Intensity Focused Ultrasound (HIFU) treatment, effective localization of HIFU focus is important for developing a safe treatment plan. While Magnetic Resonance Imaging guided HIFU (MRIgHIFU) can visualize the ultrasound path during the treatment for localizing HIFU focus, it is challenging in ultrasound imaging guided HIFU (USIgHIFU). In the present study, a real-time ultrasound beam visualization technique capable of localizing HIFU focus is presented for USIgHIFU. In the proposed method, a short pulse, with the same center frequency of an imaging ultrasound transducer below the regulated acoustic intensity (i.e., Ispta < 720 mW/cm(2)), was transmitted through a HIFU transducer whereupon backscattered signals were received by the imaging transducer. To visualize the HIFU beam path, the backscattered signals underwent dynamic receive focusing and subsequent echo processing. From in vitro experiments with bovine serum albumin gel phantoms, the HIFU beam path was clearly depicted with low acoustic intensity (i.e., Ispta of 94.8 mW/cm(2)) and the HIFU focus was successfully localized before any damages were produced. This result indicates that the proposed ultrasound beam path visualization method can be used for localizing the HIFU focus in real time while minimizing unwanted tissue damage in USIgHIFU treatment.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"16-23"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.1.047
In-Jee Jung, J. Ih
: The sound source localization technique has various application fields in the era of internet-of-things, for which the probe size becomes critical. The localization methods using the acoustic intensity vector has an advantage of downsizing the layout of the array owing to a small finite-difference error for the short distance between adjacent microphones. In this paper, the acoustic intensity vector and the Time Difference of Arrival (TDoA) method are compared in the viewpoint of the localization error in the far-field. The comparison is made according to the change of spacing between adjacent microphones of the three-dimensional microphone array arranged in a tetrahedral shape. An additional test is conducted in the reverberant field by varying the reverberation time to verify the effectiveness of the methods applied to the actual environments. For estimating the TDoA, the Generalized Cross Correlation-Phase transform (GCC-PHAT) algorithm is adopted in the computation. It is found that the mean localization error of the acoustic intensimetry is 2.9° and that of the GCC-PHAT is 7.3° for T 60 = 0.4 s, while the error increases as 9.9°, 13.0° for T 60 = 1.0 s, respectively. The data supports that a compact array employing the acoustic intensimetry can localize of the sound source in the actual environment with the moderate reflection conditions.
{"title":"Comparison of the sound source localization methods appropriate for a compact microphone array","authors":"In-Jee Jung, J. Ih","doi":"10.7776/ASK.2020.39.1.047","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.047","url":null,"abstract":": The sound source localization technique has various application fields in the era of internet-of-things, for which the probe size becomes critical. The localization methods using the acoustic intensity vector has an advantage of downsizing the layout of the array owing to a small finite-difference error for the short distance between adjacent microphones. In this paper, the acoustic intensity vector and the Time Difference of Arrival (TDoA) method are compared in the viewpoint of the localization error in the far-field. The comparison is made according to the change of spacing between adjacent microphones of the three-dimensional microphone array arranged in a tetrahedral shape. An additional test is conducted in the reverberant field by varying the reverberation time to verify the effectiveness of the methods applied to the actual environments. For estimating the TDoA, the Generalized Cross Correlation-Phase transform (GCC-PHAT) algorithm is adopted in the computation. It is found that the mean localization error of the acoustic intensimetry is 2.9° and that of the GCC-PHAT is 7.3° for T 60 = 0.4 s, while the error increases as 9.9°, 13.0° for T 60 = 1.0 s, respectively. The data supports that a compact array employing the acoustic intensimetry can localize of the sound source in the actual environment with the moderate reflection conditions.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"10 1","pages":"47-56"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.2.077
Ga-ram Ku, C. Cheong, Hanshin Seol
In this study, the Eulerian/Lagrangian one-way coupling method is proposed to predict flow noise due to Blade-Tip Vortex Cavitation (BTVC). The proposed method consists of four sequential steps: flow field simulation using Computational Fluid Dynamics (CFD) techniques, reconstruction of wing-tip vortex using vortex model, generation of BTVC using bubble dynamics model and acoustic wave prediction using the acoustic analogy. Because the CFD prediction of tip vortex structure generally suffers from severe under-prediction of its strength along the steamwise direction due to the intrinsic numerical damping of CFD schemes and excessive turbulence intensity, the wing-tip vortex along the freestream direction is regenerated by using the vortex modeling. Then, the bubble dynamics model based on the Rayleigh-Plesset equation was employed to simulate the generation and variation of BTVC. Finally, the flow noise due to BTVC is predicted by modeling each of spherical bubbles as a monople source whose strength is proportional to the rate of time-variation of bubble volume. The validity of the proposed numerical methods is confirmed by comparing the predicted results with the measured data.
{"title":"Numerical investigation of blade tip vortex cavitation noise using Reynolds-averaged Navier-Stokes simulation and bubble dynamics model","authors":"Ga-ram Ku, C. Cheong, Hanshin Seol","doi":"10.7776/ASK.2020.39.2.077","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.077","url":null,"abstract":"In this study, the Eulerian/Lagrangian one-way coupling method is proposed to predict flow noise due to Blade-Tip Vortex Cavitation (BTVC). The proposed method consists of four sequential steps: flow field simulation using Computational Fluid Dynamics (CFD) techniques, reconstruction of wing-tip vortex using vortex model, generation of BTVC using bubble dynamics model and acoustic wave prediction using the acoustic analogy. Because the CFD prediction of tip vortex structure generally suffers from severe under-prediction of its strength along the steamwise direction due to the intrinsic numerical damping of CFD schemes and excessive turbulence intensity, the wing-tip vortex along the freestream direction is regenerated by using the vortex modeling. Then, the bubble dynamics model based on the Rayleigh-Plesset equation was employed to simulate the generation and variation of BTVC. Finally, the flow noise due to BTVC is predicted by modeling each of spherical bubbles as a monople source whose strength is proportional to the rate of time-variation of bubble volume. The validity of the proposed numerical methods is confirmed by comparing the predicted results with the measured data.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"77-86"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.2.098
Kangtae Kim, Won-Jin Kim
{"title":"An effective method to reduce the contraction and expansion noise of air conditioner","authors":"Kangtae Kim, Won-Jin Kim","doi":"10.7776/ASK.2020.39.2.098","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.2.098","url":null,"abstract":"","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"79 1","pages":"98-104"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.4.227
Cheol-Soo Park, S. Jeong, Gun-Do Kim, I. Moon, G. Yim
This paper derives the insertion loss for the bubble layer of an air bubble curtain and an air masker which are used to reduce ocean anthropogenic noise such as the piling noise and the ship noise. The air bubble curtain is considered as a ‘fluid-air bubble layer-fluid’ model and the environment for the air masker is simplified as an ‘vacuum-thin plate-fluid-air bubble layer-fluid’ model. The air bubble layer in each model is assumed as the effective medium which has the complex wavenumber and the complex impedance corresponding to the bubble population distribution. The numerical simulations are performed to examine the insertion loss depending on the bubble population, the void fraction, and the thickness of the layer.
{"title":"Acoustic insertion loss by a bubble layer for the application to air bubble curtain and air masker","authors":"Cheol-Soo Park, S. Jeong, Gun-Do Kim, I. Moon, G. Yim","doi":"10.7776/ASK.2020.39.4.227","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.4.227","url":null,"abstract":"This paper derives the insertion loss for the bubble layer of an air bubble curtain and an air masker which are used to reduce ocean anthropogenic noise such as the piling noise and the ship noise. The air bubble curtain is considered as a ‘fluid-air bubble layer-fluid’ model and the environment for the air masker is simplified as an ‘vacuum-thin plate-fluid-air bubble layer-fluid’ model. The air bubble layer in each model is assumed as the effective medium which has the complex wavenumber and the complex impedance corresponding to the bubble population distribution. The numerical simulations are performed to examine the insertion loss depending on the bubble population, the void fraction, and the thickness of the layer.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"227-236"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-01-01DOI: 10.7776/ASK.2020.39.1.038
J. Ahn, Hyeon‑Deok Cho, Donghoon Shin, Taek-ik Kwon, Gwangtae Kim
LOw Frequency Analysis Recording (LOFAR) and Demodulation of Envelop Modulation On Noise (DEMON) grams are bearing-time-frequency plots of underwater acoustic signals, to visualize features for passive sonar. Those grams are characterized by tonal components, for which conventional data coding methods are not suitable. In this work, a novel LOFAR/DEMON gram compression algorithm based on binary map and prediction methods is proposed. We first generate a binary map, from which prediction for each frequency bin is determined, and then divide a frame into several macro blocks. For each macro block, we apply intra and inter prediction modes and compute residuals. Then, we perform the prediction of available bins in the binary map and quantize residuals for entropy coding. By transmitting the binary map and prediction modes, the decoder can reconstructs grams using the same process. Simulation results show that the proposed algorithm provides significantly better compression performance on LOFAR and DEMON grams than conventional data coding methods.
{"title":"LOFAR/DEMON grams compression method for passive sonars","authors":"J. Ahn, Hyeon‑Deok Cho, Donghoon Shin, Taek-ik Kwon, Gwangtae Kim","doi":"10.7776/ASK.2020.39.1.038","DOIUrl":"https://doi.org/10.7776/ASK.2020.39.1.038","url":null,"abstract":"LOw Frequency Analysis Recording (LOFAR) and Demodulation of Envelop Modulation On Noise (DEMON) grams are bearing-time-frequency plots of underwater acoustic signals, to visualize features for passive sonar. Those grams are characterized by tonal components, for which conventional data coding methods are not suitable. In this work, a novel LOFAR/DEMON gram compression algorithm based on binary map and prediction methods is proposed. We first generate a binary map, from which prediction for each frequency bin is determined, and then divide a frame into several macro blocks. For each macro block, we apply intra and inter prediction modes and compute residuals. Then, we perform the prediction of available bins in the binary map and quantize residuals for entropy coding. By transmitting the binary map and prediction modes, the decoder can reconstructs grams using the same process. Simulation results show that the proposed algorithm provides significantly better compression performance on LOFAR and DEMON grams than conventional data coding methods.","PeriodicalId":42689,"journal":{"name":"Journal of the Acoustical Society of Korea","volume":"39 1","pages":"38-46"},"PeriodicalIF":0.4,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71370549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}