首页 > 最新文献

Phonetics and Speech Sciences最新文献

英文 中文
Acoustic analysis of Korean affricates produced by dysarthric speakers with cerebral palsy* 脑瘫患者发音困难的韩语发音的声学分析*
Pub Date : 2021-06-01 DOI: 10.13064/ksss.2021.13.2.045
Jihyun Mun, Sunhee Kim, Minhwa Chung
This study aims to analyze the acoustic characteristics of Korean affricates produced by dysarthric speakers with cerebral palsy. Korean fricatives and affricates are the consonants that are prone to errors in dysarthric speech, but previous studies have focused only on fricatives. For this study, three affricates /tɕ, tɕ, t͈ɕ/ appearing at word initial and intervocalic positions produced by six mild-moderate male speakers of spastic dysarthria are selected from a QOLT database constructed in 2014. The parameters representing the acoustic characteristics of Korean affricates were extracted by using Praat: frication duration, closure duration, center of gravity, variance, skewness, kurtosis, and central moment. The results are as follows: 1) frication duration of the intervocalic affricates produced by dysarthric speakers was significantly longer than that of the non-disordered speakers; 2) the closure duration of dysarthric speakers was significantly longer; 3) in the case of the center of gravity, there was no significant difference between the two groups; 4) the skewness of the dysarthric speakers was significantly larger; and 5) the central moment of dysarthric speakers was significantly larger. This study investigated the characteristics of the affricates produced by dysarthric speakers and differences with non-disordered speakers.
本研究旨在分析脑瘫困难说话者的韩文叠音的声学特征。韩国语的擦音和非擦音是发音困难时容易出现错误的辅音,但以往的研究只关注擦音。本研究从2014年构建的QOLT数据库中选取6名轻度-中度男性痉挛性构音障碍患者在单词起始和中间位置产生的三个辅音/t _, t _, t _ /。利用Praat提取表征朝鲜语叠舌声特征的参数:摩擦持续时间、闭合持续时间、重心、方差、偏度、峰度和中心矩。结果表明:1)障碍型说话者产生的间断音摩擦持续时间显著长于非障碍型说话者;2)困难者闭口时间显著延长;3)在重心方面,两组间差异无统计学意义;4)困难者的偏度显著增大;5)困难者的中心时刻显著增大。本研究探讨了诵读困难的人在发音上的特点及其与非诵读困难的人的差异。
{"title":"Acoustic analysis of Korean affricates produced by dysarthric\u0000 speakers with cerebral palsy*","authors":"Jihyun Mun, Sunhee Kim, Minhwa Chung","doi":"10.13064/ksss.2021.13.2.045","DOIUrl":"https://doi.org/10.13064/ksss.2021.13.2.045","url":null,"abstract":"This study aims to analyze the acoustic characteristics of Korean affricates produced by dysarthric speakers with cerebral palsy. Korean fricatives and affricates are the consonants that are prone to errors in dysarthric speech, but previous studies have focused only on fricatives. For this study, three affricates /tɕ, tɕ, t͈ɕ/ appearing at word initial and intervocalic positions produced by six mild-moderate male speakers of spastic dysarthria are selected from a QOLT database constructed in 2014. The parameters representing the acoustic characteristics of Korean affricates were extracted by using Praat: frication duration, closure duration, center of gravity, variance, skewness, kurtosis, and central moment. The results are as follows: 1) frication duration of the intervocalic affricates produced by dysarthric speakers was significantly longer than that of the non-disordered speakers; 2) the closure duration of dysarthric speakers was significantly longer; 3) in the case of the center of gravity, there was no significant difference between the two groups; 4) the skewness of the dysarthric speakers was significantly larger; and 5) the central moment of dysarthric speakers was significantly larger. This study investigated the characteristics of the affricates produced by dysarthric speakers and differences with non-disordered speakers.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115760131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and effects of Nanta program using speech rhythm for children with limited speech sound production* 使用语音节奏的Nanta程序对语音产生能力有限的儿童的开发和效果*
Pub Date : 2021-06-01 DOI: 10.13064/ksss.2021.13.2.067
Y. Park, Seong Hee Choi
Nanta means “tapping” using percussion instruments such as drums, which is the rhythm of Samulnori, a tradtional Korean music. Nanta speech rhythm intervention program was developed and applied for the children with limited speech sound production and investigated its effect. Nanta program provided audible stimulation, various sound loudness and beats, and rhythms. Nanta program consists of three stages : Respiration, phonation and articulation with the rhythm. Six children with language development delay participated in this study. Children were encouraged to explore sounds and beats and freely express sounds and beats. Along with the rhythm, children also were encouraged to produce speech sounds by increasing the length of syllables in mimetic and imitating words. A total of 15 sessions were conducted twice a week for 40 minutes per session. For exploring the effectiveness, raw scores from preschool receptive-expressive scales (PRES) and receptive-expressive vocabulary test (REVT) were obtained and compared before and after therapy. The results demonstrated that significantly improved receptive ( p =.027) and expressive language scores ( p =.024) in PRES and receptive ( p =.028) and expressive ( p =.028) vocabulary scores following intervention using Wilcoxon signed-rank test.These findings suggest that the nanta rhythm program can be useful for improving language development and vocabulary in children with limited speech sound production.
Nanta的意思是用鼓等打击乐器敲击,这是韩国传统音乐四物乐的节奏。开发了“南大”语言节奏干预程序,并将其应用于言语发声能力有限的儿童中,并对其效果进行了研究。Nanta程序提供听觉刺激,各种声音的响度和节拍,以及节奏。Nanta程序包括三个阶段:呼吸,发音和发音的节奏。6名语言发展迟缓儿童参与了本研究。鼓励孩子们探索声音和节奏,自由表达声音和节奏。随着节奏,孩子们也被鼓励通过增加模仿和模仿单词的音节长度来发出语音。每周二次共15次,每次40分钟。为探讨治疗效果,本研究采用学前接受-表达量表(PRES)和接受-表达词汇测试(REVT)的原始评分,比较治疗前后的差异。采用Wilcoxon符号秩检验,结果表明,干预后学生的接受性(p = 0.027)和表达性语言得分(p = 0.024)显著提高,接受性(p = 0.028)和表达性(p = 0.028)词汇得分显著提高。这些研究结果表明,nanta节奏程序可以帮助语音有限的儿童提高语言发展和词汇量。
{"title":"Development and effects of Nanta program using speech rhythm for\u0000 children with limited speech sound production*","authors":"Y. Park, Seong Hee Choi","doi":"10.13064/ksss.2021.13.2.067","DOIUrl":"https://doi.org/10.13064/ksss.2021.13.2.067","url":null,"abstract":"Nanta means “tapping” using percussion instruments such as drums, which is the rhythm of Samulnori, a tradtional Korean music. Nanta speech rhythm intervention program was developed and applied for the children with limited speech sound production and investigated its effect. Nanta program provided audible stimulation, various sound loudness and beats, and rhythms. Nanta program consists of three stages : Respiration, phonation and articulation with the rhythm. Six children with language development delay participated in this study. Children were encouraged to explore sounds and beats and freely express sounds and beats. Along with the rhythm, children also were encouraged to produce speech sounds by increasing the length of syllables in mimetic and imitating words. A total of 15 sessions were conducted twice a week for 40 minutes per session. For exploring the effectiveness, raw scores from preschool receptive-expressive scales (PRES) and receptive-expressive vocabulary test (REVT) were obtained and compared before and after therapy. The results demonstrated that significantly improved receptive ( p =.027) and expressive language scores ( p =.024) in PRES and receptive ( p =.028) and expressive ( p =.028) vocabulary scores following intervention using Wilcoxon signed-rank test.These findings suggest that the nanta rhythm program can be useful for improving language development and vocabulary in children with limited speech sound production.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128107054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A study of /l/ velarization in American English based on the Buckeye Corpus 基于七叶语料库的美式英语/l/化研究
Pub Date : 2021-06-01 DOI: 10.13064/ksss.2021.13.2.019
Jae-Jin Sa
It has been widely recognized that there are two varieties of lateral liquid /l/, which are light /l/ (a non-velarized allophone) and dark /l/ (a velarized allophone). However, this categorical view has been challenged in recent studies, both on articulatory and acoustic aspects. The purpose of this study is to investigate whether to consider /l/ velarization as a continuum in American English and provide supporting data. A spontaneous American English speech database called the Buckeye Speech Corpus was used for the material. The formant frequencies of /l/ in each syllable position were measured and analyzed statistically. The formant frequencies of /l/ in each syllable position, especially F2 values, were significantly different from each other. The results showed that there were other significantly different varieties of /l/ in American English, which support the continuum view on /l/ velarization. Regarding the effect of the adjacent vowel, the backness of the adjacent vowels was shown to affect the degree of /l/ velarization, regardless of the syllable position of the lateral liquid. This result will help provide a solid ground for the continuum view.
人们普遍认为,侧液/l/有两种变体,即浅/l/(非velalized allophone)和暗/l/ (velalized allophone)。然而,这种分类观点在最近的研究中受到了挑战,无论是在发音方面还是声学方面。本研究的目的是探讨/l/ velarization在美式英语中是否被视为一个连续体,并提供支持性数据。该材料使用了一个名为七叶树语音语料库的自发美语语音数据库。测量/l/在每个音节位置的构音频率并进行统计分析。每个音节位置/l/的构音频率,特别是F2值存在显著差异。结果表明,美国英语中/l/还存在其他显著不同的变体,支持了/l/化的连续统一体观点。关于相邻元音的作用,无论侧音的音节位置如何,相邻元音的背音程度都会影响/l/ velarization的程度。这个结果将有助于为连续体观点提供坚实的基础。
{"title":"A study of /l/ velarization in American English based on the\u0000 Buckeye Corpus","authors":"Jae-Jin Sa","doi":"10.13064/ksss.2021.13.2.019","DOIUrl":"https://doi.org/10.13064/ksss.2021.13.2.019","url":null,"abstract":"It has been widely recognized that there are two varieties of lateral liquid /l/, which are light /l/ (a non-velarized allophone) and dark /l/ (a velarized allophone). However, this categorical view has been challenged in recent studies, both on articulatory and acoustic aspects. The purpose of this study is to investigate whether to consider /l/ velarization as a continuum in American English and provide supporting data. A spontaneous American English speech database called the Buckeye Speech Corpus was used for the material. The formant frequencies of /l/ in each syllable position were measured and analyzed statistically. The formant frequencies of /l/ in each syllable position, especially F2 values, were significantly different from each other. The results showed that there were other significantly different varieties of /l/ in American English, which support the continuum view on /l/ velarization. Regarding the effect of the adjacent vowel, the backness of the adjacent vowels was shown to affect the degree of /l/ velarization, regardless of the syllable position of the lateral liquid. This result will help provide a solid ground for the continuum view.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"193 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123736846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Longitudinal music perception performance of postlingual deaf adults with cochlear implants using acoustic and/or electrical stimulation 使用声学和/或电刺激植入人工耳蜗的语后聋成人的纵向音乐感知表现
Pub Date : 2021-06-01 DOI: 10.13064/ksss.2021.13.2.103
Son-A Chang, S. Sujin, Sungkeong Kim, Yeabitna Lee, Eun Young Lee, Hanee Kim, You-Ree Shin, Young-Myoung Chun
In this study, we investigated longitudinal music perception of adult cochlear implant (CI) users and how acoustic stimulation with CI affects their music performance. A total of 163 participants' data were analyzed retrospectively. 96 participants were using acoustic stimulation with CI and 67 participants were using electrical stimulation only via CI. The music performance (melody identification, appreciation, and satisfaction) data were collected pre-implantation, 1-year, and 2-year post-im-plantation. Mixed repeated measures of ANOVA and pairwise analysis adjusted by Tukey were used for the statistics. As result, in both groups, there were significant improvements in melody identification, music appreciation, and music satisfaction at 1-year, and 2-year post-implantation than a pre-implantation, but there was no significant difference between 1 and 2 years in any of the variables. Also, the group of acoustic stimulation with CI showed better perception skill of melody identification than the CI-only group. However, no differences found in music appreciation and satisfaction between the two groups, and possible explanations were discussed. In conclusion, acoustic and/or electrical hearing devices benefit the recipients in music performance over time. Although acoustic stimulation accompanied with electrical stimulation could benefit the recipients in terms of listening skills, those benefits may not extend to the subjective acceptance of music. These results suggest the need for improved sound processing mechanisms and music rehabilitation. of interest: Effect of auditory feedback on speech pro-duction and language development, oral
在这项研究中,我们调查了成人人工耳蜗使用者的纵向音乐感知,以及人工耳蜗的声刺激如何影响他们的音乐表现。对163名参与者的数据进行回顾性分析。96名参与者使用声刺激和CI, 67名参与者仅通过CI使用电刺激。在种植前、种植后1年和2年收集音乐表现(旋律识别、欣赏和满意度)数据。统计学采用方差分析的混合重复测量和经Tukey校正的两两分析。结果,在两组中,与植入前相比,植入后1年和2年,旋律识别、音乐欣赏和音乐满意度都有显著改善,但1年和2年之间的任何变量都没有显著差异。同时,声刺激组的旋律识别能力明显优于单纯声刺激组。然而,两组在音乐欣赏和满意度方面没有发现差异,并讨论了可能的解释。综上所述,随着时间的推移,声学和/或电气听力设备对接受者的音乐表演有益。虽然声刺激与电刺激相结合可以使接受者在听力技能方面受益,但这些好处可能不会延伸到对音乐的主观接受。这些结果表明需要改进声音处理机制和音乐康复。听觉反馈对言语产生和语言发展的影响
{"title":"Longitudinal music perception performance of postlingual deaf adults\u0000 with cochlear implants using acoustic and/or electrical\u0000 stimulation","authors":"Son-A Chang, S. Sujin, Sungkeong Kim, Yeabitna Lee, Eun Young Lee, Hanee Kim, You-Ree Shin, Young-Myoung Chun","doi":"10.13064/ksss.2021.13.2.103","DOIUrl":"https://doi.org/10.13064/ksss.2021.13.2.103","url":null,"abstract":"In this study, we investigated longitudinal music perception of adult cochlear implant (CI) users and how acoustic stimulation with CI affects their music performance. A total of 163 participants' data were analyzed retrospectively. 96 participants were using acoustic stimulation with CI and 67 participants were using electrical stimulation only via CI. The music performance (melody identification, appreciation, and satisfaction) data were collected pre-implantation, 1-year, and 2-year post-im-plantation. Mixed repeated measures of ANOVA and pairwise analysis adjusted by Tukey were used for the statistics. As result, in both groups, there were significant improvements in melody identification, music appreciation, and music satisfaction at 1-year, and 2-year post-implantation than a pre-implantation, but there was no significant difference between 1 and 2 years in any of the variables. Also, the group of acoustic stimulation with CI showed better perception skill of melody identification than the CI-only group. However, no differences found in music appreciation and satisfaction between the two groups, and possible explanations were discussed. In conclusion, acoustic and/or electrical hearing devices benefit the recipients in music performance over time. Although acoustic stimulation accompanied with electrical stimulation could benefit the recipients in terms of listening skills, those benefits may not extend to the subjective acceptance of music. These results suggest the need for improved sound processing mechanisms and music rehabilitation. of interest: Effect of auditory feedback on speech pro-duction and language development, oral","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125752184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and clinical application of Korean-version nonword intervention to improve speech motor programming* 韩文非词干预改善言语运动规划的发展及临床应用*
Pub Date : 2021-06-01 DOI: 10.13064/ksss.2021.13.2.077
Da-Hee Oh, Ji-Wan Ha
This study is to develop a Korean version of nonword intervention by modifying and supplementing a Rapid syllable transition treatment (ReST) and to determine its effect by applying it to children with CAS. Ultimately, the purpose of this study is to investigate whether nonword interventions are effective for nonword production ability and generalization of real words. Single-subject research using the ABA design was performed for a child aged five years and six months with diagnostic features of CAS. The nonwords used in the interventions were made suitable for the individual child. The intervention was provided in one-hour sessions, twice a week for six weeks. In all cases, performance of the treated three-syllable nonwords improved, and untreated three-syllable words, four-syllable words, and nonwords showed a generalization effect. However, the generalization of treatment effects to words was smaller than for nonwords. The nonword intervention was effective in improving the subject's speech motor programming skills. As a result, transition errors due to impaired speech motor programming were greatly reduced, and the ability to produce untreated nonwords was greatly increased. However, there was a limit to the full improvement of strongly habitable word errors, which would be expected if a more intensive and repetitive intervention schedule was provided.
本研究旨在通过修改和补充快速音节转换治疗(ReST)来开发韩文版非词干预,并通过将其应用于CAS儿童来确定其效果。本研究的最终目的是探讨非词干预是否对非词产生能力和真实词的泛化有效。使用ABA设计的单受试者研究对具有CAS诊断特征的5岁零6个月的儿童进行了研究。在干预中使用的非言语是适合每个孩子的。干预以一小时为一次,每周两次,持续六周。在所有情况下,经过处理的三音节非词的表现都有所提高,而未经处理的三音节词、四音节词和非词表现出泛化效应。然而,言语治疗效果的泛化程度小于非言语治疗效果。非词干预在提高被试的言语运动编程技能方面是有效的。结果,由于言语运动编程受损而导致的转换错误大大减少,并且产生未经处理的非词的能力大大提高。然而,如果提供更密集和重复的干预计划,完全改善强宜居词错误是有限度的。
{"title":"Development and clinical application of Korean-version nonword\u0000 intervention to improve speech motor programming*","authors":"Da-Hee Oh, Ji-Wan Ha","doi":"10.13064/ksss.2021.13.2.077","DOIUrl":"https://doi.org/10.13064/ksss.2021.13.2.077","url":null,"abstract":"This study is to develop a Korean version of nonword intervention by modifying and supplementing a Rapid syllable transition treatment (ReST) and to determine its effect by applying it to children with CAS. Ultimately, the purpose of this study is to investigate whether nonword interventions are effective for nonword production ability and generalization of real words. Single-subject research using the ABA design was performed for a child aged five years and six months with diagnostic features of CAS. The nonwords used in the interventions were made suitable for the individual child. The intervention was provided in one-hour sessions, twice a week for six weeks. In all cases, performance of the treated three-syllable nonwords improved, and untreated three-syllable words, four-syllable words, and nonwords showed a generalization effect. However, the generalization of treatment effects to words was smaller than for nonwords. The nonword intervention was effective in improving the subject's speech motor programming skills. As a result, transition errors due to impaired speech motor programming were greatly reduced, and the ability to produce untreated nonwords was greatly increased. However, there was a limit to the full improvement of strongly habitable word errors, which would be expected if a more intensive and repetitive intervention schedule was provided.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"30 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116408169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An analysis of listening errors by Korean EFL learners from self-paced passage dictation 韩国英语学习者自定语速听写听力错误分析
Pub Date : 2021-03-01 DOI: 10.13064/KSSS.2021.13.1.017
Hyesun Cho
In this study, listening errors by Korean EFL learners are comprehensively analyzed from self-paced passage dictation tasks. Fifty-five Korean EFL learners participated in the study. Listeners were asked to write down dictation passages as accurately as possible, while listening to the audio as much as they needed. The results show that (i) low-proficiency learners tend to misperceive longer phrases than high-proficiency learners, (ii) function words are more often omitted or misheard than content words, and (iii) low-proficiency learners have more difficulties with content words than high-proficiency learners do. Most frequent suffix errors were omissions of past or plural suffixes. Among the function words, the most frequent errors were found with auxiliary contractions, infinitive marker to, and articles, mostly in the environment of linking and elision. It is also shown that C-V linking, C-C linking, and elision are the primary sources for the most frequent errors. C-V linking led to errors in correctly locating the word boundary, while C-C linking and elision resulted in omission. These errors show that Korean EFL listeners have difficulties in detecting fine-grained phonetic details to the extent that native speakers can do.
本研究以韩国英语学习者的听力错误为研究对象,从自主语速听写的角度进行分析。55名韩国英语学习者参与了本研究。听众被要求尽可能准确地写下听写段落,同时尽可能多地听音频。结果表明:(1)低水平学习者比高水平学习者更容易误解较长的短语,(2)虚词比实词更容易被省略或听错,(3)低水平学习者比高水平学习者更难理解实词。最常见的后缀错误是省略了过去或复数后缀。虚词中,辅助性缩略词、不定式标记to和冠词错误率最高,多出现在连读和省略的环境中。英汉连读、英汉连读和省略是最常见的错误来源。英汉连读导致了正确定位词界的错误,英汉连读和省略导致了遗漏。这些错误表明,韩国的英语听众很难像母语人士那样察觉到细微的语音细节。
{"title":"An analysis of listening errors by Korean EFL learners from\u0000 self-paced passage dictation","authors":"Hyesun Cho","doi":"10.13064/KSSS.2021.13.1.017","DOIUrl":"https://doi.org/10.13064/KSSS.2021.13.1.017","url":null,"abstract":"In this study, listening errors by Korean EFL learners are comprehensively analyzed from self-paced passage dictation tasks. Fifty-five Korean EFL learners participated in the study. Listeners were asked to write down dictation passages as accurately as possible, while listening to the audio as much as they needed. The results show that (i) low-proficiency learners tend to misperceive longer phrases than high-proficiency learners, (ii) function words are more often omitted or misheard than content words, and (iii) low-proficiency learners have more difficulties with content words than high-proficiency learners do. Most frequent suffix errors were omissions of past or plural suffixes. Among the function words, the most frequent errors were found with auxiliary contractions, infinitive marker to, and articles, mostly in the environment of linking and elision. It is also shown that C-V linking, C-C linking, and elision are the primary sources for the most frequent errors. C-V linking led to errors in correctly locating the word boundary, while C-C linking and elision resulted in omission. These errors show that Korean EFL listeners have difficulties in detecting fine-grained phonetic details to the extent that native speakers can do.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114989380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Age and gender differences in the spectral characteristics of Korean sibilants* 韩国人声音光谱特征的年龄和性别差异*
Pub Date : 2021-03-01 DOI: 10.13064/KSSS.2021.13.1.037
E. Kong, Jieun Kang
While recent acoustic studies have reported associations of fronted sibilants (fricatives /s s * / and affricates /tɕ tɕ * /) with gender in Seoul Korean, there have not been any studies examining the relationship of the variants with adult speakers’ ages. The current study analyzes sibilant productions from 39 adult speakers born between 1942 and 2008 (19 females) in terms of spectral peak frequencies (SPFs) in frication, an acoustic index of place of articulation (POA). The results indicate some phonetic contexts where higher sibilant SPFs, i.e., fronter POAs, are associated with younger adults and those fronted variants are realized in a gender-differentiated manner -- tense affricates and word-initial tense fricatives before /i/ in the females’ productions, and word-medial tense fricatives before /a/ in the males’ productions. The findings confirm that the distributions of the fronted sibilants are accounted for not only by the speakers’ gender but also by their ages, indicating that the fronted variants are innovative forms of realizing sibilants in Seoul Korean. In addition, the current results convincingly show that the fronted sibilant variants are not mere reflections of individuals’ physiological differences since they are not observed across all of the examined phonetic contexts.
虽然最近的声学研究报告了首尔韩语的前元音(摩擦音/s s * /和消舌音/t * t * /)与性别的关联,但还没有任何研究调查这些变体与成年说话者年龄的关系。目前的研究分析了出生于1942年至2008年间的39名成年说话者(其中19名是女性)在摩擦中的频谱峰值频率(SPFs),这是一种发音位置(POA)的声学指数。结果表明,在某些语音语境中,较高的音节spf(即前置poa)与年轻人有关,并且这些前置变体以性别差异的方式实现——女性的产物中/i/前的词性重读和词首时态摩擦音,而男性的产物中/a/前的词中时态摩擦音。研究结果证实,前音的分布不仅与说话者的性别有关,而且与他们的年龄有关,这表明前音变体是首尔韩语中实现声的创新形式。此外,目前的结果令人信服地表明,前面的音节变体不仅仅是个体生理差异的反映,因为它们并没有在所有被研究的语音环境中被观察到。
{"title":"Age and gender differences in the spectral characteristics of Korean\u0000 sibilants*","authors":"E. Kong, Jieun Kang","doi":"10.13064/KSSS.2021.13.1.037","DOIUrl":"https://doi.org/10.13064/KSSS.2021.13.1.037","url":null,"abstract":"While recent acoustic studies have reported associations of fronted sibilants (fricatives /s s * / and affricates /tɕ tɕ * /) with gender in Seoul Korean, there have not been any studies examining the relationship of the variants with adult speakers’ ages. The current study analyzes sibilant productions from 39 adult speakers born between 1942 and 2008 (19 females) in terms of spectral peak frequencies (SPFs) in frication, an acoustic index of place of articulation (POA). The results indicate some phonetic contexts where higher sibilant SPFs, i.e., fronter POAs, are associated with younger adults and those fronted variants are realized in a gender-differentiated manner -- tense affricates and word-initial tense fricatives before /i/ in the females’ productions, and word-medial tense fricatives before /a/ in the males’ productions. The findings confirm that the distributions of the fronted sibilants are accounted for not only by the speakers’ gender but also by their ages, indicating that the fronted variants are innovative forms of realizing sibilants in Seoul Korean. In addition, the current results convincingly show that the fronted sibilant variants are not mere reflections of individuals’ physiological differences since they are not observed across all of the examined phonetic contexts.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116299729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lexico-semantic interactions during the visual and spoken recognition of homonymous Korean Eojeols* 韩语同音词视觉和口语识别过程中的词汇语义交互作用*
Pub Date : 2021-03-01 DOI: 10.13064/KSSS.2021.13.1.001
Joonwoo Kim, Kathleen Kang, Doyoung Yoo, Inseo Jeon, Hyun Kyung Kim, Hyeomin Nam, Jiyoung Shin, K. Nam
The present study investigated the mental representation and processing of an ambiguous word in the bimodal processing system by manipulating the lexical ambiguity of a visually or auditorily presented word. Homonyms (e.g., ‘물었다’) with more than two meanings and control words (e.g., ‘고통을’) with a single meaning were used in the experiments. The lemma frequency of words was manipulated while the relative frequency of multiple meanings of each homonym was balanced. In both experiments using the lexical decision task, a robust frequency effect and a critical interaction of word type by frequency were found. In Experiment 1, spoken homonyms yielded faster latencies relative to control words (i.e., ambiguity advantage) in the low frequency condition, while ambiguity disadvantage was found in the high frequency condition. A similar interactive pattern was found in visually presented homonyms in the subsequent Experiment 2. Taken together, the first key finding is that interdependent lexico-semantic processing can be found both in the visual and auditory processing system, which in turn suggests that semantic processing is not modality dependent, but rather takes place on the basis of general lexical knowledge. The second is that multiple semantic candidates provide facilitative feedback only when the lemma frequency of the word is relatively low.
本研究通过操纵视觉或听觉呈现词的词汇歧义来研究双峰加工系统中歧义词的心理表征和加工。实验中使用了具有两个以上含义的同音异义词(例如,“超大型”)和具有单一含义的对照词(例如,“超大型”)。对词的引理频次进行控制,对每个同音异义词的多义相对频次进行平衡。在两个使用词汇决策任务的实验中,发现了一个强大的频率效应和词类与频率的关键交互作用。实验1中,语音同音词在低频条件下相对于对照词产生更快的潜伏期(即歧义优势),而在高频条件下则出现歧义劣势。在随后的实验2中,视觉上呈现的同音异义词也出现了类似的交互模式。综上所述,第一个关键发现是,在视觉和听觉加工系统中都可以发现相互依存的词汇语义加工,这反过来表明语义加工不依赖于情态,而是以一般词汇知识为基础。二是多个候选语义只有在词的引理频率较低时才提供促进性反馈。
{"title":"Lexico-semantic interactions during the visual and spoken\u0000 recognition of homonymous Korean Eojeols*","authors":"Joonwoo Kim, Kathleen Kang, Doyoung Yoo, Inseo Jeon, Hyun Kyung Kim, Hyeomin Nam, Jiyoung Shin, K. Nam","doi":"10.13064/KSSS.2021.13.1.001","DOIUrl":"https://doi.org/10.13064/KSSS.2021.13.1.001","url":null,"abstract":"The present study investigated the mental representation and processing of an ambiguous word in the bimodal processing system by manipulating the lexical ambiguity of a visually or auditorily presented word. Homonyms (e.g., ‘물었다’) with more than two meanings and control words (e.g., ‘고통을’) with a single meaning were used in the experiments. The lemma frequency of words was manipulated while the relative frequency of multiple meanings of each homonym was balanced. In both experiments using the lexical decision task, a robust frequency effect and a critical interaction of word type by frequency were found. In Experiment 1, spoken homonyms yielded faster latencies relative to control words (i.e., ambiguity advantage) in the low frequency condition, while ambiguity disadvantage was found in the high frequency condition. A similar interactive pattern was found in visually presented homonyms in the subsequent Experiment 2. Taken together, the first key finding is that interdependent lexico-semantic processing can be found both in the visual and auditory processing system, which in turn suggests that semantic processing is not modality dependent, but rather takes place on the basis of general lexical knowledge. The second is that multiple semantic candidates provide facilitative feedback only when the lemma frequency of the word is relatively low.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129800644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Change in lip movement during speech by aging: Based on a double vowel* 说话时嘴唇运动的变化:基于双元音*
Pub Date : 2021-03-01 DOI: 10.13064/KSSS.2021.13.1.073
Hee-June Park
This study investigated the change in lip movement during speech according to aging. For the study, 15 elderly women with an average of 69 years and 15 young women with an average of 22 years were selected. To measure the movement of the lips, the ratio between the minimum point and the maximum point of movement when pronouncing a double vowel was analyzed in pixel units using image analysis software. For clinical utility, the software was produced by applying an automated algorithm and compared with the results of handwork. This study found that the range of the width and length of lips in double vowel tasks was smaller for the elderly than that of the young. A strong positive correlation was found between manual and automated methods, indicating that both methods are useful for extracting lip contours. Based on the above results, it was found that the range of the lips decreased when ignited as aging progressed. Therefore, monitoring the condition of lip performance by simply measuring the movement of lips before aging progresses, and performing exercises to maintain lip range, will prevent pronunciation problems caused by aging.
本研究考察了语言过程中唇部运动随年龄的变化。在这项研究中,选择了15名平均年龄为69岁的老年女性和15名平均年龄为22岁的年轻女性。为了测量嘴唇的运动,使用图像分析软件以像素为单位分析了发双元音时最小运动点与最大运动点之间的比例。为了临床应用,应用自动算法生成软件,并与手工结果进行比较。这项研究发现,在双元音任务中,老年人嘴唇的宽度和长度的范围比年轻人要小。手工和自动化方法之间存在很强的正相关关系,表明这两种方法对提取唇轮廓都是有用的。根据以上结果,我们发现随着年龄的增长,嘴唇的范围随着年龄的增长而缩小。因此,在衰老之前,通过简单地测量嘴唇的运动来监测嘴唇的表现状况,并进行保持嘴唇范围的练习,可以防止因衰老而引起的发音问题。
{"title":"Change in lip movement during speech by aging: Based on a double\u0000 vowel*","authors":"Hee-June Park","doi":"10.13064/KSSS.2021.13.1.073","DOIUrl":"https://doi.org/10.13064/KSSS.2021.13.1.073","url":null,"abstract":"This study investigated the change in lip movement during speech according to aging. For the study, 15 elderly women with an average of 69 years and 15 young women with an average of 22 years were selected. To measure the movement of the lips, the ratio between the minimum point and the maximum point of movement when pronouncing a double vowel was analyzed in pixel units using image analysis software. For clinical utility, the software was produced by applying an automated algorithm and compared with the results of handwork. This study found that the range of the width and length of lips in double vowel tasks was smaller for the elderly than that of the young. A strong positive correlation was found between manual and automated methods, indicating that both methods are useful for extracting lip contours. Based on the above results, it was found that the range of the lips decreased when ignited as aging progressed. Therefore, monitoring the condition of lip performance by simply measuring the movement of lips before aging progresses, and performing exercises to maintain lip range, will prevent pronunciation problems caused by aging.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122827284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Hyperparameter experiments on end-to-end automatic speech recognition* 端到端自动语音识别的超参数实验*
Pub Date : 2021-03-01 DOI: 10.13064/KSSS.2021.13.1.045
Hyungwon Yang, Hosung Nam
End-to-end (E2E) automatic speech recognition (ASR) has achieved promising performance gains with the introduced self-attention network, Transformer. However, due to training time and the number of hyperparameters, finding the optimal hyperparameter set is computationally expensive. This paper investigates the impact of hyperparameters in the Transformer network to answer two questions: which hyperparameter plays a critical role in the task performance and training speed. The Transformer network for training has two encoder and decoder networks combined with Connectionist Temporal Classification (CTC). We have trained the model with Wall Street Journal (WSJ) SI-284 and tested on devl93 and eval92. Seventeen hyperparameters were selected from the ESPnet training configuration, and varying ranges of values were used for experiments. The result shows that “num blocks” and “linear units” hyperparameters in the encoder and decoder networks reduce Word Error Rate (WER) significantly. However, performance gain is more prominent when they are altered in the encoder network. Training duration also linearly increased as “num blocks” and “linear units” hyperparameters’ values grow. Based on the experimental results, we collected the optimal values from each hyperparameter and reduced the WER up to 2.9/1.9 from dev93 and eval93 respectively. and 2.6/2.5 respectively, but 3.4/3.5, and 0.8/0.6 in the decoder network. A “dropout rate” hyperparameter in the decoder network does not act like the one in the encoder network, but it reaches the lowest WER at the value 0.1 and maintains high WER at the other values. Meaningful result is not found in “attention heads” and “self attention dropout rate”.
端到端(E2E)自动语音识别(ASR)通过引入自关注网络Transformer实现了有希望的性能提升。然而,由于训练时间和超参数的数量,寻找最优的超参数集在计算上是昂贵的。本文研究了超参数对变压器网络的影响,以回答哪个超参数对任务性能和训练速度起关键作用这两个问题。用于训练的Transformer网络包含两个编码器和解码器网络,并结合了连接时间分类(CTC)。我们用华尔街日报SI-284对模型进行了训练,并在devl93和eval92上进行了测试。从ESPnet训练组态中选取了17个超参数,选取不同范围的值进行实验。结果表明,编码器和解码器网络中的“num块”和“线性单元”超参数显著降低了字错误率(WER)。然而,当它们在编码器网络中改变时,性能增益更为突出。随着“num blocks”和“linear units”超参数值的增加,训练持续时间也呈线性增加。根据实验结果,我们从每个超参数中收集最优值,并将WER分别从dev93和eval93降低到2.9/1.9。和2.6/2.5,解码器网络为3.4/3.5和0.8/0.6。解码器网络中的“丢失率”超参数与编码器网络中的“丢失率”超参数不同,但它在0.1值处达到最低的WER,并在其他值处保持较高的WER。在“注意头”和“自我注意丢失率”方面没有发现有意义的结果。
{"title":"Hyperparameter experiments on end-to-end automatic speech\u0000 recognition*","authors":"Hyungwon Yang, Hosung Nam","doi":"10.13064/KSSS.2021.13.1.045","DOIUrl":"https://doi.org/10.13064/KSSS.2021.13.1.045","url":null,"abstract":"End-to-end (E2E) automatic speech recognition (ASR) has achieved promising performance gains with the introduced self-attention network, Transformer. However, due to training time and the number of hyperparameters, finding the optimal hyperparameter set is computationally expensive. This paper investigates the impact of hyperparameters in the Transformer network to answer two questions: which hyperparameter plays a critical role in the task performance and training speed. The Transformer network for training has two encoder and decoder networks combined with Connectionist Temporal Classification (CTC). We have trained the model with Wall Street Journal (WSJ) SI-284 and tested on devl93 and eval92. Seventeen hyperparameters were selected from the ESPnet training configuration, and varying ranges of values were used for experiments. The result shows that “num blocks” and “linear units” hyperparameters in the encoder and decoder networks reduce Word Error Rate (WER) significantly. However, performance gain is more prominent when they are altered in the encoder network. Training duration also linearly increased as “num blocks” and “linear units” hyperparameters’ values grow. Based on the experimental results, we collected the optimal values from each hyperparameter and reduced the WER up to 2.9/1.9 from dev93 and eval93 respectively. and 2.6/2.5 respectively, but 3.4/3.5, and 0.8/0.6 in the decoder network. A “dropout rate” hyperparameter in the decoder network does not act like the one in the encoder network, but it reaches the lowest WER at the value 0.1 and maintains high WER at the other values. Meaningful result is not found in “attention heads” and “self attention dropout rate”.","PeriodicalId":255285,"journal":{"name":"Phonetics and Speech Sciences","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127900449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Phonetics and Speech Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1