Varieeruva vältega sõnade hääldusuuringud kõnesünteesi teenistuses

Q2 Arts and Humanities Eesti Rakenduslingvistika Uhingu Aastaraamat Pub Date : 2017-04-19 DOI:10.5128/ERYA13.08
L. Piits, Mari-Liis Kalvik
{"title":"Varieeruva vältega sõnade hääldusuuringud kõnesünteesi teenistuses","authors":"L. Piits, Mari-Liis Kalvik","doi":"10.5128/ERYA13.08","DOIUrl":null,"url":null,"abstract":"Artiklis tutvustame lugemiseksperimenti, mille pohjal uurime nn varieeruva valtega sonade haaldust. Varieeruva valtega sonade maaratlemisel lahtume oigekeelsussonaraamatu (OS 2013) normingutest – uurime sonu, mida lubatakse haaldada nii teises kui kolmandas valtes. Analuusime sonade pearohulise ja jargsilbi kestussuhteid ja vordleme saadud andmeid kuuldelise hinnangu tulemustega. Uurime, kas sarnase silbistruktuuri ja sama sonaliigilise kuuluvusega sonade haalduses on sarnaseid jooni. Rakenduslikust aspektist kannus- tab uurimust vajadus leida lahendus probleemidele, mida varieeruvus tekitab tekst-kone sunteesi protsessis. Uuringu tulemusel selgusid peamised trendid varieeruva valtega sonade haalduseelistustes. Nii silpide kestussuhete kui kuuldelise hinnangu alusel moodustusid kindlad sonaruhmad, kus domineeris uks voi teine valde. Sonatuupide kaupa analuus voimaldas maarata ka valte varieerumise trende tuubiti, nt koik kolmesilbilised lik -liitelised adjektiivid uhe erandiga haaldusid kolmandas valtes. \"Words of variable quantity degrees as a problem for speech synthesis\" Estonian text-to-speech synthesis relies in its determination of pronunciation on the Dictionary of Standard Estonian (OS 2013), which is the basis of standard Estonian. However, for roughly 300 words, this dictionary allows pronunciation with both the second and third quantity degree. This causes problems in the text-to-speech synthesis system, since the automatic text analysis cannot handle multiple outputs. It is necessary to give preference to one of the pronunciation variants in the text analysis process, and therefore it is important to identify which variant is more common among language users in actual speech. For the studies of quantity degrees, words were chosen which OS 2013 lists as being pronounced with both the second and third quantity degree. This study is based on a reading experiment conducted with 23 informants (15 women and 8 men), in which each informant read 52 sentences aloud. These sentences contained 47 target words, i.e. words of variable quantity degrees; in total, the study yielded 1080 pronunciation instances to examine. The group of target words includes those with varying vowel quantity degrees as well as those with varying consonant quantity degrees; the duration ratios characteristic of each quantity degree were calculated on the basis of the primary-stress syllable and the unstressed syllable following it. The average duration ratio for the second quantity degree is 1.8, and for the third quantity degree 2.9. These results are similar to those obtained in previous studies. On the basis of the informants’ pronunciation, the words were grouped into three categories: second quantity degree, variable quantity degree (where neither the second nor the third quantity degree accounted for more than 2/3 of all pronunciations) and third quantity degree. Based on the duration ratio, 8 words fell into the second quantity degree group; however, based on auditory assessment, this group increased to 17 words. The variable quantity degree group contained 15 words based on duration ratios, but only 5 words based on auditory assessment. The third quantity degree group contained 24 words by duration ratio and 25 words by auditory assessment. Finding trends in word structure among words in the same quantity degree groups would make it possible to draw inferences about other words of the same type as well, which would increase the applied value of the study. Generally, though words of the same syllable structure and part of speech did not exhibit the same pronunciation patterns. However, it can at least be stated that the third quantity degree dominated among both two- and three-syllable adjectives formed with the suffix - lik . Of the 15 such words analysed in the study, only two were pronounced predominantly with the second quantity degree.","PeriodicalId":35118,"journal":{"name":"Eesti Rakenduslingvistika Uhingu Aastaraamat","volume":"44 1","pages":"123-140"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eesti Rakenduslingvistika Uhingu Aastaraamat","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5128/ERYA13.08","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0

Abstract

Artiklis tutvustame lugemiseksperimenti, mille pohjal uurime nn varieeruva valtega sonade haaldust. Varieeruva valtega sonade maaratlemisel lahtume oigekeelsussonaraamatu (OS 2013) normingutest – uurime sonu, mida lubatakse haaldada nii teises kui kolmandas valtes. Analuusime sonade pearohulise ja jargsilbi kestussuhteid ja vordleme saadud andmeid kuuldelise hinnangu tulemustega. Uurime, kas sarnase silbistruktuuri ja sama sonaliigilise kuuluvusega sonade haalduses on sarnaseid jooni. Rakenduslikust aspektist kannus- tab uurimust vajadus leida lahendus probleemidele, mida varieeruvus tekitab tekst-kone sunteesi protsessis. Uuringu tulemusel selgusid peamised trendid varieeruva valtega sonade haalduseelistustes. Nii silpide kestussuhete kui kuuldelise hinnangu alusel moodustusid kindlad sonaruhmad, kus domineeris uks voi teine valde. Sonatuupide kaupa analuus voimaldas maarata ka valte varieerumise trende tuubiti, nt koik kolmesilbilised lik -liitelised adjektiivid uhe erandiga haaldusid kolmandas valtes. "Words of variable quantity degrees as a problem for speech synthesis" Estonian text-to-speech synthesis relies in its determination of pronunciation on the Dictionary of Standard Estonian (OS 2013), which is the basis of standard Estonian. However, for roughly 300 words, this dictionary allows pronunciation with both the second and third quantity degree. This causes problems in the text-to-speech synthesis system, since the automatic text analysis cannot handle multiple outputs. It is necessary to give preference to one of the pronunciation variants in the text analysis process, and therefore it is important to identify which variant is more common among language users in actual speech. For the studies of quantity degrees, words were chosen which OS 2013 lists as being pronounced with both the second and third quantity degree. This study is based on a reading experiment conducted with 23 informants (15 women and 8 men), in which each informant read 52 sentences aloud. These sentences contained 47 target words, i.e. words of variable quantity degrees; in total, the study yielded 1080 pronunciation instances to examine. The group of target words includes those with varying vowel quantity degrees as well as those with varying consonant quantity degrees; the duration ratios characteristic of each quantity degree were calculated on the basis of the primary-stress syllable and the unstressed syllable following it. The average duration ratio for the second quantity degree is 1.8, and for the third quantity degree 2.9. These results are similar to those obtained in previous studies. On the basis of the informants’ pronunciation, the words were grouped into three categories: second quantity degree, variable quantity degree (where neither the second nor the third quantity degree accounted for more than 2/3 of all pronunciations) and third quantity degree. Based on the duration ratio, 8 words fell into the second quantity degree group; however, based on auditory assessment, this group increased to 17 words. The variable quantity degree group contained 15 words based on duration ratios, but only 5 words based on auditory assessment. The third quantity degree group contained 24 words by duration ratio and 25 words by auditory assessment. Finding trends in word structure among words in the same quantity degree groups would make it possible to draw inferences about other words of the same type as well, which would increase the applied value of the study. Generally, though words of the same syllable structure and part of speech did not exhibit the same pronunciation patterns. However, it can at least be stated that the third quantity degree dominated among both two- and three-syllable adjectives formed with the suffix - lik . Of the 15 such words analysed in the study, only two were pronounced predominantly with the second quantity degree.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Eesti Rakenduslingvistika Uhingu Aastaraamat
Eesti Rakenduslingvistika Uhingu Aastaraamat Arts and Humanities-Language and Linguistics
CiteScore
0.90
自引率
0.00%
发文量
19
审稿时长
28 weeks
期刊最新文献
What is philology? Philology and its utilisation in the study of linguistic variation and change Mängustatud õppetegevuste mõju üheksanda klassi õpilaste suhtumisele eesti keele kui teise keele tundidesse Sõnaliik leksikograafi töölaual: sõnaliikide roll tänapäeva leksikograafias Keeleandmete õigusliku režiimi mõju nende abil loodud keelemudelitele MAIN-testi kasutamine eesti laste jutustamisoskuse hindamiseks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1