Muhammad Swaileh A. Alzaidi , Yi Xu , Anqi Xu , Marta Szreder
{"title":"Analysis and computational modelling of Emirati Arabic intonation – A preliminary study","authors":"Muhammad Swaileh A. Alzaidi , Yi Xu , Anqi Xu , Marta Szreder","doi":"10.1016/j.wocn.2023.101236","DOIUrl":null,"url":null,"abstract":"<div><p>This study is a preliminary investigation of intonation in Emirati Arabic (EA) (an under-researched Arabic dialect), using systematic acoustic analysis and computational modelling. First, we investigated the prosodic realisation of information focus and contrastive focus at sentence-initial, -penultimate and -final positions. The analysis of 1980 EA utterances produced by eleven EA native speakers revealed that (1) in focused words, only contrastive focus is realised with expanded excursion size, longer duration, and stronger intensity relative to their neutral focus counterparts, (2) post-focus words have a lower <em>f</em><sub>0</sub> and weaker intensity in both contrastive focus and information focus, and (3) pre-focus words have compressed excursion size and relatively short duration. We then used computational modelling to test how much of the EA intonation could be captured by the PENTA model, with focus-defined functional categories and a number of other, putative categories. PENTAtrainer was trained on syllable-sized multi-functional targets from a subset of the production data. The model then generated <em>f</em><sub>0</sub> contours with the learned targets and imposed them on resynthesised speech for perceptual evaluation. A comparison of the model-generated <em>f</em><sub>0</sub> contours with the natural <em>f</em><sub>0</sub> contours showed that not only focus but also weight, stress, position of word-level stressed syllable and prosodic word are important factors determining the fine details of EA intonation. A perceptual test with native EA listeners showed that the synthetic EA <em>f</em><sub>0</sub> contours sounded nearly as natural as the original intonation, and could convey focus nearly as accurately as natural intonation.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"98 ","pages":"Article 101236"},"PeriodicalIF":1.9000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Phonetics","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0095447023000256","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study is a preliminary investigation of intonation in Emirati Arabic (EA) (an under-researched Arabic dialect), using systematic acoustic analysis and computational modelling. First, we investigated the prosodic realisation of information focus and contrastive focus at sentence-initial, -penultimate and -final positions. The analysis of 1980 EA utterances produced by eleven EA native speakers revealed that (1) in focused words, only contrastive focus is realised with expanded excursion size, longer duration, and stronger intensity relative to their neutral focus counterparts, (2) post-focus words have a lower f0 and weaker intensity in both contrastive focus and information focus, and (3) pre-focus words have compressed excursion size and relatively short duration. We then used computational modelling to test how much of the EA intonation could be captured by the PENTA model, with focus-defined functional categories and a number of other, putative categories. PENTAtrainer was trained on syllable-sized multi-functional targets from a subset of the production data. The model then generated f0 contours with the learned targets and imposed them on resynthesised speech for perceptual evaluation. A comparison of the model-generated f0 contours with the natural f0 contours showed that not only focus but also weight, stress, position of word-level stressed syllable and prosodic word are important factors determining the fine details of EA intonation. A perceptual test with native EA listeners showed that the synthetic EA f0 contours sounded nearly as natural as the original intonation, and could convey focus nearly as accurately as natural intonation.
期刊介绍:
The Journal of Phonetics publishes papers of an experimental or theoretical nature that deal with phonetic aspects of language and linguistic communication processes. Papers dealing with technological and/or pathological topics, or papers of an interdisciplinary nature are also suitable, provided that linguistic-phonetic principles underlie the work reported. Regular articles, review articles, and letters to the editor are published. Themed issues are also published, devoted entirely to a specific subject of interest within the field of phonetics.