首页 > 最新文献

Workshop on Spoken Language Technologies for Under-resourced Languages最新文献

英文 中文
A Corpus of the Sorani Kurdish Folkloric Lyrics 索拉尼族库尔德民歌歌词语料库
Pub Date : 2020-05-16 DOI: 10.13025/1YDH-EW61
Sina Ahmadi, Hossein Hassani, K. Abedi
Kurdish poetry and prose narratives were historically transmitted orally and less in a written form. Being an essential medium of oral narration and literature, Kurdish lyrics have had a unique attribute in becoming a vital resource for different types of studies, including Digital Humanities, Computational Folkloristics and Computational Linguistics. As an initial study of its kind for the Kurdish language, this paper presents our efforts in transcribing and collecting Kurdish folk lyrics as a corpus that covers various Kurdish musical genres, in particular Beyt, Gorani, Bend, and Heyran. We believe that this corpus contributes to Kurdish language processing in several ways, such as compensation for the lack of a long history of written text by incorporating oral literature, presenting an unexplored realm in Kurdish language processing, and assisting the initiation of Kurdish computational folkloristics. Our corpus contains 49,582 tokens in the Sorani dialect of Kurdish. The corpus is publicly available in the Text Encoding Initiative (TEI) format for non-commercial use.
库尔德诗歌和散文叙事在历史上是口头传播的,很少以书面形式传播。作为口头叙述和文学的重要媒介,库尔德歌词具有独特的属性,成为不同类型研究的重要资源,包括数字人文、计算民俗学和计算语言学。作为对库尔德语的初步研究,本文介绍了我们在转录和收集库尔德民间歌词方面所做的努力,这些歌词作为一个语料库,涵盖了各种库尔德音乐流派,特别是Beyt, Gorani, Bend和Heyran。我们相信这个语料库在几个方面有助于库尔德语的处理,比如通过结合口头文学来弥补缺乏长期的书面文本历史,在库尔德语处理中呈现一个未开发的领域,并协助库尔德计算民俗学的启动。我们的语料库包含49,582个库尔德语索拉尼方言的符号。该语料库以文本编码倡议(TEI)格式公开提供,供非商业用途。
{"title":"A Corpus of the Sorani Kurdish Folkloric Lyrics","authors":"Sina Ahmadi, Hossein Hassani, K. Abedi","doi":"10.13025/1YDH-EW61","DOIUrl":"https://doi.org/10.13025/1YDH-EW61","url":null,"abstract":"Kurdish poetry and prose narratives were historically transmitted orally and less in a written form. Being an essential medium of oral narration and literature, Kurdish lyrics have had a unique attribute in becoming a vital resource for different types of studies, including Digital Humanities, Computational Folkloristics and Computational Linguistics. As an initial study of its kind for the Kurdish language, this paper presents our efforts in transcribing and collecting Kurdish folk lyrics as a corpus that covers various Kurdish musical genres, in particular Beyt, Gorani, Bend, and Heyran. We believe that this corpus contributes to Kurdish language processing in several ways, such as compensation for the lack of a long history of written text by incorporating oral literature, presenting an unexplored realm in Kurdish language processing, and assisting the initiation of Kurdish computational folkloristics. Our corpus contains 49,582 tokens in the Sorani dialect of Kurdish. The corpus is publicly available in the Text Encoding Initiative (TEI) format for non-commercial use.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"331 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133433160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Sentiment Analysis Dataset for Code-Mixed Malayalam-English 马来语-英语混合语码情感分析数据集
Pub Date : 2020-05-11 DOI: 10.5281/ZENODO.4015234
Bharathi Raja Chakravarthi, Navya Jose, Shardul Suryawanshi, E. Sherly, John P. McCrae
There is an increasing demand for sentiment analysis of text from social media which are mostly code-mixed. Systems trained on monolingual data fail for code-mixed data due to the complexity of mixing at different levels of the text. However, very few resources are available for code-mixed data to create models specific for this data. Although much research in multilingual and cross-lingual sentiment analysis has used semi-supervised or unsupervised methods, supervised methods still performs better. Only a few datasets for popular languages such as English-Spanish, English-Hindi, and English-Chinese are available. There are no resources available for Malayalam-English code-mixed data. This paper presents a new gold standard corpus for sentiment analysis of code-mixed text in Malayalam-English annotated by voluntary annotators. This gold standard corpus obtained a Krippendorff’s alpha above 0.8 for the dataset. We use this new corpus to provide the benchmark for sentiment analysis in Malayalam-English code-mixed texts.
对社交媒体文本情感分析的需求越来越大,这些文本大多是代码混合的。由于在文本的不同层次上混合的复杂性,单语言数据上训练的系统在代码混合数据上失败。然而,很少有资源可用于代码混合数据来创建特定于该数据的模型。尽管在多语言和跨语言情感分析方面的许多研究都使用了半监督或非监督方法,但监督方法仍然表现更好。只有一些流行语言的数据集可用,如英语-西班牙语、英语-印地语和英语-汉语。没有马来亚拉姆-英语代码混合数据可用的资源。本文提出了一种新的金标准语料库,用于马拉雅拉姆-英语语码混合文本的情感分析。这个黄金标准语料库的数据集的Krippendorff alpha值高于0.8。我们使用这个新的语料库为马来语-英语代码混合文本的情感分析提供了基准。
{"title":"A Sentiment Analysis Dataset for Code-Mixed Malayalam-English","authors":"Bharathi Raja Chakravarthi, Navya Jose, Shardul Suryawanshi, E. Sherly, John P. McCrae","doi":"10.5281/ZENODO.4015234","DOIUrl":"https://doi.org/10.5281/ZENODO.4015234","url":null,"abstract":"There is an increasing demand for sentiment analysis of text from social media which are mostly code-mixed. Systems trained on monolingual data fail for code-mixed data due to the complexity of mixing at different levels of the text. However, very few resources are available for code-mixed data to create models specific for this data. Although much research in multilingual and cross-lingual sentiment analysis has used semi-supervised or unsupervised methods, supervised methods still performs better. Only a few datasets for popular languages such as English-Spanish, English-Hindi, and English-Chinese are available. There are no resources available for Malayalam-English code-mixed data. This paper presents a new gold standard corpus for sentiment analysis of code-mixed text in Malayalam-English annotated by voluntary annotators. This gold standard corpus obtained a Krippendorff’s alpha above 0.8 for the dataset. We use this new corpus to provide the benchmark for sentiment analysis in Malayalam-English code-mixed texts.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116413784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 191
Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text 代码混合泰米尔语-英语文本情感分析的语料库创建
Pub Date : 2020-05-11 DOI: 10.5281/ZENODO.4015253
Bharathi Raja Chakravarthi, V. Muralidaran, R. Priyadharshini, John P. McCrae
Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to analyse the popular sentiments of videos on social media based on viewer comments. However, comments from social media do not follow strict rules of grammar, and they contain mixing of more than one language, often written in non-native scripts. Non-availability of annotated code-mixed data for a low-resourced language like Tamil also adds difficulty to this problem. To overcome this, we created a gold standard Tamil-English code-switched, sentiment-annotated corpus containing 15,744 comment posts from YouTube. In this paper, we describe the process of creating the corpus and assigning polarities. We present inter-annotator agreement and show the results of sentiment analysis trained on this corpus as a benchmark.
在许多应用程序中,理解来自视频或图像的评论的情感是一项基本任务。文本的情感分析对各种决策过程都很有用。其中一个应用程序是根据观众的评论分析社交媒体上视频的流行情绪。然而,社交媒体上的评论并不遵循严格的语法规则,而且它们包含不止一种语言的混合,通常是用非母语文字写成的。对于像泰米尔这样资源匮乏的语言,带注释的代码混合数据的不可用性也增加了这个问题的难度。为了克服这个问题,我们创建了一个黄金标准的泰米尔语-英语代码转换,情感注释语料库,其中包含来自YouTube的15,744条评论帖子。在本文中,我们描述了创建语料库和分配极性的过程。我们提出了注释者之间的协议,并展示了在该语料库上训练的情感分析结果作为基准。
{"title":"Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text","authors":"Bharathi Raja Chakravarthi, V. Muralidaran, R. Priyadharshini, John P. McCrae","doi":"10.5281/ZENODO.4015253","DOIUrl":"https://doi.org/10.5281/ZENODO.4015253","url":null,"abstract":"Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to analyse the popular sentiments of videos on social media based on viewer comments. However, comments from social media do not follow strict rules of grammar, and they contain mixing of more than one language, often written in non-native scripts. Non-availability of annotated code-mixed data for a low-resourced language like Tamil also adds difficulty to this problem. To overcome this, we created a gold standard Tamil-English code-switched, sentiment-annotated corpus containing 15,744 comment posts from YouTube. In this paper, we describe the process of creating the corpus and assigning polarities. We present inter-annotator agreement and show the results of sentiment analysis trained on this corpus as a benchmark.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123509723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 242
Automatic Detection of Palatalized Consonants in Kashmiri 克什米尔语舌化辅音的自动检测
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-25
Ramakrishna Thirumuru, K. Gurugubelli, A. Vuppala
In this study, the acoustic-phonetic attributes of palatalization in the Kashmiri speech is investigated. It is a unique phonetic feature of Kashmiri in the Indian context. An automated approach is proposed to detect this unique phonetic feature from the continuous Kashmiri speech. The i-matra vowel has the impact of palatalizing the consonant connected to it. Therefore, these consonants investigated in synchronous with vowel regions, which are spotted using the instantaneous energy computed from the envelope-derivative of the speech signal. The resonating characteristics of the vocal-tract system framework that reflect the formant dynamics are used to differentiate palatalized consonants from the other consonants. In this regard, the Hilbert envelope of the numerator of the group-delay function that provides good time-frequency resolution used to extract formants. The palatalization detection experimentation carried out in various vowel contexts using the acoustic cues, and it produced a promising result with a detection accuracy of 92.46 %.
本研究探讨了喀什米尔语中舌化的声学-语音属性。这是克什米尔语在印度语境中独特的语音特征。提出了一种自动检测连续克什米尔语语音中这一独特语音特征的方法。i-matra元音的作用是使与它相连的辅音有舌化的效果。因此,这些辅音与元音区域同步研究,使用从语音信号的包络导数计算的瞬时能量来发现元音区域。声道系统框架的共振特征反映了形成峰的动态,用来区分腭化辅音和其他辅音。在这方面,希尔伯特包络的分子群延迟函数,提供了良好的时频分辨率用于提取共振峰。利用声学线索在不同的元音语境下进行了舌化检测实验,检测准确率达到92.46%,结果令人满意。
{"title":"Automatic Detection of Palatalized Consonants in Kashmiri","authors":"Ramakrishna Thirumuru, K. Gurugubelli, A. Vuppala","doi":"10.21437/SLTU.2018-25","DOIUrl":"https://doi.org/10.21437/SLTU.2018-25","url":null,"abstract":"In this study, the acoustic-phonetic attributes of palatalization in the Kashmiri speech is investigated. It is a unique phonetic feature of Kashmiri in the Indian context. An automated approach is proposed to detect this unique phonetic feature from the continuous Kashmiri speech. The i-matra vowel has the impact of palatalizing the consonant connected to it. Therefore, these consonants investigated in synchronous with vowel regions, which are spotted using the instantaneous energy computed from the envelope-derivative of the speech signal. The resonating characteristics of the vocal-tract system framework that reflect the formant dynamics are used to differentiate palatalized consonants from the other consonants. In this regard, the Hilbert envelope of the numerator of the group-delay function that provides good time-frequency resolution used to extract formants. The palatalization detection experimentation carried out in various vowel contexts using the acoustic cues, and it produced a promising result with a detection accuracy of 92.46 %.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"269 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115209418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems 孟加拉语、高棉语、尼泊尔语、爪哇语、僧伽罗语和巽他语文本到语音系统的文本规范化
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-31
Keshan Sanjaya Sodimana, Pasindu De Silva, R. Sproat, T. Wattanavekin, Alexander Gutkin, Knot Pipatsrisawat
Text normalization is the process of converting non-standard words (NSWs) such as numbers, and abbreviations into standard words so that their pronunciations can be derived by a typical means (usually lexicon lookups). Text normalization is, thus, an important component of any text-to-speech (TTS) system. Without text normalization, the resulting voice may sound unintelligent. In this paper, we describe an approach to develop rule-based text normalization. We also describe our open source repository containing text normalization grammars and tests for Bangla, Javanese, Khmer, Nepali, Sinhala and Sundanese. Fi-nally, we present a recipe for utilizing the grammars in a TTS system.
文本规范化是将数字和缩写等非标准单词转换为标准单词的过程,以便通过典型方法(通常是词典查找)推导出它们的发音。因此,文本规范化是任何文本到语音(TTS)系统的重要组成部分。如果没有文本规范化,生成的声音可能听起来很不智能。在本文中,我们描述了一种开发基于规则的文本规范化的方法。我们还描述了包含孟加拉语、爪哇语、高棉语、尼泊尔语、僧伽罗语和巽他语文本规范化语法和测试的开源存储库。最后,我们给出了在TTS系统中使用语法的方法。
{"title":"Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems","authors":"Keshan Sanjaya Sodimana, Pasindu De Silva, R. Sproat, T. Wattanavekin, Alexander Gutkin, Knot Pipatsrisawat","doi":"10.21437/SLTU.2018-31","DOIUrl":"https://doi.org/10.21437/SLTU.2018-31","url":null,"abstract":"Text normalization is the process of converting non-standard words (NSWs) such as numbers, and abbreviations into standard words so that their pronunciations can be derived by a typical means (usually lexicon lookups). Text normalization is, thus, an important component of any text-to-speech (TTS) system. Without text normalization, the resulting voice may sound unintelligent. In this paper, we describe an approach to develop rule-based text normalization. We also describe our open source repository containing text normalization grammars and tests for Bangla, Javanese, Khmer, Nepali, Sinhala and Sundanese. Fi-nally, we present a recipe for utilizing the grammars in a TTS system.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114286624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Investigating the Use of Mixed-Units Based Modeling for Improving Uyghur Speech Recognition 混合单元建模在维吾尔语语音识别中的应用研究
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-45
Pengfei Hu, Shen Huang, Zhiqiang Lv
Uyghur is a highly agglutinative language with a large number of words derived from the same root. For such languages the use of subwords in speech recognition becomes a natural choice, which can solve the OOV issues. However, short units in subword modeling will weaken the constraint of linguistic context. Besides, vowel weakening and reduction occur frequently in Uyghur language, which may lead to high deletion errors for short unit sequence recognition. In this paper, we investigate using mixed units in Uyghur speech recognition. Subwords and whole-words are mixed together to build a hybrid lexicon and language models for recognition. We also introduce an interpolated LM to further improve the performance. Experiment results show that the mixed-unit based modeling do outperform word or subword based modeling. About 10% relative reduction in Word Error Rate and 8% reduction in Character Error Rate have been achieved for test datasets compared with baseline system.
维吾尔语是一种具有高度黏着性的语言,有大量同根衍生词。对于这些语言,在语音识别中使用子词成为一种自然的选择,它可以解决OOV问题。然而,子词建模中的短单位会削弱语言语境的约束。此外,维吾尔语中元音弱化和重读现象频繁,这可能导致短单元序列识别的高缺失率。本文研究了混合单元在维吾尔语语音识别中的应用。将子词和全词混合在一起,构建混合词汇和语言模型进行识别。我们还引入了插值LM来进一步提高性能。实验结果表明,基于混合单元的建模确实优于基于词或子词的建模。与基线系统相比,测试数据集的词错误率相对降低了10%,字符错误率相对降低了8%。
{"title":"Investigating the Use of Mixed-Units Based Modeling for Improving Uyghur Speech Recognition","authors":"Pengfei Hu, Shen Huang, Zhiqiang Lv","doi":"10.21437/SLTU.2018-45","DOIUrl":"https://doi.org/10.21437/SLTU.2018-45","url":null,"abstract":"Uyghur is a highly agglutinative language with a large number of words derived from the same root. For such languages the use of subwords in speech recognition becomes a natural choice, which can solve the OOV issues. However, short units in subword modeling will weaken the constraint of linguistic context. Besides, vowel weakening and reduction occur frequently in Uyghur language, which may lead to high deletion errors for short unit sequence recognition. In this paper, we investigate using mixed units in Uyghur speech recognition. Subwords and whole-words are mixed together to build a hybrid lexicon and language models for recognition. We also introduce an interpolated LM to further improve the performance. Experiment results show that the mixed-unit based modeling do outperform word or subword based modeling. About 10% relative reduction in Word Error Rate and 8% reduction in Character Error Rate have been achieved for test datasets compared with baseline system.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133056426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Language Identification of Assamese, Bengali and English Speech 阿萨姆语、孟加拉语和英语语音的语言识别
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-37
Joyshree Chakraborty, Shikhamoni Nath, R. NirmalaS., K. Samudravijaya
Machine identification of the language of input speech is of practical interest in regions where people are either bilingual or multi-lingual. Here, we present the development of automatic language identification system that identifies the language of input speech as one of Assamese or Bengali or English spoken by them. The speech databases comprise of sentences read by multiple speakers using their mobile phones. Kaldi toolkit was used to train acoustic models based on hidden Markov model in conjunction with Gaussian mixture models and deep neural networks. The accuracy of the implemented language identification system for test data is 99.3%.
机器识别输入语音的语言在人们使用双语或多语言的地区具有实际意义。在这里,我们提出了自动语言识别系统的开发,该系统将输入语音识别为他们所说的阿萨姆语或孟加拉语或英语中的一种。语音数据库由多个说话者使用手机朗读的句子组成。利用Kaldi工具集,结合高斯混合模型和深度神经网络,训练基于隐马尔可夫模型的声学模型。所实现的语言识别系统对测试数据的准确率为99.3%。
{"title":"Language Identification of Assamese, Bengali and English Speech","authors":"Joyshree Chakraborty, Shikhamoni Nath, R. NirmalaS., K. Samudravijaya","doi":"10.21437/SLTU.2018-37","DOIUrl":"https://doi.org/10.21437/SLTU.2018-37","url":null,"abstract":"Machine identification of the language of input speech is of practical interest in regions where people are either bilingual or multi-lingual. Here, we present the development of automatic language identification system that identifies the language of input speech as one of Assamese or Bengali or English spoken by them. The speech databases comprise of sentences read by multiple speakers using their mobile phones. Kaldi toolkit was used to train acoustic models based on hidden Markov model in conjunction with Gaussian mixture models and deep neural networks. The accuracy of the implemented language identification system for test data is 99.3%.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127060256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dialect Identification Using Tonal and Spectral Features in Two Dialects of Ao 两种敖族方言的声谱特征识别
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-29
Moakala Tzudir, Priyankoo Sarmah, S. Prasanna
Ao is an under-resourced Tibeto-Burman tone language spoken in Nagaland, India, with three lexical tones, namely, high, mid and low. There are three dialects of the language namely, Chungli, Mongsen and Changki, differing in tone assignment in lexical words. This work investigates if the idiosyncratic tone assignment in the Ao dialects can be utilized for dialect identification of two Ao dialects, namely, Changki and Mongsen. A perception test confirmed that Ao speakers identified the two dialects based on their dialect-specific tone assignment. To confirm that tone is the primary cue in dialect identification, F0 was neutralized in the speech data before subjecting them to a Gaussian Mixture Model (GMM) based dialect identification system. The low dialect recognition accuracy confirmed the significance of tones in Ao dialect identification. Finally, a GMM-based dialect identification system was built with tonal and spectral features, resulting in better dialect recognition accuracy.
奥语是印度那加兰邦一种资源贫乏的藏缅语,有高、中、低三个词音。汉语有三种方言,即中隶、蒙森和昌基,它们在词汇上的声调分配不同。本研究探讨了奥方言的特异调性是否可以用于两种奥方言的方言识别,即昌基方言和蒙森方言。一项感知测试证实,说Ao的人是根据方言特有的音调分配来识别这两种方言的。为了确认声调是方言识别的主要线索,在将语音数据中和F0后,将其置于基于高斯混合模型的方言识别系统中。方言识别准确率低,证实了声调在敖族方言识别中的重要性。最后,结合声调特征和谱特征,构建了基于gmm的方言识别系统,提高了方言识别的准确率。
{"title":"Dialect Identification Using Tonal and Spectral Features in Two Dialects of Ao","authors":"Moakala Tzudir, Priyankoo Sarmah, S. Prasanna","doi":"10.21437/SLTU.2018-29","DOIUrl":"https://doi.org/10.21437/SLTU.2018-29","url":null,"abstract":"Ao is an under-resourced Tibeto-Burman tone language spoken in Nagaland, India, with three lexical tones, namely, high, mid and low. There are three dialects of the language namely, Chungli, Mongsen and Changki, differing in tone assignment in lexical words. This work investigates if the idiosyncratic tone assignment in the Ao dialects can be utilized for dialect identification of two Ao dialects, namely, Changki and Mongsen. A perception test confirmed that Ao speakers identified the two dialects based on their dialect-specific tone assignment. To confirm that tone is the primary cue in dialect identification, F0 was neutralized in the speech data before subjecting them to a Gaussian Mixture Model (GMM) based dialect identification system. The low dialect recognition accuracy confirmed the significance of tones in Ao dialect identification. Finally, a GMM-based dialect identification system was built with tonal and spectral features, resulting in better dialect recognition accuracy.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122036903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Sinhala G2P Conversion for Speech Processing 僧伽罗语G2P转换语音处理
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-24
Thilini Nadungodage, Chamila Liyanage, Amathri Prerera, Randil Pushpananda, R. Weerasinghe
Grapheme-to-phoneme (G2P) conversion plays an important role in speech processing applications and other fields of computational linguistics. Sinhala must have a grapheme-to-phoneme conversion for speech processing because Sinhala writing system does not always reflect its actual pronunciations. This paper describes a rule basedG2P conversion method to convert Sinhala text strings into phonemic representations. We use a previously defined rule set and enhance it to get a more accurate G2P conversion. The performance of our rule-based system shows that the rulebased sound patterns are effective on Sinhala G2P conversion.
字素到音素的转换在语音处理应用和计算语言学的其他领域中起着重要的作用。僧伽罗语在语音处理中必须有字素到音素的转换,因为僧伽罗语的书写系统并不总是反映其实际发音。本文描述了一种基于规则的g2p转换方法,将僧伽罗语文本字符串转换为音位表示。我们使用先前定义的规则集并对其进行增强以获得更精确的G2P转换。基于规则的系统的性能表明,基于规则的声音模式在僧伽罗语G2P转换中是有效的。
{"title":"Sinhala G2P Conversion for Speech Processing","authors":"Thilini Nadungodage, Chamila Liyanage, Amathri Prerera, Randil Pushpananda, R. Weerasinghe","doi":"10.21437/SLTU.2018-24","DOIUrl":"https://doi.org/10.21437/SLTU.2018-24","url":null,"abstract":"Grapheme-to-phoneme (G2P) conversion plays an important role in speech processing applications and other fields of computational linguistics. Sinhala must have a grapheme-to-phoneme conversion for speech processing because Sinhala writing system does not always reflect its actual pronunciations. This paper describes a rule basedG2P conversion method to convert Sinhala text strings into phonemic representations. We use a previously defined rule set and enhance it to get a more accurate G2P conversion. The performance of our rule-based system shows that the rulebased sound patterns are effective on Sinhala G2P conversion.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130253828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Acoustic Characretistics of Schwa Vowel in Punjabi 旁遮普语弱读音的声学特征
Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-18
Swaran Lata, Prashant Verma, S. Kaur
{"title":"Acoustic Characretistics of Schwa Vowel in Punjabi","authors":"Swaran Lata, Prashant Verma, S. Kaur","doi":"10.21437/SLTU.2018-18","DOIUrl":"https://doi.org/10.21437/SLTU.2018-18","url":null,"abstract":"","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130159499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Workshop on Spoken Language Technologies for Under-resourced Languages
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1