首页 > 最新文献

Laboratory Phonology最新文献

英文 中文
On the origin of post-aspirated stops: production and perception of /s/ + voiceless stop sequences in Andalusian Spanish 后送气顿音的起源:安达卢西亚西班牙语中/s/ +不发音顿音序列的产生和感知
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2016-03-16 DOI: 10.5334/LABPHON.2
Hanna Ruch, S. Peters
The present study investigates the role of articulatory and perceptual factors in the change from pre- to post-aspiration in two varieties of Andalusian Spanish. In an acoustic study, the influence of stop type, speaker age, and variety on the production of pre- and post-aspiration was analyzed in isolated words produced by 24 speakers of a Western and 24 of an Eastern variety, both divided into two age groups. The results confirmed previous findings of a sound change from pre- to post-aspiration in both varieties. Velar stops showed the longest, bilabials the shortest, and dental stops intermediate pre- and post-aspiration durations. The observed universal VOT-pattern was not found for younger Western Andalusian speakers who showed a particularly long VOT in /st/-sequences. A perception experiment with the same subjects as listeners showed that post-aspiration was used as a cue for distinguishing the minimal pair /pata/-/pasta/ by almost all listeners. Production-perception comparisons suggested a relationship between production and perception: subjects who produced long post-aspiration were also more sensitive to this cue. In sum, the results suggest that the sound change has first been actuated in the dental context, possibly due to a higher perceptual prominence of post-aspiration in this context, and that post-aspirated stops in Andalusian Spanish are on their way to being phonologized.
本研究探讨了两个安达卢西亚西班牙语变体中发音和知觉因素在从吸音前到吸音后变化中的作用。在一项声学研究中,研究人员分析了24名来自西方和东方的24名不同年龄段的说话者所产生的孤立单词,并分析了停顿类型、说话者年龄和种类对前吸音和后吸音产生的影响。结果证实了先前的发现,从前到后的吸音在两个品种的声音变化。腭止音最长,双唇音最短,牙止音介于吸吸前后。观察到的普遍的VOT模式在年轻的西安达卢西亚人身上没有发现,他们在/st/-序列中表现出特别长的VOT。一项与听者相同的感知实验表明,几乎所有听者都将吸气后作为区分最小对/pata/-/pasta/的线索。产生-感知的比较表明了产生和感知之间的关系:产生长后吸气的受试者对这个提示也更敏感。总之,结果表明,声音的变化首先是在牙齿语境中被激活的,可能是由于在这种语境中对吸音后的感知更加突出,而安达卢西亚西班牙语的吸音后顿音正在被语音化。
{"title":"On the origin of post-aspirated stops: production and perception of /s/ + voiceless stop sequences in Andalusian Spanish","authors":"Hanna Ruch, S. Peters","doi":"10.5334/LABPHON.2","DOIUrl":"https://doi.org/10.5334/LABPHON.2","url":null,"abstract":"The present study investigates the role of articulatory and perceptual factors in the change from pre- to post-aspiration in two varieties of Andalusian Spanish. In an acoustic study, the influence of stop type, speaker age, and variety on the production of pre- and post-aspiration was analyzed in isolated words produced by 24 speakers of a Western and 24 of an Eastern variety, both divided into two age groups. The results confirmed previous findings of a sound change from pre- to post-aspiration in both varieties. Velar stops showed the longest, bilabials the shortest, and dental stops intermediate pre- and post-aspiration durations. The observed universal VOT-pattern was not found for younger Western Andalusian speakers who showed a particularly long VOT in /st/-sequences. A perception experiment with the same subjects as listeners showed that post-aspiration was used as a cue for distinguishing the minimal pair /pata/-/pasta/ by almost all listeners. Production-perception comparisons suggested a relationship between production and perception: subjects who produced long post-aspiration were also more sensitive to this cue. In sum, the results suggest that the sound change has first been actuated in the dental context, possibly due to a higher perceptual prominence of post-aspiration in this context, and that post-aspirated stops in Andalusian Spanish are on their way to being phonologized.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2016-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70691200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Resyllabification Reconsidered: On the Durational Properties of Word-Final /s/ in Spanish 重新考虑重读音节:论西班牙语词尾/s/的持续特性
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2016-03-16 DOI: 10.5334/LABPHON.5
Patrycja Strycharczuk, Martin Kohlberger
Word-final consonants in Spanish are commonly assumed to undergo resyllabification across a word boundary before a following vowel, e.g., /los#otros/ 'the others' is realised as [lo.so.tros]. However, in many dialects of Spanish, word-final pre-vocalic consonants (‘derived onsets’) pattern phonologically with canonical codas and distinctly from canonical onsets. This property of derived onsets has been the subject of much interest in the phonological literature, and has led some linguists to question whether resyllabification indeed applies in all Spanish dialects. In this paper, we evaluate evidence for resyllabification based on acoustic data from 11 speakers of Peninsular Spanish. The results show that word-final pre-vocalic /s/ has increased duration compared to coda /s/, but at the same time, it is shorter compared to word-initial or word-medial pre-vocalic /s/. This result challenges an analysis where derived onsets become phonologically indistinguishable from canonical onsets. We consider an alternative in the form of partial resyllabification, and we further discuss the role of the syllable as a relevant unit in explaining /s/-sandhi in Spanish.
在西班牙语中,词尾辅音通常被认为在后面的元音出现之前要经历跨词边界的重音节化,例如,/los#otros/ 'the others'被理解为[lo.so.tros]。然而,在西班牙语的许多方言中,词尾前辅音(“派生起音”)在音系上具有规范尾音,并且与规范起音明显不同。派生起音的这一特性一直是音韵学文献中非常感兴趣的主题,并导致一些语言学家质疑重新音节化是否确实适用于所有西班牙方言。在本文中,我们基于11个半岛西班牙语使用者的声学数据评估了重音节化的证据。结果表明:尾音/s/比尾音/s/持续时间长,但比首音/s/短。这一结果挑战了一种分析,其中衍生起音在语音上与规范起音难以区分。我们考虑了部分重读音节的替代形式,并进一步讨论了音节作为解释西班牙语中/s/-变调的相关单位的作用。
{"title":"Resyllabification Reconsidered: On the Durational Properties of Word-Final /s/ in Spanish","authors":"Patrycja Strycharczuk, Martin Kohlberger","doi":"10.5334/LABPHON.5","DOIUrl":"https://doi.org/10.5334/LABPHON.5","url":null,"abstract":"Word-final consonants in Spanish are commonly assumed to undergo resyllabification across a word boundary before a following vowel, e.g., /los#otros/ 'the others' is realised as [lo.so.tros]. However, in many dialects of Spanish, word-final pre-vocalic consonants (‘derived onsets’) pattern phonologically with canonical codas and distinctly from canonical onsets. This property of derived onsets has been the subject of much interest in the phonological literature, and has led some linguists to question whether resyllabification indeed applies in all Spanish dialects. In this paper, we evaluate evidence for resyllabification based on acoustic data from 11 speakers of Peninsular Spanish. The results show that word-final pre-vocalic /s/ has increased duration compared to coda /s/, but at the same time, it is shorter compared to word-initial or word-medial pre-vocalic /s/. This result challenges an analysis where derived onsets become phonologically indistinguishable from canonical onsets. We consider an alternative in the form of partial resyllabification, and we further discuss the role of the syllable as a relevant unit in explaining /s/-sandhi in Spanish.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2016-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70691735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Phonological and Semantic Cues to Learning from Word-Types. 从词类学习的语音和语义线索
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2016-01-01 Epub Date: 2016-08-09 DOI: 10.5334/labphon.39
Peter Richtsmeier

Word-types represent the primary form of data for many models of phonological learning, and they often predict performance in psycholinguistic tasks. Word-types are often tacitly defined as phonologically unique words. Yet, an explicit test of this definition is lacking, and natural language patterning suggests that word meaning could also act as a cue to word-type status. This possibility was tested in a statistical phonotactic learning experiment in which phonological and semantic properties of word-types varied. During familiarization, the learning targets-word-medial consonant sequences-were instantiated either by four related word-types or by just one word-type (the experimental frequency factor). The expectation was that more word-types would lead participants to generalize the target sequences. Regarding semantic cues, related word-types were either associated with different referents or all with a single referent. Regarding phonological cues, related word-types differed from each other by one, two, or more phonemes. At test, participants rated novel wordforms for their similarity to the familiarization words. When participants heard four related word-types, they gave higher ratings to test words with the same consonant sequences, irrespective of the phonological and semantic manipulations. The results support the existing phonological definition of word-types.

单词类型是许多语音学习模型的主要数据形式,它们通常可以预测心理语言学任务的成绩。词类通常被默认定义为语音上独特的词。然而,目前还缺乏对这一定义的明确测试,而自然语言模式表明,词义也可以作为词类地位的线索。我们在一项统计语音战术学习实验中测试了这种可能性,在该实验中,词类的语音和语义属性各不相同。在熟悉过程中,学习目标--单词--中间辅音序列--被四个相关的单词类型或一个单词类型(实验频率因子)实例化。我们的期望是,更多的词型会使参与者泛化目标序列。在语义线索方面,相关词型要么与不同的参照物相关联,要么都与单一参照物相关联。在语音线索方面,相关词型之间存在一个、两个或多个音素的差异。测试时,受试者会对新词形与熟悉词的相似度进行评分。当受试者听到四个相关词型时,无论语音和语义操作如何,他们都会给具有相同辅音序列的测试词更高的评分。研究结果支持现有的词型语音学定义。
{"title":"Phonological and Semantic Cues to Learning from Word-Types.","authors":"Peter Richtsmeier","doi":"10.5334/labphon.39","DOIUrl":"10.5334/labphon.39","url":null,"abstract":"<p><p>Word-types represent the primary form of data for many models of phonological learning, and they often predict performance in psycholinguistic tasks. Word-types are often tacitly defined as phonologically unique words. Yet, an explicit test of this definition is lacking, and natural language patterning suggests that word meaning could also act as a cue to word-type status. This possibility was tested in a statistical phonotactic learning experiment in which phonological and semantic properties of word-types varied. During familiarization, the learning targets-word-medial consonant sequences-were instantiated either by four related word-types or by just one word-type (the experimental frequency factor). The expectation was that more word-types would lead participants to generalize the target sequences. Regarding semantic cues, related word-types were either associated with different referents or all with a single referent. Regarding phonological cues, related word-types differed from each other by one, two, or more phonemes. At test, participants rated novel wordforms for their similarity to the familiarization words. When participants heard four related word-types, they gave higher ratings to test words with the same consonant sequences, irrespective of the phonological and semantic manipulations. The results support the existing phonological definition of word-types.</p>","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703426/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35296201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Infant-directed speech as a window into the dynamic nature of phonology 幼儿导向语是了解音韵学动态本质的窗口
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-10-01 DOI: 10.1515/lp-2015-0009
R. Mazuka, Yosuke Igarashi, Andrew Martin, Akira Utsugi
Abstract Theoretical frameworks of phonology are built largely on the basis of idealized speech, typically recorded in a laboratory under static conditions. Natural speech, in contrast, occurs in a variety of communicative contexts where speakers and hearers dynamically adjust their speech to fit their needs. The present paper demonstrates that phonologically informed analysis of specialized speech registers, such as infant-directed speech, can reveal specific ways segmental and supra-segmental aspects of phonology are modulated dynamically to accommodate the specific communicative needs of speakers and hearers. Data for the analyses come from a corpus of Japanese mothers’ spontaneous speech directed to their infant child (infant-directed speech, IDS) and an adult (adult-directed speech, ADS), as well as read speech (RS). The speech samples in the corpus are annotated with segmental, morphological, and intonational information. We will show that the way intonation is exaggerated in Japanese IDS reflects the intonational structure of Japanese, which is different from that of English. We will also demonstrate that rules of phonological grammar, such as devoicing of high vowels and non-high vowels in Japanese, can be differently affected by the needs of the speaker to accommodate the specific characteristics of the listener.
音系学的理论框架在很大程度上是建立在理想化语音的基础上的,这些语音通常是在实验室静态条件下记录的。相比之下,自然语言发生在各种交际环境中,说话者和听话者动态地调整他们的语言以适应他们的需要。本文表明,对特定语音寄存器(如婴儿指向语)的音系分析可以揭示音系的片段和超片段方面是如何动态调节的,以适应说话者和听者的特定交际需求。用于分析的数据来自日本母亲对其婴儿(婴儿指向语,IDS)和成人(成人指向语,ADS)以及阅读语言(RS)的语料库。语料库中的语音样本用片段、形态和语调信息进行注释。日语语音系统中语调的夸张方式反映了日语与英语不同的语调结构。我们还将证明语音语法规则,如日语中的高元音和非高元音的发音,会受到说话者适应听者特定特征的需要的不同影响。
{"title":"Infant-directed speech as a window into the dynamic nature of phonology","authors":"R. Mazuka, Yosuke Igarashi, Andrew Martin, Akira Utsugi","doi":"10.1515/lp-2015-0009","DOIUrl":"https://doi.org/10.1515/lp-2015-0009","url":null,"abstract":"Abstract Theoretical frameworks of phonology are built largely on the basis of idealized speech, typically recorded in a laboratory under static conditions. Natural speech, in contrast, occurs in a variety of communicative contexts where speakers and hearers dynamically adjust their speech to fit their needs. The present paper demonstrates that phonologically informed analysis of specialized speech registers, such as infant-directed speech, can reveal specific ways segmental and supra-segmental aspects of phonology are modulated dynamically to accommodate the specific communicative needs of speakers and hearers. Data for the analyses come from a corpus of Japanese mothers’ spontaneous speech directed to their infant child (infant-directed speech, IDS) and an adult (adult-directed speech, ADS), as well as read speech (RS). The speech samples in the corpus are annotated with segmental, morphological, and intonational information. We will show that the way intonation is exaggerated in Japanese IDS reflects the intonational structure of Japanese, which is different from that of English. We will also demonstrate that rules of phonological grammar, such as devoicing of high vowels and non-high vowels in Japanese, can be differently affected by the needs of the speaker to accommodate the specific characteristics of the listener.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67024999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Pointed and plateau-shaped pitch accents in North Frisian 北弗里斯兰语的尖尖和高原形音高
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-10-01 DOI: 10.1515/lp-2015-0013
O. Niebuhr, J. Hoekstra
Abstract Our study presents the initial results of an analysis of North Frisian intonation, based on a spontaneous interview corpus of Fering, the dialect of the island of Föhr off the west coast of the German state of Schleswig-Holstein. The corpus was recorded more than 50 years ago during fieldwork for language documentation and conservation purposes. We selected a small part of this corpus – interviews of 10 elderly speakers – and conducted multiparametric F0 and duration measurements, focusing on nuclear rising-falling pitch accent patterns. We found strong evidence for a phonological pitch-accent distinction that relies on the difference between a pointed and a plateau-shaped F0 peak. We suggest that the two pitch accents be represented as L+H* and H*+L, and we discuss our findings with regard to possible communicative functions, implications for intonational typology, and the trade-off between F0 range and F0 peak extension in conveying pitch height.
摘要本研究基于德国石勒苏益格-荷尔斯泰因州(Schleswig-Holstein)西海岸Föhr岛的方言Fering的自发访谈语料库,对北弗里斯兰语的语调进行了初步分析。该语料库是50多年前在野外工作期间为语言文献和保护目的而记录的。我们选择了这个语料库的一小部分——对10位老年演讲者的采访——并进行了多参数F0和持续时间测量,重点关注核升降音高重音模式。我们发现了强有力的证据,证明音高-口音的语音区分依赖于尖形F0峰和平台形F0峰之间的差异。我们建议用L+H*和H*+L来表示这两种音高重音,并讨论了我们的研究结果,包括可能的交际功能、对语调类型的影响,以及在传递音高时F0范围和F0峰值延伸之间的权衡。
{"title":"Pointed and plateau-shaped pitch accents in North Frisian","authors":"O. Niebuhr, J. Hoekstra","doi":"10.1515/lp-2015-0013","DOIUrl":"https://doi.org/10.1515/lp-2015-0013","url":null,"abstract":"Abstract Our study presents the initial results of an analysis of North Frisian intonation, based on a spontaneous interview corpus of Fering, the dialect of the island of Föhr off the west coast of the German state of Schleswig-Holstein. The corpus was recorded more than 50 years ago during fieldwork for language documentation and conservation purposes. We selected a small part of this corpus – interviews of 10 elderly speakers – and conducted multiparametric F0 and duration measurements, focusing on nuclear rising-falling pitch accent patterns. We found strong evidence for a phonological pitch-accent distinction that relies on the difference between a pointed and a plateau-shaped F0 peak. We suggest that the two pitch accents be represented as L+H* and H*+L, and we discuss our findings with regard to possible communicative functions, implications for intonational typology, and the trade-off between F0 range and F0 peak extension in conveying pitch height.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0013","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67025048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Frequency effects on the vowel length contrast merger in Seoul Korean 频率对首尔韩语元音长度对比合并的影响
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-10-01 DOI: 10.1515/lp-2015-0014
Yoonjung Kang, Taejin Yoon, Sungwoo Han
Abstract This paper presents an apparent-time study of the vowel length contrast merger in Seoul Korean based on duration measurements of over 370,000 vowels in word-initial syllables in a read-speech corpus. The effects of word frequency on vowel duration and the lexical diffusion of long-vowel shortening are also examined. The findings confirm the observation made in the previous literature that the vowel length contrast is on its way out in the language, and that this sound change is nearing completion. We also find a significant effect of frequency on long-vowel duration: other things being equal, these vowels are shorter in high-frequency words than in low-frequency words. The rate of change does not differ significantly depending on the frequency of words apart from the high-frequency words reaching the endpoint of change and bottoming out in the change earlier than mid- and low-frequency words. The observed frequency effect is compatible with a model in which the frequency effect on duration comes from on-line factors that affect phonetic implementation of speech sounds, along with an across-the-board lenition bias that drives the sound change, not from stored tokens of word-specific variants.
摘要本文基于读-语语料库中37万多个词首音节元音的持续时间测量,对首尔韩语元音长度对比合并进行了明显-时间研究。词频对元音持续时间和长元音缩短的词汇扩散的影响也进行了研究。这些发现证实了先前文献中的观察,即元音长度对比在语言中正在消失,这种声音变化即将完成。我们还发现频率对长元音持续时间的显著影响:在其他条件相同的情况下,高频单词中的元音比低频单词中的短。除高频词比中低频词更早到达变化终点和触底外,不同频率词的变化速率差异不显著。观察到的频率效应与一个模型是兼容的,在这个模型中,频率对持续时间的影响来自影响语音语音实现的在线因素,以及驱动声音变化的全面的记忆偏差,而不是来自特定单词变体的存储标记。
{"title":"Frequency effects on the vowel length contrast merger in Seoul Korean","authors":"Yoonjung Kang, Taejin Yoon, Sungwoo Han","doi":"10.1515/lp-2015-0014","DOIUrl":"https://doi.org/10.1515/lp-2015-0014","url":null,"abstract":"Abstract This paper presents an apparent-time study of the vowel length contrast merger in Seoul Korean based on duration measurements of over 370,000 vowels in word-initial syllables in a read-speech corpus. The effects of word frequency on vowel duration and the lexical diffusion of long-vowel shortening are also examined. The findings confirm the observation made in the previous literature that the vowel length contrast is on its way out in the language, and that this sound change is nearing completion. We also find a significant effect of frequency on long-vowel duration: other things being equal, these vowels are shorter in high-frequency words than in low-frequency words. The rate of change does not differ significantly depending on the frequency of words apart from the high-frequency words reaching the endpoint of change and bottoming out in the change earlier than mid- and low-frequency words. The observed frequency effect is compatible with a model in which the frequency effect on duration comes from on-line factors that affect phonetic implementation of speech sounds, along with an across-the-board lenition bias that drives the sound change, not from stored tokens of word-specific variants.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67025049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Models of dataset size, question design, and cross-language speech perception for speech crowdsourcing applications 语音众包应用的数据集大小、问题设计和跨语言语音感知模型
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-10-01 DOI: 10.1515/lp-2015-0012
M. Hasegawa-Johnson, J. Cole, P. Jyothi, L. Varshney
Abstract Transcribers make mistakes. Workers recruited in a crowdsourcing marketplace, because of their varying levels of commitment and education, make more mistakes than workers in a controlled laboratory setting. Methods for compensating transcriber mistakes are desirable because, with such methods available, crowdsourcing has the potential to significantly increase the scale of experiments in laboratory phonology. This paper provides a brief tutorial on statistical learning theory, introducing the relationship between dataset size and estimation error, then presents a theoretical description and preliminary results for two new methods that control labeler error in laboratory phonology experiments. First, we discuss the method of crowdsourcing over error-correcting codes. In the error-correcting-code method, each difficult labeling task is first factored, by the experimenter, into the product of several easy labeling tasks (typically binary). Factoring increases the total number of tasks, nevertheless it results in faster completion and higher accuracy, because workers unable to perform the difficult task may be able to meaningfully contribute to the solution of each easy task. Second, we discuss the use of explicit mathematical models of the errors made by a worker in the crowd. In particular, we introduce the method of mismatched crowdsourcing, in which workers transcribe a language they do not understand, and an explicit mathematical model of second-language phoneme perception is used to learn and then compensate their transcription errors. Though introduced as technologies that increase the scale of phonology experiments, both methods have implications beyond increased scale. The method of easy questions permits us to probe the perception, by untrained listeners, of complicated phonological models; examples are provided from the prosody of English and Hindi. The method of mismatched crowdsourcing permits us to probe, in more detail than ever before, the perception of phonetic categories by listeners with a different phonological system.
转录员会犯错。在众包市场中招募的工人,由于他们的投入程度和受教育程度不同,比在受控实验室环境中的工人犯的错误更多。补偿转录者错误的方法是可取的,因为有了这些方法,众包有可能显著增加实验室音韵学实验的规模。本文简要介绍了统计学习理论,介绍了数据集大小与估计误差之间的关系,然后介绍了控制实验室音韵学实验中标注器误差的两种新方法的理论描述和初步结果。首先,我们讨论了纠错码的众包方法。在纠错码方法中,实验者首先将每个困难的标注任务分解成几个简单标注任务的乘积(通常是二进制)。分解增加了任务的总数,但它的结果是更快的完成和更高的准确性,因为无法执行困难任务的工人可能能够有意义地为每个简单任务的解决方案做出贡献。其次,我们讨论了在人群中工人所犯错误的显式数学模型的使用。特别地,我们引入了错配众包的方法,在这种方法中,工人转录他们不理解的语言,并使用第二语言音素感知的明确数学模型来学习然后补偿他们的转录错误。虽然作为增加音韵学实验规模的技术而引入,但这两种方法的含义都超出了增加规模的范围。简单问题的方法使我们能够探测未经训练的听众对复杂语音模型的感知;从英语和印地语的韵律中提供了例子。不匹配众包的方法使我们能够比以往更详细地探索不同语音系统的听者对语音类别的感知。
{"title":"Models of dataset size, question design, and cross-language speech perception for speech crowdsourcing applications","authors":"M. Hasegawa-Johnson, J. Cole, P. Jyothi, L. Varshney","doi":"10.1515/lp-2015-0012","DOIUrl":"https://doi.org/10.1515/lp-2015-0012","url":null,"abstract":"Abstract Transcribers make mistakes. Workers recruited in a crowdsourcing marketplace, because of their varying levels of commitment and education, make more mistakes than workers in a controlled laboratory setting. Methods for compensating transcriber mistakes are desirable because, with such methods available, crowdsourcing has the potential to significantly increase the scale of experiments in laboratory phonology. This paper provides a brief tutorial on statistical learning theory, introducing the relationship between dataset size and estimation error, then presents a theoretical description and preliminary results for two new methods that control labeler error in laboratory phonology experiments. First, we discuss the method of crowdsourcing over error-correcting codes. In the error-correcting-code method, each difficult labeling task is first factored, by the experimenter, into the product of several easy labeling tasks (typically binary). Factoring increases the total number of tasks, nevertheless it results in faster completion and higher accuracy, because workers unable to perform the difficult task may be able to meaningfully contribute to the solution of each easy task. Second, we discuss the use of explicit mathematical models of the errors made by a worker in the crowd. In particular, we introduce the method of mismatched crowdsourcing, in which workers transcribe a language they do not understand, and an explicit mathematical model of second-language phoneme perception is used to learn and then compensate their transcription errors. Though introduced as technologies that increase the scale of phonology experiments, both methods have implications beyond increased scale. The method of easy questions permits us to probe the perception, by untrained listeners, of complicated phonological models; examples are provided from the prosody of English and Hindi. The method of mismatched crowdsourcing permits us to probe, in more detail than ever before, the perception of phonetic categories by listeners with a different phonological system.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0012","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67025046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Perception of incompletely neutralized voicing cues in word-final obstruents: The role of differences in production context 结语障碍中不完全中和的语音线索的感知:生产语境差异的作用
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-05-01 DOI: 10.1515/lp-2015-0005
V. Kharlamov
Abstract Experimental data on final devoicing in languages such as German and Russian usually show that speakers produce incompletely neutralized acoustic differences between words ending in phonologically voiced versus voiceless obstruents (e.g., /kod/ ‘code’ vs. /kot/ ‘cat’ in Russian) and that listeners can use these differences to identify the underlying specification of final consonants at an above-chance level. The current study examines how the seemingly successful perceptual identification of voicing varies across stimulus items recorded in reading vs. non-reading procedures and with and without full minimal pairs present in the experimental list. Results of a series of identification tasks reveal that Russian listeners’ identification responses are more in line with underlying voicing for the stimuli recorded during word-reading and with minimal pairs included among the experimental items. This shows that voicing judgments are strongly influenced by the acoustic differences produced when speakers encounter orthographic forms or lexical competition. At the same time, perceptual neutralization is also not complete for the items recorded without such exposure, which indicates that listeners’ ability to recover underlying voicing is not limited to the production contexts involving written forms or minimal pairs.
摘要:关于德语和俄语等语言的尾辅音的实验数据通常表明,说话者在音系上以浊音结尾的单词和以无浊音结尾的单词之间产生不完全抵消的声学差异(例如,俄语中的/kod/ ' code '和/kot/ ' cat '),并且听者可以利用这些差异在高于偶然水平上识别尾辅音的潜在规范。目前的研究考察了在阅读和非阅读过程中记录的声音的看似成功的感知识别是如何在实验列表中有和没有完整的最小对的刺激项目中变化的。一系列识别任务的结果表明,俄语听者对单词阅读过程中所记录的刺激的识别反应更符合潜在的声音,并且实验项目中包含的配对最少。这表明,当说话者遇到正字法形式或词汇竞争时,发声判断受到声音差异的强烈影响。与此同时,对于没有这种暴露的录音项目,感知中和也不完全,这表明听者恢复潜在声音的能力并不局限于涉及书面形式或最小对的生产环境。
{"title":"Perception of incompletely neutralized voicing cues in word-final obstruents: The role of differences in production context","authors":"V. Kharlamov","doi":"10.1515/lp-2015-0005","DOIUrl":"https://doi.org/10.1515/lp-2015-0005","url":null,"abstract":"Abstract Experimental data on final devoicing in languages such as German and Russian usually show that speakers produce incompletely neutralized acoustic differences between words ending in phonologically voiced versus voiceless obstruents (e.g., /kod/ ‘code’ vs. /kot/ ‘cat’ in Russian) and that listeners can use these differences to identify the underlying specification of final consonants at an above-chance level. The current study examines how the seemingly successful perceptual identification of voicing varies across stimulus items recorded in reading vs. non-reading procedures and with and without full minimal pairs present in the experimental list. Results of a series of identification tasks reveal that Russian listeners’ identification responses are more in line with underlying voicing for the stimuli recorded during word-reading and with minimal pairs included among the experimental items. This shows that voicing judgments are strongly influenced by the acoustic differences produced when speakers encounter orthographic forms or lexical competition. At the same time, perceptual neutralization is also not complete for the items recorded without such exposure, which indicates that listeners’ ability to recover underlying voicing is not limited to the production contexts involving written forms or minimal pairs.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67024989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
The aerodynamic puzzle of nasalized fricatives: Aerodynamic and perceptual evidence from Scottish Gaelic 鼻音摩擦音的空气动力学难题:来自苏格兰盖尔语的空气动力学和知觉证据
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-05-01 DOI: 10.1515/lp-2015-0007
N. Warner, Dan Brenner, Jessamyn Schertz, A. Carnie, Muriel Fisher, Michael Hammond
Abstract Scottish Gaelic is sometimes described as having nasalized fricatives (/ṽ/ distinctively, and [f̃, x̃, h̃], etc. through assimilation). However, there are claims that it is not aerodynamically possible to open the velum for nasalization while maintaining frication noise. We present aerodynamic data from 14 native Scottish Gaelic speakers to determine how the posited nasalized fricatives in this language are realized. Most tokens demonstrate loss of nasalization, but nasalization does occur in some contexts without aerodynamic conflict, e.g., nasalization with the consonant realized as an approximant, nasalization of [h̃], nasalization on the preceding vowel, or sequential frication and nasalization. Furthermore, a very few tokens do contain simultaneous nasalization and frication with a trade-off in airflow. We also present perceptual evidence showing that Gaelic listeners can hear this distinction slightly better than chance. Thus, instrumental data from one of the few languages in the world described as having nasalized fricatives confirms that the claimed sounds are not made by producing strong nasalization concurrently with clear frication noise. Furthermore, although speakers most often neutralize the nasalization, when they maintain it, they do so through a variety of phonetic mechanisms, even within a single language.
苏格兰盖尔语有时被描述为有鼻音化的摩擦音(通过同化而明显地/ /,和[f /, x /, h /]等)。然而,有人声称,在保持摩擦噪声的同时,在空气动力学上不可能打开膜片进行鼻化。我们提供了来自14位苏格兰盖尔语母语者的空气动力学数据,以确定该语言中假定的鼻音摩擦音是如何实现的。大多数符号都表现出鼻音化的缺失,但在一些没有空气动力学冲突的情况下,鼻音化确实会发生,例如,将辅音作为近音实现的鼻音化,[h]的鼻音化,前一个元音的鼻音化,或连续的摩擦和鼻音化。此外,很少的代币确实包含同时的鼻化和摩擦,并在气流中进行权衡。我们还提供了感知证据,表明盖尔语听众可以比偶然更好地听到这种区别。因此,来自世界上少数几种被描述为有鼻音摩擦音的语言之一的仪器数据证实,所声称的声音不是通过同时产生强烈的鼻音和明显的摩擦音而发出的。此外,尽管说话者通常会消除鼻音化,但当他们保持鼻音化时,他们会通过各种语音机制来做到这一点,甚至在同一种语言中也是如此。
{"title":"The aerodynamic puzzle of nasalized fricatives: Aerodynamic and perceptual evidence from Scottish Gaelic","authors":"N. Warner, Dan Brenner, Jessamyn Schertz, A. Carnie, Muriel Fisher, Michael Hammond","doi":"10.1515/lp-2015-0007","DOIUrl":"https://doi.org/10.1515/lp-2015-0007","url":null,"abstract":"Abstract Scottish Gaelic is sometimes described as having nasalized fricatives (/ṽ/ distinctively, and [f̃, x̃, h̃], etc. through assimilation). However, there are claims that it is not aerodynamically possible to open the velum for nasalization while maintaining frication noise. We present aerodynamic data from 14 native Scottish Gaelic speakers to determine how the posited nasalized fricatives in this language are realized. Most tokens demonstrate loss of nasalization, but nasalization does occur in some contexts without aerodynamic conflict, e.g., nasalization with the consonant realized as an approximant, nasalization of [h̃], nasalization on the preceding vowel, or sequential frication and nasalization. Furthermore, a very few tokens do contain simultaneous nasalization and frication with a trade-off in airflow. We also present perceptual evidence showing that Gaelic listeners can hear this distinction slightly better than chance. Thus, instrumental data from one of the few languages in the world described as having nasalized fricatives confirms that the claimed sounds are not made by producing strong nasalization concurrently with clear frication noise. Furthermore, although speakers most often neutralize the nasalization, when they maintain it, they do so through a variety of phonetic mechanisms, even within a single language.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67024993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Categorical and gradient homophony avoidance: Evidence from Japanese 范畴和梯度谐音回避:来自日语的证据
IF 1.5 2区 文学 Q1 Health Professions Pub Date : 2015-05-01 DOI: 10.1515/lp-2015-0006
A. Kaplan, Y. Muratani
Abstract Many languages have been claimed to have phonological patterns that are sensitive to the need to avoid homophony – for example, a rule that is blocked if it would create a surface form that is identical to another word in the language. Such patterns always involve comparisons between words in the same morphological paradigm (e.g., singular and plural forms with the same stem). The lone exception to this generalization is Ichimura (2006), who argues that a nasal contraction pattern in Japanese is blocked by potential homophony between verbs with different stems. We present experimental evidence that homophony avoidance is not part of the correct synchronic description of the environment in which this pattern applies; rather, nasal contraction does not productively delete stem-final vowels. However, homophony avoidance does appear to affect the probability with which contraction applies. We conclude that homophony avoidance affects phonological behavior, but that absolute homophony-related blocking is restricted to morphological paradigms.
许多语言都被认为具有对避免同音谐音的需要很敏感的语音模式——例如,如果一个规则会产生一个与语言中另一个单词相同的表面形式,它就会被阻止。这种模式通常涉及相同形态范式的单词之间的比较(例如,具有相同词干的单数和复数形式)。唯一的例外是Ichimura(2006),他认为日语中的鼻音收缩模式被不同词干的动词之间潜在的谐音所阻碍。我们提出的实验证据表明,同音回避不是这种模式适用的环境的正确共时性描述的一部分;相反,鼻腔收缩不会有效地删除词干结尾的元音。然而,避免同音谐音似乎确实影响了使用缩写的可能性。我们得出的结论是,谐音回避影响语音行为,但绝对的谐音相关阻塞仅限于形态范式。
{"title":"Categorical and gradient homophony avoidance: Evidence from Japanese","authors":"A. Kaplan, Y. Muratani","doi":"10.1515/lp-2015-0006","DOIUrl":"https://doi.org/10.1515/lp-2015-0006","url":null,"abstract":"Abstract Many languages have been claimed to have phonological patterns that are sensitive to the need to avoid homophony – for example, a rule that is blocked if it would create a surface form that is identical to another word in the language. Such patterns always involve comparisons between words in the same morphological paradigm (e.g., singular and plural forms with the same stem). The lone exception to this generalization is Ichimura (2006), who argues that a nasal contraction pattern in Japanese is blocked by potential homophony between verbs with different stems. We present experimental evidence that homophony avoidance is not part of the correct synchronic description of the environment in which this pattern applies; rather, nasal contraction does not productively delete stem-final vowels. However, homophony avoidance does appear to affect the probability with which contraction applies. We conclude that homophony avoidance affects phonological behavior, but that absolute homophony-related blocking is restricted to morphological paradigms.","PeriodicalId":45128,"journal":{"name":"Laboratory Phonology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lp-2015-0006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67024991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Laboratory Phonology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1