Robert E Remez, Kathryn R Dubowski, Daria F Ferro, Emily F Thomas
{"title":"Primitive audiovisual integration of speech.","authors":"Robert E Remez, Kathryn R Dubowski, Daria F Ferro, Emily F Thomas","doi":"10.3758/s13414-025-03038-1","DOIUrl":null,"url":null,"abstract":"<p><p>An unintelligible video recording of a face uttering a sentence and an unintelligible acoustic sinusoid following the frequency variation of a single vocal resonance of the utterance were intelligible when presented together at their veridical synchrony. The intelligibility resulted from audiovisual sensory integration and phonetic perceptual analysis, which depended neither on the separate resolution of linguistic impressions in each modality nor on closed-set reports about a single pair of minimal phonemic contrast features. Likewise, audiovisual integration could not be attributed to Gestalt-derived similarity principles applied unimodally or bimodally. A single sinusoid lacks natural vocal quality, yet was perceptually coherent with the visible form of a natural articulating face. In other tests, integration also survived desynchronization of the two sensory streams, though only if the visible component led the audible component. These measures expose the robust action and temporal dynamic of early non-phonetic sensory integration which can occur in audiovisual speech perception.</p>","PeriodicalId":55433,"journal":{"name":"Attention Perception & Psychophysics","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Attention Perception & Psychophysics","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13414-025-03038-1","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
An unintelligible video recording of a face uttering a sentence and an unintelligible acoustic sinusoid following the frequency variation of a single vocal resonance of the utterance were intelligible when presented together at their veridical synchrony. The intelligibility resulted from audiovisual sensory integration and phonetic perceptual analysis, which depended neither on the separate resolution of linguistic impressions in each modality nor on closed-set reports about a single pair of minimal phonemic contrast features. Likewise, audiovisual integration could not be attributed to Gestalt-derived similarity principles applied unimodally or bimodally. A single sinusoid lacks natural vocal quality, yet was perceptually coherent with the visible form of a natural articulating face. In other tests, integration also survived desynchronization of the two sensory streams, though only if the visible component led the audible component. These measures expose the robust action and temporal dynamic of early non-phonetic sensory integration which can occur in audiovisual speech perception.
期刊介绍:
The journal Attention, Perception, & Psychophysics is an official journal of the Psychonomic Society. It spans all areas of research in sensory processes, perception, attention, and psychophysics. Most articles published are reports of experimental work; the journal also presents theoretical, integrative, and evaluative reviews. Commentary on issues of importance to researchers appears in a special section of the journal. Founded in 1966 as Perception & Psychophysics, the journal assumed its present name in 2009.