{"title":"人工合成语音的信噪比和语速控制对火车站广播可懂度的影响","authors":"Mizuki Maruoka, Sohei Tsujimura, Takumi Asakura","doi":"10.1007/s40857-023-00306-8","DOIUrl":null,"url":null,"abstract":"<div><p>An experimental study on the effect of the speech characteristics of the signal-to-noise ratio (SNR) and speech rate on the intelligibility of announcements at railway stations was conducted using an artificial synthetic voice. Synthesized speech has recently been used in noisy environments both indoors and outdoors, but unlike its use in quiet environments, when the environment is noisy, the intelligibility of announcements may be reduced. For railway station announcements, while natural spoken voices are currently used for multilingual announcements and disaster response broadcasts, deep neural network synthesized voices, which use deep learning, have also been adopted. However, the effect of the acoustic characteristics such as the SNR and speech rate on the intelligibility of reproduced announcements in noisy public spaces such as railway stations has not yet been clarified from a practical viewpoint. In this paper, in order to determine the appropriate SNR and speech rate for synthetic voice announcements in railway stations, auditory impressions of announcements with varying SNR and speech rate were evaluated by participants using a five-point scale. Based on the evaluations, the appropriate conditions for the broadcast of synthetic voice announcements at the ticket gate and on the platform of a station are discussed.</p></div>","PeriodicalId":54355,"journal":{"name":"Acoustics Australia","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s40857-023-00306-8.pdf","citationCount":"0","resultStr":"{\"title\":\"Effects of Artificial Synthetic Speech Control of SNR and Speech Rate on the Intelligibility of Train Station Announcements\",\"authors\":\"Mizuki Maruoka, Sohei Tsujimura, Takumi Asakura\",\"doi\":\"10.1007/s40857-023-00306-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>An experimental study on the effect of the speech characteristics of the signal-to-noise ratio (SNR) and speech rate on the intelligibility of announcements at railway stations was conducted using an artificial synthetic voice. Synthesized speech has recently been used in noisy environments both indoors and outdoors, but unlike its use in quiet environments, when the environment is noisy, the intelligibility of announcements may be reduced. For railway station announcements, while natural spoken voices are currently used for multilingual announcements and disaster response broadcasts, deep neural network synthesized voices, which use deep learning, have also been adopted. However, the effect of the acoustic characteristics such as the SNR and speech rate on the intelligibility of reproduced announcements in noisy public spaces such as railway stations has not yet been clarified from a practical viewpoint. In this paper, in order to determine the appropriate SNR and speech rate for synthetic voice announcements in railway stations, auditory impressions of announcements with varying SNR and speech rate were evaluated by participants using a five-point scale. Based on the evaluations, the appropriate conditions for the broadcast of synthetic voice announcements at the ticket gate and on the platform of a station are discussed.</p></div>\",\"PeriodicalId\":54355,\"journal\":{\"name\":\"Acoustics Australia\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s40857-023-00306-8.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acoustics Australia\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s40857-023-00306-8\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustics Australia","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s40857-023-00306-8","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Effects of Artificial Synthetic Speech Control of SNR and Speech Rate on the Intelligibility of Train Station Announcements
An experimental study on the effect of the speech characteristics of the signal-to-noise ratio (SNR) and speech rate on the intelligibility of announcements at railway stations was conducted using an artificial synthetic voice. Synthesized speech has recently been used in noisy environments both indoors and outdoors, but unlike its use in quiet environments, when the environment is noisy, the intelligibility of announcements may be reduced. For railway station announcements, while natural spoken voices are currently used for multilingual announcements and disaster response broadcasts, deep neural network synthesized voices, which use deep learning, have also been adopted. However, the effect of the acoustic characteristics such as the SNR and speech rate on the intelligibility of reproduced announcements in noisy public spaces such as railway stations has not yet been clarified from a practical viewpoint. In this paper, in order to determine the appropriate SNR and speech rate for synthetic voice announcements in railway stations, auditory impressions of announcements with varying SNR and speech rate were evaluated by participants using a five-point scale. Based on the evaluations, the appropriate conditions for the broadcast of synthetic voice announcements at the ticket gate and on the platform of a station are discussed.
期刊介绍:
Acoustics Australia, the journal of the Australian Acoustical Society, has been publishing high quality research and technical papers in all areas of acoustics since commencement in 1972. The target audience for the journal includes both researchers and practitioners. It aims to publish papers and technical notes that are relevant to current acoustics and of interest to members of the Society. These include but are not limited to: Architectural and Building Acoustics, Environmental Noise, Underwater Acoustics, Engineering Noise and Vibration Control, Occupational Noise Management, Hearing, Musical Acoustics.