Speech intelligibility and talker identification with non-telephone frequencies.

IF 1.2 Q3 ACOUSTICS JASA express letters Pub Date : 2024-07-01 DOI:10.1121/10.0027938
Xianhui Wang, Jonathan Ge, Leo Meller, Ye Yang, Fan-Gang Zeng
{"title":"Speech intelligibility and talker identification with non-telephone frequencies.","authors":"Xianhui Wang, Jonathan Ge, Leo Meller, Ye Yang, Fan-Gang Zeng","doi":"10.1121/10.0027938","DOIUrl":null,"url":null,"abstract":"<p><p>Although the telephone band (0.3-3 kHz) provides sufficient information for speech recognition, the contribution of the non-telephone band (<0.3 and >3 kHz) is unclear. To investigate its contribution, speech intelligibility and talker identification were evaluated using consonants, vowels, and sentences. The non-telephone band produced relatively good intelligibility for consonants (76.0%) and sentences (77.4%), but not vowels (11.5%). The non-telephone band supported good talker identification only with sentences (74.5%), but not vowels (45.8%) or consonants (10.8%). Furthermore, the non-telephone band cannot produce satisfactory speech intelligibility in noise at the sentence level, suggesting the importance of full-band access in realistic listening.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0027938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Although the telephone band (0.3-3 kHz) provides sufficient information for speech recognition, the contribution of the non-telephone band (<0.3 and >3 kHz) is unclear. To investigate its contribution, speech intelligibility and talker identification were evaluated using consonants, vowels, and sentences. The non-telephone band produced relatively good intelligibility for consonants (76.0%) and sentences (77.4%), but not vowels (11.5%). The non-telephone band supported good talker identification only with sentences (74.5%), but not vowels (45.8%) or consonants (10.8%). Furthermore, the non-telephone band cannot produce satisfactory speech intelligibility in noise at the sentence level, suggesting the importance of full-band access in realistic listening.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用非电话频率的语音清晰度和通话者识别。
虽然电话频段(0.3-3 kHz)为语音识别提供了足够的信息,但非电话频段(3 kHz)的贡献尚不清楚。为了研究非电话频段的贡献,我们使用辅音、元音和句子对语音清晰度和通话者识别进行了评估。非耳机频段对辅音(76.0%)和句子(77.4%)的可懂度相对较好,但对元音(11.5%)的可懂度较差。非耳机频段只在句子(74.5%)、元音(45.8%)和辅音(10.8%)方面支持良好的说话者识别。此外,非耳机频段无法在噪音中产生令人满意的句子级语音清晰度,这表明全频段接入在实际听力中的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.70
自引率
0.00%
发文量
0
期刊最新文献
Effect of hearing aids on the externalization of everyday sounds. Tests of human auditory temporal resolution: Simulations of Bayesian threshold estimation for auditory gap detection. Hearing aid evaluation for music: Accounting for acoustical variability of music stimuli. Minima in cubic distortion-product otoacoustic emission input/output functions due to distributed primary sources. Investigating muscle coordination patterns with Granger causality analysis in protrusive motion from tagged and diffusion MRI.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1