Speech intelligibility and talker identification with non-telephone frequencies.

IF 1.5 Q3 ACOUSTICS JASA express letters Pub Date : 2024-07-01 DOI:10.1121/10.0027938

Xianhui Wang, Jonathan Ge, Leo Meller, Ye Yang, Fan-Gang Zeng

引用次数: 0

Abstract

Although the telephone band (0.3-3 kHz) provides sufficient information for speech recognition, the contribution of the non-telephone band (<0.3 and >3 kHz) is unclear. To investigate its contribution, speech intelligibility and talker identification were evaluated using consonants, vowels, and sentences. The non-telephone band produced relatively good intelligibility for consonants (76.0%) and sentences (77.4%), but not vowels (11.5%). The non-telephone band supported good talker identification only with sentences (74.5%), but not vowels (45.8%) or consonants (10.8%). Furthermore, the non-telephone band cannot produce satisfactory speech intelligibility in noise at the sentence level, suggesting the importance of full-band access in realistic listening.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用非电话频率的语音清晰度和通话者识别。

虽然电话频段（0.3-3 kHz）为语音识别提供了足够的信息，但非电话频段（3 kHz）的贡献尚不清楚。为了研究非电话频段的贡献，我们使用辅音、元音和句子对语音清晰度和通话者识别进行了评估。非耳机频段对辅音（76.0%）和句子（77.4%）的可懂度相对较好，但对元音（11.5%）的可懂度较差。非耳机频段只在句子（74.5%）、元音（45.8%）和辅音（10.8%）方面支持良好的说话者识别。此外，非耳机频段无法在噪音中产生令人满意的句子级语音清晰度，这表明全频段接入在实际听力中的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

JASA express letters

CiteScore

1.70

自引率

0.00%

发文量