An automatic non-native speaker recognition system

2010 IEEE International Conference on Technologies for Homeland Security (HST) Pub Date : 2010-12-03 DOI:10.1109/THS.2010.5655088

Bozhao Tan, Qi Li, Robert Foresta

{"title":"An automatic non-native speaker recognition system","authors":"Bozhao Tan, Qi Li, Robert Foresta","doi":"10.1109/THS.2010.5655088","DOIUrl":null,"url":null,"abstract":"Identification of non-native personnel is a critical piece of information for making crucial on-the-spot decisions for security purposes. Identification of a non-native speaker is often readily apparent in normal conversation with a native speaker through speech content and accent. Such identification which requires familiarity with language nuances may not be possible for a non-native interrogator or intelligence analyst or when conversing or listening through a machine language translator. Developing an automatic system to identify speakers as native or non-native, as well as their native language, including dialect, within input audio streams, is the major goal of this project. Such a system may be used alone or with other downstream applications such as machine language translation systems. In this paper we present four approaches to identify native and non-native speakers as a binary recognition problem. The approaches can be further categorized into phonetic-based approaches and non-phonetic-based approaches. These approaches were tested on two separate databases, including text-dependent read speech and text-independent spontaneous speech. The results show that our system is competitive in comparison with other published, state-of-the-art non-native speaker recognition systems. Key metrics for automated non-native recognition systems include: 1) positive identification rates, 2) false alarm/identification rates, and 3) length of captured speech sample required to reach a decision.","PeriodicalId":106557,"journal":{"name":"2010 IEEE International Conference on Technologies for Homeland Security (HST)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Technologies for Homeland Security (HST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/THS.2010.5655088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Identification of non-native personnel is a critical piece of information for making crucial on-the-spot decisions for security purposes. Identification of a non-native speaker is often readily apparent in normal conversation with a native speaker through speech content and accent. Such identification which requires familiarity with language nuances may not be possible for a non-native interrogator or intelligence analyst or when conversing or listening through a machine language translator. Developing an automatic system to identify speakers as native or non-native, as well as their native language, including dialect, within input audio streams, is the major goal of this project. Such a system may be used alone or with other downstream applications such as machine language translation systems. In this paper we present four approaches to identify native and non-native speakers as a binary recognition problem. The approaches can be further categorized into phonetic-based approaches and non-phonetic-based approaches. These approaches were tested on two separate databases, including text-dependent read speech and text-independent spontaneous speech. The results show that our system is competitive in comparison with other published, state-of-the-art non-native speaker recognition systems. Key metrics for automated non-native recognition systems include: 1) positive identification rates, 2) false alarm/identification rates, and 3) length of captured speech sample required to reach a decision.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

非母语人士自动识别系统

非本国人员的身份是为安全目的作出关键的现场决定的一项关键信息。在与母语人士的正常对话中，通过讲话内容和口音可以很容易地识别出非母语人士。这种需要熟悉语言细微差别的识别，对于非母语的审讯人员或情报分析员来说，或者通过机器语言翻译器进行交谈或聆听时，可能是不可能的。开发一个自动系统，在输入的音频流中识别说话者是母语还是非母语，以及他们的母语，包括方言，是这个项目的主要目标。这样的系统可以单独使用或与其他下游应用程序(如机器语言翻译系统)一起使用。在本文中，我们提出了四种方法来识别母语和非母语人士作为一个二值识别问题。这些方法可以进一步分为基于语音的方法和非基于语音的方法。这些方法在两个独立的数据库上进行了测试，包括文本依赖的阅读语音和文本独立的自发语音。结果表明，与其他已发表的、最先进的非母语识别系统相比，我们的系统具有竞争力。自动非原生识别系统的关键指标包括:1)正识别率，2)假警报/识别率，以及3)达到决策所需的捕获语音样本长度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 IEEE International Conference on Technologies for Homeland Security (HST)

自引率

0.00%

发文量

期刊最新文献

Wireless systems of threats monitoring Goal-based assessment for the cybersecurity of critical infrastructure Securing IPv6 network infrastructure: A new security model Risks of unrecognized commonalities in information technology supply chains Cryptkeeper: Improving security with encrypted RAM