Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications最新文献

英文中文

A voice-activated telephone exchange system and its field trial 一种声控电话交换系统及其现场试验

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341551

S. Yamamoto, K. Takeda, N. Inoue, S. Kuroiwa, M. Naitoh

Speaker-independent speech recognition systems that can accept telephone quality speech may open opportunities for introducing new user-friendly services over the public switched telephone network (PSTN). The authors are currently engaged in a project to introduce an automatic speech recognizer over PSTN. They have developed a voice-activated telephone exchange system by combining a continuous speech recognizer and a private branch exchange system (PBX), and conducted field trials. The system has been installed in the R&D laboratories for daily use since June 1993, in order to investigate its performance in a real environment and collect man-machine dialogues. More than 5,000 man-machine dialogues have been collected, and incorrect recognitions have been analyzed and categorized into three categories such as (1) incorrect detection of speech, (2) out-of-vocabulary responses, (3) incorrect recognition with inadequate hidden Markov models of speech and noise. The authors have improved system performance by mainly attacking the issues (1) and (3). They have just developed a new version of the system, using the improved scheme obtained by analyzing the collected speech data. In order to collect more man-machine dialogues, they are planning to carry out the second phase field trial in which the new system will be installed in branch offices.<>

能够接受电话质量语音的独立于扬声器的语音识别系统可能为通过公共交换电话网(PSTN)引入新的用户友好服务提供机会。作者目前正致力于通过PSTN引入自动语音识别器的项目。他们把连续语音识别器和专用分机系统(PBX)结合在一起，开发了声控电话交换系统，并进行了现场试验。自1993年6月以来，该系统已安装在研发实验室供日常使用，以便在真实环境中调查其性能并收集人机对话。收集了超过5000个人机对话，并对错误识别进行了分析，并将其分为三类，即(1)语音检测错误，(2)词汇外响应，(3)语音和噪声的隐马尔可夫模型不充分的错误识别。作者主要通过解决问题(1)和(3)来提高系统性能。他们刚刚开发了一个新版本的系统，使用通过分析收集的语音数据得到的改进方案。为了收集更多的人机对话，他们计划在分公司安装新系统，进行第2阶段现场试验。

{"title":"A voice-activated telephone exchange system and its field trial","authors":"S. Yamamoto, K. Takeda, N. Inoue, S. Kuroiwa, M. Naitoh","doi":"10.1109/IVTTA.1994.341551","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341551","url":null,"abstract":"Speaker-independent speech recognition systems that can accept telephone quality speech may open opportunities for introducing new user-friendly services over the public switched telephone network (PSTN). The authors are currently engaged in a project to introduce an automatic speech recognizer over PSTN. They have developed a voice-activated telephone exchange system by combining a continuous speech recognizer and a private branch exchange system (PBX), and conducted field trials. The system has been installed in the R&D laboratories for daily use since June 1993, in order to investigate its performance in a real environment and collect man-machine dialogues. More than 5,000 man-machine dialogues have been collected, and incorrect recognitions have been analyzed and categorized into three categories such as (1) incorrect detection of speech, (2) out-of-vocabulary responses, (3) incorrect recognition with inadequate hidden Markov models of speech and noise. The authors have improved system performance by mainly attacking the issues (1) and (3). They have just developed a new version of the system, using the improved scheme obtained by analyzing the collected speech data. In order to collect more man-machine dialogues, they are planning to carry out the second phase field trial in which the new system will be installed in branch offices.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122525430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

VoiceDialing-the first speech recognition based telephone service delivered to customer's home 语音拨号——第一个基于语音识别的电话服务送到客户家中

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341523

G.J. Vysotsky

The paper is an overview of NYNEX VoiceDialing service-a first introduction of speech recognition based technology to the mass market of residential and business customers. It is a network based service which allows telephone callers to make calls by simply saying the name of the person or place they wish to reach. VoiceDialing is compatible with both Touch Tone and rotary service and is designed to work on all existing telephone sets. The paper focuses on the network architecture, user interface, and speech recognition issues.<>

本文概述了NYNEX语音拨号服务-首次将基于语音识别的技术引入住宅和商业客户的大众市场。这是一种基于网络的服务，它允许打电话者通过简单地说出他们想要联系的人或地方的名字来打电话。VoiceDialing兼容Touch Tone和rotary服务，并且可以在所有现有的电话机上工作。本文重点研究了网络架构、用户界面和语音识别问题。

引用次数: 3

A system for field performance assessment of a speech recognition based telephone service 基于语音识别的电话业务现场性能评估系统

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341522

L.A. Zreik

A data collection and analysis system has been designed, developed, and deployed as an integral part of VoiceDialing service, in order to measure the "true" recognition performance of the service in the field and other statistics, and to identify potential areas of improvement. The data collection system collects relevant training, recognition data and timing information, in addition to utterance recordings, on selected lines. Complementing the collection system, the data analysis system provides the capability to study, analyze and classify the collected data, to display and listen to collected utterances, generate statistics and enter analysis results in a database. A graphical user interface enhances the analysis system providing easy access to the database, and simplifying the analysis process. Methods used in measuring field recognition performance are presented, along with field results. A system, using field collected and analyzed data to test recognizers is proposed.<>

设计、开发和部署了一个数据收集和分析系统，作为语音拨号业务的一个组成部分，以衡量该业务在现场和其他统计中的“真实”识别性能，并确定潜在的改进领域。数据采集系统在选定的线路上，除了语音记录外，还收集相关的训练、识别数据和定时信息。作为采集系统的补充，数据分析系统提供了对采集到的数据进行研究、分析和分类、显示和收听采集到的话语、生成统计数据并将分析结果输入数据库的功能。图形用户界面增强了分析系统，方便访问数据库，简化了分析过程。介绍了用于测量现场识别性能的方法，并给出了现场结果。提出了一种利用现场采集和分析的数据对识别器进行测试的系统。

引用次数: 4

Telephone speech data corpus and performances of speaker independent recognition system using the corpus 电话语音数据语料库及使用该语料库的说话人独立识别系统的性能

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341535

T. Isobe, K. Murakami

The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<>

作者首先描述了他们通过电话从400名男性和400名女性受试者中收集的语音数据语料库。然后，他们比较了两种基于三联音模型的独立说话人识别系统的性能，在这两种系统中，他们使用语料库来训练模型和测试。一种系统使用正态连续混合密度HMM，另一种系统使用具有2064个高斯分布的树结构的CDHMM，其所需的高斯计算量仅为正态混合密度HMM的三十分之一。结果表明，使用树状结构CDHMM的系统比使用普通CDHMM的系统性能低3%。这表明树状结构CDHMM在电话语音识别中是有用的。

引用次数: 3

Database query generation from spoken sentences 从口语句子生成数据库查询

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341525

H. Aust, M. Oerder

In the context of our spoken language inquiry system, we present the component which extracts the values needed for a database query from the textual representation of an utterance in the form of a word graph. A stochastic attributed grammar is used as a language model, to identify the relevant parts of the sentence, and to compute their meaning. High understanding rates, low computational costs and practically no restrictions of the usable language are important features of our system.<>

在我们的口语查询系统中，我们提出了一个组件，该组件以词图的形式从话语的文本表示中提取数据库查询所需的值。使用随机属性语法作为语言模型，识别句子的相关部分，并计算它们的意义。高理解率，低计算成本和几乎没有可用语言的限制是我们系统的重要特点。

引用次数: 10

Interactive speech and language systems for telecommunications applications at NYNEX 用于NYNEX电信应用的交互式语音和语言系统

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341546

H. Leung, J. Spitz

As information plays an ever-increasing role in our lives, users are demanding more in terms of their capability to retrieve and manipulate information. The paper is concerned with NYNEX's development of interactive speech and language systems for the provision of automated information services. While the long-term goal is to develop total system solutions to interact with users and assist them to search and retrieve information, progress is made in such a way that each component technology can be deployed by itself in various telecommunications applications. The authors discuss some of their findings, and draw from experience in technology development, lessons that have been learnt from service trials, and benefits that have been derived from others in the research community. The authors believe that advanced speech and language technologies can be quite acceptable to users, as long as a graceful and friendly human computer interface is also provided.<>

随着信息在我们的生活中扮演着越来越重要的角色，用户对检索和操作信息的能力要求越来越高。这篇论文是关于NYNEX为提供自动化信息服务而开发的交互式语音和语言系统。虽然长期目标是开发与用户交互并帮助他们搜索和检索信息的整体系统解决方案，但目前取得的进展是，每个组件技术都可以单独部署在各种电信应用程序中。作者讨论了他们的一些发现，并借鉴了技术开发方面的经验、从服务试验中学到的教训以及从研究界其他人那里获得的好处。作者认为，只要提供优雅友好的人机界面，先进的语音和语言技术是可以被用户所接受的。

引用次数: 2

Enhanced voice services in the telecommunication network using the Texas Instruments multiserve 在电信网络中使用德州仪器多服务增强语音服务

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341540

L. Netsch, R. Rajasekaran, B. Price

The paper presents efforts that Texas Instruments is pursuing to place enhanced voice services in the telecommunications network. The authors describe the capabilities of the Texas Instruments multiserve platform, which is a system designed to implement enhanced telecommunication services. The paper discusses an example of some of the technology challenges involved in design of the system. The authors provide results of performance evaluation of the platform on important voice service tasks.<>

本文介绍了德州仪器在电信网络中为增强语音服务所做的努力。作者描述了德州仪器多服务平台的功能，该平台是一个旨在实现增强电信服务的系统。本文讨论了系统设计中涉及的一些技术挑战的一个例子。作者提供了该平台在重要语音业务任务上的性能评估结果。

引用次数: 2

An experimental comparison of different feature extraction and classification methods for telephone speech 电话语音不同特征提取与分类方法的实验比较

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341537

Tilo Schiirer

Robust speech recognition over telephone lines severely depends on the choice of the feature extraction and classification methods. In order to get the highest possible performance of the speech recognizer a number of commonly used feature extraction methods (MFCC, LPC, PLP, RASTA-PLP) and classification methods (MLP, LVQ, HMM) were tested on the same telephone speech data. All combinations of feature extraction and classification methods were computed and several parameters of both methods where changed in order to find a non-local maximum of recognition accuracy. The paper does not describe a comparison of classification but of feature extraction methods because it is clear that an HMM would outperform both LVQ and MLP. The big question is if the same feature extraction methods always lead to the best results, no matter which classifier is used!.<>

电话语音识别的鲁棒性很大程度上取决于特征提取和分类方法的选择。为了获得语音识别器的最高性能，在同一电话语音数据上测试了几种常用的特征提取方法(MFCC、LPC、PLP、RASTA-PLP)和分类方法(MLP、LVQ、HMM)。计算所有特征提取和分类方法的组合，并改变两种方法的几个参数，以找到识别精度的非局部最大值。本文没有描述分类的比较，而是描述特征提取方法的比较，因为很明显HMM优于LVQ和MLP。最大的问题是，无论使用哪种分类器，相同的特征提取方法是否总是能得到最好的结果!

引用次数: 8

Dialog design for automatic speech recognition of telephone numbers and account numbers 对话框设计，用于电话号码和帐号的自动语音识别

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341531

D.J. Brens, B.L. Wattenbager

The ultimate success of automatic speech recognition (ASR) depends not only performance characteristics of the technology but also on user behaviors. User behaviors are, in turn, affected by the prompts, reprompts, and user interface strategies that we use when designing a service. In one project, we have designed and tested modular elements of a user interface for automatic speech recognition (ASR). We describe a human factors study of connected digits transactions, including "telephone number" and "account number" transactions. In this study, candidate prompt/reprompt arrangements were tested using samples of the American consumer population. We consider some of the results from our study.<>

自动语音识别(ASR)的最终成功不仅取决于技术的性能特征，还取决于用户的行为。反过来，用户行为受到我们在设计服务时使用的提示、重新提示和用户界面策略的影响。在一个项目中，我们设计并测试了用于自动语音识别(ASR)的用户界面的模块化元素。我们描述了连接数字交易的人为因素研究，包括“电话号码”和“账号”交易。在这项研究中，候选提示/重新提示安排使用美国消费者群体的样本进行测试。我们考虑了我们研究的一些结果

引用次数: 2

The role of voice processing in telecommunications 语音处理在电信中的作用

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341554

L. Rabiner

During the decade of the 1990s, the fields of communications, computing, and networking are coming together in the form of personal information/communication terminals, and in the associated services (so-called personal communications services, PCS). Several technologies will play major roles in this communications revolution, but one of the key ones will be voice processing. The authors review several voice processing technologies, discuss current capabilities and the associated applications, and try to forecast where they see progress being achieved in the next decade and what applications will become commonplace as a result of the increased capabilities. They show how progress in voice processing is accompanied and stimulated by progress in microelectronics (memory and processing power of single chip architectures), and how, by the 21st century, telecommunications will have made major advances as a result of the use of voice processing.<>

在20世纪90年代的十年中，通信、计算和网络领域以个人信息/通信终端的形式和相关服务(所谓的个人通信服务，PCS)的形式结合在一起。有几种技术将在这场通信革命中发挥重要作用，但其中一个关键技术将是语音处理。作者回顾了几种语音处理技术，讨论了当前的能力和相关的应用，并试图预测他们在未来十年中所取得的进展，以及随着能力的增加，哪些应用将变得司空见惯。它们展示了语音处理的进步是如何伴随着微电子技术(单芯片架构的存储和处理能力)的进步而受到刺激的，以及到21世纪，由于语音处理的使用，电信将如何取得重大进展

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀