首页 > 最新文献

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications最新文献

英文 中文
A voice-activated telephone exchange system and its field trial 一种声控电话交换系统及其现场试验
S. Yamamoto, K. Takeda, N. Inoue, S. Kuroiwa, M. Naitoh
Speaker-independent speech recognition systems that can accept telephone quality speech may open opportunities for introducing new user-friendly services over the public switched telephone network (PSTN). The authors are currently engaged in a project to introduce an automatic speech recognizer over PSTN. They have developed a voice-activated telephone exchange system by combining a continuous speech recognizer and a private branch exchange system (PBX), and conducted field trials. The system has been installed in the R&D laboratories for daily use since June 1993, in order to investigate its performance in a real environment and collect man-machine dialogues. More than 5,000 man-machine dialogues have been collected, and incorrect recognitions have been analyzed and categorized into three categories such as (1) incorrect detection of speech, (2) out-of-vocabulary responses, (3) incorrect recognition with inadequate hidden Markov models of speech and noise. The authors have improved system performance by mainly attacking the issues (1) and (3). They have just developed a new version of the system, using the improved scheme obtained by analyzing the collected speech data. In order to collect more man-machine dialogues, they are planning to carry out the second phase field trial in which the new system will be installed in branch offices.<>
能够接受电话质量语音的独立于扬声器的语音识别系统可能为通过公共交换电话网(PSTN)引入新的用户友好服务提供机会。作者目前正致力于通过PSTN引入自动语音识别器的项目。他们把连续语音识别器和专用分机系统(PBX)结合在一起,开发了声控电话交换系统,并进行了现场试验。自1993年6月以来,该系统已安装在研发实验室供日常使用,以便在真实环境中调查其性能并收集人机对话。收集了超过5000个人机对话,并对错误识别进行了分析,并将其分为三类,即(1)语音检测错误,(2)词汇外响应,(3)语音和噪声的隐马尔可夫模型不充分的错误识别。作者主要通过解决问题(1)和(3)来提高系统性能。他们刚刚开发了一个新版本的系统,使用通过分析收集的语音数据得到的改进方案。为了收集更多的人机对话,他们计划在分公司安装新系统,进行第2阶段现场试验。
{"title":"A voice-activated telephone exchange system and its field trial","authors":"S. Yamamoto, K. Takeda, N. Inoue, S. Kuroiwa, M. Naitoh","doi":"10.1109/IVTTA.1994.341551","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341551","url":null,"abstract":"Speaker-independent speech recognition systems that can accept telephone quality speech may open opportunities for introducing new user-friendly services over the public switched telephone network (PSTN). The authors are currently engaged in a project to introduce an automatic speech recognizer over PSTN. They have developed a voice-activated telephone exchange system by combining a continuous speech recognizer and a private branch exchange system (PBX), and conducted field trials. The system has been installed in the R&D laboratories for daily use since June 1993, in order to investigate its performance in a real environment and collect man-machine dialogues. More than 5,000 man-machine dialogues have been collected, and incorrect recognitions have been analyzed and categorized into three categories such as (1) incorrect detection of speech, (2) out-of-vocabulary responses, (3) incorrect recognition with inadequate hidden Markov models of speech and noise. The authors have improved system performance by mainly attacking the issues (1) and (3). They have just developed a new version of the system, using the improved scheme obtained by analyzing the collected speech data. In order to collect more man-machine dialogues, they are planning to carry out the second phase field trial in which the new system will be installed in branch offices.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122525430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
VoiceDialing-the first speech recognition based telephone service delivered to customer's home 语音拨号——第一个基于语音识别的电话服务送到客户家中
G.J. Vysotsky
The paper is an overview of NYNEX VoiceDialing service-a first introduction of speech recognition based technology to the mass market of residential and business customers. It is a network based service which allows telephone callers to make calls by simply saying the name of the person or place they wish to reach. VoiceDialing is compatible with both Touch Tone and rotary service and is designed to work on all existing telephone sets. The paper focuses on the network architecture, user interface, and speech recognition issues.<>
本文概述了NYNEX语音拨号服务-首次将基于语音识别的技术引入住宅和商业客户的大众市场。这是一种基于网络的服务,它允许打电话者通过简单地说出他们想要联系的人或地方的名字来打电话。VoiceDialing兼容Touch Tone和rotary服务,并且可以在所有现有的电话机上工作。本文重点研究了网络架构、用户界面和语音识别问题。
{"title":"VoiceDialing-the first speech recognition based telephone service delivered to customer's home","authors":"G.J. Vysotsky","doi":"10.1109/IVTTA.1994.341523","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341523","url":null,"abstract":"The paper is an overview of NYNEX VoiceDialing service-a first introduction of speech recognition based technology to the mass market of residential and business customers. It is a network based service which allows telephone callers to make calls by simply saying the name of the person or place they wish to reach. VoiceDialing is compatible with both Touch Tone and rotary service and is designed to work on all existing telephone sets. The paper focuses on the network architecture, user interface, and speech recognition issues.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115680696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A system for field performance assessment of a speech recognition based telephone service 基于语音识别的电话业务现场性能评估系统
L.A. Zreik
A data collection and analysis system has been designed, developed, and deployed as an integral part of VoiceDialing service, in order to measure the "true" recognition performance of the service in the field and other statistics, and to identify potential areas of improvement. The data collection system collects relevant training, recognition data and timing information, in addition to utterance recordings, on selected lines. Complementing the collection system, the data analysis system provides the capability to study, analyze and classify the collected data, to display and listen to collected utterances, generate statistics and enter analysis results in a database. A graphical user interface enhances the analysis system providing easy access to the database, and simplifying the analysis process. Methods used in measuring field recognition performance are presented, along with field results. A system, using field collected and analyzed data to test recognizers is proposed.<>
设计、开发和部署了一个数据收集和分析系统,作为语音拨号业务的一个组成部分,以衡量该业务在现场和其他统计中的“真实”识别性能,并确定潜在的改进领域。数据采集系统在选定的线路上,除了语音记录外,还收集相关的训练、识别数据和定时信息。作为采集系统的补充,数据分析系统提供了对采集到的数据进行研究、分析和分类、显示和收听采集到的话语、生成统计数据并将分析结果输入数据库的功能。图形用户界面增强了分析系统,方便访问数据库,简化了分析过程。介绍了用于测量现场识别性能的方法,并给出了现场结果。提出了一种利用现场采集和分析的数据对识别器进行测试的系统。
{"title":"A system for field performance assessment of a speech recognition based telephone service","authors":"L.A. Zreik","doi":"10.1109/IVTTA.1994.341522","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341522","url":null,"abstract":"A data collection and analysis system has been designed, developed, and deployed as an integral part of VoiceDialing service, in order to measure the \"true\" recognition performance of the service in the field and other statistics, and to identify potential areas of improvement. The data collection system collects relevant training, recognition data and timing information, in addition to utterance recordings, on selected lines. Complementing the collection system, the data analysis system provides the capability to study, analyze and classify the collected data, to display and listen to collected utterances, generate statistics and enter analysis results in a database. A graphical user interface enhances the analysis system providing easy access to the database, and simplifying the analysis process. Methods used in measuring field recognition performance are presented, along with field results. A system, using field collected and analyzed data to test recognizers is proposed.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115793852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Telephone speech data corpus and performances of speaker independent recognition system using the corpus 电话语音数据语料库及使用该语料库的说话人独立识别系统的性能
T. Isobe, K. Murakami
The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<>
作者首先描述了他们通过电话从400名男性和400名女性受试者中收集的语音数据语料库。然后,他们比较了两种基于三联音模型的独立说话人识别系统的性能,在这两种系统中,他们使用语料库来训练模型和测试。一种系统使用正态连续混合密度HMM,另一种系统使用具有2064个高斯分布的树结构的CDHMM,其所需的高斯计算量仅为正态混合密度HMM的三十分之一。结果表明,使用树状结构CDHMM的系统比使用普通CDHMM的系统性能低3%。这表明树状结构CDHMM在电话语音识别中是有用的。
{"title":"Telephone speech data corpus and performances of speaker independent recognition system using the corpus","authors":"T. Isobe, K. Murakami","doi":"10.1109/IVTTA.1994.341535","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341535","url":null,"abstract":"The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124143892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Database query generation from spoken sentences 从口语句子生成数据库查询
H. Aust, M. Oerder
In the context of our spoken language inquiry system, we present the component which extracts the values needed for a database query from the textual representation of an utterance in the form of a word graph. A stochastic attributed grammar is used as a language model, to identify the relevant parts of the sentence, and to compute their meaning. High understanding rates, low computational costs and practically no restrictions of the usable language are important features of our system.<>
在我们的口语查询系统中,我们提出了一个组件,该组件以词图的形式从话语的文本表示中提取数据库查询所需的值。使用随机属性语法作为语言模型,识别句子的相关部分,并计算它们的意义。高理解率,低计算成本和几乎没有可用语言的限制是我们系统的重要特点。
{"title":"Database query generation from spoken sentences","authors":"H. Aust, M. Oerder","doi":"10.1109/IVTTA.1994.341525","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341525","url":null,"abstract":"In the context of our spoken language inquiry system, we present the component which extracts the values needed for a database query from the textual representation of an utterance in the form of a word graph. A stochastic attributed grammar is used as a language model, to identify the relevant parts of the sentence, and to compute their meaning. High understanding rates, low computational costs and practically no restrictions of the usable language are important features of our system.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121610269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Interactive speech and language systems for telecommunications applications at NYNEX 用于NYNEX电信应用的交互式语音和语言系统
H. Leung, J. Spitz
As information plays an ever-increasing role in our lives, users are demanding more in terms of their capability to retrieve and manipulate information. The paper is concerned with NYNEX's development of interactive speech and language systems for the provision of automated information services. While the long-term goal is to develop total system solutions to interact with users and assist them to search and retrieve information, progress is made in such a way that each component technology can be deployed by itself in various telecommunications applications. The authors discuss some of their findings, and draw from experience in technology development, lessons that have been learnt from service trials, and benefits that have been derived from others in the research community. The authors believe that advanced speech and language technologies can be quite acceptable to users, as long as a graceful and friendly human computer interface is also provided.<>
随着信息在我们的生活中扮演着越来越重要的角色,用户对检索和操作信息的能力要求越来越高。这篇论文是关于NYNEX为提供自动化信息服务而开发的交互式语音和语言系统。虽然长期目标是开发与用户交互并帮助他们搜索和检索信息的整体系统解决方案,但目前取得的进展是,每个组件技术都可以单独部署在各种电信应用程序中。作者讨论了他们的一些发现,并借鉴了技术开发方面的经验、从服务试验中学到的教训以及从研究界其他人那里获得的好处。作者认为,只要提供优雅友好的人机界面,先进的语音和语言技术是可以被用户所接受的。
{"title":"Interactive speech and language systems for telecommunications applications at NYNEX","authors":"H. Leung, J. Spitz","doi":"10.1109/IVTTA.1994.341546","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341546","url":null,"abstract":"As information plays an ever-increasing role in our lives, users are demanding more in terms of their capability to retrieve and manipulate information. The paper is concerned with NYNEX's development of interactive speech and language systems for the provision of automated information services. While the long-term goal is to develop total system solutions to interact with users and assist them to search and retrieve information, progress is made in such a way that each component technology can be deployed by itself in various telecommunications applications. The authors discuss some of their findings, and draw from experience in technology development, lessons that have been learnt from service trials, and benefits that have been derived from others in the research community. The authors believe that advanced speech and language technologies can be quite acceptable to users, as long as a graceful and friendly human computer interface is also provided.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131481533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhanced voice services in the telecommunication network using the Texas Instruments multiserve 在电信网络中使用德州仪器多服务增强语音服务
L. Netsch, R. Rajasekaran, B. Price
The paper presents efforts that Texas Instruments is pursuing to place enhanced voice services in the telecommunications network. The authors describe the capabilities of the Texas Instruments multiserve platform, which is a system designed to implement enhanced telecommunication services. The paper discusses an example of some of the technology challenges involved in design of the system. The authors provide results of performance evaluation of the platform on important voice service tasks.<>
本文介绍了德州仪器在电信网络中为增强语音服务所做的努力。作者描述了德州仪器多服务平台的功能,该平台是一个旨在实现增强电信服务的系统。本文讨论了系统设计中涉及的一些技术挑战的一个例子。作者提供了该平台在重要语音业务任务上的性能评估结果。
{"title":"Enhanced voice services in the telecommunication network using the Texas Instruments multiserve","authors":"L. Netsch, R. Rajasekaran, B. Price","doi":"10.1109/IVTTA.1994.341540","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341540","url":null,"abstract":"The paper presents efforts that Texas Instruments is pursuing to place enhanced voice services in the telecommunications network. The authors describe the capabilities of the Texas Instruments multiserve platform, which is a system designed to implement enhanced telecommunication services. The paper discusses an example of some of the technology challenges involved in design of the system. The authors provide results of performance evaluation of the platform on important voice service tasks.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130451814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An experimental comparison of different feature extraction and classification methods for telephone speech 电话语音不同特征提取与分类方法的实验比较
Tilo Schiirer
Robust speech recognition over telephone lines severely depends on the choice of the feature extraction and classification methods. In order to get the highest possible performance of the speech recognizer a number of commonly used feature extraction methods (MFCC, LPC, PLP, RASTA-PLP) and classification methods (MLP, LVQ, HMM) were tested on the same telephone speech data. All combinations of feature extraction and classification methods were computed and several parameters of both methods where changed in order to find a non-local maximum of recognition accuracy. The paper does not describe a comparison of classification but of feature extraction methods because it is clear that an HMM would outperform both LVQ and MLP. The big question is if the same feature extraction methods always lead to the best results, no matter which classifier is used!.<>
电话语音识别的鲁棒性很大程度上取决于特征提取和分类方法的选择。为了获得语音识别器的最高性能,在同一电话语音数据上测试了几种常用的特征提取方法(MFCC、LPC、PLP、RASTA-PLP)和分类方法(MLP、LVQ、HMM)。计算所有特征提取和分类方法的组合,并改变两种方法的几个参数,以找到识别精度的非局部最大值。本文没有描述分类的比较,而是描述特征提取方法的比较,因为很明显HMM优于LVQ和MLP。最大的问题是,无论使用哪种分类器,相同的特征提取方法是否总是能得到最好的结果!
{"title":"An experimental comparison of different feature extraction and classification methods for telephone speech","authors":"Tilo Schiirer","doi":"10.1109/IVTTA.1994.341537","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341537","url":null,"abstract":"Robust speech recognition over telephone lines severely depends on the choice of the feature extraction and classification methods. In order to get the highest possible performance of the speech recognizer a number of commonly used feature extraction methods (MFCC, LPC, PLP, RASTA-PLP) and classification methods (MLP, LVQ, HMM) were tested on the same telephone speech data. All combinations of feature extraction and classification methods were computed and several parameters of both methods where changed in order to find a non-local maximum of recognition accuracy. The paper does not describe a comparison of classification but of feature extraction methods because it is clear that an HMM would outperform both LVQ and MLP. The big question is if the same feature extraction methods always lead to the best results, no matter which classifier is used!.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131640213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Dialog design for automatic speech recognition of telephone numbers and account numbers 对话框设计,用于电话号码和帐号的自动语音识别
D.J. Brens, B.L. Wattenbager
The ultimate success of automatic speech recognition (ASR) depends not only performance characteristics of the technology but also on user behaviors. User behaviors are, in turn, affected by the prompts, reprompts, and user interface strategies that we use when designing a service. In one project, we have designed and tested modular elements of a user interface for automatic speech recognition (ASR). We describe a human factors study of connected digits transactions, including "telephone number" and "account number" transactions. In this study, candidate prompt/reprompt arrangements were tested using samples of the American consumer population. We consider some of the results from our study.<>
自动语音识别(ASR)的最终成功不仅取决于技术的性能特征,还取决于用户的行为。反过来,用户行为受到我们在设计服务时使用的提示、重新提示和用户界面策略的影响。在一个项目中,我们设计并测试了用于自动语音识别(ASR)的用户界面的模块化元素。我们描述了连接数字交易的人为因素研究,包括“电话号码”和“账号”交易。在这项研究中,候选提示/重新提示安排使用美国消费者群体的样本进行测试。我们考虑了我们研究的一些结果
{"title":"Dialog design for automatic speech recognition of telephone numbers and account numbers","authors":"D.J. Brens, B.L. Wattenbager","doi":"10.1109/IVTTA.1994.341531","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341531","url":null,"abstract":"The ultimate success of automatic speech recognition (ASR) depends not only performance characteristics of the technology but also on user behaviors. User behaviors are, in turn, affected by the prompts, reprompts, and user interface strategies that we use when designing a service. In one project, we have designed and tested modular elements of a user interface for automatic speech recognition (ASR). We describe a human factors study of connected digits transactions, including \"telephone number\" and \"account number\" transactions. In this study, candidate prompt/reprompt arrangements were tested using samples of the American consumer population. We consider some of the results from our study.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130996359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The role of voice processing in telecommunications 语音处理在电信中的作用
L. Rabiner
During the decade of the 1990s, the fields of communications, computing, and networking are coming together in the form of personal information/communication terminals, and in the associated services (so-called personal communications services, PCS). Several technologies will play major roles in this communications revolution, but one of the key ones will be voice processing. The authors review several voice processing technologies, discuss current capabilities and the associated applications, and try to forecast where they see progress being achieved in the next decade and what applications will become commonplace as a result of the increased capabilities. They show how progress in voice processing is accompanied and stimulated by progress in microelectronics (memory and processing power of single chip architectures), and how, by the 21st century, telecommunications will have made major advances as a result of the use of voice processing.<>
在20世纪90年代的十年中,通信、计算和网络领域以个人信息/通信终端的形式和相关服务(所谓的个人通信服务,PCS)的形式结合在一起。有几种技术将在这场通信革命中发挥重要作用,但其中一个关键技术将是语音处理。作者回顾了几种语音处理技术,讨论了当前的能力和相关的应用,并试图预测他们在未来十年中所取得的进展,以及随着能力的增加,哪些应用将变得司空见惯。它们展示了语音处理的进步是如何伴随着微电子技术(单芯片架构的存储和处理能力)的进步而受到刺激的,以及到21世纪,由于语音处理的使用,电信将如何取得重大进展
{"title":"The role of voice processing in telecommunications","authors":"L. Rabiner","doi":"10.1109/IVTTA.1994.341554","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341554","url":null,"abstract":"During the decade of the 1990s, the fields of communications, computing, and networking are coming together in the form of personal information/communication terminals, and in the associated services (so-called personal communications services, PCS). Several technologies will play major roles in this communications revolution, but one of the key ones will be voice processing. The authors review several voice processing technologies, discuss current capabilities and the associated applications, and try to forecast where they see progress being achieved in the next decade and what applications will become commonplace as a result of the increased capabilities. They show how progress in voice processing is accompanied and stimulated by progress in microelectronics (memory and processing power of single chip architectures), and how, by the 21st century, telecommunications will have made major advances as a result of the use of voice processing.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"213 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132334819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1